2025-07-03

Title: A Systematic Review of Security Vulnerabilities in Smart Home Devices and Mitigation Techniques

Title: Few-Shot Inspired Generative Zero-Shot Learning

Title: Dual Perspectives on Non-Contrastive Self-Supervised Learning

Title: PathCoT: Chain-of-Thought Prompting for Zero-shot Pathology Visual Reasoning

Title: Sensing Cardiac Health Across Scenarios and Devices: A Multi-Modal Foundation Model Pretrained on Heterogeneous Data from 1.7 Million Individuals

Title: XxaCT-NN: Structure Agnostic Multimodal Learning for Materials Science

Title: Good Enough to Learn: LLM-based Anomaly Detection in ECU Logs without Reliable Labels

Title: Event-based evaluation of abstractive news summarization

Title: Diffusion Explorer: Interactive Exploration of Diffusion Models

Title: Are Large Brainwave Foundation Models Capable Yet? Insights from Fine-tuning

Title: Escaping Platos Cave: JAM for Aligning Independently Trained Vision and Language Models

Title: Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing

Title: DiffusionLight-Turbo: Accelerated Light Probes for Free via Single-Pass Chrome Ball Inpainting

Title: ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks

Title: Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Title: Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

Title: Efficient Kilometer-Scale Precipitation Downscaling with Conditional Wavelet Diffusion

Title: Activation Reward Models for Few-Shot Model Alignment

Title: Distributional Soft Actor-Critic with Diffusion Policy

Title: Medical-Knowledge Driven Multiple Instance Learning for Classifying Severe Abdominal Anomalies on Prenatal Ultrasound

Title: CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning

Title: Decomposing Prediction Mechanisms for In-Context Recall

Title: DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal

Title: DiffMark: Diffusion-based Robust Watermark Against Deepfakes

Title: OoDDINO:A Multi-level Framework for Anomaly Segmentation on Complex Road Scenes

Title: NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation

Title: Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think

Title: Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities

Title: AVC-DPO: Aligned Video Captioning via Direct Preference Optimization

Title: ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation

Title: Loss Functions in Diffusion Models: A Comparative Study

Title: MARVIS: Modality Adaptive Reasoning over VISualizations

Title: A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation

Title: SketchColour: Channel Concat Guided DiT-based Sketch-to-Colour Pipeline for 2D Animation

Title: DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Title: Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference

Title: SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement

Title: RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather

Title: Graph Representation-based Model Poisoning on Federated LLMs in CyberEdge Networks

Title: LLMs for Legal Subsumption in German Employment Contracts

Title: HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion

Title: Calibrated Self-supervised Vision Transformers Improve Intracranial Arterial Calcification Segmentation from Clinical CT Head Scans

Title: SSL4SAR: Self-Supervised Learning for Glacier Calving Front Extraction from SAR Imagery

Title: Enhanced Generative Model Evaluation with Clipped Density and Coverage

Title: FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization

Title: Towards Decentralized and Sustainable Foundation Model Training with the Edge

Title: Out-of-Distribution Detection Methods Answer the Wrong Questions

Title: Towards Foundation Auto-Encoders for Time-Series Anomaly Detection

Title: Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Title: Exploring a Hybrid Deep Learning Approach for Anomaly Detection in Mental Healthcare Provider Billing: Addressing Label Scarcity through Semi-Supervised Anomaly Detection

Title: IC-Custom: Diverse Image Customization via In-Context Learning

Title: Kwai Keye-VL Technical Report

Title: Test-Time Scaling with Reflective Generative Model

Title: FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Title: How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks