2025-09-23

Title: Stabilizing Information Flow Entropy: Regularization for Safe and Interpretable Autonomous Driving Perception

Title: Estimating Clinical Lab Test Result Trajectories from PPG using Physiological Foundation Model and Patient-Aware State Space Model -- a UNIPHY+ Approach

Title: From Canopy to Ground via ForestGen3D: Learning Cross-Domain Generation of 3D Forest Structure from Aerial-to-Terrestrial LiDAR

Title: Guided Sequence-Structure Generative Modeling for Iterative Antibody Optimization

Title: Introducing Resizable Region Packing Problem in Image Generation, with a Heuristic Solution

Title: StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes

Title: TractoTransformer: Diffusion MRI Streamline Tractography using CNN and Transformer Networks

Title: Improved mmFormer for Liver Fibrosis Staging via Missing-Modality Compensation

Title: Local Mechanisms of Compositional Generalization in Conditional Diffusion

Title: Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations

Title: Cross-Corpus and Cross-domain Handwriting Assessment of NeuroDegenerative Diseases via Time-Series-to-Image Conversion

Title: Octree Latent Diffusion for Semantic 3D Scene Generation and Completion

Title: FairTune: A Bias-Aware Fine-Tuning Framework Towards Fair Heart Rate Prediction from PPG

Title: A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

Title: RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation

Title: OS-DiffVSR: Towards One-step Latent Diffusion Model for High-detailed Real-world Video Super-Resolution

Title: SlowFast-SCI: Slow-Fast Deep Unfolding Learning for Spectral Compressive Imaging

Title: FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion Transformers

Title: Efficient Rectified Flow for Image Fusion

Title: ViTCAE: ViT-based Class-conditioned Autoencoder

Title: V-CECE: Visual Counterfactual Explanations via Conceptual Edits

Title: A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis

Title: Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs

Title: SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving

Title: FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection

Title: Detection and Simulation of Urban Heat Islands Using a Fine-Tuned Geospatial Foundation Model

Title: Self-Supervised Learning of Graph Representations for Network Intrusion Detection

Title: Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation

Title: Unlocking Hidden Potential in Point Cloud Networks with Attention-Guided Grouping-Feature Coordination

Title: InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention

Title: Animalbooth: multimodal feature enhancement for animal subject personalization

Title: A Multi-Level Benchmark for Causal Language Understanding in Social Media Discourse

Title: Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment

Title: Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees

Title: DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images

Title: MMPart: Harnessing Multi-Modal Large Language Models for Part-Aware 3D Generation

Title: Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models

Title: $\mathtt{M^3VIR}$: A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation

Title: PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion

Title: Parameter-efficient fine-tuning (PEFT) of Vision Foundation Models for Atypical Mitotic Figure Classification

Title: VidCLearn: A Continual Learning Approach for Text-to-Video Generation

Title: Penalizing Boundary Activation for Object Completeness in Diffusion Models

Title: VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation

Title: Advancing Speech Understanding in Speech-Aware Language Models with GRPO

Title: When Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration

Title: Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition

Title: TSGym: Design Choices for Deep Multivariate Time-Series Forecasting

Title: Informative Text-Image Alignment for Visual Affordance Learning with Foundation Models

Title: AlignedGen: Aligning Style Across Generated Images

Title: Uncertainty-Supervised Interpretable and Robust Evidential Segmentation

Title: ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting

Title: Stencil: Subject-Driven Generation with Context Guidance

Title: SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction

Title: Ambiguous Medical Image Segmentation Using Diffusion Schrödinger Bridge

Title: Echo-Path: Pathology-Conditioned Echo Video Generation

Title: SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing

Title: Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization

Title: Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation

Title: DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction

Title: Prospective Multi-Graph Cohesion for Multivariate Time Series Anomaly Detection

Title: SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views

Title: Graph Signal Generative Diffusion Models

Title: GraphWeave: Interpretable and Robust Graph Generation via Random Walk Trajectories

Title: DepTR-MOT: Unveiling the Potential of Depth-Informed Trajectory Refinement for Multi-Object Tracking

Title: Scale-free Characteristics of Multilingual Legal Texts and the Limitations of LLMs

Title: Diff-GNSS: Diffusion-based Pseudorange Error Estimation

Title: Robust Anomaly Detection Under Normality Distribution Shift in Dynamic Graphs

Title: Efficient Sliced Wasserstein Distance Computation via Adaptive Bayesian Optimization

Title: Single-Image Depth from Defocus with Coded Aperture and Diffusion Posterior Sampling

Title: Emergent 3D Correspondence from Neural Shape Representation

Title: MedFact: A Large-scale Chinese Dataset for Evidence-based Medical Fact-checking of LLM Responses

Title: Training-Free Label Space Alignment for Universal Domain Adaptation

Title: CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration

Title: Periodic Graph-Enhanced Multivariate Time Series Anomaly Detector

Title: Stable Video-Driven Portraits

Title: Multimodal Medical Image Classification via Synergistic Learning Pre-training

Title: Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models

Title: Can LLMs Reason Over Non-Text Modalities in a Training-Free Manner? A Case Study with In-Context Representation Learning

Title: Visual Instruction Pretraining for Domain-Specific Foundation Models

Title: MRN: Harnessing 2D Vision Foundation Models for Diagnosing Parkinson's Disease with Limited 3D MR Data

Title: From Benchmarks to Reality: Advancing Visual Anomaly Detection by the VAND 3.0 Challenge

Title: OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Title: SISMA: Semantic Face Image Synthesis with Mamba

Title: Clothing agnostic Pre-inpainting Virtual Try-ON

Title: Development and validation of an AI foundation model for endoscopic diagnosis of esophagogastric junction adenocarcinoma: a cohort and deep learning study

Title: Tailored Transformation Invariance for Industrial Anomaly Detection

Title: DINOv3-Diffusion Policy: Self-Supervised Large Visual Model for Visuomotor Diffusion Policy Learning

Title: A Generative Conditional Distribution Equality Testing Framework and Its Minimax Analysis

Title: Breaking Token Into Concepts: Exploring Extreme Compression in Token Representation Via Compositional Shared Semantics

Title: WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification

Title: GEM-T: Generative Tabular Data via Fitting Moments

Title: Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance

Title: Qwen3-Omni Technical Report

Title: I2VWM: Robust Watermarking for Image to Video Generation

Title: Elucidating the Design Space of FP4 training

Title: Enhancing Semantic Segmentation with Continual Self-Supervised Pre-training

Title: ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment

Title: Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology

Title: Unsupervised Learning and Representation of Mandarin Tonal Categories by a Generative CNN

Title: Deep Hierarchical Learning with Nested Subspace Networks

Title: Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark

Title: SingLEM: Single-Channel Large EEG Model

Title: Medical priority fusion: achieving dual optimization of sensitivity and interpretability in nipt anomaly detection

Title: StefaLand: An Efficient Geoscience Foundation Model That Improves Dynamic Land-Surface Predictions

Title: Can multimodal representation learning by alignment preserve modality-specific information?

Title: Budgeted Adversarial Attack against Graph-Based Anomaly Detection in Sensor Networks

Title: StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models

Title: Variation in Verification: Understanding Verification Dynamics in Large Language Models

Title: Synth-MIA: A Testbed for Auditing Privacy Leakage in Tabular Data Synthesis

Title: Hybrid Reputation Aggregation: A Robust Defense Mechanism for Adversarial Federated Learning in 5G and Edge Network Environments

Title: Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding

Title: ComposeMe: Attribute-Specific Image Prompts for Controllable Human Image Generation

Title: Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers