2025-12-25

Title: Dominating vs. Dominated: Generative Collapse in Diffusion Models

Title: SA-DiffuSeq: Addressing Computational and Scalability Challenges in Long-Document Generation with Sparse Attention

Title: VL4Gaze: Unleashing Vision-Language Models for Gaze Following

Title: TS-Arena Technical Report -- A Pre-registered Live Forecasting Platform

Title: Improving Matrix Exponential for Generative AI Flows: A Taylor-Based Approach Beyond Paterson--Stockmeyer

Title: Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

Title: Beyond Weight Adaptation: Feature-Space Domain Injection for Cross-Modal Ship Re-Identification

Title: DiEC: Diffusion Embedded Clustering

Title: Self-supervised Multiplex Consensus Mamba for General Image Fusion

Title: Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection

Title: Foundation Model-based Evaluation of Neuropsychiatric Disorders: A Lifespan-Inclusive, Multi-Modal, and Multi-Lingual Study

Title: Generalization of Diffusion Models Arises with a Balanced Representation Space

Title: Neutralization of IMU-Based GPS Spoofing Detection using external IMU sensor and feedback methodology

Title: X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data

Title: PUFM++: Point Cloud Upsampling via Enhanced Flow Matching

Title: Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

Title: FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing

Title: Multi-Attribute guided Thermal Face Image Translation based on Latent Diffusion Model

Title: Next-Scale Prediction: A Self-Supervised Approach for Real-World Image Denoising

Title: DexAvatar: 3D Sign Language Reconstruction with Hand and Body Pose Priors

Title: Beyond Pixel Simulation: Pathology Image Generation via Diagnostic Semantic Tokens and Prototype Control

Title: Multimodal Skeleton-Based Action Representation Learning via Decomposition and Composition

Title: FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting

Title: STLDM: Spatio-Temporal Latent Diffusion Model for Precipitation Nowcasting

Title: A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation

Title: UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement

Title: SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation

Title: AnyAD: Unified Any-Modality Anomaly Detection in Incomplete Multi-Sequence MRI

Title: ACD: Direct Conditional Control for Video Diffusion Models via Attention Supervision

Title: GriDiT: Factorized Grid-Based Diffusion for Efficient Long Image Sequence Generation

Title: Surgical Scene Segmentation using a Spike-Driven Video Transformer with Real-Time Potential

Title: Transcriptome-Conditioned Personalized De Novo Drug Generation for AML Using Metaheuristic Assembly and Target-Driven Filtering

Title: TICON: A Slide-Level Tile Contextualizer for Histopathology Representation Learning

Title: Fast SAM2 with Text-Driven Token Pruning

Title: Optimizing Decoding Paths in Masked Diffusion Models by Quantifying Uncertainty

Title: HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming