2025-12-30

Title: Towards Unsupervised Causal Representation Learning via Latent Additive Noise Model Causal Autoencoders

Title: Characterizing Motion Encoding in Video Diffusion Timesteps

Title: Wireless Traffic Prediction with Large Language Model

Title: Latent Sculpting for Zero-Shot Generalization: A Manifold Learning Approach to Out-of-Distribution Anomaly Detection

Title: DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Title: Meta-information Guided Cross-domain Synergistic Diffusion Model for Low-dose PET Reconstruction

Title: Interpretable Perturbation Modeling Through Biomedical Knowledge Graphs

Title: Graph Attention-based Adaptive Transfer Learning for Link Prediction

Title: Human-Aligned Generative Perception: Bridging Psychophysics and Generative Models

Title: The Illusion of Clinical Reasoning: A Benchmark Reveals the Pervasive Gap in Vision-Language Models for Clinical Competency

Title: Hierarchical Stacking Optimization Using Dirichlet's Process (SoDip): Towards Accelerated Design for Graft Polymerization

Title: Cluster Aggregated GAN (CAG): A Cluster-Based Hybrid Model for Appliance Pattern Generation

Title: Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model

Title: Multi-Head Spectral-Adaptive Graph Anomaly Detection

Title: LLA: Enhancing Security and Privacy for Generative Models with Logic-Locked Accelerators

Title: LangPrecip: Language-Aware Multimodal Precipitation Nowcasting

Title: SpotEdit: Selective Region Editing in Diffusion Transformers

Title: DeMoGen: Towards Decompositional Human Motion Generation with Energy-Based Diffusion Models

Title: Self-Evaluation Unlocks Any-Step Text-to-Image Generation

Title: The Syntax of qulk-clauses in Yemeni Ibbi Arabic: A Minimalist Approach

Title: DeFloMat: Detection with Flow Matching for Stable and Efficient Generative Object Localization

Title: Bright 4B: Scaling Hyperspherical Learning for Segmentation in 3D Brightfield Microscopy

Title: SAM 3D for 3D Object Reconstruction from Remote Sensing Images

Title: Pose-Guided Residual Refinement for Interpretable Text-to-Motion Generation and Editing

Title: Tracking by Predicting 3-D Gaussians Over Time

Title: Decomposing Task Vectors for Refined Model Editing

Title: Energy-Guided Flow Matching Enables Few-Step Conformer Generation and Ground-State Identification

Title: Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Title: Rethinking Memory Design in SAM-Based Visual Object Tracking

Title: Envision: Embodied Visual Planning via Goal-Imagery Video Diffusion

Title: On the Role of Discreteness in Diffusion LLMs

Title: Visual Autoregressive Modelling for Monocular Depth Estimation

Title: Quantum Generative Models for Computational Fluid Dynamics: A First Exploration of Latent Space Learning in Lattice Boltzmann Simulations

Title: CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation

Title: SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis

Title: Improved cystic hygroma detection from prenatal imaging using ultrasound-specific self-supervised representation learning

Title: WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Title: GRExplainer: A Universal Explanation Method for Temporal Graph Neural Networks

Title: Parallel Diffusion Solver via Residual Dirichlet Policy Optimization

Title: ReDiF: Reinforced Distillation for Few Step Diffusion

Title: EgoReAct: Egocentric Video-Driven 3D Human Reaction Generation

Title: ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Title: Learning Anatomy from Multiple Perspectives via Self-supervision in Chest Radiographs

Title: M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models

Title: Guided Path Sampling: Steering Diffusion Models Back on Track with Principled Path Guidance

Title: DECEPTICON: How Dark Patterns Manipulate Web Agents

Title: Multiple Token Divergence: Measuring and Steering In-Context Computation Density

Title: Reverse Personalization

Title: Toward Stable Semi-Supervised Remote Sensing Segmentation via Co-Guidance and Co-Fusion

Title: 3D sans 3D Scans: Scalable Pre-training from Video-Generated Point Clouds

Title: PI-MFM: Physics-informed multimodal foundation model for solving partial differential equations

Title: TabiBERT: A Large-Scale ModernBERT Foundation Model and Unified Benchmarking Framework for Turkish

Title: Multimodal Functional Maximum Correlation for Emotion Recognition

Title: MedSAM-based lung masking for multi-label chest X-ray classification

Title: Osmotic Learning: A Self-Supervised Paradigm for Decentralized Contextual Data Representation

Title: How Much Data Is Enough? Uniform Convergence Bounds for Generative & Vision-Language Models under Low-Dimensional Structure

Title: PathoSyn: Imaging-Pathology MRI Synthesis via Disentangled Deviation Diffusion

Title: Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems

Title: Diffusion-based Decentralized Federated Multi-Task Representation Learning

Title: GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation

Title: Task-oriented Learnable Diffusion Timesteps for Universal Few-shot Learning of Dense Tasks

Title: Anka: A Domain-Specific Language for Reliable LLM Code Generation

Title: Anomaly Detection by Effectively Leveraging Synthetic Images

Title: SURE Guided Posterior Sampling: Trajectory Correction for Diffusion-Based Inverse Problems

Title: Physics-Inspired Modeling and Content Adaptive Routing in an Infrared Gas Leak Detection Network

Title: RS-Prune: Training-Free Data Pruning at High Ratios for Efficient Remote Sensing Diffusion Foundation Models

Title: ASemConsist: Adaptive Semantic Feature Control for Training-Free Identity-Consistent Generation

Title: Plug-and-Play Fidelity Optimization for Diffusion Transformer Acceleration via Cumulative Error Minimization

Title: On the Inverse Flow Matching Problem in the One-Dimensional and Gaussian Cases

Title: Diffusion priors enhanced velocity model building from time-lag images using a neural operator

Title: SoulX-LiveTalk Technical Report

Title: A unified framework for detecting point and collective anomalies in operating system logs via collaborative transformers

Title: SOFTooth: Semantics-Enhanced Order-Aware Fusion for Tooth Instance Segmentation

Title: DriveLaW:Unifying Planning and Video Generation in a Latent Driving World

Title: Direct Diffusion Score Preference Optimization via Stepwise Contrastive Policy-Pair Supervision

Title: Towards Integrating Uncertainty for Domain-Agnostic Segmentation

Title: Stochastic Siamese MAE Pretraining for Longitudinal Medical Images

Title: CoFi-Dec: Hallucination-Resistant Decoding via Coarse-to-Fine Generative Feedback in Large Vision-Language Models

Title: Automated river gauge plate reading using a hybrid object detection and generative AI framework in the Limpopo River Basin

Title: Deterministic Image-to-Image Translation via Denoising Brownian Bridge Models with Dual Approximators

Title: HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation

Title: FRoD: Full-Rank Efficient Fine-Tuning with Rotational Degrees for Fast Convergence

Title: IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Title: Iterative Inference-time Scaling with Adaptive Frequency Steering for Image Super-Resolution

Title: AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Title: PathFound: An Agentic Multimodal Model Activating Evidence-seeking Pathological Diagnosis

Title: PurifyGen: A Risk-Discrimination and Semantic-Purification Model for Safe Text-to-Image Generation

Title: ThinkGen: Generalized Thinking for Visual Generation

Title: ProGuard: Towards Proactive Multimodal Safeguard

Title: LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Title: Distribution-Free Process Monitoring with Conformal Prediction

Title: Memorization in 3D Shape Generation: An Empirical Study

Title: IDT: A Physically Grounded Transformer for Feed-Forward Multi-View Intrinsic Decomposition

Title: Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Title: Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion