2025-12-17

Title: Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution

Title: PIS: A Generalized Physical Inversion Solver for Arbitrary Sparse Observations via Set-Conditioned Diffusion

Title: DARTs: A Dual-Path Robust Framework for Anomaly Detection in High-Dimensional Multivariate Time Series

Title: TF-MCL: Time-frequency Fusion and Multi-domain Cross-Loss for Self-supervised Depression Detection

Title: Why Text Prevails: Vision May Undermine Multimodal Medical Decision Making

Title: STAR: STacked AutoRegressive Scheme for Unified Multimodal Learning

Title: MoLingo: Motion-Language Alignment for Text-to-Motion Generation

Title: Coarse-to-Fine Hierarchical Alignment for UAV-based Human Detection using Diffusion Models

Title: A Complete Guide to Spherical Equivariant Graph Transformers

Title: Informing Acquisition Functions via Foundation Models for Molecular Discovery

Title: Pattern-Guided Diffusion Models

Title: An evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes

Title: From Unlearning to UNBRANDING: A Benchmark for Trademark-Safe Text-to-Image Generation

Title: Quality-Driven and Diversity-Aware Sample Expansion for Robust Marine Obstacle Segmentation

Title: Repurposing 2D Diffusion Models for 3D Shape Completion

Title: Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Title: EXAONE Path 2.5: Pathology Foundation Model with Multi-Omics Alignment

Title: Unleashing the Power of Image-Tabular Self-Supervised Learning via Breaking Cross-Tabular Barriers

Title: A Deep Dive into Function Inlining and its Security Implications for ML-based Binary Analysis

Title: FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling

Title: Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution

Title: Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Title: SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding

Title: FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis

Title: Derivative-Informed Fourier Neural Operator: Universal Approximation and Applications to PDE-Constrained Optimization

Title: ProtoFlow: Interpretable and Robust Surgical Workflow Modeling with Learned Dynamic Scene Graph Prototypes

Title: AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation

Title: OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration

Title: Cornserve: Efficiently Serving Any-to-Any Multimodal Models

Title: ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models

Title: Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries

Title: MFE-GAN: Efficient GAN-based Framework for Document Image Enhancement and Binarization with Multi-scale Feature Extraction

Title: FastDDHPose: Towards Unified, Efficient, and Disentangled 3D Human Pose Estimation

Title: Random-Bridges as Stochastic Transports for Generative Models

Title: DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos

Title: OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving

Title: 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation

Title: Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding

Title: Two CFG Nahuatl for automatic corpora expansion

Title: Physically consistent model learning for reaction-diffusion systems

Title: Beyond MMD: Evaluating Graph Generative Models with Geometric Deep Learning

Title: FLAME: Flow Enhanced Legendre Memory Models for General Time Series Forecasting

Title: SS4D: Native 4D Generative Model via Structured Spacetime Latents

Title: PSMamba: Progressive Self-supervised Vision Mamba for Plant Disease Recognition

Title: Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity

Title: Dual Attention Guided Defense Against Malicious Edits

Title: Towards Transferable Defense Against Malicious Image Edits

Title: RePo: Language Models with Context Re-Positioning

Title: LCMem: A Universal Model for Robust Image Memorization Detection

Title: The Devil is in Attention Sharing: Improving Complex Non-rigid Image Editing Faithfulness via Attention Synergy

Title: Score-Based Turbo Message Passing for Plug-and-Play Compressive Imaging

Title: A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Title: Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space

Title: Native Intelligence Emerges from Large-Scale Clinical Practice: A Retinal Foundation Model with Deployment Efficiency

Title: Linguists should learn to love speech-based deep learning models

Title: Improving Slow Transfer Predictions: Generative Methods Compared

Title: DASP: Self-supervised Nighttime Monocular Depth Estimation with Domain Adaptation of Spatiotemporal Priors

Title: Synthetic Electrogram Generation with Variational Autoencoders for ECGI

Title: HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

Title: Dual Language Models: Balancing Training Efficiency and Overfitting Resilience

Title: Polypersona: Persona-Grounded LLM for Synthetic Survey Responses

Title: WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Title: Hierarchical Persistence Velocity for Network Anomaly Detection: Theory and Applications to Cryptocurrency Markets

Title: JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

Title: A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images

Title: Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Title: MMGR: Multi-Modal Generative Reasoning

Title: Native and Compact Structured Latents for 3D Generation