2025-12-17

Title: Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution

Title: STAR: STacked AutoRegressive Scheme for Unified Multimodal Learning

Title: Time-aware UNet and super-resolution deep residual networks for spatial downscaling

Title: The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces

Title: MoLingo: Motion-Language Alignment for Text-to-Motion Generation

Title: Coarse-to-Fine Hierarchical Alignment for UAV-based Human Detection using Diffusion Models

Title: SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

Title: A Complete Guide to Spherical Equivariant Graph Transformers

Title: An evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes

Title: From Unlearning to UNBRANDING: A Benchmark for Trademark-Safe Text-to-Image Generation

Title: Repurposing 2D Diffusion Models for 3D Shape Completion

Title: Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Title: EXAONE Path 2.5: Pathology Foundation Model with Multi-Omics Alignment

Title: FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling

Title: Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution

Title: SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding

Title: AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation

Title: OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration

Title: ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models

Title: A First-Order Logic-Based Alternative to Reward Models in RLHF

Title: MFE-GAN: Efficient GAN-based Framework for Document Image Enhancement and Binarization with Multi-scale Feature Extraction

Title: SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing

Title: TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models

Title: Random-Bridges as Stochastic Transports for Generative Models

Title: DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos

Title: Estimating problem difficulty without ground truth using Large Language Model comparisons

Title: OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving

Title: Understanding the Gain from Data Filtering in Multimodal Contrastive Learning

Title: ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body

Title: 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation

Title: Beyond MMD: Evaluating Graph Generative Models with Geometric Deep Learning

Title: FLAME: Flow Enhanced Legendre Memory Models for General Time Series Forecasting

Title: Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Title: SS4D: Native 4D Generative Model via Structured Spacetime Latents

Title: Dual Attention Guided Defense Against Malicious Edits

Title: Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

Title: Towards Transferable Defense Against Malicious Image Edits

Title: Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos

Title: LCMem: A Universal Model for Robust Image Memorization Detection

Title: Score-Based Turbo Message Passing for Plug-and-Play Compressive Imaging

Title: A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Title: Improving Slow Transfer Predictions: Generative Methods Compared

Title: Synthetic Electrogram Generation with Variational Autoencoders for ECGI

Title: HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion

Title: TAT: Task-Adaptive Transformer for All-in-One Medical Image Restoration

Title: FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Title: gridfm-datakit-v1: A Python Library for Scalable and Realistic Power Flow and Optimal Power Flow Data Generation

Title: VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Title: Native and Compact Structured Latents for 3D Generation

Title: Spherical Leech Quantization for Visual Tokenization and Generation

Title: MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives