2025-12-16

Title: Active Inference with Reusable State-Dependent Value Profiles

Title: On the Design of One-step Diffusion via Shortcutting Flow Paths

Title: Adaptive Path Integral Diffusion: AdaPID

Title: Generative Stochastic Optimal Transport: Guided Harmonic Path-Integral Diffusion

Title: An Operator-Consistent Graph Neural Network for Learning Diffusion Dynamics on Irregular Meshes

Title: Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL

Title: On the Dangers of Bootstrapping Generation for Continual Learning and Beyond

Title: Generalization vs. Specialization: Evaluating Segment Anything Model (SAM3) Zero-Shot Segmentation Against Fine-Tuned YOLO Detectors

Title: MONET -- Virtual Cell Painting of Brightfield Images and Time Lapses Using Reference Consistent Diffusion

Title: Learning to Extract Context for Context-Aware LLM Inference

Title: CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

Title: CreativeVR: Diffusion-Prior-Guided Approach for Structure and Motion Restoration in Generative and Real Videos

Title: Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring

Title: BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models

Title: RePack: Representation Packing of Vision Foundation Model Features Enhances Diffusion Transformer

Title: CLOAK: Contrastive Guidance for Latent Diffusion-Based Data Obfuscation

Title: SPDMark: Selective Parameter Displacement for Robust Video Watermarking

Title: EchoVLM: Measurement-Grounded Multimodal Learning for Echocardiography

Title: High-Dimensional Tensor Discriminant Analysis: Low-Rank Discriminant Structure, Representation Synergy, and Theoretical Guarantees

Title: BaRISTA: Brain Scale Informed Spatiotemporal Representation of Human Intracranial Neural Activity

Title: Diffusion Language Model Inference with Monte Carlo Tree Search

Title: HydroDiffusion: Diffusion-Based Probabilistic Streamflow Forecasting with a State Space Backbone

Title: SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation

Title: MolGuidance: Advanced Guidance Strategies for Conditional Molecular Generation with Flow Matching

Title: A Multi-Year Urban Streetlight Imagery Dataset for Visual Monitoring and Spatio-Temporal Drift Detection

Title: EEG-DLite: Dataset Distillation for Efficient Large EEG Model Training

Title: Ultra-Low Bitrate Perceptual Image Compression with Shallow Encoder

Title: Semantic Distance Measurement based on Multi-Kernel Gaussian Processes

Title: Moment and Highlight Detection via MLLM Frame Segmentation

Title: MetaTPT: Meta Test-time Prompt Tuning for Vision-Language Models

Title: MRD: Using Physically Based Differentiable Rendering to Probe Vision Models for 3D Scene Understanding

Title: Unified Control for Inference-Time Guidance of Denoising Diffusion Models

Title: STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative

Title: V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping

Title: The Data Efficiency Frontier of Financial Foundation Models: Scaling Laws from Continued Pretraining

Title: Speedrunning ImageNet Diffusion

Title: Anchoring Values in Temporal and Group Dimensions for Flow Matching Model Alignment

Title: ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States

Title: DeepVekua: Geometric-Spectral Representation Learning for Physics-Informed Fields

Title: Can Graphs Improve Tabular Foundation Models?

Title: BokehDepth: Enhancing Monocular Depth Estimation through Bokeh Generation

Title: Knowledge-Guided Masked Autoencoder with Linear Spectral Mixing and Spectral-Angle-Aware Reconstruction

Title: Exploring the Design Space of Transition Matching

Title: The American Ghost in the Machine: How language models align culturally and the effects of cultural prompting

Title: Generative Spatiotemporal Data Augmentation

Title: Animus3D: Text-driven 3D Animation via Motion Score Distillation

Title: NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data

Title: Skillful Subseasonal-to-Seasonal Forecasting of Extreme Events with a Multi-Sphere Coupled Probabilistic Model

Title: Differentiable Energy-Based Regularization in GANs: A Simulator-Based Exploration of VQE-Inspired Auxiliary Losses

Title: Vision-Enhanced Large Language Models for High-Resolution Image Synthesis and Multimodal Data Interpretation

Title: No Cache Left Idle: Accelerating diffusion model via Extreme-slimming Caching

Title: InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation

Title: DynaGen: Unifying Temporal Knowledge Graph Reasoning with Dynamic Subgraphs and Generative Regularization

Title: On Approaches to Building Surrogate ODE Models for Diffusion Bridges

Title: GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation

Title: Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior

Title: Learning Common and Salient Generative Factors Between Two Image Datasets

Title: On the continuity of flows

Title: Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners

Title: Information-Consistent Language Model Recommendations through Group Relative Policy Optimization

Title: Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification

Title: Distillation of Discrete Diffusion by Exact Conditional Distribution Matching

Title: Investigating Data Pruning for Pretraining Biological Foundation Models at Scale

Title: SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer

Title: Scaling Up AI-Generated Image Detection via Generator-Aware Prototypes

Title: Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Title: Light Field Based 6DoF Tracking of Previously Unobserved Objects

Title: JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion

Title: SneakPeek: Future-Guided Instructional Streaming Video Generation

Title: Motus: A Unified Latent Action World Model

Title: Bi-Erasing: A Bidirectional Framework for Concept Removal in Diffusion Models

Title: Understanding Structured Financial Data with LLMs: A Case Study on Fraud Detection

Title: Towards Test-time Efficient Visual Place Recognition via Asymmetric Query Processing

Title: Forging a Dynamic Memory: Retrieval-Guided Continual Learning for Generalist Medical Foundation Models

Title: UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era

Title: Harmonizing Generalization and Specialization: Uncertainty-Informed Collaborative Learning for Semi-supervised Medical Image Segmentation

Title: Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather

Title: Intrinsic Image Fusion for Multi-View 3D Material Reconstruction

Title: A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis

Title: POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling

Title: CORE: Contrastive Masked Feature Reconstruction on Graphs

Title: STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits

Title: BézierFlow: Bézier Stochastic Interpolant Schedulers for Few-Step Generation

Title: CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing

Title: CausalCLIP: Causally-Informed Feature Disentanglement and Filtering for Generalizable Detection of Generated Images

Title: LINA: Learning INterventions Adaptively for Physical Alignment and Generalization in Diffusion Models

Title: ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Title: ALIGN-FL: Architecture-independent Learning through Invariant Generative component sharing in Federated Learning

Title: FIN-bench-v2: A Unified and Robust Benchmark Suite for Evaluating Finnish Large Language Models

Title: Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs

Title: RecTok: Reconstruction Distillation along Rectified Flow

Title: Test-Time Modification: Inverse Domain Transformation for Robust Perception

Title: On-Device Continual Learning for Unsupervised Visual Anomaly Detection in Dynamic Manufacturing

Title: Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Title: Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains

Title: PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation

Title: 3D Human-Human Interaction Anomaly Detection

Title: Memory in the Age of AI Agents

Title: ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Title: Image Diffusion Preview with Consistency Solver

Title: Lighting in Motion: Spatiotemporal HDR Lighting Estimation

Title: DA-SSL: self-supervised domain adaptor to leverage foundational models in turbt histopathology slides

Title: DBT-DINO: Towards Foundation model based analysis of Digital Breast Tomosynthesis

Title: Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models

Title: SCR2-ST: Combine Single Cell with Spatial Transcriptomics for Efficient Active Sampling via Reinforcement Learning

Title: Grab-3D: Detecting AI-Generated Videos from 3D Geometric Temporal Consistency

Title: AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection

Title: Feedforward 3D Editing via Text-Steerable Image-to-3D

Title: I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

Title: Beyond surface form: A pipeline for semantic analysis in Alzheimer's Disease detection from spontaneous speech

Title: Towards Scalable Pre-training of Visual Tokenizers for Generation

Title: DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders