2025-02-11

Title: Survey on AI-Generated Media Detection: From Non-MLLM to MLLM

Title: Parameter Symmetry Breaking and Restoration Determines the Hierarchical Learning in AI Systems

Title: fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving

Title: Coarse-to-Fine Structure-Aware Artistic Style Transfer

Title: Beyond and Free from Diffusion: Invertible Guided Consistency Training

Title: Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Title: Deep Generative Models with Hard Linear Equality Constraints

Title: APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

Title: AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection

Title: Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Title: Block Graph Neural Networks for tumor heterogeneity prediction

Title: Gen-DFL: Decision-Focused Generative Learning for Robust Decision Making

Title: A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction

Title: Do Spikes Protect Privacy? Investigating Black-Box Model Inversion Attacks in Spiking Neural Networks

Title: Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation

Title: SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation

Title: 4DR P2T: 4D Radar Tensor Synthesis with Point Clouds

Title: FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion

Title: Training-Free Constrained Generation With Stable Diffusion Models

Title: TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model

Title: Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Title: The Evolution of Dataset Distillation: Toward Scalable and Generalizable Solutions

Title: SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant Surgical Scene Completion

Title: Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

Title: UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Title: Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails

Title: Predictive Crash Analytics for Traffic Safety using Deep Learning

Title: GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation

Title: Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue

Title: Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Title: MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Title: NeuralPrefix: A Zero-shot Sensory Data Imputation Plugin

Title: Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation

Title: Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors

Title: Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search

Title: Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention

Title: VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Title: Generating 3D Binding Molecules Using Shape-Conditioned Diffusion Models with Guidance

Title: Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization

Title: Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo

Title: Graph Pseudotime Analysis and Neural Stochastic Differential Equations for Analyzing Retinal Degeneration Dynamics and Beyond

Title: Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Title: Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Title: Universal Approximation of Visual Autoregressive Transformers

Title: An Interpretable Implicit-Based Approach for Modeling Local Spatial Effects: A Case Study of Global Gross Primary Productivity

Title: Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis

Title: CANeRV: Content Adaptive Neural Representation for Video Compression

Title: Comparing Image Segmentation Algorithms

Title: DGNO: A Novel Physics-aware Neural Operator for Solving Forward and Inverse PDE Problems based on Deep, Generative Probabilistic Modeling

Title: Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation

Title: UniDemoir\'e: Towards Universal Image Demoir\'eing with Data Generation and Synthesis

Title: LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models

Title: Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo

Title: How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators

Title: TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints

Title: Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification

Title: FCVSR: A Frequency-aware Method for Compressed Video Super-Resolution

Title: Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising

Title: Rethinking Large-scale Dataset Compression: Shifting Focus From Labels to Images

Title: Low-dimensional Functions are Efficiently Learnable under Randomly Biased Distributions

Title: UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths

Title: Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution

Title: Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling

Title: Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation

Title: CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Title: Dimension-free Regret for Learning Asymmetric Linear Dynamical Systems

Title: Diffusion Models for Computational Neuroimaging: A Survey

Title: Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data?

Title: A Large-scale AI-generated Image Inpainting Benchmark

Title: TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Title: Automatic Annotation Augmentation Boosts Translation between Molecules and Natural Language

Title: MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing

Title: Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

Title: No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers

Title: Se\~norita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Title: VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Title: ViSIR: Vision Transformer Single Image Reconstruction Method for Earth System Models

Title: History-Guided Video Diffusion

Title: Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Title: Enhancing Performance of Explainable AI Models with Constrained Concept Refinement

Title: Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT