2025-05-27

Title: GenAI Security: Outsmarting the Bots with a Proactive Testing Framework

Title: GAIA: A Foundation Model for Operational Atmospheric Dynamics

Title: Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry

Title: Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality

Title: Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models

Title: Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback

Title: Decomposition of Water Demand Patterns Using Skewed Gaussian Distributions for Behavioral Insights and Operational Planning

Title: InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Title: Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

Title: Diffusion Self-Weighted Guidance for Offline Reinforcement Learning

Title: CONCORD: Concept-Informed Diffusion for Dataset Distillation

Title: Weakly-supervised Mamba-Based Mastoidectomy Shape Prediction for Cochlear Implant Surgery Using 3D T-Distribution Loss

Title: Next-token pretraining implies in-context learning

Title: Dynamic Risk Assessments for Offensive Cybersecurity Agents

Title: Applications of Modular Co-Design for De Novo 3D Molecule Generation

Title: Taming Diffusion for Dataset Distillation with High Representativeness

Title: LatentLLM: Attention-Aware Joint Tensor Compression

Title: OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Title: $μ$-MoE: Test-Time Pruning as Micro-Grained Mixture-of-Experts

Title: Hybrid Latent Reasoning via Reinforcement Learning

Title: Anchored Diffusion Language Model

Title: BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks

Title: Measuring South Asian Biases in Large Language Models

Title: HonestFace: Towards Honest Face Restoration with One-Step Diffusion Model

Title: Syn3DTxt: Embedding 3D Cues for Scene Text Generation

Title: The Prompt is Mightier than the Example

Title: FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation

Title: Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking

Title: G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

Title: Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility

Title: Joint-stochastic-approximation Autoencoders with Application to Semi-supervised Learning

Title: On Denoising Walking Videos for Gait Recognition

Title: Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations

Title: EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models

Title: Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Title: Flex-Judge: Think Once, Judge Anywhere

Title: Rethinking Causal Mask Attention for Vision-Language Inference

Title: Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter

Title: Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Title: ThanoRA: Task Heterogeneity-Aware Multi-Task Low-Rank Adaptation

Title: Flow Matching for Geometric Trajectory Simulation

Title: ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos

Title: On the Emergence of Linear Analogies in Word Embeddings

Title: So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Title: DVD-Quant: Data-free Video Diffusion Transformers Quantization

Title: Self-Supervised Evolution Operator Learning for High-Dimensional Dynamical Systems

Title: Restoring Real-World Images with an Internal Detail Enhancement Diffusion Model

Title: From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation

Title: Towards Semantic Integration of Opinions: Unified Opinion Concepts Ontology and Extraction Task

Title: Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Title: Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction

Title: MADCAT: Combating Malware Detection Under Concept Drift with Test-Time Adaptation

Title: Rethinking Direct Preference Optimization in Diffusion Models

Title: Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning

Title: GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Title: Multiple Wasserstein Gradient Descent Algorithm for Multi-Objective Distributional Optimization

Title: StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations

Title: OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

Title: VORTA: Efficient Video Diffusion via Routing Sparse Attention

Title: Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding

Title: How to build a consistency model: Learning flow maps via self-distillation

Title: Localizing Knowledge in Diffusion Transformers

Title: Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation

Title: Eye-See-You: Reverse Pass-Through VR and Head Avatars

Title: Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Title: SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes

Title: Partition Generative Modeling: Masked Modeling Without Masks

Title: PromptWise: Online Learning for Cost-Aware Prompt Assignment in Generative Models

Title: Graph-Based Operator Learning from Limited Data on Irregular Domains

Title: Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation

Title: Hybrid Neural-MPM for Interactive Fluid Simulations in Real-Time

Title: WeedNet: A Foundation Model-Based Global-to-Local AI Approach for Real-Time Weed Species Identification and Classification

Title: Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection

Title: OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model

Title: CDPDNet: Integrating Text Guidance with Hybrid Vision Encoders for Medical Image Segmentation

Title: MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

Title: Protein Design with Dynamic Protein Vocabulary

Title: GhostPrompt: Jailbreaking Text-to-image Generative Models based on Dynamic Optimization

Title: STRICT: Stress Test of Rendering Images Containing Text

Title: Kernel Space Diffusion Model for Efficient Remote Sensing Pansharpening

Title: Rethinking Metrics and Benchmarks of Video Anomaly Detection

Title: Training-free Stylized Text-to-Image Generation with Fast Inference

Title: Jodi: Unification of Visual Generation and Understanding via Joint Modeling

Title: Plug-and-Play Context Feature Reuse for Efficient Masked Generation

Title: Optimization-Inspired Few-Shot Adaptation for Large Language Models

Title: An Interpretable Representation Learning Approach for Diffusion Tensor Imaging

Title: CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Title: Exploring Magnitude Preservation and Rotation Modulation in Diffusion Transformers

Title: MIND-Edit: MLLM Insight-Driven Editing via Language-Vision Projection

Title: JEDI: The Force of Jensen-Shannon Divergence in Disentangling Diffusion Models

Title: Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning

Title: Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation

Title: Towards Understanding the Mechanisms of Classifier-Free Guidance

Title: Advancing Video Self-Supervised Learning via Image Foundation Models

Title: LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Title: RAISE: Realness Assessment for Image Synthesis and Evaluation

Title: DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

Title: Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Title: Improving Novel view synthesis of 360$^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images

Title: TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis

Title: Alchemist: Turning Public Text-to-Image Data into Generative Gold

Title: Concept Reachability in Diffusion Models: Beyond Dataset Constraints

Title: Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions

Title: Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality

Title: Absolute Coordinates Make Motion Generation Easy

Title: Advancing Limited-Angle CT Reconstruction Through Diffusion-Based Sinogram Completion

Title: Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains

Title: Erasing Concepts, Steering Generations: A Comprehensive Survey of Concept Suppression

Title: LlamaSeg: Image Segmentation via Autoregressive Mask Generation

Title: Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation

Title: The Role of Diversity in In-Context Learning for Large Language Models

Title: Importance Weighted Score Matching for Diffusion Samplers with Enhanced Mode Coverage

Title: Your Classifier Can Do More: Towards Bridging the Gaps in Classification, Robustness, and Generation

Title: Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory

Title: Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

Title: Language of Network: A Generative Pre-trained Model for Encrypted Traffic Comprehension

Title: The Role of Video Generation in Enhancing Data-Limited Action Understanding

Title: Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift

Title: Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning

Title: On scalable and efficient training of diffusion samplers

Title: Aggregated Structural Representation with Large Language Models for Human-Centric Layout Generation

Title: What You Perceive Is What You Conceive: A Cognition-Inspired Framework for Open Vocabulary Image Segmentation

Title: TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Title: Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically

Title: Rotation-Equivariant Self-Supervised Method in Image Denoising

Title: SESaMo: Symmetry-Enforcing Stochastic Modulation for Normalizing Flows

Title: HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices

Title: Energy-based generator matching: A neural sampler for general state space

Title: ReDDiT: Rehashing Noise for Discrete Visual Generation

Title: Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement

Title: Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation

Title: DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving

Title: Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition

Title: JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

Title: Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments

Title: On the Relation between Rectified Flows and Optimal Transport

Title: Graceful Forgetting in Generative Language Models

Title: Cross-Sequence Semi-Supervised Learning for Multi-Parametric MRI-Based Visual Pathway Delineation

Title: HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

Title: SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect

Title: SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model

Title: Discrete Markov Bridge

Title: A Regularization-Guided Equivariant Approach for Image Restoration

Title: Foundation Models for Tabular Data within Systemic Contexts Need Grounding

Title: Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

Title: FruitNeRF++: A Generalized Multi-Fruit Counting Method Utilizing Contrastive Learning and Neural Radiance Fields

Title: Deep Active Inference Agents for Delayed and Long-Horizon Environments

Title: Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling

Title: StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation

Title: Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Title: Generalized and Personalized Federated Learning with Foundation Models via Orthogonal Transformations

Title: Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement

Title: Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM

Title: Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning

Title: An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning

Title: UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space

Title: Learning to Select In-Context Demonstration Preferred by Large Language Model

Title: Rethinking Probabilistic Circuit Parameter Learning

Title: TabPFN: One Model to Rule Them All?

Title: ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving

Title: Gradient Inversion Transcript: Leveraging Robust Generative Priors to Reconstruct Training Data from Gradient Leakage

Title: Graph Wave Networks

Title: Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Title: PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation

Title: Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning

Title: Proxy-Free GFlowNet

Title: Understanding Generalization in Diffusion Models via Probability Flow Distance

Title: MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning

Title: HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

Title: Exploring Generative Error Correction for Dysarthric Speech Recognition

Title: Long-Context State-Space Video World Models

Title: Monocle: Hybrid Local-Global In-Context Evaluation for Long-Text Generation with Uncertainty-Based Active Learning

Title: Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

Title: PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology

Title: Fine-grained List-wise Alignment for Generative Medication Recommendation

Title: Multimodal Federated Learning With Missing Modalities through Feature Imputation Network

Title: AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models

Title: Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Title: In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation

Title: ImgEdit: A Unified Image Editing Dataset and Benchmark

Title: MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search Capability

Title: Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots

Title: DiSA: Diffusion Step Annealing in Autoregressive Image Generation