2025-05-23

Title: Generative AI for Autonomous Driving: A Review

Title: Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities

Title: Challenger: Affordable Adversarial Driving Video Generation

Title: MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding

Title: OViP: Online Vision-Language Preference Learning

Title: Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders

Title: Image-to-Image Translation with Diffusion Transformers and CLIP-Based Image Conditioning

Title: Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging

Title: OpenEthics: A Comprehensive Ethical Evaluation of Open-Source Generative Large Language Models

Title: An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection

Title: Mesh-free sparse identification of nonlinear dynamics

Title: Scalable Graph Generative Modeling via Substructure Sequences

Title: Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning

Title: TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Title: Automated Feedback Loops to Protect Text Simplification with Generative AI from Information Loss

Title: Erased or Dormant? Rethinking Concept Erasure Through Reversibility

Title: Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Title: An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Title: DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution

Title: IRONIC: Coherence-Aware Reasoning Chains for Multi-Modal Sarcasm Detection

Title: Interpretable Anomaly Detection in Encrypted Traffic Using SHAP with Machine Learning Models

Title: Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

Title: SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation

Title: Paired and Unpaired Image to Image Translation using Generative Adversarial Networks

Title: NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Title: TensorAR: Refinement is All You Need in Autoregressive Image Generation

Title: FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design

Title: Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification

Title: Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation

Title: A collaborative constrained graph diffusion model for the generation of realistic synthetic molecules

Title: Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning

Title: Pose-invariant face recognition via feature-space pose frontalization

Title: Investigating Fine- and Coarse-grained Structural Correspondences Between Deep Neural Networks and Human Object Image Similarity Judgments Using Unsupervised Alignment

Title: $I^2G$: Generating Instructional Illustrations via Text-Conditioned Diffusion

Title: MAGIC: Motion-Aware Generative Inference via Confidence-Guided LLM

Title: Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization

Title: Consistent World Models via Foresight Diffusion

Title: Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection

Title: Joint Relational Database Generation via Graph-Conditional Diffusion Models

Title: HOFT: Householder Orthogonal Fine-tuning

Title: SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion

Title: TextureSAM: Towards a Texture Aware Foundation Model for Segmentation

Title: Towards Coordinate- and Dimension-Agnostic Machine Learning for Partial Differential Equations

Title: M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion

Title: Unsupervised Network Anomaly Detection with Autoencoders and Traffic Images

Title: Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding

Title: SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images

Title: Quantum Feature Optimization for Enhanced Clustering of Blockchain Transaction Data

Title: Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge

Title: On the Out-of-Distribution Generalization of Self-Supervised Learning

Title: Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds

Title: Learning Genomic Structure from $k$-mers

Title: One-Step Diffusion-Based Image Compression with Semantic Distillation

Title: Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence

Title: KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Title: Advancing Brainwave Modeling with a Codebook-Based Foundation Model

Title: Masked Conditioning for Deep Generative Models

Title: Forward-only Diffusion Probabilistic Models

Title: Mitigating Overfitting in Medical Imaging: Self-Supervised Pretraining vs. ImageNet Transfer Learning for Dermatological Diagnosis

Title: Learning Flexible Forward Trajectories for Masked Molecular Diffusion

Title: Cohort-Based Active Modality Acquisition

Title: REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Title: REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Title: Learning Beyond Limits: Multitask Learning and Synthetic Data for Low-Resource Canonical Morpheme Segmentation

Title: LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Title: Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Title: Training-Free Efficient Video Generation via Dynamic Token Carving

Title: A Multi-Step Comparative Framework for Anomaly Detection in IoT Data Streams

Title: T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

Title: CASTILLO: Characterizing Response Length Distributions of Large Language Models

Title: Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs

Title: Unsupervised Prompting for Graph Neural Networks

Title: Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Title: LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Title: In-Context Watermarks for Large Language Models

Title: SPAR: Self-supervised Placement-Aware Representation Learning for Multi-Node IoT Systems

Title: FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

Title: Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models

Title: Creatively Upscaling Images with Global-Regional Priors

Title: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On

Title: Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction

Title: Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Title: Guided Diffusion Sampling on Function Spaces with Applications to PDEs

Title: CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

Title: Understanding Prompt Tuning and In-Context Learning via Meta-Learning

Title: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Title: When Are Concepts Erased From Diffusion Models?