2025-05-20

Title: Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning

Title: Concept-Guided Interpretability via Neural Chunking

Title: Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning

Title: Flash Invariant Point Attention

Title: Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search

Title: Enhancing Network Anomaly Detection with Quantum GANs and Successive Data Injection for Multivariate Time Series

Title: The Gaussian-Multinoulli Restricted Boltzmann Machine: A Potts Model Extension of the GRBM

Title: BandRC: Band Shifted Raised Cosine Activated Implicit Neural Representations

Title: Joint Graph Estimation and Signal Restoration for Robust Federated Learning

Title: DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation

Title: Mollifier Layers: Enabling Efficient High-Order Derivatives in Inverse PDE Learning

Title: Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

Title: LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance

Title: Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration

Title: UGoDIT: Unsupervised Group Deep Image Prior Via Transferable Weights

Title: Semantically-Aware Game Image Quality Assessment

Title: Token-Level Uncertainty Estimation for Large Language Model Reasoning

Title: Redefining Neural Operators in $d+1$ Dimensions

Title: Generative and Contrastive Graph Representation Learning

Title: Self-NPO: Negative Preference Optimization of Diffusion Models by Simply Learning from Itself without Explicit Preference Annotations

Title: JULI: Jailbreak Large Language Models by Self-Introspection

Title: CL-CaGAN: Capsule differential adversarial continuous learning for cross-domain hyperspectral anomaly detection

Title: CL-BioGAN: Biologically-Inspired Cross-Domain Continual Learning for Hyperspectral Anomaly Detection

Title: Diffmv: A Unified Diffusion Framework for Healthcare Predictions with Random Missing Views and View Laziness

Title: SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation

Title: RVTBench: A Benchmark for Visual Reasoning Tasks

Title: Learning Pareto-Optimal Rewards from Noisy Preferences: A Framework for Multi-Objective Inverse Reinforcement Learning

Title: GenZSL: Generative Zero-Shot Learning Via Inductive Variational Autoencoder

Title: Facial Recognition Leveraging Generative Adversarial Networks

Title: Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?

Title: SafeVid: Toward Safety Aligned Video Large Multimodal Models

Title: How can Diffusion Models Evolve into Continual Generators?

Title: AoP-SAM: Automation of Prompts for Efficient Segmentation

Title: Online Iterative Self-Alignment for Radiology Report Generation

Title: Approximation theory for 1-Lipschitz ResNets

Title: Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation

Title: Improving regional weather forecasts with neural interpolation

Title: FIGhost: Fluorescent Ink-based Stealthy and Flexible Backdoor Attacks on Physical Traffic Sign Recognition

Title: Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling

Title: VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Title: LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

Title: EarthSynth: Generating Informative Earth Observation with Diffusion Models

Title: Learning to Highlight Audio by Watching Movies

Title: Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather

Title: CompBench: Benchmarking Complex Instruction-guided Image Editing

Title: NOFT: Test-Time Noise Finetune via Information Bottleneck for Highly Correlated Asset Creation

Title: PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement

Title: Context-Aware Autoregressive Models for Multi-Conditional Image Generation

Title: Model alignment using inter-modal bridges

Title: Is Artificial Intelligence Generated Image Detection a Solved Problem?

Title: Towards Open-world Generalized Deepfake Detection: General Feature Extraction via Unsupervised Domain Adaptation

Title: AbFlowNet: Optimizing Antibody-Antigen Binding Energy via Diffusion-GFlowNet Fusion

Title: CLIP-aware Domain-Adaptive Super-Resolution

Title: Few-Shot Concept Unlearning with Low Rank Adaptation

Title: DPCD: A Quality Assessment Database for Dynamic Point Clouds

Title: Joint Embedding vs Reconstruction: Provable Benefits of Latent Space Prediction for Self Supervised Learning

Title: Guiding Diffusion with Deep Geometric Moments: Balancing Fidelity and Variation

Title: Video-GPT via Next Clip Diffusion

Title: Unsupervised Invariant Risk Minimization

Title: Towards Budget-Friendly Model-Agnostic Explanation Generation for Large Language Models

Title: Exploring Sparsity for Parameter Efficient Fine Tuning Using Wavelets

Title: ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models

Title: Event-based Star Tracking under Spacecraft Jitter: the e-STURT Dataset

Title: SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

Title: Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking

Title: BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation

Title: Dual-Agent Reinforcement Learning for Automated Feature Generation

Title: Degradation-Aware Feature Perturbation for All-in-One Image Restoration

Title: Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents

Title: MVPainter: Accurate and Detailed 3D Texture Generation via Multi-View Diffusion with Geometric Control

Title: Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking

Title: Few-Step Diffusion via Score identity Distillation

Title: CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models

Title: PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI

Title: Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses

Title: Any-to-Any Learning in Computational Pathology via Triplet Multimodal Pretraining

Title: MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning

Title: ProDS: Preference-oriented Data Selection for Instruction Tuning

Title: A Study on the Refining Handwritten Font by Mixing Font Styles

Title: Accelerate TarFlow Sampling with GS-Jacobi Iteration

Title: Towards a Universal Image Degradation Model via Content-Degradation Disentanglement

Title: PhyDA: Physics-Guided Diffusion Models for Data Assimilation in Atmospheric Systems

Title: TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks

Title: ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling

Title: Active Learning on Synthons for Molecular Design

Title: LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration

Title: Leveraging LLM Inconsistency to Boost Pass@k Performance

Title: Generative Modeling of Random Fields from Limited Data via Constrained Latent Flow Matching

Title: LiBOG: Lifelong Learning for Black-Box Optimizer Generation

Title: RGB-to-Polarization Estimation: A New Task and Benchmark Study

Title: Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction

Title: Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation

Title: Just Dance with $π$! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection

Title: Adaptive Image Restoration for Video Surveillance: A Real-Time Approach

Title: CacheFlow: Fast Human Motion Prediction by Cached Normalizing Flow

Title: Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision

Title: True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics

Title: A Physics-Inspired Optimizer: Velocity Regularized Adam

Title: MAGI-1: Autoregressive Video Generation at Scale

Title: Swin DiT: Diffusion Transformer using Pseudo Shifted Windows

Title: WriteViT: Handwritten Text Generation with Vision Transformer

Title: RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models

Title: eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks

Title: Denoising Diffusion Probabilistic Model for Point Cloud Compression at Low Bit-Rates

Title: VesselGPT: Autoregressive Modeling of Vascular Geometry

Title: RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers

Title: One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling

Title: Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation

Title: Faster Video Diffusion with Trainable Sparse Attention

Title: Understanding Complexity in VideoQA via Visual Program Generation

Title: Synthetic-Powered Predictive Inference

Title: FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance

Title: VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation

Title: Mean Flows for One-step Generative Modeling