2025-05-30

Title: Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

Title: HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

Title: Pre-Training Curriculum for Multi-Token Prediction in Language Models

Title: FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian

Title: MIAS-SAM: Medical Image Anomaly Segmentation without thresholding

Title: Multivariate de Bruijn Graphs: A Symbolic Graph Framework for Time Series Forecasting

Title: Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems

Title: Navigating the Latent Space Dynamics of Neural Models

Title: Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization

Title: IMTS is Worth Time $\times$ Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction

Title: Preference Learning with Response Time

Title: How Do Diffusion Models Improve Adversarial Robustness?

Title: Kernel-Smoothed Scores for Denoising Diffusion: A Bias-Variance Study

Title: Security Benefits and Side Effects of Labeling AI-Generated Images

Title: RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation

Title: CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting

Title: A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition

Title: Scaling Offline RL via Efficient and Expressive Shortcut Models

Title: CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

Title: Defining Foundation Models for Computational Science: A Call for Clarity and Rigor

Title: Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

Title: Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification

Title: Is Noise Conditioning Necessary? A Unified Theory of Unconditional Graph Diffusion Models

Title: ATI: Any Trajectory Instruction for Controllable Video Generation

Title: Directed Graph Grammars for Sequence-based Learning

Title: LLM-based HSE Compliance Assessment: Benchmark, Performance, and Advancements

Title: Exploring Scaling Laws for EHR Foundation Models

Title: EquiReg: Equivariance Regularized Diffusion for Inverse Problems

Title: HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions

Title: MOVi: Training-free Text-conditioned Multi-Object Video Generation

Title: LLM Agents for Bargaining with Utility-based Feedback

Title: Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition

Title: $K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Title: EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models

Title: From Theory to Application: Fine-Tuning Large EEG Model with Real-World Stress Data

Title: ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Title: Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object

Title: DINGO: Constrained Inference for Diffusion LLMs

Title: GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

Title: Weight Spectra Induced Efficient Model Adaptation

Title: Generating Diverse Training Samples for Relation Extraction with Large Language Models

Title: Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Title: TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance

Title: MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

Title: HMAD: Advancing E2E Driving with Anchored Offset Proposals and Simulation-Supervised Multi-target Scoring

Title: Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing

Title: FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

Title: Implicit Inversion turns CLIP into a Decoder

Title: Tell, Don't Show: Leveraging Language Models' Abstractive Retellings to Model Literary Themes

Title: RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Title: FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification

Title: HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image

Title: Less is More: Unlocking Specialization of Time Series Foundation Models via Structured Pruning

Title: UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes

Title: Efficiently Access Diffusion Fisher: Within the Outer Product Span Space

Title: Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs

Title: Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs

Title: RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

Title: GenCAD-Self-Repairing: Feasibility Enhancement for 3D CAD Generation

Title: Federated Unsupervised Semantic Segmentation

Title: Score-based Generative Modeling for Conditional Independence Testing

Title: TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models

Title: Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors

Title: Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

Title: Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization

Title: Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

Title: Graph Positional Autoencoders as Self-supervised Learners

Title: Sentinel: Scheduling Live Streams with Proactive Anomaly Detection in Crowdsourced Cloud-Edge Platforms

Title: Discriminative Policy Optimization for Token-Level Reward Models

Title: Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation

Title: From Parameters to Prompts: Understanding and Mitigating the Factuality Gap between Fine-Tuned LLMs

Title: Bidirectional predictive coding

Title: Enhanced DACER Algorithm with High Diffusion Efficiency

Title: UrbanCraft: Urban View Extrapolation via Hierarchical Sem-Geometric Priors

Title: CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis

Title: Diffusion Guidance Is a Controllable Policy Improvement Operator

Title: LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter

Title: TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning

Title: Spoken Language Modeling with Duration-Penalized Self-Supervised Units

Title: VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning

Title: CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization

Title: Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion Modulation

Title: Normalizing Flows are Capable Models for RL

Title: Subgraph Gaussian Embedding Contrast for Self-Supervised Graph Representation Learning

Title: Maximum Likelihood Learning of Latent Dynamics Without Reconstruction

Title: BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

Title: Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Title: Inference-time Scaling of Diffusion Models through Classical Search

Title: MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment

Title: VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Title: D-AR: Diffusion via Autoregressive Models

Title: OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation

Title: ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer

Title: Automatic classification of stop realisation with wav2vec2.0

Title: DiCoFlex: Model-agnostic diverse counterfactuals with flexible control

Title: Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

Title: SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods

Title: TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

Title: DiffER: Categorical Diffusion for Chemical Retrosynthesis

Title: Label-Guided In-Context Learning for Named Entity Recognition

Title: FMG-Det: Foundation Model Guided Robust Object Detection

Title: ATLAS: Learning to Optimally Memorize the Context at Test Time

Title: How Animals Dance (When You're Not Looking)

Title: LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization

Title: MAGREF: Masked Guidance for Any-Reference Video Generation

Title: DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP

Title: Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Title: LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers