2025-06-06

Title: Backbone Augmented Training for Adaptations

Title: Relational reasoning and inductive bias in transformers trained on a transitive inference task

Title: GEM: Empowering LLM for both Embedding Generation and Language Understanding

Title: HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting

Title: WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning

Title: Visualizing and Controlling Cortical Responses Using Voxel-Weighted Activation Maximization

Title: The Hashed Fractal Key Recovery (HFKR) Problem: From Symbolic Path Inversion to Post-Quantum Cryptographic Keys

Title: MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLP

Title: Is Perturbation-Based Image Protection Disruptive to Image Editing?

Title: Self-Supervised Contrastive Learning is Approximately Supervised Contrastive Learning

Title: HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

Title: RETRO SYNFLOW: Discrete Flow Matching for Accurate and Diverse Single-Step Retrosynthesis

Title: Neural MJD: Neural Non-Stationary Merton Jump Diffusion for Time Series Prediction

Title: BESA: Boosting Encoder Stealing Attack with Perturbation Recovery

Title: Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts?

Title: Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching

Title: Follow-Your-Creation: Empowering 4D Creation through Video Inpainting

Title: Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets

Title: SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Title: Exploring bidirectional bounds for minimax-training of Energy-based models

Title: Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning

Title: Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth

Title: Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders

Title: FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Title: Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction

Title: Gen-n-Val: Agentic Image Data Generation and Validation

Title: Explicit Density Approximation for Neural Implicit Samplers Using a Bernstein-Based Convex Divergence

Title: UNO: Unlearning via Orthogonalization in Generative models

Title: Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model

Title: Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion

Title: Using In-Context Learning for Automatic Defect Labelling of Display Manufacturing Data

Title: SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs

Title: On Automating Security Policies with Contemporary LLMs

Title: Sparse Autoencoders, Again?

Title: Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking

Title: ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests

Title: A Practitioner's Guide to Building ASR Models for Low-Resource Languages: A Case Study on Scottish Gaelic

Title: CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx

Title: From Struggle (06-2024) to Mastery (02-2025) LLMs Conquer Advanced Algorithm Exams and Pave the Way for Editorial Generation

Title: Attack Effect Model based Malicious Behavior Detection

Title: UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting

Title: Tuning the Right Foundation Models is What you Need for Partial Label Learning

Title: FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing

Title: SeedEdit 3.0: Fast and High-Quality Generative Image Editing

Title: Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers

Title: Privacy Amplification Through Synthetic Data: Insights from Linear Regression

Title: DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models

Title: Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems

Title: Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline

Title: Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Title: Associative Memory and Generative Diffusion in the Zero-noise Limit

Title: Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis

Title: Counterfactual reasoning: an analysis of in-context emergence

Title: Locality Preserving Markovian Transition for Instance Retrieval

Title: Quantifying Cross-Modality Memorization in Vision-Language Models

Title: Transformers Meet In-Context Learning: A Universal Approximation Theory

Title: OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View

Title: RELIC: Evaluating Compositional Instruction Following via Language Recognition

Title: Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning

Title: Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation

Title: DSG-World: Learning a 3D Gaussian World Model from Dual State Videos

Title: Improving Low-Resource Morphological Inflection via Self-Supervised Objectives

Title: Progressive Tempering Sampler with Diffusion

Title: MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Title: Aligning Latent Spaces with Flow Priors

Title: Spatiotemporal Contrastive Learning for Cross-View Video Localization in Unstructured Off-road Terrains

Title: Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning

Title: Can Foundation Models Generalise the Presentation Attack Detection Capabilities on ID Cards?

Title: How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control

Title: Rectified Point Flow: Generic Point Cloud Pose Estimation

Title: Stable Vision Concept Transformers for Medical Diagnosis

Title: AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

Title: A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$: Robust Imitation via Learning to Search

Title: SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Title: Learning normalized image densities via dual score matching

Title: LSM-2: Learning from Incomplete Wearable Sensor Data

Title: Exploring Diffusion Transformer Designs via Grafting

Title: Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

Title: Contrastive Flow Matching