2025-06-11

Title: ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Title: Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques

Title: CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems

Title: Benchmarking Pre-Trained Time Series Models for Electricity Price Forecasting

Title: BLUR: A Bi-Level Optimization Approach for LLM Unlearning

Title: Surgeon Style Fingerprinting and Privacy Risk Quantification via Discrete Diffusion Models in a Vision-Language-Action Framework

Title: Generative Learning of Differentiable Object Models for Compositional Interpretation of Complex Scenes

Title: GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra

Title: A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation

Title: Using Satellite Images And Self-supervised Machine Learning Networks To Detect Water Hidden Under Vegetation

Title: Highly Compressed Tokenizer Can Generate Without Training

Title: Seeing Voices: Generating A-Roll Video from Audio with Mirage

Title: H$^2$GFM: Towards unifying Homogeneity and Heterogeneity on Text-Attributed Graphs

Title: Why Masking Diffusion Works: Condition on the Jump Schedule for Improved Discrete Diffusion

Title: How Good LLM-Generated Password Policies Are?

Title: Graph Prompting for Graph Learning Models: Recent Advances and Future Directions

Title: A Simple Analysis of Discretization Error in Diffusion Models

Title: Dynamical System Optimization

Title: How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models

Title: Learning to Hear Broken Motors: Signature-Guided Data Augmentation for Induction-Motor Diagnostics

Title: MARMOT: Masked Autoencoder for Modeling Transient Imaging

Title: Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization

Title: LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s

Title: TrajFlow: Multi-modal Motion Prediction via Flow Matching

Title: Neighbors and relatives: How do speech embeddings reflect linguistic connections across the world?

Title: Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Title: Diffusion-based Time Series Forecasting for Sewerage Systems

Title: Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation

Title: Sample Efficient Demonstration Selection for In-Context Learning

Title: RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping

Title: Orientation Matters: Making 3D Generative Models Orientation-Aligned

Title: Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers

Title: Summarization for Generative Relation Extraction in the Microbiome Domain

Title: MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Title: Breaking the ICE: Exploring promises and challenges of benchmarks for Inference Carbon & Energy estimation for LLMs

Title: Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Title: AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP

Title: RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation

Title: Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting

Title: Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models

Title: HomographyAD: Deep Anomaly Detection Using Self Homography Learning

Title: A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory

Title: Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling

Title: HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation

Title: HiSin: Efficient High-Resolution Sinogram Inpainting via Resolution-Guided Progressive Inference

Title: IMAGIC-500: IMputation benchmark on A Generative Imaginary Country (500k samples)

Title: Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis

Title: InfoDPCCA: Information-Theoretic Dynamic Probabilistic Canonical Correlation Analysis

Title: Product of Experts for Visual Generation

Title: MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis

Title: Intention-Conditioned Flow Occupancy Models

Title: Quantifying Mix Network Privacy Erosion with Generative Models

Title: BioLangFusion: Multimodal Fusion of DNA, mRNA, and Protein Language Models

Title: SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation

Title: Segment Concealed Objects with Incomplete Supervision

Title: ORIDa: Object-centric Real-world Image Composition Dataset

Title: GFRIEND: Generative Few-shot Reward Inference through EfficieNt DPO

Title: On Finetuning Tabular Foundation Models

Title: Do Concept Replacement Techniques Really Erase Unacceptable Concepts?

Title: Employing self-supervised learning models for cross-linguistic child speech maturity classification

Title: Branched Schrödinger Bridge Matching

Title: Edit Flows: Flow Matching with Edit Operations

Title: Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features

Title: Do MIL Models Transfer?

Title: e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs

Title: Diffuse and Disperse: Image Generation with Representation Regularization

Title: Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models

Title: MagCache: Fast Video Generation with Magnitude-Aware Cache

Title: Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations