2025-05-21

Title: Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency

Title: OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making

Title: Incentivizing Truthful Language Models via Peer Elicitation Games

Title: Improving Compositional Generation with Diffusion Models Using Lift Scores

Title: Synthetic Non-stationary Data Streams for Recognition of the Unknown

Title: Scalable Autoregressive 3D Molecule Generation

Title: Context-Free Synthetic Data Mitigates Forgetting

Title: SuperMapNet for Long-Range and High-Accuracy Vectorized HD Map Construction

Title: Exploring Causes of Representational Similarity in Machine Learning Models

Title: Blind Restoration of High-Resolution Ultrasound Video

Title: LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts

Title: RLVR-World: Training World Models with Reinforcement Learning

Title: CLEVER: A Curated Benchmark for Formally Verified Code Generation

Title: Every Pixel Tells a Story: End-to-End Urdu Newspaper OCR

Title: UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache

Title: OmniStyle: Filtering High Quality Style Transfer Data at Scale

Title: Adaptive Cyclic Diffusion for Inference Scaling

Title: MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow

Title: Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Title: FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning

Title: ReactDiff: Latent Diffusion for Facial Reaction Generation

Title: Unify Graph Learning with Text: Unleashing LLM Potentials for Session Search

Title: LMP: Leveraging Motion Prior in Zero-Shot Video Generation with Diffusion Transformer

Title: $α$-GAN by Rényi Cross Entropy

Title: MSDformer: Multi-scale Discrete Transformer For Time Series Generation

Title: Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data

Title: Flexible-weighted Chamfer Distance: Enhanced Objective Function for Point Cloud Completion

Title: Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization

Title: Towards Generating Realistic Underwater Images

Title: RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection

Title: Handloom Design Generation Using Generative Networks

Title: Vid2World: Crafting Video Diffusion Models to Interactive World Models

Title: Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable

Title: Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives

Title: ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Title: Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models

Title: VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Title: RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding

Title: Enhancing Interpretability of Sparse Latent Representations with Class Information

Title: Latent Flow Transformer

Title: SparC: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling

Title: Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image

Title: Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI

Title: CSTS: A Benchmark for the Discovery of Correlation Structures in Time Series Clustering

Title: KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models

Title: CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation

Title: UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Title: Training-Free Watermarking for Autoregressive Image Generation

Title: UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

Title: Emerging Properties in Unified Multimodal Pretraining

Title: Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers