2025-07-15

Title: Recurrent Expansion: A Pathway Toward the Next Generation of Deep Learning

Title: GUIDE: Towards Scalable Advising for Research Ideas

Title: Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination

Title: Detecting Deepfake Talking Heads from Facial Biometric Anomalies

Title: Beyond Scores: Proximal Diffusion Models

Title: Theory-Informed Improvements to Classifier-Free Guidance for Discrete Diffusion Models

Title: Learning Diffusion Models with Flexible Representation Guidance

Title: Exploiting Leaderboards for Large-Scale Distribution of Malicious Models

Title: On Evaluating Performance of LLM Inference Serving Systems

Title: Behavioral Exploration: Learning to Explore via In-Context Adaptation

Title: Shortening the Trajectories: Identity-Aware Gaussian Approximation for Efficient 3D Molecular Generation

Title: Can Contrastive Learning Improve Class-Imbalanced Diffusion Model?

Title: From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion

Title: Taming generative video models for zero-shot optical flow extraction

Title: RadEyeVideo: Enhancing general-domain Large Vision Language Model for chest X-ray analysis with video representations of eye gaze

Title: Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning

Title: Hybrid Autoregressive-Diffusion Model for Real-Time Streaming Sign Language Production

Title: SnapMoGen: Human Motion Generation from Expressive Texts

Title: $I^{2}$-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting

Title: THYME: Temporal Hierarchical-Cyclic Interactivity Modeling for Video Scene Graphs in Aerial Footage

Title: Capturing Unseen Spatial Extremes Through Knowledge-Informed Generative Modeling

Title: Warm Starts Accelerate Generative Modelling

Title: EgoAnimate: Generating Human Animations from Egocentric top-down Views

Title: Generative Latent Kernel Modeling for Blind Motion Deblurring

Title: Geo-RepNet: Geometry-Aware Representation Learning for Surgical Phase Recognition in Endoscopic Submucosal Dissection

Title: AlphaVAE: Unified End-to-End RGBA Image Reconstruction and Generation with Alpha-Aware Representation Learning

Title: Geometric Generative Modeling with Noise-Conditioned Graph Networks

Title: Domain Adaptation and Multi-view Attention for Learnable Landmark Tracking with Sparse Data

Title: Toward Developing Machine-Learning-Aided Tools for the Thermomechanical Monitoring of Nuclear Reactor Components

Title: La-Proteina: Atomistic Protein Generation via Partially Latent Flow Matching

Title: Assessing reliability of explanations in unbalanced datasets: a use-case on the occurrence of frost events

Title: WordCraft: Interactive Artistic Typography with Attention Awareness and Noise Blending

Title: MENTOR: Efficient Multimodal-Conditioned Tuning for Autoregressive Vision Generation Models

Title: Demystifying Flux Architecture

Title: Generate Aligned Anomaly: Region-Guided Few-Shot Anomaly Image-Mask Pair Synthesis for Industrial Inspection

Title: Brain Stroke Detection and Classification Using CT Imaging with Transformer Models and Explainable AI

Title: Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model

Title: Post-Training Quantization of Generative and Discriminative LSTM Text Classifiers: A Study of Calibration, Class Balance, and Robustness

Title: ExpStar: Towards Automatic Commentary Generation for Multi-discipline Scientific Experiments

Title: Continental scale habitat modelling with artificial intelligence and multimodal earth observation

Title: Universal Physics Simulation: A Foundational Diffusion Approach

Title: Advancing Text-to-3D Generation with Linearized Lookahead Variational Score Distillation

Title: Do we need equivariant models for molecule generation?

Title: Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow

Title: Generative Cognitive Diagnosis

Title: A Pre-training Framework for Relational Data with Information-theoretic Principles

Title: SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Title: Counterfactual Visual Explanation via Causally-Guided Adversarial Steering

Title: IGD: Instructional Graphic Design with Multimodal Layer Generation

Title: Crucial-Diff: A Unified Diffusion Model for Crucial Image and Annotation Synthesis in Data-scarce Scenarios

Title: Long-Tailed Data Classification by Increasing and Decreasing Neurons During Training

Title: Iceberg: Enhancing HLS Modeling with Synthetic Data

Title: 4D-MISR: A unified model for low-dose super-resolution imaging via feature fusion

Title: Compliance Minimization via Physics-Informed Gaussian Processes

Title: Latent Diffusion Models with Masked AutoEncoders

Title: 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving

Title: Frequency Regulation for Exposure Bias Mitigation in Diffusion Models

Title: Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering

Title: A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images

Title: From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation

Title: Straighten Viscous Rectified Flow via Noise Optimization

Title: Spatial Lifting for Dense Prediction

Title: Navigating the Challenges of AI-Generated Image Detection in the Wild: What Truly Matters?

Title: Conditional Chemical Language Models are Versatile Tools in Drug Discovery

Title: Show and Polish: Reference-Guided Identity Preservation in Face Video Restoration

Title: Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching

Title: MoCap-Impute: A Comprehensive Benchmark and Comparative Analysis of Imputation Methods for IMU-based Motion Capture Data

Title: Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Title: Text-Visual Semantic Constrained AI-Generated Image Quality Assessment

Title: RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction

Title: Graph World Model