2025-09-05

Title: Towards Efficient General Feature Prediction in Masked Skeleton Modeling

Title: treeX: Unsupervised Tree Instance Segmentation in Dense Forest Point Clouds

Title: CEHR-GPT: A Scalable Multi-Task Foundation Model for Electronic Health Records

Title: AutoGrid AI: Deep Reinforcement Learning Framework for Autonomous Microgrid Management

Title: Learning an Adversarial World Model for Automated Curriculum Generation in MARL

Title: Fitting Image Diffusion Models on Video Datasets

Title: EGTM: Event-guided Efficient Turbulence Mitigation

Title: Machine Learning for LiDAR-Based Indoor Surface Classification in Intelligent Wireless Environments

Title: Human Motion Video Generation: A Survey

Title: OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction

Title: A Generative Foundation Model for Chest Radiography

Title: On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study

Title: TaleDiffusion: Multi-Character Story Generation with Dialogue Rendering

Title: MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation

Title: TAGAL: Tabular Data Generation using Agentic LLM Methods

Title: Set Block Decoding is a Language Model Inference Accelerator

Title: DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval

Title: Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models

Title: From Editor to Dense Geometry Estimator

Title: AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anomaly Search

Title: PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Title: Transition Models: Rethinking the Generative Learning Objective

Title: Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

Title: Durian: Dual Reference-guided Portrait Animation with Attribute Transfer

Title: From Lines to Shapes: Geometric-Constrained Segmentation of X-Ray Collimators via Hough Transform

Title: The Telephone Game: Evaluating Semantic Drift in Unified Models

Title: One Flight Over the Gap: A Survey from Perspective to Panoramic Vision

Title: Plot'n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models

Title: TRUST-VL: An Explainable News Assistant for General Multimodal Misinformation Detection

Title: Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image -- Technical Preview