2025-12-15

Title: MolSculpt: Sculpting 3D Molecular Geometries from Chemical Syntax

Title: PIAST: Rapid Prompting with In-context Augmentation for Scarce Training data

Title: SoccerMaster: A Vision Foundation Model for Soccer Understanding

Title: VDAWorld: World Modelling via VLM-Directed Abstraction and Simulation

Title: Vision-Language Models for Infrared Industrial Sensing in Additive Manufacturing Scene Description

Title: Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization

Title: In-Context Multi-Objective Optimization

Title: Learning from a Generative Oracle: Domain Adaptation for Restoration

Title: Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Title: Beyond Memorization: Gradient Projection Enables Selective Learning in Diffusion Models

Title: CADKnitter: Compositional CAD Generation from Text and Geometry Guidance

Title: AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Title: VFMF: World Modeling by Forecasting Vision Foundation Model Features

Title: REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Title: WildCap: Facial Appearance Capture in the Wild via Hybrid Inverse Rendering

Title: PersonaLive! Expressive Portrait Image Animation for Live Streaming

Title: A Simple Generalisation of the Implicit Dynamics of In-Context Learning

Title: FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion

Title: RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Title: QGEC : Quantum Golay Code Error Correction

Title: Benchmarking the Generality of Vision-Language-Action Models

Title: Symmetry-Aware Steering of Equivariant Diffusion Policies: Benefits and Limits

Title: Collaborative Reconstruction and Repair for Multi-class Industrial Anomaly Detection

Title: Sliced ReLU attention: Quasi-linear contextual expressivity via sorting

Title: JoyAvatar: Real-time and Infinite Audio-Driven Avatar Generation with Autoregressive Diffusion

Title: Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Title: DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation

Title: CADMorph: Geometry-Driven Parametric CAD Editing via a Plan-Generate-Verify Loop

Title: Mistake Notebook Learning: Selective Batch-Wise Context Optimization for In-Context Learning

Title: VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing

Title: SSA3D: Text-Conditioned Assisted Self-Supervised Framework for Automatic Dental Abutment Design

Title: On Geometric Understanding and Learned Data Priors in VGGT

Title: NeuralOGCM: Differentiable Ocean Modeling with Learnable Physics

Title: xGR: Efficient Generative Recommendation Serving at Scale

Title: A Multi-Criteria Automated MLOps Pipeline for Cost-Effective Cloud-Based Classifier Retraining in Response to Data Distribution Shifts

Title: Infinity and Beyond: Compositional Alignment in VAR and Diffusion T2I Models

Title: SSL-MedSAM2: A Semi-supervised Medical Image Segmentation Framework Powered by Few-shot Learning of SAM2

Title: 3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation

Title: Fully Inductive Node Representation Learning via Graph View Transformation

Title: Evaluating Foundation Models' 3D Understanding Through Multi-View Correspondence Analysis

Title: In-Context Learning for Seismic Data Processing

Title: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Title: Fast and Explicit: Slice-to-Volume Reconstruction via 3D Gaussian Primitives with Analytic Point Spread Function Modeling

Title: FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint

Title: Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation

Title: Bridging Streaming Continual Learning via In-Context Large Tabular Models

Title: EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Title: Referring Change Detection in Remote Sensing Imagery

Title: SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Title: Reducing Domain Gap with Diffusion-Based Domain Adaptation for Cell Counting

Title: Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective

Title: Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation