2026-02-27

Title: Zatom-1: A Multimodal Flow Foundation Model for 3D Molecules and Materials

Title: BrepCoder: A Unified Multimodal Large Language Model for Multi-task B-rep Reasoning

Title: MammoWise: Multi-Model Local RAG Pipeline for Mammography Report Generation

Title: Space Syntax-guided Post-training for Residential Floor Plan Generation

Title: DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation

Title: Autoregressive Visual Decoding from EEG Signals

Title: Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation

Title: GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views

Title: TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion

Title: Causal Motion Diffusion Models for Autoregressive Motion Generation

Title: BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

Title: Transformers converge to invariant algorithmic cores

Title: LoR-LUT: Learning Compact 3D Lookup Tables via Low-Rank Residuals

Title: Instruction-based Image Editing with Planning, Reasoning, and Generation

Title: CRAG: Can 3D Generative Models Help 3D Assembly?

Title: Denoising as Path Planning: Training-Free Acceleration of Diffusion Models with DPCache

Title: Scaling Audio-Visual Quality Assessment Dataset via Crowdsourcing

Title: Forecasting Antimicrobial Resistance Trends Using Machine Learning on WHO GLASS Surveillance Data: A Retrieval-Augmented Generation Approach for Policy Decision Support

Title: SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses

Title: No Caption, No Problem: Caption-Free Membership Inference via Model-Fitted Embeddings

Title: Enhancing Geometric Perception in VLMs via Translator-Guided Reinforcement Learning

Title: IRSDE-Despeckle: A Physics-Grounded Diffusion Model for Generalizable Ultrasound Despeckling

Title: SPATIALALIGN: Aligning Dynamic Spatial Relationships in Video Generation

Title: Beyond Detection: Multi-Scale Hidden-Code for Natural Image Deepfake Recovery and Factual Retrieval

Title: SceneTransporter: Optimal Transport-Guided Compositional Latent Diffusion for Single-Image Structured 3D Scene Generation

Title: GSTurb: Gaussian Splatting for Atmospheric Turbulence Mitigation

Title: PhotoAgent: Agentic Photo Editing with Exploratory Visual Aesthetic Planning

Title: A data- and compute-efficient chest X-ray foundation model beyond aggressive scaling

Title: MEDNA-DFM: A Dual-View FiLM-MoE Model for Explainable DNA Methylation Prediction

Title: From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Title: Chain of Flow: A Foundational Generative Framework for ECG-to-4D Cardiac Digital Twins

Title: OSDaR-AR: Enhancing Railway Perception Datasets via Multi-modal Augmented Reality

Title: MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

Title: ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization

Title: MM-NeuroOnco: A Multimodal Benchmark and Instruction Dataset for MRI-Based Brain Tumor Diagnosis

Title: UCM: Unifying Camera Control and Memory with Time-aware Positional Encoding Warping for World Models

Title: DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis

Title: RhythmBERT: A Self-Supervised Language Model Based on Latent Representations of ECG Waveforms for Heart Disease Detection

Title: Benchmarking Temporal Web3 Intelligence: Lessons from the FinSurvival 2025 Challenge

Title: MetaOthello: A Controlled Study of Multiple World Models in Transformers

Title: DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation

Title: Learning Continuous Wasserstein Barycenter Space for Generalized All-in-One Image Restoration

Title: Efficient Real-Time Adaptation of ROMs for Unsteady Flows Using Data Assimilation

Title: InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models

Title: ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation

Title: Through BrokenEyes: How Eye Disorders Impact Face Detection?

Title: Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction

Title: MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

Title: Large Multimodal Models as General In-Context Classifiers

Title: Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving

Title: Decomposing Private Image Generation via Coarse-to-Fine Wavelet Modeling

Title: ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation

Title: PRIMA: Pre-training with Risk-integrated Image-Metadata Alignment for Medical Diagnosis via LLM

Title: A Proper Scoring Rule for Virtual Staining

Title: ParamMem: Augmenting Language Agents with Parametric Reflective Memory

Title: SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation