2025-10-27

Title: Video-As-Prompt: Unified Semantic Control for Video Generation

Title: Code-enabled language models can outperform reasoning models on diverse tasks

Title: Generative Point Tracking with Flow Matching

Title: VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models

Title: BioDet: Boosting Industrial Object Detection with Image Preprocessing Strategies

Title: Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation

Title: Can Current Detectors Catch Face-to-Voice Deepfake Attacks?

Title: From Information to Generative Exponent: Learning Rate Induces Phase Transitions in SGD

Title: Physically consistent and uncertainty-aware learning of spatiotemporal dynamics

Title: Amortized Active Generation of Pareto Sets

Title: Dynamic Retriever for In-Context Knowledge Editing via Policy Optimization

Title: Scalable Machine Learning Analysis of Parker Solar Probe Solar Wind Data

Title: ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models

Title: Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts

Title: Digital Contrast CT Pulmonary Angiography Synthesis from Non-contrast CT for Pulmonary Vascular Disease

Title: Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design

Title: Towards Physics-informed Spatial Intelligence with Human Priors: An Autonomous Driving Pilot Study

Title: Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation

Title: TokenCLIP: Token-wise Prompt Learning for Zero-shot Anomaly Detection

Title: PLAN: Proactive Low-Rank Allocation for Continual Learning

Title: 3rd Place Solution to ICCV LargeFineFoodAI Retrieval

Title: Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models

Title: On the flow matching interpretability

Title: Model Merging with Functional Dual Anchors

Title: Improved Training Technique for Shortcut Models

Title: Correlation Dimension of Auto-Regressive Large Language Models

Title: Topology Sculptor, Shape Refiner: Discrete Diffusion Model for High-Fidelity 3D Meshes Generation

Title: An Evidence-Based Post-Hoc Adjustment Framework for Anomaly Detection Under Data Contamination

Title: Efficient semantic uncertainty quantification in language models via diversity-steered sampling

Title: VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set

Title: Morphologically Intelligent Perturbation Prediction with FORM

Title: Compositional Monte Carlo Tree Diffusion for Extendable Planning

Title: FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models

Title: BADiff: Bandwidth Adaptive Diffusion Model

Title: TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation

Title: Large Language Models as Model Organisms for Human Associative Learning

Title: Self-diffusion for Solving Inverse Problems

Title: Vision Language Models for Dynamic Human Activity Recognition in Healthcare Settings

Title: ArtiLatent: Realistic Articulated 3D Object Generation via Structured Latents

Title: REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring

Title: MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection

Title: VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance

Title: MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Title: ITC-RWKV: Interactive Tissue-Cell Modeling with Recurrent Key-Value Aggregation for Histopathological Subtyping

Title: Towards a Golden Classifier-Free Guidance Path via Foresight Fixed Point Iterations

Title: Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Title: Surrogate-based quantification of policy uncertainty in generative flow networks

Title: FrameShield: Adversarially Robust Video Anomaly Detection

Title: Document Understanding, Measurement, and Manipulation Using Category Theory

Title: Automated Quality Control for Language Documentation: Detecting Phonotactic Inconsistencies in a Kokborok Wordlist

Title: REVE: A Foundation Model for EEG -- Adapting to Any Setup with Large-Scale Pretraining on 25,000 Subjects

Title: Restore Text First, Enhance Image Later: Two-Stage Scene Text Image Super-Resolution with Glyph Structure Guidance

Title: S3OD: Towards Generalizable Salient Object Detection with Synthetic Data

Title: Generalised Flow Maps for Few-Step Generative Modelling on Riemannian Manifolds

Title: Generative Correlation Manifolds: Generating Synthetic Data with Preserved Higher-Order Correlations

Title: Epipolar Geometry Improves Video Generation Models

Title: DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Title: Self-Supervised Learning of Synapse Types from EM Images

Title: Foundation Models in Dermatopathology: Skin Tissue Classification

Title: WorldGrow: Generating Infinite 3D World

Title: BachVid: Training-Free Video Generation with Consistent Background and Character

Title: Visual Diffusion Models are Geometric Solvers