2025-10-30

Title: Towards Fine-Grained Human Motion Video Captioning

Title: ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality

Title: SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing

Title: Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Title: The Generation Phases of Flow Matching: a Denoising Perspective

Title: VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos

Title: Resource-Efficient and Robust Inference of Deep and Bayesian Neural Networks on Embedded and Analog Computing Platforms

Title: Sequences of Logits Reveal the Low Rank Structure of Language Models

Title: PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face Attributes

Title: Continual Low-Rank Adapters for LLM-based Generative Recommender Systems

Title: The Neural Differential Manifold: An Architecture with Explicit Geometric Structure

Title: Revisiting Reconstruction-based AI-generated Image Detection: A Geometric Perspective

Title: EA3D: Online Open-World 3D Object Extraction from Streaming Videos

Title: Machine Learning Guided Optimal Transmission Switching to Mitigate Wildfire Ignition Risk

Title: Target-Guided Bayesian Flow Networks for Quantitatively Constrained CAD Generation

Title: Balanced conic rectified flow

Title: DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Title: VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations

Title: Diffusion-Driven Progressive Target Manipulation for Source-Free Domain Adaptation

Title: Seeing Clearly and Deeply: An RGBD Imaging Approach with a Bio-inspired Monocentric Design

Title: CDFlow: Building Invertible Layers with Circulant and Diagonal Matrices

Title: StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA

Title: More than a Moment: Towards Coherent Sequences of Audio Descriptions

Title: TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting

Title: RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Title: BOLT-GAN: Bayes-Optimal Loss for Stable GAN Training

Title: Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation

Title: FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion

Title: VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning