2025-12-18

Title: LLM as a Neural Architect: Controlled Generation of Image Captioning Models Under Strict API Contracts

Title: How a Bit Becomes a Story: Semantic Steering via Differentiable Fault Injection

Title: Automatic Extraction of Rules for Generating Synthetic Patient Data From Real-World Population Data Using Glioblastoma as an Example

Title: Generative Urban Flow Modeling: From Geometry to Airflow with Graph Diffusion

Title: Entropy-Reservoir Bregman Projection: An Information-Geometric Unification of Model Collapse

Title: Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections

Title: TalkVerse: Democratizing Minute-Long Audio-Driven Video Generation

Title: Softly Constrained Denoisers for Diffusion Models

Title: Where is the Watermark? Interpretable Watermark Detection at the Block Level

Title: DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding

Title: Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Title: MVGSR: Multi-View Consistent 3D Gaussian Super-Resolution via Epipolar Guidance

Title: EMFusion: Conditional Diffusion Framework for Trustworthy Frequency Selective EMF Forecasting in Wireless Networks

Title: The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems

Title: PMMD: A pose-guided multi-view multi-modal diffusion for person generation

Title: The Semantic Architect: How FEAML Bridges Structured Data and LLMs for Multi-Label Tasks

Title: Uni-Parser Technical Report

Title: Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

Title: FADTI: Fourier and Attention Driven Diffusion for Multivariate Time Series Imputation

Title: 3DProxyImg: Controllable 3D-Aware Animation Synthesis from Single Image via 2D-3D Aligned Proxy Embedding

Title: Explainable Action Form Assessment by Exploiting Multimodal Chain-of-Thoughts Reasoning

Title: Robust and Calibrated Detection of Authentic Multimedia Content

Title: SLCFormer: Spectral-Local Context Transformer with Physics-Grounded Flare Synthesis for Nighttime Flare Removal

Title: Accelerating High-Throughput Catalyst Screening by Direct Generation of Equilibrium Adsorption Structures

Title: MMMamba: A Versatile Cross-Modal In Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement

Title: Quantum Machine Learning for Cybersecurity: A Taxonomy and Future Directions

Title: SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation

Title: Automated Motion Artifact Check for MRI (AutoMAC-MRI): An Interpretable Framework for Motion Artifact Detection and Severity Assessment

Title: A Masked Reverse Knowledge Distillation Method Incorporating Global and Local Information for Image Anomaly Detection

Title: Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics

Title: Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models

Title: Robustness Evaluation of Machine Learning Models for Fault Classification and Localization In Power System Protection

Title: FlowBind: Efficient Any-to-Any Generation with Bidirectional Flows

Title: Copyright Infringement Risk Reduction via Chain-of-Thought and Task Instruction Prompting

Title: Multi-stage Bayesian optimisation for dynamic decision-making in self-driving labs

Title: Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting

Title: VAAS: Vision-Attention Anomaly Scoring for Image Manipulation Detection in Digital Forensics

Title: DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations

Title: An Efficient and Effective Encoder Model for Vision and Language Tasks in the Remote Sensing Domain

Title: GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Title: Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Title: InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization

Title: SoFlow: Solution Flow Models for One-Step Generative Modeling

Title: Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning

Title: End-to-End Training for Autoregressive Video Diffusion via Self-Resampling

Title: DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Title: Spatia: Video Generation with Updatable Spatial Memory