2026-02-11

Title: Distributed Hybrid Parallelism for Large Language Models: Comparative Study and System Design Guide

Title: Epistemic Throughput: Fundamental Limits of Attention-Constrained Inference

Title: Counterfactual Maps: What They Are and How to Find Them

Title: A Hybrid Deterministic Framework for Named Entity Extraction in Broadcast News Video

Title: What do Geometric Hallucination Detection Metrics Actually Measure?

Title: All-in-One Conditioning for Text-to-Image Synthesis

Title: Gradient Residual Connections

Title: Rethinking Global Text Conditioning in Diffusion Transformers

Title: Measuring Privacy Risks and Tradeoffs in Financial Synthetic Data Generation

Title: Stabilizing Physics-Informed Consistency Models via Structure-Preserving Training

Title: Empowering Contrastive Federated Sequential Recommendation with LLMs

Title: Learning with Multiple Correct Answers -- A Trichotomy of Regret Bounds under Different Feedback Models

Title: K-Sort Eval: Efficient Preference Evaluation for Visual Generation via Corrected VLM-as-a-Judge

Title: Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

Title: Bridging the Modality Gap in Roadside LiDAR: A Training-Free Vision-Language Model Framework for Vehicle Classification

Title: SceneReVis: A Self-Reflective Vision-Grounded Framework for 3D Indoor Scene Synthesis via Multi-turn RL

Title: Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-Tuning

Title: Look-Ahead and Look-Back Flows: Training-Free Image Generation with Trajectory Smoothing

Title: FD-DB: Frequency-Decoupled Dual-Branch Network for Unpaired Synthetic-to-Real Domain Translation

Title: Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions

Title: Towards Uniformity and Alignment for Multimodal Representation Learning

Title: Robust Depth Super-Resolution via Adaptive Diffusion Sampling

Title: SchröMind: Mitigating Hallucinations in Multimodal Large Language Models via Solving the Schrödinger Bridge Problem

Title: DR.Experts: Differential Refinement of Distortion-Aware Experts for Blind Image Quality Assessment

Title: AUHead: Realistic Emotional Talking Head Generation via Action Units Control

Title: ECG-IMN: Interpretable Mesomorphic Neural Networks for 12-Lead Electrocardiogram Interpretation

Title: Mitigating the Likelihood Paradox in Flow-based OOD Detection via Entropy Manipulation

Title: MieDB-100k: A Comprehensive Dataset for Medical Image Editing

Title: Why the Counterintuitive Phenomenon of Likelihood Rarely Appears in Tabular Anomaly Detection with Deep Generative Models?

Title: Hand2World: Autoregressive Egocentric Interaction Generation via Free-Space Hand Gestures

Title: Tele-Omni: a Unified Multimodal Framework for Video Generation and Editing

Title: AGMark: Attention-Guided Dynamic Watermarking for Large Vision-Language Models

Title: Blind denoising diffusion models and the blessings of dimensionality

Title: TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution

Title: Resilient Class-Incremental Learning: on the Interplay of Drifting, Unlabelled and Imbalanced Data Streams

Title: Physics-informed diffusion models in spectral space

Title: Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Title: Allure of Craquelure: A Variational-Generative Approach to Crack Detection in Paintings

Title: Toward Fine-Grained Facial Control in 3D Talking Head Generation

Title: Towards Poisoning Robustness Certification for Natural Language Generation

Title: Where Do Images Come From? Analyzing Captions to Geographically Profile Datasets

Title: Explainability in Generative Medical Diffusion Models: A Faithfulness-Based Analysis on MRI Synthesis

Title: When Less is More: The LLM Scaling Paradox in Context Compression

Title: Fully-automated sleep staging: multicenter validation of a generalizable deep neural network for Parkinson's disease and isolated REM sleep behavior disorder

Title: SciFlow-Bench: Evaluating Structure-Aware Scientific Diagram Generation via Inverse Parsing

Title: SAKED: Mitigating Hallucination in Large Vision-Language Models via Stability-Aware Knowledge Enhanced Decoding

Title: Kelix Technique Report

Title: CoFEH: LLM-driven Feature Engineering Empowered by Collaborative Bayesian Hyperparameter Optimization

Title: Code2World: A GUI World Model via Renderable Code Generation

Title: Free-GVC: Towards Training-Free Extreme Generative Video Compression with Temporal Coherence

Title: MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation

Title: AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization

Title: Monocular Normal Estimation via Shading Sequence Estimation

Title: A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

Title: Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection

Title: WildCat: Near-Linear Attention in Theory and Practice

Title: Causality in Video Diffusers is Separable from Denoising

Title: Learning on the Manifold: Unlocking Standard Diffusion Transformers with Representation Encoders

Title: VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

Title: ConsID-Gen: View-Consistent and Identity-Preserving Image-to-Video Generation

Title: SAGE: Scalable Agentic 3D Scene Generation for Embodied AI