2026-02-17

Title: Directional Concentration Uncertainty: A representational approach to uncertainty quantification for generative models

Title: MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

Title: Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

Title: Spectral Collapse in Diffusion Inversion

Title: Sim2Radar: Toward Bridging the Radar Sim-to-Real Gap with VLM-Guided Scene Reconstruction

Title: HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving

Title: FireRed-Image-Edit-1.0 Techinical Report

Title: Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots

Title: AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers

Title: The Diffusion Duet: Harmonizing Dual Channels with Wavelet Suppression for Image Separation

Title: An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

Title: High-Resolution Climate Projections Using Diffusion-Based Downscaling of a Lightweight Climate Emulator

Title: Text Has Curvature

Title: SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Title: Diff-Aid: Inference-time Adaptive Interaction Denoising for Rectified Text-to-Image Generation

Title: AdaVBoost: Mitigating Hallucinations in LVLMs via Token-Level Adaptive Visual Attention Boosting

Title: DCDM: Divide-and-Conquer Diffusion Models for Consistency-Preserving Video Generation

Title: EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation

Title: A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy

Title: Attention Head Entropy of LLMs Predicts Answer Correctness

Title: HBVLA: Pushing 1-Bit Post-Training Quantization for Vision-Language-Action Models

Title: Generative Latent Representations of 3D Brain MRI for Multi-Task Downstream Analysis in Down Syndrome

Title: Data-driven Bi-level Optimization of Thermal Power Systems with embedded Artificial Neural Networks

Title: T2MBench: A Benchmark for Out-of-Distribution Text-to-Motion Generation

Title: Skeleton2Stage: Reward-Guided Fine-Tuning for Physically Plausible Dance Generation

Title: MEMTS: Internalizing Domain Knowledge via Parameterized Memory for Retrieval-Free Domain Adaptation of Time Series Foundation Models

Title: Mean Flow Policy with Instantaneous Velocity Constraint for One-step Action Generation

Title: VAR-3D: View-aware Auto-Regressive Model for Text-to-3D Generation via a 3D Tokenizer

Title: Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings

Title: Prior-guided Hierarchical Instance-pixel Contrastive Learning for Ultrasound Speckle Noise Suppression

Title: High-Fidelity Causal Video Diffusion Models for Real-Time Ultra-Low-Bitrate Semantic Communication

Title: Synthetic Dataset Generation and Validation for Robotic Surgery Instrument Segmentation

Title: Low-Pass Filtering Improves Behavioral Alignment of Vision Models

Title: Parameter-Efficient Fine-Tuning of DINOv2 for Large-Scale Font Classification

Title: Why Code, Why Now: Learnability, Computability, and the Real Limits of Machine Learning

Title: A Multi-Agent Framework for Code-Guided, Modular, and Verifiable Automated Machine Learning

Title: Chemical Language Models for Natural Products: A State-Space Model Approach

Title: MarsRetrieval: Benchmarking Vision-Language Models for Planetary-Scale Geospatial Retrieval on Mars

Title: Elastic Diffusion Transformer

Title: Inject Where It Matters: Training-Free Spatially-Adaptive Identity Preservation for Text-to-Image Personalization

Title: Train Short, Inference Long: Training-free Horizon Extension for Autoregressive Video Generation

Title: BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Title: Restoration Adaptation for Semantic Segmentation on Low Quality Images

Title: CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Learning

Title: EgoSound: Benchmarking Sound Understanding in Egocentric Videos

Title: LaViDa-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models

Title: UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Title: UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Title: MAGE: All-[MASK] Block Already Knows Where to Look in Diffusion LLM

Title: KernelBlaster: Continual Cross-Task CUDA Optimization via Memory-Augmented In-Context Reinforcement Learning

Title: Machine Learning as a Tool (MLAT): A Framework for Integrating Statistical ML Models as Callable Tools within LLM Agent Workflows

Title: DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices

Title: A Generative AI Approach for Reducing Skin Tone Bias in Skin Cancer Classification

Title: Adapting VACE for Real-Time Autoregressive Video Diffusion

Title: Controlling Your Image via Simplified Vector Graphics

Title: CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer

Title: TikArt: Aperture-Guided Observation for Fine-Grained Visual Reasoning via Reinforcement Learning

Title: MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction

Title: Efficient Text-Guided Convolutional Adapter for the Diffusion Model

Title: MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation

Title: DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving

Title: VIGIL: Tackling Hallucination Detection in Image Recontextualization

Title: SketchingReality: From Freehand Scene Sketches To Photorealistic Images

Title: Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error

Title: D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation

Title: Picking the Right Specialist: Attentive Neural Process-based Selection of Task-Specialized Models as Tools for Agentic Healthcare Systems

Title: AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories

Title: PAct: Part-Decomposed Single-View Articulated Object Generation

Title: MacroGuide: Topological Guidance for Macrocycle Generation

Title: Scaling Beyond Masked Diffusion Language Models

Title: Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation

Title: Image Generation with a Sphere Encoder

Title: EditCtrl: Disentangled Local and Global Control for Real-Time Generative Video Editing