2026-01-09

Title: Generation of synthetic delay time series for air transport applications

Title: LEGATO: Good Identity Unlearning Is Continuous

Title: ArtCognition: A Multimodal AI Framework for Affective State Sensing from Visual and Kinematic Drawing Cues

Title: Beyond Binary Preference: Aligning Diffusion Models to Fine-grained Criteria by Decoupling Attributes

Title: Quantifying the Effect of Test Set Contamination on Generative Evaluations

Title: Embedding Textual Information in Images Using Quinary Pixel Combinations

Title: Unified Text-Image Generation with Weakness-Targeted Post-Training

Title: ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers

Title: PackCache: A Training-Free Acceleration Method for Unified Autoregressive Video Generation via Compact KV-Cache

Title: Addressing Overthinking in Large Vision-Language Models via Gated Perception-Reasoning Optimization

Title: UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving

Title: Meta-probabilistic Modeling

Title: IGenBench: Benchmarking the Reliability of Text-to-Infographic Generation

Title: Surface-based Molecular Design with Multi-modal Flow Matching

Title: FaceRefiner: High-Fidelity Facial Texture Refinement with Differentiable Rendering-based Style Transfer

Title: TSSR: Two-Stage Swap-Reward-Driven Reinforcement Learning for Character-Level SMILES Generation

Title: GEnSHIN: Graphical Enhanced Spatio-temporal Hierarchical Inference Network for Traffic Flow Prediction

Title: A Vision for Multisensory Intelligence: Sensing, Synergy, and Science

Title: Spatial-Temporal Feedback Diffusion Guidance for Controlled Traffic Imputation

Title: 3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks

Title: MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing

Title: HyperAlign: Hyperbolic Entailment Cones for Adaptive Text-to-Image Alignment Assessment

Title: Agri-R1: Empowering Generalizable Agricultural Reasoning in Vision-Language Models with Reinforcement Learning

Title: HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-Resolution

Title: Do LLMs Benefit from User and Item Embeddings in Recommendation Tasks?

Title: Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models

Title: MQ-GNN: A Multi-Queue Pipelined Architecture for Scalable and Efficient GNN Training

Title: AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Title: Intraday spatiotemporal PV power prediction at national scale using satellite-based solar forecast models

Title: CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models

Title: SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning

Title: Measurement-Consistent Langevin Corrector: A Remedy for Latent Diffusion Inverse Solvers

Title: On the Definition and Detection of Cherry-Picking in Counterfactual Explanations

Title: OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene Reconstruction

Title: DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights

Title: From Understanding to Engagement: Personalized pharmacy Video Clips via Vision Language Models (VLMs)

Title: Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Title: VERSE: Visual Embedding Reduction and Space Exploration. Clustering-Guided Insights for Training Data Enhancement in Visually-Rich Document Understanding

Title: VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Title: A Lightweight and Explainable Vision-Language Framework for Crop Disease Visual Question Answering

Title: Multi-Scale Local Speculative Decoding for Image Generation

Title: FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching

Title: Plenoptic Video Generation

Title: RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Title: GREx: Generalized Referring Expression Segmentation, Comprehension, and Generation

Title: Pixel-Perfect Visual Geometry Estimation