2026-01-13

Title: CrossTrafficLLM: A Human-Centric Framework for Interpretable Traffic Intelligence via Large Language Model

Title: Semantic Event Graphs for Long-Form Video Question Answering

Title: Stress Testing Machine Learning at $10^{10}$ Scale: A Comprehensive Study of Adversarial Robustness on Algebraically Structured Integer Streams

Title: AIS-CycleGen: A CycleGAN-Based Framework for High-Fidelity Synthetic AIS Data Generation and Augmentation

Title: Forget Many, Forget Right: Scalable and Precise Concept Unlearning in Diffusion Models

Title: Forget-It-All: Multi-Concept Machine Unlearning via Concept-Aware Neuron Masking

Title: Think Bright, Diffuse Nice: Enhancing T2I-ICL via Inductive-Bias Hint Instruction and Query Contrastive Decoding

Title: CEEMDAN-Based Multiscale CNN for Wind Turbine Gearbox Fault Detection

Title: Synthetic FMCW Radar Range Azimuth Maps Augmentation with Generative Diffusion Model

Title: Perception Test 2025: Challenge Summary and a Unified VQA Extension

Title: Teach Diffusion Language Models to Learn from Their Own Mistakes

Title: 3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence

Title: A novel RF-enabled Non-Destructive Inspection Method through Machine Learning and Programmable Wireless Environments

Title: Bridging Robustness and Efficiency: Real-Time Low-Light Enhancement via Attention U-Net GAN

Title: BabyVision: Visual Reasoning Beyond Language

Title: Mosaic: Unlocking Long-Context Inference for Diffusion LLMs via Global Memory Planning and Dynamic Peak Taming

Title: Hellinger Multimodal Variational Autoencoders

Title: APEX: Learning Adaptive Priorities for Multi-Objective Alignment in Vision-Language Generation

Title: Sissi: Zero-shot Style-guided Image Synthesis via Semantic-style Integration

Title: CEDAR: Context Engineering for Agentic Data Science

Title: When Humans Judge Irises: Pupil Size Normalization as an Aid and Synthetic Irises as a Challenge

Title: Cross-Modal Computational Model of Brain-Heart Interactions via HRV and EEG Feature

Title: WFR-FM: Simulation-Free Dynamic Unbalanced Optimal Transport

Title: OSCAR: Optical-aware Semantic Control for Aleatoric Refinement in Sar-to-Optical Translation

Title: Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models

Title: Variational decomposition autoencoding improves disentanglement of latent representations

Title: U-MASK: User-adaptive Spatio-Temporal Masking for Personalized Mobile AI Applications

Title: DaQ-MSA: Denoising and Qualifying Diffusion Augmentations for Multimodal Sentiment Analysis

Title: RenderFlow: Single-Step Neural Rendering via Flow Matching

Title: Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Title: Unified Personalized Understanding, Generating and Editing

Title: When Should We Introduce Safety Interventions During Pretraining?

Title: 3D Wavelet-Based Structural Priors for Controlled Diffusion in Whole-Body Low-Dose PET Denoising

Title: Few-shot Class-Incremental Learning via Generative Co-Memory Regularization

Title: Generating readily synthesizable small molecule fluorophore scaffolds with reinforcement learning

Title: Stable On-Policy Distillation through Adaptive Target Reformulation

Title: Offline Meta-Reinforcement Learning with Flow-Based Task Inference and Adaptive Correction of Feature Overgeneralization

Title: Forward versus Backward: Comparing Reasoning Objectives in Direct Preference Optimization

Title: MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization

Title: SIRR-LMM: Single-image Reflection Removal via Large Multimodal Model

Title: SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis

Title: Language-Grounded Multi-Domain Image Translation via Semantic Difference Guidance

Title: Innovation Capacity of Dynamical Learning Systems

Title: GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection

Title: Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models

Title: Inference-Time Scaling for Visual AutoRegressive modeling by Searching Representative Samples

Title: HiVid-Narrator: Hierarchical Video Narrative Generation with Scene-Primed ASR-anchored Compression

Title: OceanSAR-2: A Universal Feature Extractor for SAR Ocean Observation

Title: Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Efficient Diffusion Transformers

Title: From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution

Title: Graph Inference Towards ICD Coding

Title: FROAV: A Framework for RAG Observation and Agent Verification - Lowering the Barrier to LLM Agent Research

Title: Land-then-transport: A Flow Matching-Based Generative Decoder for Wireless Image Transmission

Title: d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation

Title: StdGEN++: A Comprehensive System for Semantic-Decomposed 3D Character Generation

Title: Advancing Multinational License Plate Recognition Through Synthetic and Real Data Fusion: A Comprehensive Evaluation

Title: Leveraging 3D Representation Alignment and RGB Pretrained Priors for LiDAR Scene Generation

Title: Evaluating the encoding competence of visual language models using uncommon actions

Title: Beyond External Guidance: Unleashing the Semantic Richness Inside Diffusion Transformers for Improved Training

Title: More Images, More Problems? A Controlled Analysis of VLM Failure Modes

Title: MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head