2026-01-12

Title: Coding the Visual World: From Image to Simulation Using Vision Language Models

Title: Efficient Inference for Noisy LLM-as-a-Judge Evaluation

Title: Prediction of Fault Slip Tendency in CO${_2}$ Storage using Data-space Inversion

Title: RingSQL: Generating Synthetic Data with Schema-Independent Templates for Text-to-SQL Reasoning Models

Title: MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization

Title: Hippocampal Atrophy Patterns Across the Alzheimer's Disease Spectrum: A Voxel-Based Morphometry Analysis

Title: Hi-ZFO: Hierarchical Zeroth- and First-Order LLM Fine-Tuning via Importance-Guided Tensor Selection

Title: GaussianSwap: Animatable Video Face Swapping with 3D Gaussian Splatting

Title: MoGen: A Unified Collaborative Framework for Controllable Multi-Object Image Generation

Title: Towards Generalized Multi-Image Editing for Unified Multimodal Models

Title: Orient Anything V2: Unifying Orientation and Rotation Understanding

Title: Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection

Title: Learn to Evolve: Self-supervised Neural JKO Operator for Wasserstein Gradient Flow

Title: AGDC: Autoregressive Generation of Variable-Length Sequences with Joint Discrete and Continuous Spaces

Title: Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Title: TAGRPO: Boosting GRPO on Image-to-Video Generation with Direct Trajectory Alignment

Title: ViTNT-FIQA: Training-Free Face Image Quality Assessment with Vision Transformers

Title: Adaptive Disentangled Representation Learning for Incomplete Multi-View Multi-Label Classification

Title: SceneFoundry: Generating Interactive Infinite 3D Worlds

Title: Boosting Latent Diffusion Models via Disentangled Representation Alignment

Title: Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Title: Kidney Cancer Detection Using 3D-Based Latent Diffusion Models

Title: Phase4DFD: Multi-Domain Phase-Aware Attention for Deepfake Detection

Title: Context-Aware Decoding for Faithful Vision-Language Generation

Title: WaveRNet: Wavelet-Guided Frequency Learning for Multi-Source Domain-Generalized Retinal Vessel Segmentation

Title: VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction