2026-02-03

Title: OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

Title: ELLMPEG: An Edge-based Agentic LLM Video Processing Tool

Title: RAPTOR-AI for Disaster OODA Loop: Hierarchical Multimodal RAG with Experience-Driven Agentic Decision-Making

Title: Enhancing few-shot time series forecasting with LLM-guided diffusion

Title: TextBFGS: Quasi-Newton Optimization for Discrete Executable Text via Gradient-Operator Retrieval

Title: The Impact of Machine Learning Uncertainty on the Robustness of Counterfactual Explanations

Title: Generative AI-enhanced Probabilistic Multi-Fidelity Surrogate Modeling Via Transfer Learning

Title: Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits

Title: Mirage2Matter: A Physically Grounded Gaussian World Model from Video

Title: R3G: A Reasoning--Retrieval--Reranking Framework for Vision-Centric Answer Generation

Title: AI-Driven Three-Dimensional Reconstruction and Quantitative Analysis for Burn Injury Assessment

Title: 1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization

Title: ALIGN: Aligned Delegation with Performance Guarantees for Multi-Agent LLM Reasoning

Title: Monte Carlo Tree Search for Execution-Guided Program Repair with Large Language Models

Title: Learning Physics-Grounded 4D Dynamics with Neural Gaussian Force Fields

Title: SDCM: Simulated Densifying and Compensatory Modeling Fusion for Radar-Vision 3-D Object Detection in Internet of Vehicles

Title: Joint Continual Learning of Local Language Models and Cloud Offloading Decisions with Budget Constraints

Title: Stabilizing Diffusion Posterior Sampling by Noise--Frequency Continuation

Title: Reducing Memorisation in Generative Models via Riemannian Bayesian Inference

Title: SANEval: Open-Vocabulary Compositional Benchmarks with Failure-mode Diagnosis

Title: TABES: Trajectory-Aware Backward-on-Entropy Steering for Masked Diffusion Models

Title: World-Shaper: A Unified Framework for 360° Panoramic Editing

Title: PLACID: Identity-Preserving Multi-Object Compositing via Video Diffusion with Synthetic Trajectories

Title: TokenTrim: Inference-Time Token Pruning for Autoregressive Long Video Generation

Title: Generation Order and Parallel Decoding in Masked Diffusion Models: An Information-Theoretic Perspective

Title: TimeBlind: A Spatio-Temporal Compositionality Benchmark for Video LLMs

Title: Self-Attention at Constant Cost per Token via Symmetry-Aware Taylor Approximation

Title: Agentic Framework for Epidemiological Modeling

Title: Bridging the Semantic Chasm: Synergistic Conceptual Anchoring for Generalized Few-Shot and Zero-Shot OOD Perception

Title: When RAG Hurts: Diagnosing and Mitigating Attention Distraction in Retrieval-Augmented LVLMs

Title: ReLAPSe: Reinforcement-Learning-trained Adversarial Prompt Search for Erased concepts in unlearned diffusion models

Title: Planning with Language and Generative Models: Toward General Reward-Guided Wireless Network Design

Title: RePaint-Enhanced Conditional Diffusion Model for Parametric Engineering Designs under Performance and Parameter Constraints

Title: A Fragile Guardrail: Diffusion LLM's Safety Blessing and Its Failure Mode

Title: Brazilian Portuguese Image Captioning with Transformers: A Study on Cross-Native-Translated Dataset

Title: Toward Autonomous Laboratory Safety Monitoring with Vision Language Models: Learning to See Hazards Through Scene Structure

Title: Federated-inspired Single-cell Batch Integration in Latent Space

Title: Open Materials Generation with Inference-Time Reinforcement Learning

Title: LLMs as High-Dimensional Nonlinear Autoregressive Models with Attention: Training, Alignment and Inference

Title: FedMOA: Federated GRPO for Personalized Reasoning LLMs under Heterogeneous Rewards

Title: LatentTrack: Sequential Weight Generation via Latent Filtering

Title: PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting

Title: ZS-TreeSeg: A Zero-Shot Framework for Tree Crown Instance Segmentation

Title: Diffusion LMs Can Approximate Optimal Infilling Lengths Implicitly

Title: Quality-Diversity Optimization as Multi-Objective Optimization

Title: OD-DEAL: Dynamic Expert-Guided Adversarial Learning with Online Decomposition for Scalable Capacitated Vehicle Routing

Title: Sparse Shortcuts: Facilitating Efficient Fusion in Multimodal Large Language Models

Title: DuoGen: Towards General Purpose Interleaved Multimodal Generation

Title: SAGE: Accelerating Vision-Language Models via Entropy-Guided Adaptive Speculative Decoding

Title: Physiology as Language: Translating Respiration to Sleep EEG

Title: Convergent World Representations and Divergent Tasks

Title: SADER: Structure-Aware Diffusion Framework with DEterministic Resampling for Multi-Temporal Remote Sensing Cloud Removal

Title: GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates

Title: Bridging Degradation Discrimination and Generation for Universal Image Restoration

Title: MAUGen: A Unified Diffusion Approach for Multi-Identity Facial Expression and AU Label Generation

Title: From Pixels to Facts (Pix2Fact): Benchmarking Multi-Hop Reasoning for Fine-Grained Visual Fact Checking

Title: Towards Interpretable Hallucination Analysis and Mitigation in LVLMs via Contrastive Neuron Steering

Title: FaceSnap: Enhanced ID-fidelity Network for Tuning-free Portrait Customization

Title: S$^3$POT: Contrast-Driven Face Occlusion Segmentation via Self-Supervised Prompt Learning

Title: VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning

Title: Riemannian Flow Matching for Disentangled Graph Domain Adaptation

Title: Improving Neuropathological Reconstruction Fidelity via AI Slice Imputation

Title: LocalV: Exploiting Information Locality for IP-level Verilog Generation

Title: Supervised makeup transfer with a curated dataset: Decoupling identity and makeup features for enhanced transformation

Title: HSI-VAR: Rethinking Hyperspectral Restoration through Spatial-Spectral Visual Autoregression

Title: Latent Shadows: The Gaussian-Discrete Duality in Masked Diffusion

Title: Edge-Native Generative De-identification: Inversion-Free Flow for Privacy-Preserving Federated Skin Image Analysis

Title: RMFlow: Refined Mean Flow by a Noise-Injection Step for Multimodal Generation

Title: Improving Flow Matching by Aligning Flow Divergence

Title: Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs

Title: DIAMOND: Directed Inference for Artifact Mitigation in Flow Matching Models

Title: Data Augmentation for High-Fidelity Generation of CAR-T/NK Immunological Synapse Images

Title: SAGE: Agentic Framework for Interpretable and Clinically Translatable Computational Pathology Biomarker Discovery

Title: Multimodal Scientific Learning Beyond Diffusions and Flows

Title: Toward Universal and Transferable Jailbreak Attacks on Vision-Language Models

Title: FUSE-Flow: Scalable Real-Time Multi-View Point Cloud Reconstruction Using Confidence

Title: PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers

Title: Differential Vector Erasure: Unified Training-Free Concept Erasure for Flow Matching Models

Title: Self-Generative Adversarial Fine-Tuning for Large Language Models

Title: Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization

Title: Improving Robustness of Vision-Language-Action Models by Restoring Corrupted Visual Inputs

Title: EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment

Title: Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution

Title: EDIS: Diagnosing LLM Reasoning via Entropy Dynamics

Title: ReDiStory: Region-Disentangled Diffusion for Consistent Visual Story Generation

Title: StoryState: Agent-Based State Control for Consistent and Editable Storybooks

Title: FlowCast: Trajectory Forecasting for Scalable Zero-Cost Speculative Flow Matching

Title: Beyond Pixels: Visual Metaphor Transfer via Schema-Driven Agentic Reasoning

Title: MTC-VAE: Multi-Level Temporal Compression with Content Awareness

Title: Adaptive Visual Autoregressive Acceleration via Dual-Linkage Entropy Analysis

Title: T2M Mamba: Motion Periodicity-Saliency Coupling Approach for Stable Text-Driven Motion Generation

Title: PolyGen: Fully Synthetic Vision-Language Training via Multi-Generator Ensembles

Title: PromptRL: Prompt Matters in RL for Flow-Based Image Generation

Title: Cross-Paradigm Evaluation of Gaze-Based Semantic Object Identification for Intelligent Vehicles

Title: P-EAGLE: Parallel-Drafting EAGLE with Scalable Training

Title: OpInf-LLM: Parametric PDE Solving with LLMs via Operator Inference

Title: A Relative-Budget Theory for Reinforcement Learning with Verifiable Rewards in Large Language Model Reasoning

Title: The Inlet Rank Collapse in Implicit Neural Representations: Diagnosis and Unified Remedy

Title: Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Title: InfoTok: Regulating Information Flow for Capacity-Constrained Shared Visual Tokenization in Unified MLLMs

Title: Combined Flicker-banding and Moire Removal for Screen-Captured Images

Title: Generative Visual Code Mobile World Models

Title: Know Your Step: Faster and Better Alignment for Flow Matching Models via Step-aware Advantages

Title: Boosting Maximum Entropy Reinforcement Learning via One-Step Flow Matching

Title: Token Pruning for In-Context Generation in Diffusion Transformers

Title: Omni-Judge: Can Omni-LLMs Serve as Human-Aligned Judges for Text-Conditioned Audio-Video Generation?

Title: PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards

Title: Chance-Constrained Inference for Hallucination Risk Control in Large Language Models

Title: ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval

Title: De Novo Molecular Generation from Mass Spectra via Many-Body Enhanced Diffusion

Title: From Perception to Action: Spatial AI Agents and World Models

Title: Efficient Adversarial Attacks on High-dimensional Offline Bandits

Title: Moonworks Lunara Aesthetic II: An Image Variation Dataset

Title: Beyond Mode Elicitation: Diversity-Preserving Reinforcement Learning via Latent Diffusion Reasoner

Title: Physics Informed Generative AI Enabling Labour Free Segmentation For Microscopy Analysis

Title: MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Title: Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Title: Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation

Title: MagicFuse: Single Image Fusion for Visual and Semantic Reinforcement

Title: IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination

Title: Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention

Title: GPD: Guided Progressive Distillation for Fast and High-Quality Video Generation

Title: Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Title: No Generation without Representation: Efficient Causal Protein Language Models Enable Zero-Shot Fitness Estimation

Title: Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Title: WS-IMUBench: Can Weakly Supervised Methods from Audio, Image, and Video Be Adapted for IMU-based Temporal Action Localization?

Title: How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Title: Time2Vec-Integrated Transformer for Robust Gesture Recognition from Low-Density sEMG

Title: Trust but Verify: Adaptive Conditioning for Reference-Based Diffusion Super-Resolution via Implicit Reference Correlation Modeling

Title: ProxyImg: Towards Highly-Controllable Image Representation via Hierarchical Disentangled Proxy Embedding

Title: Internal Flow Signatures for Self-Checking and Refinement in LLMs

Title: Q Cache: Visual Attention is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model

Title: Learning Sparse Visual Representations via Spatial-Semantic Factorization

Title: Bayesian Integration of Nonlinear Incomplete Clinical Data

Title: Boundary-Constrained Diffusion Models for Floorplan Generation: Balancing Realism and Diversity

Title: Grounding Generated Videos in Feasible Plans via World Models

Title: Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated

Title: Leveraging Latent Vector Prediction for Localized Control in Image Generation via Diffusion Models

Title: On the Limits of Layer Pruning for Generative Reasoning in LLMs

Title: UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Title: Logic-Guided Vector Fields for Constrained Generative Modeling

Title: One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Title: On Stability and Robustness of Diffusion Posterior Sampling for Bayesian Inverse Problems

Title: AICD Bench: A Challenging Benchmark for AI-Generated Code Detection

Title: FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

Title: Unifying Masked Diffusion Models with Various Generation Orders and Beyond

Title: Enhancing Diffusion-Based Quantitatively Controllable Image Generation via Matrix-Form EDM and Adaptive Vicinal Training

Title: Scalable Spatio-Temporal SE(3) Diffusion for Long-Horizon Protein Dynamics

Title: Eliminating Registration Bias in Synthetic CT Generation: A Physics-Based Simulation Framework

Title: DCoPilot: Generative AI-Empowered Policy Adaptation for Dynamic Data Center Operations

Title: Learning Generative Selection for Best-of-N

Title: Lung Nodule Image Synthesis Driven by Two-Stage Generative Adversarial Networks

Title: ECHO-2: A Large Scale Distributed Rollout Framework for Cost-efficient Reinforcement Learning

Title: Hierarchical Adaptive Eviction for KV Cache Management in Multimodal Language Models

Title: Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation

Title: MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection

Title: Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Title: Geometry- and Relation-Aware Diffusion for EEG Super-Resolution

Title: Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models

Title: Unlocking the Duality between Flow and Field Matching

Title: MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology

Title: Implicit neural representation of textures

Title: Unified Personalized Reward Model for Vision Generation

Title: Self-Supervised Learning from Structural Invariance

Title: SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization

Title: Personalized Image Generation via Human-in-the-loop Bayesian Optimization

Title: Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Title: Superman: Unifying Skeleton and Vision for Human Motion Perception and Generation

Title: Trust Region Continual Learning as an Implicit Meta-Learner

Title: Repurposing Protein Language Models for Latent Flow-Based Fitness Optimization

Title: Embedding Perturbation may Better Reflect the Uncertainty in LLM Reasoning

Title: UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Title: Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE

Title: Expanding the Capabilities of Reinforcement Learning via Text Feedback

Title: PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss