2026-03-18

Title: Improving Generative Adversarial Network Generalization for Facial Expression Synthesis

Title: Transition Flow Matching

Title: Embedding-Aware Feature Discovery: Bridging Latent Representations and Interpretable Features in Event Sequences

Title: Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models

Title: Time-Aware Prior Fitted Networks for Zero-Shot Forecasting with Exogenous Variables

Title: Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs

Title: Feed-forward Gaussian Registration for Head Avatar Creation and Editing

Title: Beyond the Embedding Bottleneck: Adaptive Retrieval-Augmented 3D CT Report Generation

Title: EvoIQA - Explaining Image Distortions with Evolved White-Box Logic

Title: Generative Inverse Design with Abstention via Diagonal Flow Matching

Title: GASP: Guided Asymmetric Self-Play For Coding LLMs

Title: UMO: Unified In-Context Learning Unlocks Motion Foundation Model Priors

Title: FlatLands: Generative Floormap Completion From a Single Egocentric View

Title: Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition

Title: Interact3D: Compositional 3D Generation of Interactive Objects

Title: LICA: Layered Image Composition Annotations for Graphic Design Research

Title: OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder

Title: Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning

Title: When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems

Title: Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training

Title: EFF-Grasp: Energy-Field Flow Matching for Physics-Aware Dexterous Grasp Generation

Title: Execution-Grounded Credit Assignment for GRPO in Code Generation

Title: AI-Generated Figures in Academic Publishing: Policies, Tools, and Practical Guidelines

Title: 360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method

Title: ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control

Title: Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning

Title: S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight

Title: Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation

Title: RASLF: Representation-Aware State Space Model for Light Field Super-Resolution

Title: Synergizing Deep Learning and Biological Heuristics for Extreme Long-Tail White Blood Cell Classification

Title: Visual Prompt Discovery via Semantic Exploration

Title: Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target Detection

Title: VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment

Title: Persistent Story World Simulation with Continuous Character Customization

Title: DriveFix: Spatio-Temporally Coherent Driving Scene Restoration

Title: Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation

Title: $D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation

Title: Advancing Visual Reliability: Color-Accurate Underwater Image Enhancement for Real-Time Underwater Missions

Title: FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios

Title: InViC: Intent-aware Visual Cues for Medical Visual Question Answering

Title: Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation

Title: DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification

Title: DISCOVER: A Solver for Distributional Counterfactual Explanations

Title: Unified Removal of Raindrops and Reflections: A New Benchmark and A Novel Pipeline

Title: Unlearning for One-Step Generative Models via Unbalanced Optimal Transport

Title: VIEW2SPACE: Studying Multi-View Visual Reasoning from Sparse Observations

Title: CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation

Title: VideoMatGen: PBR Materials through Joint Generative Modeling

Title: Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration

Title: REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models

Title: When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective

Title: Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models

Title: ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery

Title: Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Title: Search2Motion: Training-Free Object-Level Motion Control via Attention-Consensus Search

Title: Emotion-Aware Classroom Quality Assessment Leveraging IoT-Based Real-Time Student Monitoring

Title: GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems

Title: Bayesian Inference of Psychometric Variables From Brain and Behavior in Implicit Association Tests

Title: Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

Title: pADAM: A Plug-and-Play All-in-One Diffusion Architecture for Multi-Physics Learning

Title: GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution

Title: IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans

Title: V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Title: Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Sampling

Title: RaDAR: Relation-aware Diffusion-Asymmetric Graph Contrastive Learning for Recommendation

Title: SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation

Title: Efficient Reasoning on the Edge

Title: SegviGen: Repurposing 3D Generative Model for Part Segmentation

Title: Demystifing Video Reasoning

Title: WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation