2025-11-24

Title: Motion Transfer-Enhanced StyleGAN for Generating Diverse Macaque Facial Expressions

Title: PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation

Title: SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG

Title: Mesh RAG: Retrieval Augmentation for Autoregressive Mesh Generation

Title: WorldGen: From Text to Traversable and Interactive 3D Worlds

Title: Towards Unified Vision Language Models for Forest Ecological Analysis in Earth Observation

Title: The use of vocal biomarkers in the detection of Parkinson's disease: a robust statistical performance comparison of classic machine learning models

Title: BOP-ASK: Object-Interaction Reasoning for Vision-Language Models

Title: Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representational Alignment

Title: Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models

Title: Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content

Title: PepEVOLVE: Position-Aware Dynamic Peptide Optimization via Group-Relative Advantage

Title: UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation

Title: DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution

Title: Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features

Title: Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models

Title: MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis

Title: Two Heads Better than One: Dual Degradation Representation for Blind Super-Resolution

Title: Real-Time Cooked Food Image Synthesis and Visual Cooking Progress Monitoring on Edge Devices

Title: VLM-Augmented Degradation Modeling for Image Restoration Under Adverse Weather Conditions

Title: Vision Language Models are Confused Tourists

Title: Mask the Redundancy: Evolving Masking Representation Learning for Multivariate Time-Series Clustering

Title: Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation

Title: RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation

Title: ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion

Title: Diversity Has Always Been There in Your Visual Autoregressive Models

Title: Spanning Tree Autoregressive Visual Generation

Title: Four decades of circumpolar super-resolved satellite land surface temperature data

Title: One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution

Title: DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving

Title: FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle

Title: PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention

Title: Dual-domain Adaptation Networks for Realistic Image Super-resolution

Title: FlexiFlow: decomposable flow matching for generation of flexible molecular ensemble

Title: Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats

Title: A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback

Title: Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing

Title: Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation

Title: Refracting Reality: Generating Images with Realistic Transparent Objects

Title: Loomis Painter: Reconstructing the Painting Process

Title: Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks

Title: MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment

Title: Planning with Sketch-Guided Verification for Physics-Aware Video Generation

Title: Illustrator's Depth: Monocular Layer Index Prediction for Image Decomposition

Title: PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM

Title: An Artificial Intelligence Framework for Measuring Human Spine Aging Using MRI

Title: EvDiff: High Quality Video with an Event Camera

Title: Native 3D Editing with Full Attention