2025-09-09

Title: A Dataset Generation Scheme Based on Video2EEG-SPGN-Diffusion for SEED-VD

Title: RT-VLM: Re-Thinking Vision Language Model with 4-Clues for Real-World Object Recognition Robustness

Title: FAVAE-Effective Frequency Aware Latent Tokenizer

Title: EditIDv2: Editable ID Customization with Data-Lubricated ID Feature Integration for Text-to-Image Generation

Title: Context-Aware Multi-Turn Visual-Textual Reasoning in LVLMs via Dynamic Memory and Adaptive Visual Guidance

Title: Depth-Aware Super-Resolution via Distance-Adaptive Variational Formulation

Title: CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation

Title: A Probabilistic Segment Anything Model for Ambiguity-Aware Medical Image Segmentation

Title: X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs

Title: Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching

Title: OmniStyle2: Scalable and High Quality Artistic Style Transfer Data Generation via Destylization

Title: Multi-Strategy Guided Diffusion via Sparse Masking Temporal Reweighting Distribution Correction

Title: BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models

Title: PolicyEvolve: Evolving Programmatic Policies by LLMs for multi-player games via Population-Based Training

Title: Home-made Diffusion Model from Scratch to Hatch

Title: If generative AI is the answer, what is the question?

Title: SpecSwin3D: Generating Hyperspectral Imagery from Multispectral Data via Transformer Networks

Title: RetinaGuard: Obfuscating Retinal Age in Fundus Images for Biometric Privacy Preserving

Title: UniVerse-1: Unified Audio-Video Generation via Stitching of Experts

Title: UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning

Title: UrbanMIMOMap: A Ray-Traced MIMO CSI Dataset with Precoding-Aware Maps and Benchmarks

Title: WindFM: An Open-Source Foundation Model for Zero-Shot Wind Power Forecasting

Title: Evaluating the Efficiency of Latent Spaces via the Coupling-Matrix

Title: Text4Seg++: Advancing Image Segmentation via Generative Language Modeling

Title: Towards scalable organ level 3D plant segmentation: Bridging the data algorithm computing gap

Title: A Fragile Number Sense: Probing the Elemental Limits of Numerical Reasoning in LLMs

Title: Your Super Resolution Model is not Enough for Tackling Real-World Scenarios

Title: VQualA 2025 Challenge on Image Super-Resolution Generated Content Quality Assessment: Methods and Results

Title: CAPMix: Robust Time Series Anomaly Detection Based on Abnormal Assumptions with Dual-Space Mixup

Title: Perception-oriented Bidirectional Attention Network for Image Super-resolution Quality Assessment

Title: A Statistical 3D Stomach Shape Model for Anatomical Analysis

Title: TIDE: Achieving Balanced Subject-Driven Image Generation via Target-Instructed Diffusion Enhancement

Title: On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data

Title: Predicting Fetal Outcomes from Cardiotocography Signals Using a Supervised Variational Autoencoder

Title: Group Effect Enhanced Generative Adversarial Imitation Learning for Individual Travel Behavior Modeling under Incentives

Title: STAGE: Segmentation-oriented Industrial Anomaly Synthesis via Graded Diffusion with Explicit Mask Alignment

Title: Nested Optimal Transport Distances

Title: Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training

Title: Raw2Event: Converting Raw Frame Camera into Event Camera

Title: P3-SAM: Native 3D Part Segmentation

Title: SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis

Title: MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration

Title: UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Title: floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL

Title: A New Hybrid Model of Generative Adversarial Network and You Only Look Once Algorithm for Automatic License-Plate Recognition

Title: BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration

Title: From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers

Title: Outcome-based Exploration for LLM Reasoning

Title: Interleaving Reasoning for Better Text-to-Image Generation