2025-03-26

Title: Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks

Title: DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Title: RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis

Title: DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding

Title: Color Conditional Generation with Sliced Wasserstein Guidance

Title: HingeRLC-GAN: Combating Mode Collapse with Hinge Loss and RLC Regularization

Title: Paving the way for scientific foundation models: enhancing generalization and robustness in PDEs with constraint-aware pre-training

Title: Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics

Title: MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks

Title: Risk-Based Thresholding for Reliable Anomaly Detection in Concentrated Solar Power Plants

Title: HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Title: SoK: How Robust is Audio Watermarking in Generative AI models?

Title: FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

Title: Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

Title: Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing

Title: ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency

Title: Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment

Title: UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design

Title: LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text

Title: ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning

Title: Efficient Adversarial Detection Frameworks for Vehicle-to-Microgrid Services in Edge Computing

Title: Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Title: BADGR: Bundle Adjustment Diffusion Conditioned by GRadients for Wide-Baseline Floor Plan Reconstruction

Title: Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models

Title: Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection

Title: Show and Segment: Universal Medical Image Segmentation via In-Context Learning

Title: ImageSet2Text: Describing Sets of Images through Text

Title: VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction

Title: EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models

Title: DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image

Title: Interpretable Generative Models through Post-hoc Concept Bottlenecks

Title: Social Network User Profiling for Anomaly Detection Based on Graph Neural Networks

Title: MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation

Title: Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing

Title: Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models

Title: Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model

Title: SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors

Title: G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation

Title: AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Title: Noisier2Inverse: Self-Supervised Learning for Image Reconstruction with Correlated Noise

Title: GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Title: KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models

Title: Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage

Title: SparSamp: Efficient Provably Secure Steganography Based on Sparse Sampling

Title: VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models

Title: Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion

Title: Post-Hoc Calibrated Anomaly Detection

Title: Video Anomaly Detection with Contours - A Study

Title: Optimization through In-Context Learning and Iterative LLM Prompting for Nuclear Engineering Design Problems

Title: Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation

Title: OpenSDI: Spotting Diffusion-Generated Images in the Open World

Title: CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation

Title: PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models

Title: FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion

Title: Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

Title: ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Title: Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models

Title: SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation

Title: In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush

Title: Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models

Title: SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI

Title: Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning

Title: FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model

Title: An Overview of Low-Rank Structures in the Training and Adaptation of Large Models

Title: Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation

Title: Scaling Down Text Encoders of Text-to-Image Diffusion Models

Title: CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Title: ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models

Title: AvatarArtist: Open-Domain 4D Avatarization

Title: FullDiT: Multi-Task Video Generative Foundation Model with Full Attention

Title: SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining

Title: PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

Title: Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models