2025-03-03

Title: EgoNormia: Benchmarking Physical Social Norm Understanding

Title: Unified Kernel-Segregated Transpose Convolution Operation

Title: CoCa-CXR: Contrastive Captioners Learn Strong Temporal Structures for Chest X-Ray Vision-Language Understanding

Title: Towards Statistical Factuality Guarantee for Large Vision-Language Models

Title: LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks

Title: InstaFace: Identity-Preserving Facial Editing with Single Image Inference

Title: RTGen: Real-Time Generative Detection Transformer

Title: Are LLMs Ready for Practical Adoption for Assertion Generation?

Title: Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models

Title: Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA

Title: Diffusion Restoration Adapter for Real-World Image Restoration

Title: WorldModelBench: Judging Video Generation Models As World Models

Title: Towards General Visual-Linguistic Face Forgery Detection(V2)

Title: Generating Clinically Realistic EHR Data via a Hierarchy- and Semantics-Guided Transformer

Title: CADDreamer: CAD object Generation from Single-view Images

Title: Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints

Title: HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Title: MFSR-GAN: Multi-Frame Super-Resolution with Handheld Motion Modeling

Title: LADs: Leveraging LLMs for AI-Driven DevOps

Title: Oscillation-Reduced MXFP4 Training for Vision Transformers

Title: Adaptive Identification of Blurred Regions for Accurate Image Deblurring

Title: DiffBrush:Just Painting the Art by Your Hands

Title: BadRefSR: Backdoor Attacks Against Reference-based Image Super Resolution

Title: Generative Uncertainty in Diffusion Models

Title: Retrieval Augmented Generation for Topic Modeling in Organizational Research: An Introduction with Empirical Demonstration

Title: Fine-Grained Retrieval-Augmented Generation for Visual Question Answering

Title: Synthesizing Tabular Data Using Selectivity Enhanced Generative Adversarial Networks

Title: Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport

Title: Spatial Reasoning with Denoising Models

Title: Training-free and Adaptive Sparse Attention for Efficient Long Video Generation

Title: Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?

Title: A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images

Title: Autonomous Curriculum Design via Relative Entropy Based Task Modifications

Title: QFAL: Quantum Federated Adversarial Learning

Title: SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training

Title: BAnG: Bidirectional Anchored Generation for Conditional RNA Design

Title: Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion

Title: MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Title: Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos

Title: How far can we go with ImageNet for Text-to-Image generation?