2025-05-05

Title: Fast2comm:Collaborative perception combined with prior knowledge

Title: InstructAttribute: Fine-grained Object Attributes editing with Instruction

Title: Multi-Modal Language Models as Text-to-Image Model Evaluators

Title: Scalable Unit Harmonization in Medical Informatics Using Bi-directional Transformers and Bayesian-Optimized BM25 and Sentence Embedding Retrieval

Title: Data-Driven Optical To Thermal Inference in Pool Boiling Using Generative Adversarial Networks

Title: The Comparability of Model Fusion to Measured Data in Confuser Rejection

Title: NeMo-Inspector: A Visualization Tool for LLM Generation Analysis

Title: Tree-Sliced Wasserstein Distance with Nonlinear Projection

Title: Generating Animated Layouts as Structured Text Representations

Title: Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis

Title: Where's the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content

Title: Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees

Title: Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs

Title: Improving Editability in Image Generation with Layer-wise Memory

Title: Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation

Title: Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages

Title: VSC: Visual Search Compositional Text-to-Image Diffusion Model

Title: Incorporating Inductive Biases to Energy-based Generative Models

Title: Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders

Title: Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability

Title: Distilling Two-Timed Flow Models by Separately Matching Initial and Terminal Velocities

Title: FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis

Title: TSTMotion: Training-free Scene-awarenText-to-motion Generation

Title: Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications

Title: FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

Title: VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models