2025-01-07

Title: INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models

Title: Gender Bias in Text-to-Video Generation Models: A case study of Sora

Title: CRRG-CLIP: Automatic Generation of Chest Radiology Reports and Classification of Chest Radiographs

Title: Towards Sustainable Large Language Model Serving

Title: SmartSpatial: Enhancing the 3D Spatial Arrangement Capabilities of Stable Diffusion Models and Introducing a Novel 3D Spatial Evaluation Framework

Title: On the Utility of Equivariance and Symmetry Breaking in Deep Learning Architectures on Point Clouds

Title: Communication Efficient Cooperative Edge AI via Event-Triggered Computation Offloading

Title: Information Subtraction: Learning Representations for Conditional Entropy

Title: Machine Learning-Based Differential Diagnosis of Parkinson's Disease Using Kinematic Feature Extraction and Selection

Title: Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language Models

Title: MRG: A Multi-Robot Manufacturing Digital Scene Generation Method Using Multi-Instance Point Cloud Registration

Title: DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data

Title: Active Learning Enables Extrapolation in Molecular Generative Models

Title: Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN

Title: Generating Multimodal Images with GAN: Integrating Text, Image, and Style

Title: Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey

Title: Unsupervised Class Generation to Expand Semantic Segmentation Datasets

Title: TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration

Title: DiffGraph: Heterogeneous Graph Diffusion Model

Title: Optimizing Small Language Models for In-Vehicle Function-Calling

Title: Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models

Title: Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations

Title: MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation

Title: Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data

Title: ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

Title: Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors

Title: Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation

Title: Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks

Title: LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations

Title: DepthMaster: Taming Diffusion Models for Monocular Depth Estimation

Title: Multispectral Pedestrian Detection with Sparsely Annotated Label

Title: A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model

Title: GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Title: Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis

Title: Persistence of Backdoor-based Watermarks for Neural Networks: A Comprehensive Evaluation

Title: Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment

Title: Holistic Semantic Representation for Navigational Trajectory Generation

Title: Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising

Title: LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating

Title: InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models

Title: HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation

Title: Large Language Models for Video Surveillance Applications

Title: Synthetic Fungi Datasets: A Time-Aligned Approach

Title: Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems

Title: Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis

Title: SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Title: Fuzzy Granule Density-Based Outlier Detection with Multi-Scale Granular Balls

Title: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Title: TransPixar: Advancing Text-to-Video Generation with Transparency

Title: Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation

Title: CAT: Content-Adaptive Image Tokenization

Title: Geometry Restoration and Dewarping of Camera-Captured Document Images

Title: ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Title: Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Title: Gaussian Masked Autoencoders