2025-02-28

Title: On the Interpolation Effect of Score Smoothing

Title: Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation

Title: Retrieval Augmented Anomaly Detection (RAAD): Nimble Model Adjustment Without Retraining

Title: Improving Representation Learning of Complex Critical Care Data with ICU-BERT

Title: cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning

Title: Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality

Title: SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization

Title: BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance

Title: You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Title: SAP-DIFF: Semantic Adversarial Patch Generation for Black-Box Face Recognition Models via Diffusion Models

Title: Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network

Title: MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery

Title: UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition

Title: Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study

Title: Knowledge Bridger: Towards Training-free Missing Multi-modality Completion

Title: ProAPO: Progressively Automatic Prompt Optimization for Visual Classification

Title: One-for-More: Continual Diffusion Model for Anomaly Detection

Title: C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

Title: GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors

Title: Shifting the Paradigm: A Diffeomorphism Between Time Series Data Manifolds for Achieving Shift-Invariancy in Deep Learning

Title: Identity-preserving Distillation Sampling by Fixed-Point Iterator

Title: Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios

Title: Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation

Title: A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation

Title: VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers

Title: FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Title: Adaptive H&E-IHC information fusion staining framework based on feature extra

Title: Similarity-Distance-Magnitude Universal Verification

Title: Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Title: Attention Distillation: A Unified Approach to Visual Characteristics Transfer

Title: Do computer vision foundation models learn the low-level characteristics of the human visual system?

Title: Mobius: Text to Seamless Looping Video Generation via Latent Shift

Title: FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction

Title: Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds

Title: UniTok: A Unified Tokenizer for Visual Generation and Understanding

Title: ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model

Title: Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

Title: Constrained Generative Modeling with Manually Bridged Diffusion Models

Title: Multi-Turn Code Generation Through Single-Step Rewards

Title: Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis

Title: Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Title: InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions