2025-02-28

Title: On the Interpolation Effect of Score Smoothing

Title: TRIX: A More Expressive Model for Zero-shot Domain Transfer in Knowledge Graphs

Title: Mixtraining: A Better Trade-Off Between Compute and Performance

Title: Retrieval Augmented Anomaly Detection (RAAD): Nimble Model Adjustment Without Retraining

Title: Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?

Title: Stay Focused: Problem Drift in Multi-Agent Debate

Title: Tell me why: Visual foundation models as self-explainable classifiers

Title: NeoBERT: A Next-Generation BERT

Title: Improving Representation Learning of Complex Critical Care Data with ICU-BERT

Title: Is Your Paper Being Reviewed by an LLM? A New Benchmark Dataset and Approach for Detecting AI Text in Peer Review

Title: 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer

Title: cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning

Title: SubZero: Composing Subject, Style, and Action via Zero-Shot Personalization

Title: BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance

Title: You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Title: Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification

Title: Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model

Title: SAP-DIFF: Semantic Adversarial Patch Generation for Black-Box Face Recognition Models via Diffusion Models

Title: Recent Advances on Generalizable Diffusion-generated Image Detection

Title: Learning Mask Invariant Mutual Information for Masked Image Modeling

Title: Few-Shot Multilingual Open-Domain QA from 5 Examples

Title: Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network

Title: EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models

Title: In-Context Learning with Hypothesis-Class Guidance

Title: Mixtera: A Data Plane for Foundation Model Training

Title: MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery

Title: UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition

Title: Implicit Search via Discrete Diffusion: A Study on Chess

Title: Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study

Title: CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation

Title: One-for-More: Continual Diffusion Model for Anomaly Detection

Title: MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue

Title: C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

Title: High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model

Title: GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors

Title: Image Referenced Sketch Colorization Based on Animation Creation Workflow

Title: SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model

Title: A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation

Title: VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers

Title: Self-Training Elicits Concise Reasoning in Large Language Models

Title: FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Title: Your contrastive learning problem is secretly a distribution alignment problem

Title: Adaptive H&E-IHC information fusion staining framework based on feature extra

Title: Learning to Generalize without Bias for Open-Vocabulary Action Recognition

Title: Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Title: Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars

Title: Attention Distillation: A Unified Approach to Visual Characteristics Transfer

Title: From Retrieval to Generation: Comparing Different Approaches

Title: Do computer vision foundation models learn the low-level characteristics of the human visual system?

Title: Vector-Quantized Vision Foundation Models for Object-Centric Learning

Title: Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions

Title: Mobius: Text to Seamless Looping Video Generation via Latent Shift

Title: FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction

Title: Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds

Title: ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model

Title: Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization

Title: Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

Title: Constrained Generative Modeling with Manually Bridged Diffusion Models

Title: Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Title: InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions