2025-03-27

Title: Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders

Title: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Title: Experience Replay Addresses Loss of Plasticity in Continual Learning

Title: Deep Learning Approaches for Blood Disease Diagnosis Across Hematopoietic Lineages

Title: Can Multi-modal (reasoning) LLMs work as deepfake detectors?

Title: Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success

Title: Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion

Title: "Is There Anything Else?'': Examining Administrator Influence on Linguistic Features from the Cookie Theft Picture Description Cognitive Test

Title: AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions

Title: Guiding Human-Object Interactions with Rich Geometry and Relations

Title: Offline Reinforcement Learning with Discrete Diffusion Skills

Title: Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology

Title: Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators

Title: GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization

Title: Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Title: Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors

Title: Video Motion Graphs

Title: DINeMo: Learning Neural Mesh Models with no 3D Annotations

Title: Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models

Title: LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions

Title: Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos

Title: EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation

Title: Faster Parameter-Efficient Tuning with Token Redundancy Reduction

Title: RelTriple: Learning Plausible Indoor Layouts by Integrating Relationship Triples into the Diffusion Process

Title: Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model

Title: Wan: Open and Advanced Large-Scale Video Generative Models

Title: VideoGEM: Training-free Action Grounding in Videos

Title: Consistency Trajectory Matching for One-Step Generative Super-Resolution

Title: CNN+Transformer Based Anomaly Traffic Detection in UAV Networks for Emergency Rescue

Title: FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies

Title: ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On

Title: Latent Beam Diffusion Models for Decoding Image Sequences

Title: Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability

Title: Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation

Title: VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Title: Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications

Title: MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation

Title: GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Title: TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration

Title: TerraTorch: The Geospatial Foundation Models Toolkit

Title: Diffusion Counterfactuals for Image Regressors

Title: $β$-GNN: A Robust Ensemble Approach Against Graph Structure Perturbation

Title: MMGen: Unified Multi-modal Image Generation and Understanding in One Go

Title: Imitating Radiological Scrolling: A Global-Local Attention Model for 3D Chest CT Volumes Multi-Label Anomaly Classification

Title: ARMO: Autoregressive Rigging for Multi-Category Objects

Title: Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound

Title: From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models

Title: Learning Straight Flows by Learning Curved Interpolants

Title: A weakly-supervised deep learning model for fast localisation and delineation of the skeleton, internal organs, and spinal canal on Whole-Body Diffusion-Weighted MRI (WB-DWI)

Title: Dynamic Motion Blending for Versatile Motion Editing

Title: RecTable: Fast Modeling Tabular Data with Rectified Flow

Title: High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching

Title: UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Title: Reliable algorithm selection for machine learning-guided design

Title: Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Title: FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks

Title: Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency