2025-03-20

Title: Synthetic Data Generation of Body Motion Data by Neural Gas Network for Emotion Recognition

Title: Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control

Title: PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Title: Matching Skeleton-based Activity Representations with Heterogeneous Signals for HAR

Title: Sampling Decisions

Title: SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization

Title: Potential Score Matching: Debiasing Molecular Structure Sampling with Potential Energy Guidance

Title: Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation

Title: Anomaly-Flow: A Multi-domain Federated Generative Adversarial Network for Distributed Denial-of-Service Detection

Title: Retrieval-Augmented Simulacra: Generative Agents for Up-to-date and Knowledge-Adaptive Simulations

Title: Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer

Title: A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising

Title: ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints

Title: Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection

Title: Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure

Title: MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Title: Decompositional Neural Scene Reconstruction with Generative Diffusion Prior

Title: SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments

Title: LogLLaMA: Transformer-based log anomaly detection with LLaMA

Title: 1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities

Title: Temporal-Consistent Video Restoration with Pre-trained Diffusion Models

Title: Efficient Personalization of Quantized Diffusion Model without Backpropagation

Title: When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach

Title: Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift

Title: Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology

Title: GenM$^3$: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation

Title: Shushing! Let's Imagine an Authentic Speech from the Silent Video

Title: MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance

Title: Generating Multimodal Driving Scenes via Next-Scene Prediction

Title: Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models

Title: Language-based Image Colorization: A Benchmark and Beyond

Title: Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening

Title: Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training

Title: Multivariate Gaussian Topic Modelling: A novel approach to discover topics with greater semantic coherence

Title: Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation

Title: Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis

Title: Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings

Title: Diffusion-Based Forecasting for Uncertainty-Aware Model Predictive Control

Title: When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning

Title: DeCaFlow: A Deconfounding Causal Generative Model

Title: Object-Centric Pretraining via Target Encoder Bootstrapping

Title: PointSFDA: Source-free Domain Adaptation for Point Cloud Completion

Title: Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Title: DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Title: A Foundation Model for Patient Behavior Monitoring and Suicide Detection

Title: BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

Title: LEGION: Learning to Ground and Explain for Synthetic Image Detection

Title: Visual Persona: Foundation Model for Full-Body Human Customization

Title: Learn Your Scales: Towards Scale-Consistent Generative Novel View Synthesis

Title: Automated Processing of eXplainable Artificial Intelligence Outputs in Deep Learning Models for Fault Diagnostics of Large Infrastructures

Title: LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding

Title: MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space

Title: Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator

Title: From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Title: FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

Title: EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining

Title: Cube: A Roblox View of 3D Intelligence

Title: Value Profiles for Encoding Human Variation

Title: TULIP: Towards Unified Language-Image Pretraining