2025-03-20

Title: Synthetic Data Generation of Body Motion Data by Neural Gas Network for Emotion Recognition

Title: Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control

Title: ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis

Title: SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders

Title: PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Title: Sampling Decisions

Title: Fire and Smoke Datasets in 20 Years: An In-depth Review

Title: Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions

Title: SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization

Title: SpecReX: Explainable AI for Raman Spectroscopy

Title: Potential Score Matching: Debiasing Molecular Structure Sampling with Potential Energy Guidance

Title: Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation

Title: Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM

Title: Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives

Title: Transparent Attested DNS for Confidential Computing Services

Title: Unique Hard Attention: A Tale of Two Sides

Title: Anomaly-Flow: A Multi-domain Federated Generative Adversarial Network for Distributed Denial-of-Service Detection

Title: Retrieval-Augmented Simulacra: Generative Agents for Up-to-date and Knowledge-Adaptive Simulations

Title: Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer

Title: A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising

Title: Sepsyn-OLCP: An Online Learning-based Framework for Early Sepsis Prediction with Uncertainty Quantification using Conformal Prediction

Title: These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models

Title: Generating Medically-Informed Explanations for Depression Detection using LLMs

Title: DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis

Title: HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Title: SplatVoxel: History-Aware Novel View Streaming without Temporal Training

Title: ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints

Title: Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence

Title: LipShiFT: A Certifiably Robust Shift-based Vision Transformer

Title: Revisiting Image Fusion for Multi-Illuminant White-Balance Correction

Title: RAT: Boosting Misclassification Detection Ability without Extra Data

Title: SEEK: Self-adaptive Explainable Kernel For Nonstationary Gaussian Processes

Title: MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models

Title: Decompositional Neural Scene Reconstruction with Generative Diffusion Prior

Title: On the Robustness Tradeoff in Fine-Tuning

Title: SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments

Title: LogLLaMA: Transformer-based log anomaly detection with LLaMA

Title: Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection

Title: Global Renewables Watch: A Temporal Dataset of Solar and Wind Energy Derived from Satellite Imagery

Title: Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark

Title: Temporal-Consistent Video Restoration with Pre-trained Diffusion Models

Title: Efficient Personalization of Quantized Diffusion Model without Backpropagation

Title: Robust Support Vector Machines for Imbalanced and Noisy Data via Benders Decomposition

Title: Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers

Title: MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer

Title: Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Title: Deep Contrastive Unlearning for Language Models

Title: Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation

Title: Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift

Title: Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes

Title: MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models

Title: GenM$^3$: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation

Title: A Semantic and Clean-label Backdoor Attack against Graph Convolutional Networks

Title: pFedFair: Towards Optimal Group Fairness-Accuracy Trade-off in Heterogeneous Federated Learning

Title: Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon Detection

Title: Shushing! Let's Imagine an Authentic Speech from the Silent Video

Title: Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices

Title: FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding

Title: VisNumBench: Evaluating Number Sense of Multimodal Large Language Models

Title: UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation

Title: MMAIF: Multi-task and Multi-degradation All-in-One for Image Fusion with Language Guidance

Title: Generating Multimodal Driving Scenes via Next-Scene Prediction

Title: ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents

Title: USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network

Title: Depth-Aware Range Image-Based Model for Point Cloud Segmentation

Title: Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency Relearning

Title: Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models

Title: Language-based Image Colorization: A Benchmark and Beyond

Title: Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening

Title: One-Shot Medical Video Object Segmentation via Temporal Contrastive Memory Networks

Title: Semi-KAN: KAN Provides an Effective Representation for Semi-Supervised Learning in Medical Image Segmentation

Title: Inspecting the Representation Manifold of Differentially-Private Text

Title: Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering

Title: LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones?

Title: Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning

Title: OFL: Opportunistic Federated Learning for Resource-Heterogeneous and Privacy-Aware Devices

Title: Manifold Learning for Hyperspectral Images

Title: Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training

Title: Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene

Title: Bridging the Gap: Fusing CNNs and Transformers to Decode the Elegance of Handwritten Arabic Script

Title: Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models

Title: Multivariate Gaussian Topic Modelling: A novel approach to discover topics with greater semantic coherence

Title: SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection

Title: ELTEX: A Framework for Domain-Driven Synthetic Data Generation

Title: Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation

Title: Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis

Title: A Comprehensive Quantification of Inconsistencies in Memory Dumps

Title: An Investigation of Beam Density on LiDAR Object Detection Performance

Title: Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings

Title: Diffusion-Based Forecasting for Uncertainty-Aware Model Predictive Control

Title: Distilling 3D distinctive local descriptors for 6D pose estimation

Title: VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making

Title: FedLWS: Federated Learning with Adaptive Layer-wise Weight Shrinking

Title: DeCaFlow: A Deconfounding Causal Generative Model

Title: Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification

Title: Text-Derived Relational Graph-Enhanced Network for Skeleton-Based Action Segmentation

Title: Increasing the Robustness of the Fine-tuned Multilingual Machine-Generated Text Detectors

Title: EmoGRACE: Aspect-based emotion analysis for social media data

Title: Machine learning surrogate models of many-body dispersion interactions in polymer melts

Title: Preference Construction: A Bayesian Interactive Preference Elicitation Framework Based on Monte Carlo Tree Search

Title: ARC: Anchored Representation Clouds for High-Resolution INR Classification

Title: UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework

Title: Global Group Fairness in Federated Learning via Function Tracking

Title: Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks

Title: Benchmarking Large Language Models for Handwritten Text Recognition

Title: Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Title: DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Title: Kolmogorov-Arnold Network for Transistor Compact Modeling

Title: CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification

Title: Exploring Large Language Models for Word Games:Who is the Spy?

Title: Your Signal, Their Data: An Empirical Privacy Analysis of Wireless-scanning SDKs in Android

Title: BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

Title: ImputeGAP: A Comprehensive Library for Time Series Imputation

Title: DEPT: Deep Extreme Point Tracing for Ultrasound Image Segmentation

Title: LEGION: Learning to Ground and Explain for Synthetic Image Detection

Title: Learning to quantify graph nodes

Title: MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration

Title: TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models

Title: EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds

Title: PAPI-Reg: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image

Title: TROVE: A Challenge for Fine-Grained Text Provenance via Source Sentence Tracing and Relationship Classification

Title: Test-Time Backdoor Detection for Object Detection Models

Title: DCA: Dividing and Conquering Amnesia in Incremental Object Detection

Title: Inside-Out: Hidden Factual Knowledge in LLMs

Title: SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes

Title: TruthLens:A Training-Free Paradigm for DeepFake Detection

Title: SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models

Title: SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation

Title: FedBEns: One-Shot Federated Learning based on Bayesian Ensemble

Title: EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models

Title: Real-world validation of a multimodal LLM-powered pipeline for High-Accuracy Clinical Trial Patient Matching leveraging EHR data

Title: Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement

Title: Visual Persona: Foundation Model for Full-Body Human Customization

Title: Learn Your Scales: Towards Scale-Consistent Generative Novel View Synthesis

Title: LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding

Title: Visual Position Prompt for MLLM based Visual Grounding

Title: V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception

Title: MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space

Title: Evaluating Bias in Retrieval-Augmented Medical Question-Answering Systems

Title: Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator

Title: From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Title: FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers

Title: Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification

Title: Cube: A Roblox View of 3D Intelligence

Title: SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Title: Value Profiles for Encoding Human Variation

Title: TULIP: Towards Unified Language-Image Pretraining