2025-03-18

Title: Explainable Sentiment Analysis with DeepSeek-R1: Performance, Efficiency, and Few-Shot Learning

Title: Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs

Title: A Survey of Direct Preference Optimization

Title: Fine-Tuning Diffusion Generative Models via Rich Preference Optimization

Title: BACE-RUL: A Bi-directional Adversarial Network with Covariate Encoding for Machine Remaining Useful Life Prediction

Title: Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation

Title: UBMF: Uncertainty-Aware Bayesian Meta-Learning Framework for Fault Diagnosis with Imbalanced Industrial Data

Title: StyleMorpheus: A Style-Based 3D-Aware Morphable Face Model

Title: Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring

Title: How Can Time Series Analysis Benefit From Multiple Modalities? A Survey and Outlook

Title: Trust Under Siege: Label Spoofing Attacks against Machine Learning for Android Malware Detection

Title: Test-Time Training Provably Improves Transformers as In-context Learners

Title: Towards a Unified Copernicus Foundation Model for Earth Vision

Title: Spatio-temporal Fourier Transformer (StFT) for Long-term Dynamics Prediction

Title: Upcycling Text-to-Image Diffusion Models for Multi-Task Capabilities

Title: REGEN: A Dataset and Benchmarks with Natural Language Critiques and Narratives

Title: Generating a Biometrically Unique and Realistic Iris Database

Title: Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder

Title: Your Text Encoder Can Be An Object-Level Watermarking Controller

Title: Winning the MIDST Challenge: New Membership Inference Attacks on Diffusion Models for Tabular Data Synthesis

Title: QDM: Quadtree-Based Region-Adaptive Sparse Diffusion Models for Efficient Image Super-Resolution

Title: Compose Your Aesthetics: Empowering Text-to-Image Models with the Principles of Art

Title: Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning

Title: Unsupervised Graph Anomaly Detection via Multi-Hypersphere Heterophilic Graph Learning

Title: TACO: Taming Diffusion for in-the-wild Video Amodal Completion

Title: Tailor: An Integrated Text-Driven CG-Ready Human and Garment Generation System

Title: A Comprehensive Survey on Knowledge Distillation

Title: Temporally Consistent Mitral Annulus Measurements from Sparse Annotations in Echocardiographic Videos

Title: A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI

Title: Robust Isolation Forest using Soft Sparse Random Projection and Valley Emphasis Method

Title: DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap

Title: Probabilistic Graph Circuits: Deep Generative Models for Tractable Probabilistic Inference over Graphs

Title: SEAL: Semantic Aware Image Watermarking

Title: STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation

Title: Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment

Title: Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing

Title: Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Title: Toward Foundation Models for Online Complex Event Detection in CPS-IoT: A Case Study

Title: Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

Title: The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation

Title: ProbDiffFlow: An Efficient Learning-Free Framework for Probabilistic Single-Image Optical Flow Estimation

Title: Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation

Title: Pathology Image Restoration via Mixture of Prompts

Title: MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification

Title: SAM2-ELNet: Label Enhancement and Automatic Annotation for Remote Sensing Segmentation

Title: Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification

Title: KDSelector: A Knowledge-Enhanced and Data-Efficient Model Selector Learning Framework for Time Series Anomaly Detection

Title: Cross-Modal Consistency Learning for Sign Language Recognition

Title: Segment Any-Quality Images with Generative Latent Space Enhancement

Title: Multi Activity Sequence Alignment via Implicit Clustering

Title: Towards Suturing World Models: Learning Predictive Models for Robotic Surgical Tasks

Title: Time-EAPCR-T: A Universal Deep Learning Approach for Anomaly Detection in Industrial Equipment

Title: Debiasing Diffusion Model: Enhancing Fairness through Latent Representation Learning in Stable Diffusion Model

Title: Diffusion on Graph: Augmentation of Graph Structure for Node Classification

Title: GAN-Based Single-Stage Defense for Traffic Sign Classification Under Adversarial Patch Attack

Title: BalancedDPO: Adaptive Multi-Metric Alignment

Title: Personalize Anything for Free with Diffusion Transformer

Title: SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models

Title: LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization

Title: FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

Title: UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing

Title: AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Title: GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching

Title: In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Title: VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis

Title: RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Title: A Survey on Human Interaction Motion Generation

Title: TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image

Title: SAM2 for Image and Video Segmentation: A Comprehensive Survey

Title: A Reinforcement Learning-Driven Transformer GAN for Molecular Generation

Title: From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Title: DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode

Title: Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data

Title: UniReg: Foundation Model for Controllable Medical Image Registration

Title: An interpretable approach to automating the assessment of biofouling in video footage

Title: UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks

Title: MFP-CLIP: Exploring the Efficacy of Multi-Form Prompts for Zero-Shot Industrial Anomaly Detection

Title: AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

Title: Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs

Title: Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction

Title: HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

Title: Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait

Title: Training Video Foundation Models with NVIDIA NeMo

Title: Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity

Title: Prospects for Mitigating Spectral Variability in Tropical Species Classification Using Self-Supervised Learning

Title: A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models

Title: TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba

Title: Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task

Title: InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving

Title: Bitcoin Battle: Burning Bitcoin for Geopolitical Fun and Profit

Title: MaskSDM with Shapley values to improve flexibility, robustness, and explainability in species distribution modeling

Title: Do Vision Models Develop Human-Like Progressive Difficulty Understanding?

Title: Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Title: Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration

Title: REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities

Title: DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry

Title: 3D Human Interaction Generation: A Survey

Title: ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation

Title: Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images

Title: Language-guided Open-world Video Anomaly Detection

Title: DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction

Title: Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing Process

Title: Deep Learning Advancements in Anomaly Detection: A Comprehensive Survey

Title: MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis

Title: HoloGest: Decoupled Diffusion and Motion Priors for Generating Holisticly Expressive Co-speech Gestures

Title: FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis

Title: Graph Generative Models Evaluation with Masked Autoencoder

Title: Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors

Title: MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Portrait Few-Step Synthesis

Title: Edit Transfer: Learning Image Editing via Vision In-Context Relations

Title: One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Title: SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization

Title: Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation

Title: Measuring In-Context Computation Complexity via Hidden State Prediction

Title: BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Title: Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images