2025-06-10

Title: Facial Foundational Model Advances Early Warning of Coronary Artery Disease from Live Videos with DigitalShadow

Title: dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

Title: Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Title: From Transformers to Large Language Models: A systematic review of AI applications in the energy sector towards Agentic Digital Twins

Title: Beyond the Norm: A Survey of Synthetic Data Generation for Rare Events

Title: Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

Title: TimeWak: Temporal Chained-Hashing Watermark for Time Series Data

Title: A Systematic Review of Poisoning Attacks Against Large Language Models

Title: Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models

Title: A Deep Learning Approach for Facial Attribute Manipulation and Reconstruction in Surveillance and Reconnaissance

Title: EV-LayerSegNet: Self-supervised Motion Segmentation using Event Cameras

Title: TrustConnect: An In-Vehicle Anomaly Detection Framework through Topology-Based Trust Rating

Title: Non-Intrusive Load Monitoring Based on Image Load Signatures and Continual Learning

Title: Learning Robust Heterogeneous Graph Representations via Contrastive-Reconstruction under Sparse Semantics

Title: Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning

Title: MarginSel : Max-Margin Demonstration Selection for LLMs

Title: Dynamic and Parametric Retrieval-Augmented Generation

Title: Active Contour Models Driven by Hyperbolic Mean Curvature Flow for Image Segmentation

Title: LADSG: Label-Anonymized Distillation and Similar Gradient Substitution for Label Privacy in Vertical Federated Learning

Title: Training-Free Identity Preservation in Stylized Image Generation Using Diffusion Models

Title: Label-semantics Aware Generative Approach for Domain-Agnostic Multilabel Classification

Title: IMPA-HGAE:Intra-Meta-Path Augmented Heterogeneous Graph Autoencoder

Title: Path Integral Optimiser: Global Optimisation via Neural Schrödinger-Föllmer Diffusion

Title: Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Title: Hi-LSplat: Hierarchical 3D Language Gaussian Splatting

Title: Controllable Coupled Image Generation via Diffusion Models

Title: EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery

Title: Harnessing Vision-Language Models for Time Series Anomaly Detection

Title: Position Prediction Self-Supervised Learning for Multimodal Satellite Imagery Semantic Segmentation

Title: Face recognition on point cloud with cgan-top for denoising

Title: Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?

Title: LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Title: Towards Physics-informed Diffusion for Anomaly Detection in Trajectories

Title: ModelForge: Using GenAI to Improve the Development of Security Protocols

Title: UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment

Title: Mixture Experts with Test-Time Self-Supervised Aggregation for Tabular Imbalanced Regression

Title: Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Title: FairPFN: A Tabular Foundation Model for Causal Fairness

Title: A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge

Title: E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models

Title: Filling the Missings: Spatiotemporal Data Imputation by Conditional Diffusion

Title: Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion

Title: Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Title: GGBall: Graph Generative Model on Poincaré Ball

Title: TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation

Title: SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes

Title: Promoting Ensemble Diversity with Interactive Bayesian Distributional Robustness for Fine-tuning Foundation Models

Title: A Stable Whitening Optimizer for Efficient Neural Network Training

Title: Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs

Title: From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models

Title: Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI

Title: Pre-trained Large Language Models Learn Hidden Markov Models In-context

Title: Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference

Title: Generative Modeling of Networked Time-Series via Transformer Architectures

Title: DEF: Diffusion-augmented Ensemble Forecasting

Title: Reward Model Interpretability via Optimal and Pessimal Tokens

Title: Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding

Title: Enhanced Consistency Bi-directional GAN(CBiGAN) for Malware Anomaly Detection

Title: DINO-CoDT: Multi-class Collaborative Detection and Tracking with Vision Foundation Models

Title: Anomaly Detection and Early Warning Mechanism for Intelligent Monitoring Systems in Multi-Cloud Environments Based on LLM

Title: LG-ANNA-Embedding technical report

Title: Federated In-Context Learning: Iterative Refinement for Improved Answer Quality

Title: PhysiInter: Integrating Physical Mapping for High-Fidelity Human Interaction Generation

Title: ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning

Title: Circumventing Backdoor Space via Weight Symmetry

Title: Drive Any Mesh: 4D Latent Diffusion for Mesh Deformation from Video

Title: Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Title: DeRAGEC: Denoising Named Entity Candidates with Synthetic Rationale for ASR Error Correction

Title: APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs

Title: Cross-channel Perception Learning for H&E-to-IHC Virtual Staining

Title: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

Title: Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding

Title: MIRA: Medical Time Series Foundation Model for Real-World Health Data

Title: MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity

Title: Explore the vulnerability of black-box models via diffusion models

Title: SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis

Title: ProSplat: Improved Feed-Forward 3D Gaussian Splatting for Wide-Baseline Sparse Views

Title: NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation

Title: Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation

Title: Consistent Video Editing as Flow-Driven Image-to-Video Generation

Title: AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization

Title: Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images

Title: Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Title: Comparing Credit Risk Estimates in the Gen-AI Era

Title: Language-Vision Planner and Executor for Text-to-Visual Reasoning

Title: Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger

Title: Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution

Title: WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code

Title: Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation

Title: R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation

Title: Diffusion models under low-noise regime

Title: Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels

Title: VIVAT: Virtuous Improving VAE Training through Artifact Mitigation

Title: Diffusion Counterfactual Generation with Semantic Abduction

Title: EgoM2P: Egocentric Multimodal Multitask Pretraining

Title: Video Unlearning via Low-Rank Refusal Vector

Title: FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling

Title: Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Title: CausalPFN: Amortized Causal Effect Estimation via In-Context Learning

Title: A Generative Physics-Informed Reinforcement Learning-Based Approach for Construction of Representative Drive Cycle

Title: Mimicking or Reasoning: Rethinking Multi-Modal In-Context Learning in Vision-Language Models

Title: Cost-Optimal Active AI Model Evaluation

Title: Neural Tangent Kernel Analysis to Probe Convergence in Physics-informed Neural Solvers: PIKANs vs. PINNs

Title: CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray

Title: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Title: Generative Modeling of Weights: Generalization or Memorization?

Title: MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

Title: Dynamic View Synthesis as an Inverse Problem

Title: Dreamland: Controllable World Creation with Simulator and Generative Models

Title: Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Title: StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets