2025-05-08

Title: Hierarchical Multi-Label Generation with Probabilistic Level-Constraint

Title: ALFRED: Ask a Large-language model For Reliable ECG Diagnosis

Title: When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

Title: Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Title: Information Filtering Networks: Theoretical Foundations, Generative Methodologies, and Real-World Applications

Title: Program Semantic Inequivalence Game with Large Language Models

Title: Machine Learning: a Lecture Note

Title: Deep Learning Framework for Infrastructure Maintenance: Crack Detection and High-Resolution Imaging of Infrastructure Surfaces

Title: Call for Action: towards the next generation of symbolic regression benchmark

Title: Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Autospeculation

Title: MAISY: Motion-Aware Image SYnthesis for MedicalImage Motion Correction

Title: Learning from Similarity Proportion Loss for Classifying Skeletal Muscle Recovery Stages

Title: DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion

Title: DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation

Title: S3D: Sketch-Driven 3D Model Generation

Title: A Large Language Model for Feasible and Diverse Population Synthesis

Title: Technology prediction of a 3D model using Neural Network

Title: Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting

Title: Non-stationary Diffusion For Probabilistic Time Series Forecasting

Title: MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition

Title: Riemannian Denoising Diffusion Probabilistic Models

Title: CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion

Title: WDMamba: When Wavelet Degradation Prior Meets Vision Mamba for Image Dehazing

Title: DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake Attribution

Title: Localized Diffusion Models for High Dimensional Distributions Generation

Title: RLMiniStyler: Light-weight RL Style Agent for Arbitrary Sequential Neural Style Generation

Title: CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation

Title: Efficient Flow Matching using Latent Variables

Title: Defining and Quantifying Creative Behavior in Popular Image Generators

Title: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Title: Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model

Title: Person Recognition at Altitude and Range: Fusion of Face, Body Shape and Gait

Title: On Path to Multimodal Generalist: General-Level and General-Bench