2025-04-22

Title: Ising Models with Hidden Markov Structure: Applications to Probabilistic Inference in Machine Learning

Title: Generative System Dynamics in Recurrent Neural Networks

Title: Entropy Rectifying Guidance for Diffusion and Flow Models

Title: Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation

Title: LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models

Title: A synthetic dataset of French electric load curves with temperature conditioning

Title: Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models

Title: PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models

Title: BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution

Title: HFBRI-MAE: Handcrafted Feature Based Rotation-Invariant Masked Autoencoder for 3D Point Cloud Analysis

Title: Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach

Title: Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Title: Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization

Title: Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis

Title: DConAD: A Differencing-based Contrastive Representation Learning Framework for Time Series Anomaly Detection

Title: Decomposition-based multi-scale transformer framework for time series anomaly detection

Title: Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Title: Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation

Title: Single Document Image Highlight Removal via A Large-Scale Real-World Dataset and A Location-Aware Network

Title: A Pre-Training and Adaptive Fine-Tuning Framework for Graph Anomaly Detection

Title: Visual Consensus Prompting for Co-Salient Object Detection

Title: Cross-attention for State-based model RWKV-7

Title: Generative emulation of chaotic dynamics with coherent prior

Title: Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

Title: From Missing Pieces to Masterpieces: Image Completion with Context-Adaptive Diffusion

Title: Learning and Generating Diverse Residential Load Patterns Using GAN with Weakly-Supervised Training and Weight Selection

Title: Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization

Title: Exploring Generalizable Pre-training for Real-world Change Detection via Geometric Estimation

Title: Visual Prompting for One-shot Controllable Video Editing without Inversion

Title: Manipulating Multimodal Agents via Cross-Modal Prompt Injection

Title: Integrating Single-Cell Foundation Models with Graph Neural Networks for Drug Response Prediction

Title: Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator

Title: Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models

Title: SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Title: Causal Disentanglement for Robust Long-tail Medical Image Generation

Title: LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation

Title: sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment

Title: Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Title: Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation

Title: DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Title: SUDO: Enhancing Text-to-Image Diffusion Models with Self-Supervised Direct Preference Optimization

Title: FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models

Title: VGNC: Reducing the Overfitting of Sparse-view 3DGS via Validation-guided Gaussian Number Control

Title: REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models

Title: Using street view imagery and deep generative modeling for estimating the health of urban forests

Title: Generative Auto-Bidding with Value-Guided Explorations

Title: Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance

Title: Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens

Title: Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning

Title: Can We Ignore Labels In Out of Distribution Detection?

Title: Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches

Title: Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

Title: Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model

Title: When Cloud Removal Meets Diffusion Model in Remote Sensing

Title: Enhanced Data-driven Topology Design Methodology with Multi-level Mesh and Correlation-based Mutation for Stress-related Multi-objective Optimization

Title: Edge-boosted graph learning for functional brain connectivity analysis

Title: Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Title: A Basic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm

Title: What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale

Title: Protecting Your Voice: Temporal-aware Robust Watermarking

Title: Memory-Augmented Dual-Decoder Networks for Multi-Class Unsupervised Anomaly Detection

Title: Latent Bayesian Optimization via Autoregressive Normalizing Flows

Title: Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Title: GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

Title: TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models

Title: PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV

Title: Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization

Title: RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

Title: Insert Anything: Image Insertion via In-Context Editing in DiT

Title: LLMs as Data Annotators: How Close Are We to Human Performance

Title: Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models

Title: DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation

Title: SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation

Title: Structure-guided Diffusion Transformer for Low-Light Image Enhancement

Title: VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation

Title: Fast-Slow Co-advancing Optimizer: Toward Harmonious Adversarial Training of GAN

Title: Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection

Title: Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration

Title: The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks

Title: DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution

Title: FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image

Title: Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform

Title: Automated Measurement of Eczema Severity with Self-Supervised Learning

Title: M$^2$AD: Multi-Sensor Multi-System Anomaly Detection through Global Scoring and Calibrated Thresholding

Title: Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators

Title: Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation

Title: Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Title: Diffusion Bridge Models for 3D Medical Image Translation

Title: StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians