2025-08-07

Title: PLA: Prompt Learning Attack against Text-to-Image Generative Models

Title: From Waveforms to Pixels: A Survey on Audio-Visual Segmentation

Title: TIR-Diffusion: Diffusion-based Thermal Infrared Image Denoising via Latent and Wavelet Domain Optimization

Title: CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Learning

Title: StorySync: Training-Free Subject Consistency in Text-to-Image Generation via Region Harmonization

Title: LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation

Title: Provably Near-Optimal Distributionally Robust Reinforcement Learning in Online Settings

Title: SoilNet: A Multimodal Multitask Model for Hierarchical Classification of Soil Horizons

Title: HPSv3: Towards Wide-Spectrum Human Preference Score

Title: VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations

Title: Active Learning and Transfer Learning for Anomaly Detection in Time-Series Data

Title: Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model

Title: Markov Chain Estimation with In-Context Learning

Title: RAVID: Retrieval-Augmented Visual Detection: A Knowledge-Driven Approach for AI-Generated Image Identification

Title: Data and AI governance: Promoting equity, ethics, and fairness in large language models

Title: Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models

Title: CAD-Judge: Toward Efficient Morphological Grading and Verification for Text-to-CAD Generation

Title: $\text{S}^2$Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation

Title: FeDaL: Federated Dataset Learning for Time Series Foundation Models

Title: Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion

Title: GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

Title: Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Title: Model Inversion Attacks on Vision-Language Models: Do They Leak What They Learn?

Title: Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation

Title: IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control

Title: ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation

Title: AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization

Title: DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models

Title: LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation

Title: Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction

Title: DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification

Title: PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction

Title: WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification

Title: A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks

Title: TempFlow-GRPO: When Timing Matters for GRPO in Flow Models

Title: From Split to Share: Private Inference with Distributed Feature Sharing

Title: Chain of Questions: Guiding Multimodal Curiosity in Language Models

Title: VisionTS++: Cross-Modal Time Series Foundation Model with Continual Pre-trained Visual Backbones

Title: Why are LLMs' abilities emergent?

Title: Benchmarking Foundation Models for Mitotic Figure Classification

Title: Automated Generation of Curriculum-Aligned Multiple-Choice Questions for Malaysian Secondary Mathematics Using Generative AI

Title: Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling

Title: Automatic LLM Red Teaming

Title: Small transformer architectures for task switching

Title: 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation

Title: FedHiP: Heterogeneity-Invariant Personalized Federated Learning Through Closed-Form Solutions

Title: Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model

Title: Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach

Title: QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

Title: CALE : Concept-Aligned Embeddings for Both Within-Lemma and Inter-Lemma Sense Differentiation

Title: Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis

Title: One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose

Title: Drone Detection with Event Cameras

Title: TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning

Title: Analyzing and Mitigating Object Hallucination: A Training Bias Perspective

Title: DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling

Title: GraphProp: Training the Graph Foundation Models using Graph Properties

Title: Multitask Learning with Stochastic Interpolants

Title: CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series

Title: EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts

Title: HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models