2025-08-12

Title: Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

Title: DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation

Title: MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing

Title: Factor Augmented Supervised Learning with Text Embeddings

Title: Slice or the Whole Pie? Utility Control for AI Models

Title: Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering

Title: GFlowNets for Learning Better Drug-Drug Interaction Representations

Title: Generative Artificial Intelligence Extracts Structure-Function Relationships from Plants for New Materials

Title: Local Diffusion Models and Phases of Data Distributions

Title: CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation

Title: Segmented Confidence Sequences and Multi-Scale Adaptive Confidence Segments for Anomaly Detection in Nonstationary Time Series

Title: Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN

Title: Towards Robust Red-Green Watermarking for Autoregressive Image Generators

Title: In-Context Reinforcement Learning via Communicative World Models

Title: Testing the Limits of Machine Translation from One Book

Title: Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video

Title: Mode-Aware Non-Linear Tucker Autoencoder for Tensor-based Unsupervised Learning

Title: Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation

Title: Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models

Title: MultiRef: Controllable Image Generation with Multiple Visual References

Title: MMReID-Bench: Unleashing the Power of MLLMs for Effective and Versatile Person Re-identification

Title: CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing

Title: Structure-Preserving Digital Twins via Conditional Neural Whitney Forms

Title: WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering

Title: S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Title: Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments

Title: HiMat: DiT-based Ultra-High Resolution SVBRDF Generation

Title: Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings

Title: TerraMAE: Learning Spatial-Spectral Representations from Hyperspectral Earth Observation Data via Adaptive Masked Autoencoders

Title: Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities

Title: A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling

Title: 3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression

Title: Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria

Title: Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays

Title: Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction

Title: SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models

Title: CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion

Title: Large-scale Multi-sequence Pretraining for Generalizable MRI Analysis in Versatile Clinical Applications

Title: Gradient Surgery for Safe LLM Fine-Tuning

Title: What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Title: Neural Bridge Processes

Title: HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation

Title: Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation

Title: Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Title: Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition

Title: When Is Prior Knowledge Helpful? Exploring the Evaluation and Selection of Unsupervised Pretext Tasks from a Neuro-Symbolic Perspective

Title: Finite-Time Convergence Analysis of ODE-based Generative Models for Stochastic Interpolants

Title: SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal

Title: DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery

Title: Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Title: Tight Bounds for Schrödinger Potential Estimation in Unpaired Image-to-Image Translation Problems

Title: ForensicsSAM: Toward Robust and Unified Image Forgery Detection and Localization Resisting to Adversarial Attack

Title: CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization

Title: Levarging Learning Bias for Noisy Anomaly Detection

Title: From Field to Drone: Domain Drift Tolerant Automated Multi-Species and Damage Plant Semantic Segmentation for Herbicide Trials

Title: Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing

Title: Enhanced Generative Structure Prior for Chinese Text Image Super-resolution

Title: Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation

Title: Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression

Title: Exploiting Layer Normalization Fine-tuning in Visual Transformer Foundation Models for Classification

Title: When and how can inexact generative models still sample from the data manifold?

Title: Keyword-Centric Prompting for One-Shot Event Detection with Self-Generated Rationale Enhancements

Title: LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation

Title: X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning

Title: Enhancing Egocentric Object Detection in Static Environments using Graph-based Spatial Anomaly Detection and Correction

Title: Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo

Title: LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering

Title: GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Title: AIS-LLM: A Unified Framework for Maritime Trajectory Prediction, Anomaly Detection, and Collision Risk Assessment with Explainable Forecasting

Title: DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework

Title: Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing

Title: Enhancing Small-Scale Dataset Expansion with Triplet-Connection-based Sample Re-Weighting

Title: Grouped Speculative Decoding for Autoregressive Image Generation

Title: Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild

Title: Sparse Probabilistic Graph Circuits

Title: Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation

Title: DiTVR: Zero-Shot Diffusion Transformer for Video Restoration

Title: Architectural Co-Design for Zero-Shot Anomaly Detection: Decoupling Representation and Dynamically Fusing Features in CLIP

Title: CATP: Contextually Adaptive Token Pruning for Efficient and Enhanced Multimodal In-Context Learning

Title: Not Yet AlphaFold for the Mind: Evaluating Centaur as a Synthetic Participant

Title: Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

Title: Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models

Title: Generative Video Matting

Title: Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection

Title: Score Augmentation for Diffusion Models

Title: Understanding Syntactic Generalization in Structure-inducing Language Models

Title: Prompt-Guided Relational Reasoning for Social Behavior Understanding with Vision Foundation Models

Title: Robust Anomaly Detection in O-RAN: Leveraging LLMs against Data Manipulation Attacks

Title: IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning

Title: S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix

Title: Matrix-3D: Omnidirectional Explorable 3D World Generation

Title: Assessing LLM Text Detection in Educational Contexts: Does Human Contribution Affect Detection?

Title: TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning

Title: Hyperspectral Imaging

Title: Iterative refinement, not training objective, makes HuBERT behave differently from wav2vec 2.0

Title: FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting

Title: Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models

Title: Data-Efficient Biomedical In-Context Learning: A Diversity-Enhanced Submodular Perspective

Title: ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction

Title: CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data

Title: RedDino: A foundation model for red blood cell analysis

Title: Reinforcement Learning in Vision: A Survey

Title: SAGOnline: Segment Any Gaussians Online

Title: OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution

Title: Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge

Title: Cut2Next: Generating Next Shot via In-Context Tuning

Title: StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation