2025-07-23

Title: PAT++: a cautionary tale about generative visual augmentation for Object Re-identification

Title: ReDi: Rectified Discrete Flow

Title: Foundation Models and Transformers for Anomaly Detection: A Survey

Title: HyDRA: A Hybrid-Driven Reasoning Architecture for Verifiable Knowledge Graphs

Title: Enhancing Hindi NER in Low Context: A Comparative study of Transformer-based models with vs. without Retrieval Augmentation

Title: Learning without training: The implicit dynamics of in-context learning

Title: AutoMeet: a proof-of-concept study of genAI to automate meetings in automotive engineering

Title: Deep Researcher with Test-Time Diffusion

Title: Efficient Compositional Multi-tasking for On-device Large Language Models

Title: Improving Personalized Image Generation through Social Context Feedback

Title: Stop-band Energy Constraint for Orthogonal Tunable Wavelet Units in Convolutional Neural Networks for Computer Vision problems

Title: PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

Title: DP2Guard: A Lightweight and Byzantine-Robust Privacy-Preserving Federated Learning Scheme for Industrial IoT

Title: Learning Patient-Specific Spatial Biomarker Dynamics via Operator Learning for Alzheimer's Disease Progression

Title: LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation

Title: EBaReT: Expert-guided Bag Reward Transformer for Auto Bidding

Title: METER: Multi-modal Evidence-based Thinking and Explainable Reasoning -- Algorithm and Benchmark

Title: Advancing Visual Large Language Model for Multi-granular Versatile Perception

Title: Towards Compute-Optimal Many-Shot In-Context Learning

Title: Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective

Title: Dens3R: A Foundation Model for 3D Geometry Prediction

Title: Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning

Title: M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision

Title: DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling

Title: Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

Title: One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution

Title: Navigating Large-Pose Challenge for High-Fidelity Face Reenactment with Video Diffusion Model

Title: Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?

Title: Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

Title: Robust Noisy Pseudo-label Learning for Semi-supervised Medical Image Segmentation Using Diffusion Model

Title: VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences

Title: The Ever-Evolving Science Exam

Title: EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Title: Scaling Linear Attention with Sparse State Expansion

Title: Dyna3DGR: 4D Cardiac Motion Tracking with Dynamic 3D Gaussian Representation

Title: CTSL: Codebook-based Temporal-Spatial Learning for Accurate Non-Contrast Cardiac Risk Prediction Using Cine MRIs

Title: Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs

Title: PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization

Title: FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation

Title: Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation

Title: HarmonPaint: Harmonized Training-Free Diffusion Inpainting

Title: CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation

Title: Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning