2025-06-02

Title: Meaning Is Not A Metric: Using LLMs to make cultural context legible at scale

Title: Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things

Title: My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals

Title: MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection

Title: Watermarking Without Standards Is Not AI Governance

Title: Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams

Title: LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation

Title: GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance

Title: Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs

Title: Mamba Integrated with Physics Principles Masters Long-term Chaotic System Forecasting

Title: MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection

Title: ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning

Title: KGMark: A Diffusion Watermark for Knowledge Graphs

Title: BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning

Title: Test-Time Training Done Right

Title: Generating Fit Check Videos with a Handheld Camera

Title: Cora: Correspondence-aware image editing using few step diffusion

Title: One Task Vector is not Enough: A Large-Scale Study for In-Context Learning

Title: Simplifying Bayesian Optimization Via In-Context Direct Optimum Sampling

Title: Multi-Group Proportional Representation for Text-to-Image Models

Title: DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models

Title: From Images to Signals: Are Large Vision Models Useful for Time Series Analysis?

Title: Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning

Title: ComposeAnything: Composite Object Priors for Text-to-Image Generation

Title: Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting

Title: Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors

Title: Federated Foundation Model for GI Endoscopy Images

Title: S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation

Title: The Butterfly Effect in Pathology: Exploring Security in Pathology Foundation Models

Title: CrossICL: Cross-Task In-Context Learning via Unsupervised Demonstration Transfer

Title: Autoregressive regularized score-based diffusion models for multi-scenarios fluid flow prediction

Title: Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction

Title: Pretraining Deformable Image Registration Networks with Random Images

Title: Invariant Link Selector for Spatial-Temporal Out-of-Distribution Problem

Title: CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Title: STORK: Improving the Fidelity of Mid-NFE Sampling for Diffusion and Flow Matching Models

Title: Are Any-to-Any Models More Consistent Across Modality Transfers Than Specialists?

Title: Benchmarking Foundation Models for Zero-Shot Biometric Tasks

Title: Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin

Title: From Hallucinations to Jailbreaks: Rethinking the Vulnerability of Large Foundation Models

Title: LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework

Title: Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization

Title: Interactive Video Generation via Domain Adaptation

Title: Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames

Title: MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection

Title: Large Language Models are Locally Linear Mappings

Title: Category-aware EEG image generation based on wavelet transform and contrast semantic loss

Title: InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing

Title: KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices

Title: Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings

Title: Multilingual Gloss-free Sign Language Translation: Towards Building a Sign Language Foundation Model

Title: Interpreting Large Text-to-Image Diffusion Models with Dictionary Learning

Title: Anomaly Detection and Improvement of Clusters using Enhanced K-Means Algorithm

Title: Adversarial Preference Learning for Robust LLM Alignment

Title: IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Title: EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Title: Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation

Title: Graph Flow Matching: Enhancing Image Generation with Neighbor-Aware Flow Fields

Title: Object Centric Concept Bottlenecks

Title: un$^2$CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP

Title: UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

Title: Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors

Title: Geospatial Foundation Models to Enable Progress on Sustainable Development Goals

Title: Transformers Are Universally Consistent

Title: HLSAD: Hodge Laplacian-based Simplicial Anomaly Detection

Title: AutoChemSchematic AI: A Closed-Loop, Physics-Aware Agentic Framework for Auto-Generating Chemical Process and Instrumentation Diagrams

Title: Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX

Title: Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data

Title: A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning

Title: MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR

Title: Conformal Prediction for Zero-Shot Models

Title: PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations

Title: HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts

Title: AFLoRA: Adaptive Federated Fine-Tuning of Large Language Models with Resource-Aware Low-Rank Adaption

Title: Diffusion-Based Symbolic Regression

Title: EVA-MILP: Towards Standardized Evaluation of MILP Instance Generation

Title: QGAN-based data augmentation for hybrid quantum-classical neural networks

Title: Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding

Title: Guiding Generative Storytelling with Knowledge Graphs

Title: Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking

Title: ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Title: TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection

Title: GenSpace: Benchmarking Spatially-Aware Image Generation

Title: MiniMax-Remover: Taming Bad Noise Helps Video Object Removal

Title: The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models

Title: ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

Title: AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion