2025-06-09

Title: AI-Driven Dynamic Firewall Optimization Using Reinforcement Learning for Anomaly Detection and Prevention

Title: Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

Title: Seed Selection for Human-Oriented Image Reconstruction via Guided Diffusion

Title: Text2Stereo: Repurposing Stable Diffusion for Stereo Generation with Consistency Rewards

Title: Speaking images. A novel framework for the automated self-description of artworks

Title: An Independent Discriminant Network Towards Identification of Counterfeit Images and Videos

Title: Can Vision Transformers with ResNet's Global Features Fairly Authenticate Demographic Faces?

Title: LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models

Title: Attacking Attention of Foundation Models Disrupts Downstream Tasks

Title: Poisoning Behavioral-based Worker Selection in Mobile Crowdsensing using Generative Adversarial Networks

Title: A VLM-based Method for Visual Anomaly Detection in Robotic Scientific Laboratories

Title: PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APIs

Title: Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Title: Self-supervised One-Stage Learning for RF-based Multi-Person Pose Estimation

Title: Mixture-of-Experts Meets In-Context Reinforcement Learning

Title: Diffusion with a Linguistic Compass: Steering the Generation of Clinically Plausible Future sMRI Representations for Early MCI Conversion Prediction

Title: Towards Reliable Identification of Diffusion-based Image Manipulations

Title: Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models

Title: The Generative Leap: Sharp Sample Complexity for Efficiently Learning Gaussian Multi-Index Models

Title: FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

Title: MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Title: SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

Title: FRAME: Pre-Training Video Feature Representations via Anticipation and Memory

Title: EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh

Title: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction

Title: PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Title: When can in-context learning generalize out of task distribution?

Title: TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Title: FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting

Title: UniRes: Universal Image Restoration for Complex Degradations

Title: GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance

Title: Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones

Title: Learning to Weight Parameters for Data Attribution

Title: RNE: a plug-and-play framework for diffusion density estimation and inference-time control

Title: Contextually Guided Transformers via Low-Rank Adaptation

Title: Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery

Title: Learning Design-Score Manifold to Guide Diffusion Models for Offline Optimization

Title: Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR

Title: Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application

Title: Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation

Title: When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning

Title: BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning

Title: LLIA -- Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models

Title: Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling

Title: FontAdapter: Instant Font Adaptation in Visual Text Generation

Title: Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

Title: ChronoTailor: Harnessing Attention Guidance for Fine-Grained Video Virtual Try-On

Title: CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy

Title: Stealix: Model Stealing via Prompt Evolution

Title: Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection

Title: Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router

Title: FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

Title: Exponential Family Variational Flow Matching for Tabular Data Generation

Title: AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models

Title: LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles

Title: LightGTS: A Lightweight General Time Series Forecasting Model

Title: Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models

Title: Restereo: Diffusion stereo video generation and restoration

Title: Sample-Specific Noise Injection For Diffusion-Based Adversarial Purification

Title: Large Language Models are Demonstration Pre-Selectors for Themselves

Title: HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion

Title: Do-PFN: In-Context Learning for Causal Effect Estimation

Title: Tensor-to-Tensor Models with Fast Iterated Sum Features

Title: Diffusion-Based Hierarchical Graph Neural Networks for Simulating Nonlinear Solid Mechanics

Title: Full Conformal Adaptation of Medical Vision-Language Models

Title: Feedback Guidance of Diffusion Models

Title: Text-to-LoRA: Instant Transformer Adaption

Title: Bridging the Gap: In-Context Learning for Modeling Human Disagreement

Title: Table-r1: Self-supervised and Reinforcement Learning for Program-based Table Reasoning in Small Language Models

Title: ENMA: Tokenwise Autoregression for Generative Neural PDE Operators

Title: Antithetic Noise in Diffusion Models

Title: PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts

Title: Model-Driven Graph Contrastive Learning

Title: GenIR: Generative Visual Feedback for Mental Image Retrieval

Title: Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study

Title: Cartridges: Lightweight and general-purpose long context representations via self-study

Title: STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Title: TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation