2025-06-09

Title: Zero-Trust Mobility-Aware Authentication Framework for Secure Vehicular Fog Computing Networks

Title: AI-Driven Dynamic Firewall Optimization Using Reinforcement Learning for Anomaly Detection and Prevention

Title: Can ChatGPT Perform Image Splicing Detection? A Preliminary Study

Title: CarboNeXT and CarboFormer: Dual Semantic Segmentation Architectures for Detecting and Quantifying Carbon Dioxide Emissions Using Optical Gas Imaging

Title: Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

Title: Seed Selection for Human-Oriented Image Reconstruction via Guided Diffusion

Title: Text2Stereo: Repurposing Stable Diffusion for Stereo Generation with Consistency Rewards

Title: Speaking images. A novel framework for the automated self-description of artworks

Title: State Estimation and Control of Dynamic Systems from High-Dimensional Image Data

Title: A Red Teaming Roadmap Towards System-Level Safety

Title: An Independent Discriminant Network Towards Identification of Counterfeit Images and Videos

Title: A Compendium of Autonomous Navigation using Object Detection and Tracking in Unmanned Aerial Vehicles

Title: EvidenceOutcomes: a Dataset of Clinical Trial Publications with Clinically Meaningful Outcomes

Title: Heterogeneous Secure Transmissions in IRS-Assisted NOMA Communications: CO-GNN Approach

Title: How stealthy is stealthy? Studying the Efficacy of Black-Box Adversarial Attacks in the Real World

Title: Can Vision Transformers with ResNet's Global Features Fairly Authenticate Demographic Faces?

Title: Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment

Title: LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models

Title: Beyond RAG: Reinforced Reasoning Augmented Generation for Clinical Notes

Title: Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs

Title: Understanding Gender Bias in AI-Generated Product Descriptions

Title: Are Large Language Models Good Temporal Graph Learners?

Title: Attacking Attention of Foundation Models Disrupts Downstream Tasks

Title: TriPSS: A Tri-Modal Keyframe Extraction Framework Using Perceptual, Structural, and Semantic Representations

Title: Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation

Title: Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation

Title: Auto Review: Second Stage Error Detection for Highly Accurate Information Extraction from Phone Conversations

Title: Robust Anti-Backdoor Instruction Tuning in LVLMs

Title: Sylva: Tailoring Personalized Adversarial Defense in Pre-trained Models via Collaborative Fine-tuning

Title: Poisoning Behavioral-based Worker Selection in Mobile Crowdsensing using Generative Adversarial Networks

Title: PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APIs

Title: Differentially Private Federated $k$-Means Clustering with Server-Side Data

Title: Object-level Self-Distillation for Vision Pretraining

Title: Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs

Title: QA-HFL: Quality-Aware Hierarchical Federated Learning for Resource-Constrained Mobile Devices with Heterogeneous Image Quality

Title: Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

Title: SmoothRot: Combining Channel-Wise Scaling and Rotation for Quantization-Friendly LLMs

Title: SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing

Title: FERRET: Private Deep Learning Faster And Better Than DPSGD

Title: Better STEP, a format and dataset for boundary representation

Title: Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning

Title: Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions

Title: TRIDENT -- A Three-Tier Privacy-Preserving Propaganda Detection Model in Mobile Networks using Transformers, Adversarial Learning, and Differential Privacy

Title: SIV-Bench: A Video Benchmark for Social Interaction Understanding and Reasoning

Title: Mixture-of-Experts Meets In-Context Reinforcement Learning

Title: Diffusion with a Linguistic Compass: Steering the Generation of Clinically Plausible Future sMRI Representations for Early MCI Conversion Prediction

Title: Coordinated Robustness Evaluation Framework for Vision-Language Models

Title: Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models

Title: Robustness Evaluation for Video Models with Reinforcement Learning

Title: PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling

Title: Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks

Title: An Unsupervised Framework for Dynamic Health Indicator Construction and Its Application in Rolling Bearing Prognostics

Title: U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation

Title: Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic

Title: Sentinel: SOTA model to protect against prompt injections

Title: MLLM-CL: Continual Learning for Multimodal Large Language Models

Title: Zeroth-Order Optimization Finds Flat Minima

Title: Towards Reliable Identification of Diffusion-based Image Manipulations

Title: F2T2-HiT: A U-Shaped FFT Transformer and Hierarchical Transformer for Reflection Removal

Title: Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models

Title: The Generative Leap: Sharp Sample Complexity for Efficiently Learning Gaussian Multi-Index Models

Title: FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL

Title: StealthInk: A Multi-bit and Stealthy Watermark for Large Language Models

Title: MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Title: Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum

Title: Personalized Interpretability -- Interactive Alignment of Prototypical Parts Networks

Title: SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

Title: Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data

Title: FRAME: Pre-Training Video Feature Representations via Anticipation and Memory

Title: Layered Motion Fusion: Lifting Motion Segmentation to 3D in Egocentric Videos

Title: When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Title: EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh

Title: On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images

Title: Improving LLMs with a knowledge from databases

Title: Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning

Title: PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Title: When can in-context learning generalize out of task distribution?

Title: Conformal Prediction Adaptive to Unknown Subpopulation Shifts

Title: TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Title: CoFrNets: Interpretable Neural Architecture Inspired by Continued Fractions

Title: UTSA-NLP at ArchEHR-QA 2025: Improving EHR Question Answering via Self-Consistency Prompting

Title: SoK: Are Watermarks in LLMs Ready for Deployment?

Title: Zero-shot protein stability prediction by inverse folding models: a free energy interpretation

Title: FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting

Title: SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Title: UniRes: Universal Image Restoration for Complex Degradations

Title: Network Hexagons Under Attack: Secure Crowdsourcing of Geo-Referenced Data

Title: OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Title: Mitigating Confounding in Speech-Based Dementia Detection through Weight Masking

Title: Breaking Anonymity at Scale: Re-identifying the Trajectories of 100K Real Users in Japan

Title: When Maximum Entropy Misleads Policy Optimization

Title: LFA applied to CNNs: Efficient Singular Value Decomposition of Convolutional Mappings by Local Fourier Analysis

Title: GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance

Title: Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

Title: FedShield-LLM: A Secure and Scalable Federated Fine-Tuned Large Language Model

Title: Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones

Title: Learning to Weight Parameters for Data Attribution

Title: Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection

Title: TissUnet: Improved Extracranial Tissue and Cranium Segmentation for Children through Adulthood

Title: BAQ: Efficient Bit Allocation Quantization for Large Language Models

Title: DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models

Title: RNE: a plug-and-play framework for diffusion density estimation and inference-time control

Title: Contextually Guided Transformers via Low-Rank Adaptation

Title: Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery

Title: Zero-Shot Event Causality Identification via Multi-source Evidence Fuzzy Aggregation with Large Language Models

Title: Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions

Title: Learning Design-Score Manifold to Guide Diffusion Models for Offline Optimization

Title: Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR

Title: Pts3D-LLM: Studying the Impact of Token Structure for 3D Scene Understanding With Large Language Models

Title: When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation

Title: SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code

Title: Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework

Title: RKEFino1: A Regulation Knowledge-Enhanced Large Language Model

Title: Hybrid Stabilization Protocol for Cross-Chain Digital Assets Using Adaptor Signatures and AI-Driven Arbitrage

Title: Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration

Title: Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application

Title: A symmetric LWE-based Multi-Recipient Cryptosystem

Title: Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation

Title: Ensemble Elastic DQN: A novel multi-step ensemble approach to address overestimation in deep value-based reinforcement learning

Title: You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping

Title: Any-Class Presence Likelihood for Robust Multi-Label Classification with Abundant Negative Data

Title: Large Language Models are Good Relational Learners

Title: There's Waldo: PCB Tamper Forensic Analysis using Explainable AI on Impedance Signatures

Title: Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness

Title: Generalized Incremental Learning under Concept Drift across Evolving Data Streams

Title: To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt

Title: FIST: A Structured Threat Modeling Framework for Fraud Incidents

Title: When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning

Title: LLM-Symbolic Integration for Robust Temporal Tabular Reasoning

Title: Efficient Online RFT with Plug-and-Play LLM Judges: Unlocking State-of-the-Art Performance

Title: Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning

Title: BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning

Title: Exploring Microstructural Dynamics in Cryptocurrency Limit Order Books: Better Inputs Matter More Than Stacking Another Hidden Layer

Title: BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions

Title: dots.llm1 Technical Report

Title: AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation

Title: Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Title: Robust sensor fusion against on-vehicle sensor staleness

Title: EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs

Title: Discrete Minds in a Continuous World: Do Language Models Know Time Passes?

Title: EqCollide: Equivariant and Collision-Aware Deformable Objects Neural Simulator

Title: Option Pricing Using Ensemble Learning

Title: LLIA -- Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models

Title: DeformCL: Learning Deformable Centerline Representation for Vessel Extraction in 3D Medical Image

Title: FuseUNet: A Multi-Scale Feature Fusion Method for U-like Networks

Title: Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning

Title: Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling

Title: FontAdapter: Instant Font Adaptation in Visual Text Generation

Title: $\text{C}^{2}\text{BNVAE}$: Dual-Conditional Deep Generation of Network Traffic Data for Network Intrusion Detection System Balancing

Title: Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models

Title: Cross-View Multi-Modal Segmentation @ Ego-Exo4D Challenges 2025

Title: ChronoTailor: Harnessing Attention Guidance for Fine-Grained Video Virtual Try-On

Title: CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy

Title: Stealix: Model Stealing via Prompt Evolution

Title: BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures

Title: Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection

Title: Interpretable Clustering Ensemble

Title: NILMFormer: Non-Intrusive Load Monitoring that Accounts for Non-Stationarity

Title: Query Nearby: Offset-Adjusted Mask2Former enhances small-organ segmentation

Title: Differentially Private Explanations for Clusters

Title: Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router

Title: A Driving Regime-Embedded Deep Learning Framework for Modeling Intra-Driver Heterogeneity in Multi-Scale Car-Following Dynamics

Title: Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness

Title: Generating Grounded Responses to Counter Misinformation via Learning Efficient Fine-Grained Critiques

Title: LengClaro2023: A Dataset of Administrative Texts in Spanish with Plain Language adaptations

Title: MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models

Title: FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

Title: DynamicMind: A Tri-Mode Thinking System for Large Language Models

Title: Quantifying Adversarial Uncertainty in Evidential Deep Learning using Conflict Resolution

Title: Exponential Family Variational Flow Matching for Tabular Data Generation

Title: Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting

Title: Additive decomposition of one-dimensional signals using Transformers

Title: IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems

Title: Elementary Math Word Problem Generation using Large Language Models

Title: MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation

Title: AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models

Title: Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning

Title: Let's Put Ourselves in Sally's Shoes: Shoes-of-Others Prefixing Improves Theory of Mind in Large Language Models

Title: On Measuring Long-Range Interactions in Graph Neural Networks

Title: LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles

Title: Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning

Title: Tau-Eval: A Unified Evaluation Framework for Useful and Private Text Anonymization

Title: AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification

Title: MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks

Title: A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos

Title: LaDEEP: A Deep Learning-based Surrogate Model for Large Deformation of Elastic-Plastic Solids

Title: What Really is a Member? Discrediting Membership Inference via Poisoning

Title: Enhancing Orthopox Image Classification Using Hybrid Machine Learning and Deep Learning Models

Title: Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models

Title: Unlocking Recursive Thinking of LLMs: Alignment via Refinement

Title: AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search

Title: When to Trust Context: Self-Reflective Debates for Context Reliability

Title: Unisoma: A Unified Transformer-based Solver for Multi-Solid Systems

Title: Restereo: Diffusion stereo video generation and restoration

Title: O-MaMa @ EgoExo4D Correspondence Challenge: Learning Object Mask Matching between Egocentric and Exocentric Views

Title: Sample-Specific Noise Injection For Diffusion-Based Adversarial Purification

Title: Large Language Models are Demonstration Pre-Selectors for Themselves

Title: MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?

Title: HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion

Title: Do-PFN: In-Context Learning for Causal Effect Estimation

Title: Diffusion-Based Hierarchical Graph Neural Networks for Simulating Nonlinear Solid Mechanics

Title: Hey, That's My Data! Label-Only Dataset Inference in Large Language Models

Title: Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models

Title: Zero-Shot Detection of LLM-Generated Code via Approximated Task Conditioning

Title: System-Aware Unlearning Algorithms: Use Lesser, Forget Faster

Title: Feedback Guidance of Diffusion Models

Title: Reinforcing Code Generation: Improving Text-to-SQL with Execution-Based Learning

Title: Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU

Title: VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Title: Text-to-LoRA: Instant Transformer Adaption

Title: Synthetic Tabular Data: Methods, Attacks and Defenses

Title: Towards Lifecycle Unlearning Commitment Management: Measuring Sample-level Unlearning Completeness

Title: Bridging the Gap: In-Context Learning for Modeling Human Disagreement

Title: SATversary: Adversarial Attacks on Satellite Fingerprinting

Title: PrivTru: A Privacy-by-Design Data Trustee Minimizing Information Leakage

Title: CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting

Title: Let's CONFER: A Dataset for Evaluating Natural Language Inference Models on CONditional InFERence and Presupposition

Title: Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems

Title: A Novel Large-scale Crop Dataset and Dual-stream Transformer Method for Fine-grained Hierarchical Crop Classification from Integrated Hyperspectral EnMAP Data and Multispectral Sentinel-2 Time Series

Title: ENMA: Tokenwise Autoregression for Generative Neural PDE Operators

Title: Obfuscation-Resilient Binary Code Similarity Analysis using Dominance Enhanced Semantic Graph

Title: The Lock-in Hypothesis: Stagnation by Algorithm

Title: Technical Report for Egocentric Mistake Detection for the HoloAssist Challenge

Title: Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach

Title: SatelliteFormula: Multi-Modal Symbolic Regression from Remote Sensing Imagery for Physics Discovery

Title: Detecting Voice Phishing with Precision: Fine-Tuning Small Language Models

Title: Antithetic Noise in Diffusion Models

Title: Transformative or Conservative? Conservation laws for ResNets and Transformers

Title: How to craft a deep reinforcement learning policy for wind farm flow control

Title: Building Models of Neurological Language

Title: Model-Driven Graph Contrastive Learning

Title: Can Theoretical Physics Research Benefit from Language Agents?

Title: STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving

Title: GenIR: Generative Visual Feedback for Mental Image Retrieval

Title: PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems

Title: Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge

Title: Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models

Title: Cartridges: Lightweight and general-purpose long context representations via self-study

Title: AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization

Title: STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Title: Distillation Robustifies Unlearning

Title: CoMemo: LVLMs Need Image Context with Image Memory

Title: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias

Title: TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation