2025-06-06

Title: A Comprehensive Survey on the Risks and Limitations of Concept-based Models

Title: Improving Out-of-Distribution Detection with Markov Logic Networks

Title: Triple Attention Transformer Architecture for Time-Dependent Concrete Creep Prediction

Title: SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs

Title: Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training

Title: RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought

Title: Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark

Title: SF$^2$Bench: Evaluating Data-Driven Models for Compound Flood Forecasting in South Florida

Title: DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience

Title: Backbone Augmented Training for Adaptations

Title: Relational reasoning and inductive bias in transformers trained on a transitive inference task

Title: AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents

Title: Deep learning for predicting hauling fleet production capacity under uncertainties in open pit mines using real and simulated data

Title: RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming

Title: GEM: Empowering LLM for both Embedding Generation and Language Understanding

Title: You Only Train Once

Title: HuGeDiff: 3D Human Generation via Diffusion with Gaussian Splatting

Title: ReXVQA: A Large-scale Visual Question Answering Benchmark for Generalist Chest X-ray Understanding

Title: A Risk-Aware Reinforcement Learning Reward for Financial Trading

Title: WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning

Title: Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR

Title: Fine-Tuning Video Transformers for Word-Level Bangla Sign Language: A Comparative Analysis for Classification Tasks

Title: Mechanistic Decomposition of Sentence Representations

Title: Visualizing and Controlling Cortical Responses Using Voxel-Weighted Activation Maximization

Title: The Hashed Fractal Key Recovery (HFKR) Problem: From Symbolic Path Inversion to Post-Quantum Cryptographic Keys

Title: MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLP

Title: Through the Stealth Lens: Rethinking Attacks and Defenses in RAG

Title: Is Perturbation-Based Image Protection Disruptive to Image Editing?

Title: Normalize Filters! Classical Wisdom for Deep Vision

Title: MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Title: Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning

Title: HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

Title: Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order

Title: RETRO SYNFLOW: Discrete Flow Matching for Accurate and Diverse Single-Step Retrosynthesis

Title: Selective Matching Losses -- Not All Scores Are Created Equal

Title: Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification

Title: Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning

Title: Zero-Shot Open-Schema Entity Structure Discovery

Title: Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey

Title: Watermarking Degrades Alignment in Language Models: Analysis and Mitigation

Title: Aligning Large Language Models with Implicit Preferences from User-Generated Content

Title: Classifying Dental Care Providers Through Machine Learning with Features Ranking

Title: SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL

Title: Towards Large-Scale Pose-Invariant Face Recognition Using Face Defrontalization

Title: FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices

Title: AuthGuard: Generalizable Deepfake Detection via Language Guidance

Title: Pruning Everything, Everywhere, All at Once

Title: DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation

Title: Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation

Title: Perturbative Gradient Training: A novel training paradigm for bridging the gap between deep neural networks and physical reservoir computing

Title: EECD-Net: Energy-Efficient Crack Detection with Spiking Neural Networks and Gated Attention

Title: HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training

Title: Neural MJD: Neural Non-Stationary Merton Jump Diffusion for Time Series Prediction

Title: Communication Efficient Adaptive Model-Driven Quantum Federated Learning

Title: Unsupervised Machine Learning for Scientific Discovery: Workflow and Best Practices

Title: BESA: Boosting Encoder Stealing Attack with Perturbation Recovery

Title: Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning

Title: Clustering and Median Aggregation Improve Differentially Private Inference

Title: StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation

Title: Demonstrations of Integrity Attacks in Multi-Agent Systems

Title: Reasoning or Overthinking: Evaluating Large Language Models on Financial Sentiment Analysis

Title: Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts?

Title: Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching

Title: SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and Editing

Title: LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models

Title: Follow-Your-Creation: Empowering 4D Creation through Video Inpainting

Title: Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification

Title: Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets

Title: A MISMATCHED Benchmark for Scientific Natural Language Inference

Title: SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents

Title: Ignoring Directionality Leads to Compromised Graph Neural Network Explanations

Title: Exploring bidirectional bounds for minimax-training of Energy-based models

Title: Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning

Title: Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth

Title: Subjective Perspectives within Learned Representations Predict High-Impact Innovation

Title: Deep Learning Reforms Image Matching: A Survey and Outlook

Title: Static Word Embeddings for Sentence Semantic Representation

Title: Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning

Title: Composing Agents to Minimize Worst-case Risk

Title: Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Title: Incentivizing Collaborative Breach Detection

Title: ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition

Title: Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders

Title: TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering

Title: Authenticated Private Set Intersection: A Merkle Tree-Based Approach for Enhancing Data Integrity

Title: FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Title: Feature-Based Lie Group Transformer for Real-World Applications

Title: FedAPM: Federated Learning via ADMM with Partial Model Personalization

Title: Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts

Title: Gen-n-Val: Agentic Image Data Generation and Validation

Title: The cost of ensembling: is it always worth combining?

Title: Normative Conflicts and Shallow AI Alignment

Title: Urania: Differentially Private Insights into AI Use

Title: MARS: Radio Map Super-resolution and Reconstruction Method under Sparse Channel Measurements

Title: MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models

Title: Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Title: Towards Better Generalization via Distributional Input Projection Network

Title: Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification

Title: Influence Functions for Edge Edits in Non-Convex Graph Neural Networks

Title: Enhanced Drought Analysis in Bangladesh: A Machine Learning Approach for Severity Classification Using Satellite Data

Title: Explicit Density Approximation for Neural Implicit Samplers Using a Bernstein-Based Convex Divergence

Title: HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model

Title: UNO: Unlearning via Orthogonalization in Generative models

Title: Robust Few-Shot Vision-Language Model Adaptation

Title: Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model

Title: Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion

Title: Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection

Title: SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs

Title: Multi-Layer GRPO: Enhancing Reasoning and Self-Correction in Large Language Models

Title: Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning

Title: Log-Linear Attention

Title: HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition

Title: OpenGT: A Comprehensive Benchmark For Graph Transformers

Title: Fine-Grained Interpretation of Political Opinions in Large Language Models

Title: MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark

Title: Kernel $k$-Medoids as General Vector Quantization

Title: Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques

Title: MULTISS: un protocole de stockage confidentiel {à} long terme sur plusieurs r{é}seaux QKD

Title: SupeRANSAC: One RANSAC to Rule Them All

Title: Adaptive Preconditioners Trigger Loss Spikes in Adam

Title: Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study

Title: Design of intelligent proofreading system for English translation based on CNN and BERT

Title: Spike-TBR: a Noise Resilient Neuromorphic Event Representation

Title: LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning

Title: Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms

Title: Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Title: DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation

Title: Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models

Title: OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model

Title: On Automating Security Policies with Contemporary LLMs

Title: Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights

Title: A Private Smart Wallet with Probabilistic Compliance

Title: Prompting LLMs: Length Control for Isometric Machine Translation

Title: Sparse Autoencoders, Again?

Title: Geological Field Restoration through the Lens of Image Inpainting

Title: There Was Never a Bottleneck in Concept Bottleneck Models

Title: Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking

Title: Evaluating the Effectiveness of Linguistic Knowledge in Pretrained Language Models: A Case Study of Universal Dependencies

Title: Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study

Title: ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests

Title: From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes

Title: Verbose ListOps (VLO): Beyond Long Context -- Unmasking LLM's Reasoning Blind Spots

Title: Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer

Title: Dissecting Long Reasoning Models: An Empirical Study

Title: Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback

Title: Predicting ICU In-Hospital Mortality Using Adaptive Transformer Layer Fusion

Title: ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT

Title: CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx

Title: Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining

Title: Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations

Title: APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval

Title: FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Title: PoCGen: Generating Proof-of-Concept Exploits for Vulnerabilities in Npm Packages

Title: Hiding in Plain Sight: Query Obfuscation via Random Multilingual Searches

Title: From Struggle (06-2024) to Mastery (02-2025) LLMs Conquer Advanced Algorithm Exams and Pave the Way for Editorial Generation

Title: Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery

Title: Evaluating the Impact of Privacy-Preserving Federated Learning on CAN Intrusion Detection

Title: Agentic AI for Intent-Based Industrial Automation

Title: TextVidBench: A Benchmark for Long Video Scene Text Understanding

Title: FPTQuant: Function-Preserving Transforms for LLM Quantization

Title: Multi-scale Image Super Resolution with a Single Auto-Regressive Model

Title: PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment

Title: SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View

Title: Attack Effect Model based Malicious Behavior Detection

Title: Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting

Title: ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Title: Identifying and Understanding Cross-Class Features in Adversarial Training

Title: Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers

Title: FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing

Title: TALL -- A Trainable Architecture for Enhancing LLM Performance in Low-Resource Languages

Title: NIMO: a Nonlinear Interpretable MOdel

Title: A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions

Title: Does It Make Sense to Speak of Introspection in Large Language Models?

Title: RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation

Title: Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation

Title: Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin

Title: SeedEdit 3.0: Fast and High-Quality Generative Image Editing

Title: Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics

Title: FG 2025 TrustFAA: the First Workshop on Towards Trustworthy Facial Affect Analysis: Advancing Insights of Fairness, Explainability, and Safety (TrustFAA)

Title: Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers

Title: Privacy Amplification Through Synthetic Data: Insights from Linear Regression

Title: DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models

Title: Practical Manipulation Model for Robust Deepfake Detection

Title: The NTNU System at the S&I Challenge 2025 SLA Open Track

Title: Membership Inference Attacks on Sequence Models

Title: DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning

Title: Information Locality as an Inductive Bias for Neural Language Models

Title: Federated Isolation Forest for Efficient Anomaly Detection on Edge IoT Systems

Title: Do Large Language Models Judge Error Severity Like Humans?

Title: Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation

Title: Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective

Title: ECoRAG: Evidentiality-guided Compression for Long Context RAG

Title: Through-the-Wall Radar Human Activity Recognition WITHOUT Using Neural Networks

Title: Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline

Title: Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Title: Associative Memory and Generative Diffusion in the Zero-noise Limit

Title: TreeRPO: Tree Relative Policy Optimization

Title: Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis

Title: Counterfactual reasoning: an analysis of in-context emergence

Title: Locality Preserving Markovian Transition for Instance Retrieval

Title: Quantifying Cross-Modality Memorization in Vision-Language Models

Title: Transformers Meet In-Context Learning: A Universal Approximation Theory

Title: OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View

Title: RELIC: Evaluating Compositional Instruction Following via Language Recognition

Title: Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning

Title: The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Title: Learning Theory of Decentralized Robust Kernel-Based Learning Algorithm

Title: DSG-World: Learning a 3D Gaussian World Model from Dual State Videos

Title: SAM-aware Test-time Adaptation for Universal Medical Image Segmentation

Title: Improving Low-Resource Morphological Inflection via Self-Supervised Objectives

Title: Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Title: Progressive Tempering Sampler with Diffusion

Title: MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Title: Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit

Title: Aligning Latent Spaces with Flow Priors

Title: SECNEURON: Reliable and Flexible Abuse Control in Local LLMs via Hybrid Neuron Encryption

Title: On the Convergence of Gradient Descent on Learning Transformers with Residual Connections

Title: Spatiotemporal Contrastive Learning for Cross-View Video Localization in Unstructured Off-road Terrains

Title: Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning

Title: LeanPO: Lean Preference Optimization for Likelihood Alignment in Video-LLMs

Title: Can Foundation Models Generalise the Presentation Attack Detection Capabilities on ID Cards?

Title: Tight analyses of first-order methods with error feedback

Title: How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control

Title: Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning

Title: Fast-DataShapley: Neural Modeling for Training Data Valuation

Title: Rectified Point Flow: Generic Point Cloud Pose Estimation

Title: RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

Title: Stable Vision Concept Transformers for Medical Diagnosis

Title: EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Title: AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

Title: Big Bird: Privacy Budget Management for W3C's Privacy-Preserving Attribution API

Title: A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$: Robust Imitation via Learning to Search

Title: Sample Complexity and Representation Ability of Test-time Scaling Paradigms

Title: Power Law Guided Dynamic Sifting for Efficient Attention

Title: SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Title: Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Title: ProRefine: Inference-time Prompt Refinement with Textual Feedback

Title: Learning normalized image densities via dual score matching

Title: Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models

Title: Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

Title: LSM-2: Learning from Incomplete Wearable Sensor Data

Title: Seeing the Invisible: Machine learning-Based QPI Kernel Extraction via Latent Alignment

Title: Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting

Title: MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Title: Search Arena: Analyzing Search-Augmented LLMs

Title: VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Title: Exploring Diffusion Transformer Designs via Grafting

Title: Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

Title: Refer to Anything with Vision-Language Prompts

Title: SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs

Title: Inference-Time Hyper-Scaling with KV Cache Compression

Title: Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Title: Contrastive Flow Matching