2025-05-19

Title: Robust Emotion Recognition via Bi-Level Self-Supervised Continual Learning

Title: Bias and Generalizability of Foundation Models across Datasets in Breast Mammography

Title: Relative Drawing Identification Complexity is Invariant to Modality in Vision-Language Models

Title: Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios

Title: Efficient Malicious UAV Detection Using Autoencoder-TSMamba Integration

Title: Super-Resolution Generative Adversarial Networks based Video Enhancement

Title: ARFC-WAHNet: Adaptive Receptive Field Convolution and Wavelet-Attentive Hierarchical Network for Infrared Small Target Detection

Title: Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Title: Enhancing IoT Cyber Attack Detection in the Presence of Highly Imbalanced Data

Title: Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models

Title: MONAQ: Multi-Objective Neural Architecture Querying for Time-Series Analysis on Resource-Constrained Devices

Title: Agent Name Service (ANS): A Universal Directory for Secure AI Agent Discovery and Interoperability

Title: MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Title: How many measurements are enough? Bayesian recovery in inverse problems with general distributions

Title: Mitigate Language Priors in Large Vision-Language Models by Cross-Images Contrastive Decoding

Title: FRET: Feature Redundancy Elimination for Test Time Adaptation

Title: A Conformal Predictive Measure for Assessing Catastrophic Forgetting

Title: Clustering Rooftop PV Systems via Probabilistic Embeddings

Title: SafeTrans: LLM-assisted Transpilation from C to Rust

Title: GNN-Suite: a Graph Neural Network Benchmarking Framework for Biomedical Informatics

Title: A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

Title: AI-enhanced semantic feature norms for 786 concepts

Title: Tracr-Injection: Distilling Algorithms into Pre-trained Language Models

Title: Automating Security Audit Using Large Language Model based Agent: An Exploration Experiment

Title: Model Performance-Guided Evaluation Data Selection for Effective Prompt Optimization

Title: IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation

Title: Mapping Semantic Segmentation to Point Clouds Using Structure from Motion for Forest Analysis

Title: Random Client Selection on Contrastive Federated Learning for Tabular Data

Title: Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics

Title: Benchmarking performance, explainability, and evaluation strategies of vision-language models for surgery: Challenges and opportunities

Title: Unifying Segment Anything in Microscopy with Multimodal Large Language Model

Title: Ranked Voting based Self-Consistency of Large Language Models

Title: Context-Aware Probabilistic Modeling with LLM for Multimodal Time Series Forecasting

Title: A Systematic Analysis of Base Model Choice for Reward Modeling

Title: Completely Weakly Supervised Class-Incremental Learning for Semantic Segmentation

Title: SynRailObs: A Synthetic Dataset for Obstacle Detection in Railway Scenarios

Title: Neural-Inspired Advances in Integral Cryptanalysis

Title: Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation

Title: Relation Extraction Across Entire Books to Reconstruct Community Networks: The AffilKG Datasets

Title: Attention-Based Reward Shaping for Sparse and Delayed Rewards

Title: MoCLIP: Motion-Aware Fine-Tuning and Distillation of CLIP for Human Motion Generation

Title: RAN Tester UE: An Automated Declarative UE Centric Security Testing Platform

Title: Enhancing Secrecy Energy Efficiency in RIS-Aided Aerial Mobile Edge Computing Networks: A Deep Reinforcement Learning Approach

Title: Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation

Title: From Embeddings to Accuracy: Comparing Foundation Models for Radiographic Classification

Title: Enhancing Low-Resource Minority Language Translation with LLMs and Retrieval-Augmented Generation for Cultural Nuances

Title: Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs

Title: LARGO: Latent Adversarial Reflection through Gradient Optimization for Jailbreaking LLMs

Title: RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects

Title: AutoRAN: Weak-to-Strong Jailbreaking of Large Reasoning Models

Title: On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating

Title: Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM

Title: Have Multimodal Large Language Models (MLLMs) Really Learned to Tell the Time on Analog Clocks?

Title: Improve Rule Retrieval and Reasoning with Self-Induction and Relevance ReEstimate

Title: Optimal Allocation of Privacy Budget on Hierarchical Data Release

Title: MultiLink: Multi-class Structure Recovery via Agglomerative Clustering and Model Selection

Title: A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision

Title: Approximation and Generalization Abilities of Score-based Neural Network Generative Models for Sub-Gaussian Distributions

Title: Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Title: PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation

Title: Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models

Title: CTP: A hybrid CNN-Transformer-PINN model for ocean front forecasting

Title: On the Security Risks of ML-based Malware Detection Systems: A Survey

Title: VISTA: Enhancing Vision-Text Alignment in MLLMs via Cross-Modal Mutual Information Maximization

Title: A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron?

Title: A Dataset for Spatiotemporal-Sensitive POI Question Answering

Title: Physics-informed Temporal Alignment for Auto-regressive PDE Foundation Models

Title: M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection

Title: Connecting the Dots: A Chain-of-Collaboration Prompting Framework for LLM Agents

Title: Accurate KV Cache Quantization with Outlier Tokens Tracing

Title: GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction

Title: Privacy-Aware Lifelong Learning

Title: Nosy Layers, Noisy Fixes: Tackling DRAs in Federated Learning Systems using Explainable AI

Title: Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer

Title: The Way We Prompt: Conceptual Blending, Neural Dynamics, and Prompt-Induced Transitions in LLMs

Title: Shackled Dancing: A Bit-Locked Diffusion Algorithm for Lossless and Controllable Image Steganography

Title: SubGCache: Accelerating Graph-based RAG with Subgraph-level KV Cache

Title: Relational Graph Transformer

Title: Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio

Title: Group-in-Group Policy Optimization for LLM Agent Training

Title: GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models

Title: ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks

Title: Visual Anomaly Detection under Complex View-Illumination Interplay: A Large-Scale Benchmark

Title: DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning

Title: Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning

Title: Reconstructing Syllable Sequences in Abugida Scripts with Incomplete Inputs

Title: Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models

Title: Towards Robust and Controllable Text-to-Motion via Masked Autoregressive Diffusion

Title: WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?

Title: GoLeash: Mitigating Golang Software Supply Chain Attacks with Runtime Policy Enforcement

Title: Logo-LLM: Local and Global Modeling with Large Language Models for Time Series Forecasting

Title: Rethinking the Mean Teacher Strategy from the Perspective of Self-paced Learning

Title: Informed, but Not Always Improved: Challenging the Benefit of Background Knowledge in GNNs

Title: OntoURL: A Benchmark for Evaluating Large Language Models on Symbolic Ontological Understanding, Reasoning and Learning

Title: CleanPatrick: A Benchmark for Image Data Cleaning

Title: Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios

Title: Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in Transformers

Title: NeuralSurv: Deep Survival Analysis with Bayesian Uncertainty Quantification

Title: Side Channel Analysis in Homomorphic Encryption

Title: Assessing the Performance of Analog Training for Transfer Learning

Title: Towards Self-Improvement of Diffusion Models via Group Preference Optimization

Title: Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation

Title: Addition is almost all you need: Compressing neural networks with double binary factorization

Title: ShiQ: Bringing back Bellman to LLMs

Title: Blockchain-Enabled Decentralized Privacy-Preserving Group Purchasing for Energy Plans

Title: Towards Better Evaluation for Generated Patent Claims

Title: Verifiably Forgotten? Gradient Differences Still Enable Data Reconstruction in Federated Unlearning

Title: Hybrid-Emba3D: Geometry-Aware and Cross-Path Feature Hybrid Enhanced State Space Model for Point Cloud Classification

Title: MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark

Title: Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation

Title: FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation

Title: Dual-Balancing for Physics-Informed Neural Networks

Title: FedDuA: Doubly Adaptive Federated Learning

Title: What's Inside Your Diffusion Model? A Score-Based Riemannian Metric to Explore the Data Manifold

Title: PhiNet v2: A Mask-Free Brain-Inspired Vision Foundation Model from Video

Title: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Title: Fairness-aware Anomaly Detection via Fair Projection

Title: Towards Robust Spiking Neural Networks:Mitigating Heterogeneous Training Vulnerability via Dominant Eigencomponent Projection

Title: Covariance Density Neural Networks

Title: Scaling Reasoning can Improve Factuality in Large Language Models

Title: Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans

Title: Learning Dense Hand Contact Estimation from Imbalanced Data

Title: Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes

Title: MPMA: Preference Manipulation Attack Against Model Context Protocol

Title: Attention on the Sphere

Title: SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization

Title: CheX-DS: Improving Chest X-ray Image Classification with Ensemble Learning Based on DenseNet and Swin Transformer

Title: Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training

Title: Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline

Title: CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback

Title: Imputation-free and Alignment-free: Incomplete Multi-view Clustering Driven by Consensus Semantic Learning

Title: FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Pretraining

Title: DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

Title: NoPE: The Counting Power of Transformers with No Positional Encodings

Title: Minimizing False-Positive Attributions in Explanations of Non-Linear Models

Title: HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization

Title: Learning traffic flows: Graph Neural Networks for Metamodelling Traffic Assignment

Title: AW-GATCN: Adaptive Weighted Graph Attention Convolutional Network for Event Camera Data Joint Denoising and Object Recognition

Title: Massive-STEPS: Massive Semantic Trajectories for Understanding POI Check-ins -- Dataset and Benchmarks

Title: Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models

Title: Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

Title: DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models

Title: Equal is Not Always Fair: A New Perspective on Hyperspectral Representation Non-Uniformity

Title: Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models

Title: Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs

Title: Temporal fine-tuning for early risk detection

Title: Bidirectional Information Flow (BIF) -- A Sample Efficient Hierarchical Gaussian Process for Bayesian Optimization

Title: Probing Subphonemes in Morphology Models

Title: Heterogeneity-Aware Client Sampling: A Unified Solution for Consistent Federated Learning

Title: Effective Probabilistic Time Series Forecasting with Fourier Adaptive Noise-Separated Diffusion

Title: Diffusion Learning with Partial Agent Participation and Local Updates

Title: CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks

Title: Anomaly Detection for Non-stationary Time Series using Recurrent Wavelet Probabilistic Neural Network

Title: MARRS: Masked Autoregressive Unit-based Reaction Synthesis

Title: XtraGPT: LLMs for Human-AI Collaboration on Controllable Academic Paper Revision

Title: Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models

Title: Dynamic Base model Shift for Delta Compression

Title: Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning

Title: LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

Title: GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents

Title: MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory

Title: IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting

Title: Finding Counterfactual Evidences for Node Classification

Title: Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner

Title: EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models

Title: Visual Planning: Let's Think Only with Images

Title: Is Grokking a Computational Glass Relaxation?

Title: Uncertainty quantification with approximate variational learning for wearable photoplethysmography prediction tasks

Title: CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs

Title: MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems

Title: When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs

Title: Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study

Title: MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production

Title: GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art

Title: SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision

Title: Is Compression Really Linear with Code Intelligence?

Title: A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation

Title: LLMs unlock new paths to monetizing exploits

Title: HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation

Title: ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks

Title: Disentangling Reasoning and Knowledge in Medical Large Language Models

Title: PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment

Title: No Gold Standard, No Problem: Reference-Free Evaluation of Taxonomies

Title: HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

Title: Improving Assembly Code Performance with Large Language Models via Reinforcement Learning

Title: Unsupervised Detection of Distribution Shift in Inverse Problems using Diffusion Models

Title: msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML

Title: Modeling cognitive processes of natural reading with transformer-based Language Models

Title: QVGen: Pushing the Limit of Quantized Video Generative Models