2025-09-22

Title: Pre-Forgettable Models: Prompt Learning as a Native Mechanism for Unlearning

Title: Exploring the Capabilities of LLM Encoders for Image-Text Retrieval in Chest X-rays

Title: ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding

Title: M-PACE: Mother Child Framework for Multimodal Compliance

Title: ProFusion: 3D Reconstruction of Protein Complex Structures from Multi-view AFM Images

Title: Multi-Modal Interpretability for Enhanced Localization in Vision-Language Models

Title: Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning

Title: Comparative Analysis of Tokenization Algorithms for Low-Resource Language Dzongkha

Title: RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation

Title: Generative AI Meets Wireless Sensing: Towards Wireless Foundation Model

Title: IEFS-GMB: Gradient Memory Bank-Guided Feature Selection Based on Information Entropy for EEG Classification of Neurological Disorders

Title: Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages

Title: A Weak Supervision Approach for Monitoring Recreational Drug Use Effects in Social Media

Title: Autoguided Online Data Curation for Diffusion Model Training

Title: Modeling Transformers as complex networks to analyze learning dynamics

Title: PRISM: Phase-enhanced Radial-based Image Signature Mapping framework for fingerprinting AI-generated images

Title: Large Vision Models Can Solve Mental Rotation Problems

Title: Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks

Title: Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning

Title: Kuramoto Orientation Diffusion Models

Title: CoDoL: Conditional Domain Prompt Learning for Out-of-Distribution Generalization

Title: Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Title: PolBiX: Detecting LLMs' Political Bias in Fact-Checking through X-phemisms

Title: Quantifying Self-Awareness of Knowledge in Large Language Models

Title: LowDiff: Efficient Diffusion Sampling with Low-Resolution Condition

Title: Real, Fake, or Manipulated? Detecting Machine-Influenced Text

Title: Predicting Language Models' Success at Zero-Shot Probabilistic Prediction

Title: MaskAttn-SDXL: Controllable Region-Level Text-To-Image Generation

Title: Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing

Title: Stochastic Sample Approximations of (Local) Moduli of Continuity

Title: Adversarial generalization of unfolding (model-based) networks

Title: RaceGAN: A Framework for Preserving Individuality while Converting Racial Information for Image-to-Image Translation

Title: VMDNet: Time Series Forecasting with Leakage-Free Samplewise Variational Mode Decomposition and Multibranch Decoding

Title: Quantifying Uncertainty in Natural Language Explanations of Large Language Models for Question Answering

Title: Causal Fingerprints of AI Generative Models

Title: NeuroRAD-FM: A Foundation Model for Neuro-Oncology with Distributionally Robust Training

Title: Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data

Title: Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data

Title: Synergizing Static Analysis with Large Language Models for Vulnerability Discovery and beyond

Title: ORCA: Agentic Reasoning For Hallucination and Adversarial Robustness in Vision-Language Models

Title: PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting

Title: Hierarchical Self-Attention: Generalizing Neural Attention Mechanics to Multi-Scale Problems

Title: IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs

Title: CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

Title: Temporal Reasoning with Large Language Models Augmented by Evolving Knowledge Graphs

Title: Efficient Multimodal Dataset Distillation via Generative Models

Title: Evaluating Multimodal Large Language Models on Spoken Sarcasm Understanding

Title: Red Teaming Multimodal Language Models: Evaluating Harm Across Prompt Modalities and Models

Title: Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies

Title: Comparing Computational Pathology Foundation Models using Representational Similarity Analysis

Title: SmolRGPT: Efficient Spatial Reasoning for Warehouse Environments with 600M Parameters

Title: Lynx: Towards High-Fidelity Personalized Video Generation

Title: Backdoor Mitigation via Invertible Pruning Masks

Title: Mental Accounts for Actions: EWA-Inspired Attention in Decision Transformers

Title: Adversarially Robust Assembly Language Model for Packed Executables Detection

Title: KoopCast: Trajectory Forecasting via Koopman Operators

Title: How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages

Title: SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models

Title: Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track

Title: A method for improving multilingual quality and diversity of instruction fine-tuning datasets

Title: DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair Paradigm

Title: PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors

Title: Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification

Title: Hybrid Deep Learning-Federated Learning Powered Intrusion Detection System for IoT/5G Advanced Edge Computing Network

Title: Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

Title: Reward Hacking Mitigation using Verifiable Composite Rewards

Title: From Development to Deployment of AI-assisted Telehealth and Screening for Vision- and Hearing-threatening diseases in resource-constrained settings: Field Observations, Challenges and Way Forward

Title: Small LLMs with Expert Blocks Are Good Enough for Hyperparamter Tuning

Title: DC-Mamba: Bi-temporal deformable alignment and scale-sparse enhancement for remote sensing change detection

Title: BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent

Title: LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs

Title: Cuckoo Attack: Stealthy and Persistent Attacks Against AI-IDE

Title: Relevance to Utility: Process-Supervised Rewrite for RAG

Title: Multimodal Learning for Fake News Detection in Short Videos Using Linguistically Verified Data and Heterogeneous Modality Fusion

Title: DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models

Title: Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Title: Personalized Prediction By Learning Halfspace Reference Classes Under Well-Behaved Distribution

Title: EyePCR: A Comprehensive Benchmark for Fine-Grained Perception, Knowledge Comprehension and Clinical Reasoning in Ophthalmic Surgery

Title: TennisTV: Do Multimodal Large Language Models Understand Tennis Rallies?

Title: Enhancing WSI-Based Survival Analysis with Report-Auxiliary Self-Distillation

Title: SciEvent: Benchmarking Multi-domain Scientific Event Extraction

Title: Concept Unlearning in Large Language Models via Self-Constructed Knowledge Triplets

Title: PCSR: Pseudo-label Consistency-Guided Sample Refinement for Noisy Correspondence Learning

Title: Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models

Title: pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation

Title: Information Geometry of Variational Bayes

Title: UNIV: Unified Foundation Model for Infrared and Visible Modalities

Title: Future-Proofing Cloud Security Against Quantum Attacks: Risk, Transition, and Mitigation Strategies

Title: Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations

Title: VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion

Title: Inference Offloading for Cost-Sensitive Binary Classification at the Edge

Title: KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning

Title: Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model

Title: SCENEFORGE: Enhancing 3D-text alignment with Structured Scene Compositions

Title: Inference Attacks on Encrypted Online Voting via Traffic Analysis

Title: Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method

Title: Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

Title: REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting

Title: Flying Drones to Locate Cyber-Attackers in LoRaWAN Metropolitan Networks

Title: EigenTrack: Spectral Activation Feature Tracking for Hallucination and Out-of-Distribution Detection in LLMs and VLMs

Title: GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning

Title: Can LLMs Judge Debates? Evaluating Non-Linear Reasoning via Argumentation Theory Semantics

Title: TrueMoE: Dual-Routing Mixture of Discriminative Experts for Synthetic Image Detection

Title: FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Title: MCOD: The First Challenging Benchmark for Multispectral Camouflaged Object Detection

Title: An Adversarial Robust Behavior Sequence Anomaly Detection Approach Based on Critical Behavior Unit Learning

Title: On Optimal Steering to Achieve Exact Fairness

Title: UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

Title: Learning to Optimize Capacity Planning in Semiconductor Manufacturing

Title: Overview of PlantCLEF 2024: multi-species plant identification in vegetation plot images

Title: Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation

Title: Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution

Title: Ideal Registration? Segmentation is All You Need

Title: TASAM: Terrain-and-Aware Segment Anything Model for Temporal-Scale Remote Sensing Segmentation

Title: Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

Title: CIDER: A Causal Cure for Brand-Obsessed Text-to-Image Models

Title: Best-of-L: Cross-Lingual Reward Modeling for Mathematical Reasoning

Title: SolarCrossFormer: Improving day-ahead Solar Irradiance Forecasting by Integrating Satellite Imagery and Ground Sensors

Title: Multi-Physics: A Comprehensive Benchmark for Multimodal LLMs Reasoning on Chinese Multi-Subject Physics Problems

Title: FedHK-MVFC: Federated Heat Kernel Multi-View Clustering

Title: ToFU: Transforming How Federated Learning Systems Forget User Data

Title: SAGE: Semantic-Aware Shared Sampling for Efficient Diffusion

Title: LC-SLab -- An Object-based Deep Learning Framework for Large-scale Land Cover Classification from Satellite Imagery and Sparse In-situ Labels

Title: ENSAM: an efficient foundation model for interactive segmentation of 3D medical images

Title: Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration

Title: RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation

Title: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Title: Global Regulation and Excitation via Attention Tuning for Stereo Matching

Title: The Psychology of Falsehood: A Human-Centric Survey of Misinformation Detection

Title: Re-FRAME the Meeting Summarization SCOPE: Fact-Based Summarization and Personalization via Questions

Title: Deep Feedback Models

Title: Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds

Title: Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment

Title: Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search

Title: Improving Monte Carlo Tree Search for Symbolic Regression

Title: The Alignment Bottleneck

Title: Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics

Title: UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation

Title: PAN: Pillars-Attention-Based Network for 3D Object Detection

Title: Targeted Fine-Tuning of DNN-Based Receivers via Influence Functions

Title: Adversarial Graph Fusion for Incomplete Multi-view Semi-supervised Learning with Tensorial Imputation

Title: Localmax dynamics for attention in transformers and its asymptotic behavior

Title: A multi-temporal multi-spectral attention-augmented deep convolution neural network with contrastive learning for crop yield prediction

Title: BEFT: Bias-Efficient Fine-Tuning of Language Models

Title: Shedding Light on Depth: Explainability Assessment in Monocular Depth Estimation

Title: Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations

Title: Towards Robust Visual Continual Learning with Multi-Prototype Supervision

Title: DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching

Title: Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning

Title: Think, Verbalize, then Speak: Bridging Complex Thoughts and Comprehensible Speech

Title: A High-performance Real-time Container File Monitoring Approach Based on Virtual Machine Introspection

Title: GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Title: ConCap: Practical Network Traffic Generation for Flow-based Intrusion Detection Systems

Title: Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

Title: SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

Title: Communications to Circulations: 3D Wind Field Retrieval and Real-Time Prediction Using 5G GNSS Signals and Deep Learning

Title: Rethinking Molecule Synthesizability with Chain-of-Reaction

Title: See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model

Title: Randomized Smoothing Meets Vision-Language Models

Title: Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising

Title: SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features

Title: Personalized Federated Learning with Heat-Kernel Enhanced Tensorized Multi-View Clustering

Title: It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge

Title: CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion

Title: DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Title: RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars

Title: Network-Based Detection of Autism Spectrum Disorder Using Sustainable and Non-invasive Salivary Biomarkers

Title: BaseReward: A Strong Baseline for Multimodal Reward Model

Title: Dynamic Classifier-Free Diffusion Guidance via Online Feedback

Title: Spatio-temporal, multi-field deep learning of shock propagation in meso-structured media

Title: AcT2I: Evaluating and Improving Action Depiction in Text-to-Image Models

Title: Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models

Title: Automated Cyber Defense with Generalizable Graph-based Reinforcement Learning Agents

Title: Robust Vision-Language Models via Tensor Decomposition: A Defense Against Adversarial Attacks

Title: UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation

Title: Fast OTSU Thresholding Using Bisection Method

Title: CultureScope: A Dimensional Lens for Probing Cultural Understanding in LLMs

Title: MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Title: RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Title: Inverting Trojans in LLMs