2025-07-24

Title: TD-Interpreter: Enhancing the Understanding of Timing Diagrams with Visual-Language Learning

Title: Post-Disaster Affected Area Segmentation with a Vision Transformer (ViT)-based EVAP Model using Sentinel-2 and Formosat-5 Imagery

Title: Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors

Title: Coarse-to-fine crack cue for robust crack detection

Title: SynthCTI: LLM-Driven Synthetic CTI Generation to enhance MITRE Technique Mapping

Title: Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection

Title: Pixels, Patterns, but No Poetry: To See The World like Humans

Title: Reinforcement Learning in hyperbolic space for multi-step reasoning

Title: Diffusion-Modeled Reinforcement Learning for Carbon and Risk-Aware Microgrid Optimization

Title: Building a robust OAuth token based API Security: A High level Overview

Title: CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage

Title: HIPPO-Video: Simulating Watch Histories with Large Language Models for Personalized Video Highlighting

Title: ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension

Title: CausalStep: A Benchmark for Explicit Stepwise Causal Reasoning in Videos

Title: Finding Dori: Memorization in Text-to-Image Diffusion Models Is Less Local Than Assumed

Title: SplitMeanFlow: Interval Splitting Consistency in Few-Step Generative Modeling

Title: Sparser2Sparse: Single-shot Sparser-to-Sparse Learning for Spatial Transcriptomics Imputation with Natural Image Co-learning

Title: Revisiting Pre-trained Language Models for Vulnerability Detection

Title: SiLQ: Simple Large Language Model Quantization-Aware Training

Title: AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation

Title: AI-based Clinical Decision Support for Primary Care: A Real-World Study

Title: Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs

Title: Evaluating Ensemble and Deep Learning Models for Static Malware Detection with Dimensionality Reduction Using the EMBER Dataset

Title: Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain

Title: Obscured but Not Erased: Evaluating Nationality Bias in LLMs via Name-Based Bias Benchmarks

Title: PyG 2.0: Scalable Learning on Real World Graphs

Title: From Cracks to Crooks: YouTube as a Vector for Malware Distribution

Title: Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks

Title: Should Bias Always be Eliminated? A Principled Framework to Use Data Bias for OOD Generation

Title: Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models

Title: Multi-Label Classification with Generative AI Models in Healthcare: A Case Study of Suicidality and Risk Factors

Title: Towards Trustworthy AI: Secure Deepfake Detection using CNNs and Zero-Knowledge Proofs

Title: Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?

Title: Causal Graph Fuzzy LLMs: A First Introduction and Applications in Time Series Forecasting

Title: GATEBLEED: Exploiting On-Core Accelerator Power Gating for High Performance & Stealthy Attacks on AI

Title: Transformer Based Building Boundary Reconstruction using Attraction Field Maps

Title: Controllable Hybrid Captioner for Improved Long-form Video Understanding

Title: Toward Scalable Video Narration: A Training-free Approach Using Multimodal Large Language Models

Title: Pragmatic Policy Development via Interpretable Behavior Cloning

Title: SoK: Securing the Final Frontier for Cybersecurity in Space-Based Infrastructure

Title: Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation

Title: Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach

Title: Sensor Drift Compensation in Electronic-Nose-Based Gas Recognition Using Knowledge Distillation

Title: Analysis of Post-Quantum Cryptography in User Equipment in 5G and Beyond

Title: SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction

Title: FedVLM: Scalable Personalized Vision-Language Models through Federated Learning

Title: IONext: Unlocking the Next Era of Inertial Odometry

Title: Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

Title: Probabilistic Graphical Models: A Concise Tutorial

Title: Robust Five-Class and binary Diabetic Retinopathy Classification Using Transfer Learning and Data Augmentation

Title: Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance

Title: SADA: Stability-guided Adaptive Diffusion Acceleration

Title: CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards

Title: ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

Title: DOOMGAN:High-Fidelity Dynamic Identity Obfuscation Ocular Generative Morphing

Title: Tabular Diffusion based Actionable Counterfactual Explanations for Network Intrusion Detection

Title: SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

Title: A Privacy-Preserving Data Collection Method for Diversified Statistical Analysis

Title: Hierarchical Fusion and Joint Aggregation: A Multi-Level Feature Representation Method for AIGC Image Quality Assessment

Title: Vec2Face+ for Face Dataset Generation

Title: Threshold-Protected Searchable Sharing: Privacy Preserving Aggregated-ANN Search for Collaborative RAG

Title: DesignLab: Designing Slides Through Iterative Detection and Correction

Title: Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation

Title: VBCD: A Voxel-Based Framework for Personalized Dental Crown Design

Title: The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

Title: PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

Title: Dataset Distillation as Data Compression: A Rate-Utility Perspective

Title: P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge Devices

Title: CLARIFID: Improving Radiology Report Generation by Reinforcing Clinically Accurate Impressions and Enforcing Detailed Findings

Title: MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training

Title: Perceptual Classifiers: Detecting Generative Images using Perceptual Features

Title: Eco-Friendly AI: Unleashing Data Power for Green Federated Learning

Title: DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs

Title: Rethinking VAE: From Continuous to Discrete Representations Without Probabilistic Assumptions

Title: Tab-MIA: A Benchmark Dataset for Membership Inference Attacks on Tabular Data in LLMs

Title: PolarAnything: Diffusion-based Polarimetric Image Synthesis

Title: Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance

Title: Fully Automated SAM for Single-source Domain Generalization in Medical Image Segmentation

Title: Decentralized Federated Learning of Probabilistic Generative Classifiers

Title: Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Title: PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining

Title: R-Stitch: Dynamic Trajectory Stitching for Efficient Reasoning

Title: CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance

Title: An Empirical Study on Virtual Reality Software Security Weaknesses

Title: CartoonAlive: Towards Expressive Live2D Modeling from Single Portraits

Title: PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image

Title: Temporal Point-Supervised Signal Reconstruction: A Human-Annotation-Free Framework for Weak Moving Target Detection

Title: TransLPRNet: Lite Vision-Language Network for Single/Dual-line Chinese License Plate Recognition

Title: Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation

Title: TOC-UCO: a comprehensive repository of tabular ordinal classification datasets

Title: Exploring Active Learning for Semiconductor Defect Segmentation

Title: DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning

Title: Exploring Spatial Diversity for Region-based Active Learning

Title: ViRN: Variational Inference and Distribution Trilateration for Long-Tailed Continual Representation Learning

Title: Continual Generalized Category Discovery: Learning and Forgetting from a Bayesian Perspective

Title: A Zero-overhead Flow for Security Closure

Title: EndoGen: Conditional Autoregressive Endoscopic Video Generation

Title: HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs

Title: Physics-based Human Pose Estimation from a Single Moving RGB Camera

Title: Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging

Title: A Comprehensive Evaluation on Quantization Techniques for Large Language Models

Title: CAPRI-CT: Causal Analysis and Predictive Reasoning for Image Quality Optimization in Computed Tomography

Title: Each to Their Own: Exploring the Optimal Embedding in RAG

Title: C3RL: Rethinking the Combination of Channel-independence and Channel-mixing from Representation Learning

Title: VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization

Title: Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

Title: ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents

Title: BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles

Title: MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

Title: Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer's disease

Title: Active Attack Resilience in 5G: A New Take on Authentication and Key Agreement

Title: DNT: a Deeply Normalized Transformer that can be trained by Momentum SGD

Title: Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation

Title: Accelerating Parallel Diffusion Model Serving with Residual Compression

Title: HOTA: Hamiltonian framework for Optimal Transport Advection

Title: URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Title: Frequency Estimation of Correlated Multi-attribute Data under Local Differential Privacy

Title: Enabling Cyber Security Education through Digital Twins and Generative AI

Title: Generalized Advantage Estimation for Distributional Policy Gradients

Title: Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding

Title: Federated Majorize-Minimization: Beyond Parameter Aggregation

Title: An h-space Based Adversarial Attack for Protection Against Few-shot Personalization

Title: Boosting Ray Search Procedure of Hard-label Attacks with Transfer-based Priors

Title: Synthetic Voice Data for Automatic Speech Recognition in African Languages

Title: Enhancing Quantum Federated Learning with Fisher Information-Based Optimization

Title: Dual-branch Prompting for Multimodal Machine Translation

Title: PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

Title: InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling

Title: Vision Transformer attention alignment with human visual perception in aesthetic object evaluation

Title: Reusing Attention for One-stage Lane Topology Understanding

Title: A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)

Title: Quantifying the ROI of Cyber Threat Intelligence: A Data-Driven Approach

Title: Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries

Title: XStacking: Explanation-Guided Stacked Ensemble Learning

Title: CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts

Title: Rethinking HSM and TPM Security in the Cloud: Real-World Attacks and Next-Gen Defenses

Title: Attention (as Discrete-Time Markov) Chains

Title: See the Forest and the Trees: A Synergistic Reasoning Framework for Knowledge-Based Visual Question Answering

Title: Monocular Semantic Scene Completion via Masked Recurrent Networks

Title: Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras

Title: Perspective-Invariant 3D Object Detection

Title: How Should We Meta-Learn Reinforcement Learning Algorithms?

Title: Generalized Dual Discriminator GANs

Title: Towards Effective Open-set Graph Class-incremental Learning

Title: Joint Asymmetric Loss for Learning with Noisy Labels

Title: Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Title: HydraOpt: Navigating the Efficiency-Performance Trade-off of Adapter Merging

Title: From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

Title: AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

Title: BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems

Title: On the Interaction of Compressibility and Adversarial Robustness

Title: Megrez2 Technical Report

Title: Flow Matching Meets Biology and Life Science: A Survey

Title: Yume: An Interactive World Generation Model

Title: Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Title: Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

Title: Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks

Title: Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility