2025-08-18

Title: A2HCoder: An LLM-Driven Coding Agent for Hierarchical Algorithm-to-HDL Translation

Title: PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins

Title: Privacy Enhancement for Gaze Data Using a Noise-Infused Autoencoder

Title: A Survey on Video Temporal Grounding with Multimodal Large Language Model

Title: gpt-oss-120b & gpt-oss-20b Model Card

Title: Modeling and Detecting Company Risks from News: A Case Study in Bloomberg News

Title: VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By \underline{V}alue \underline{S}ign \underline{F}lip

Title: ViPE: Video Pose Engine for 3D Geometric Perception

Title: Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction

Title: Personalized Face Super-Resolution with Identity Decoupling and Fitting

Title: NIRMAL Pooling: An Adaptive Max Pooling Approach with Non-linear Activation for Enhanced Image Classification

Title: Analysis of the Compaction Behavior of Textile Reinforcements in Low-Resolution In-Situ CT Scans via Machine-Learning and Descriptor-Based Methods

Title: IPG: Incremental Patch Generation for Generalized Adversarial Patch Training

Title: MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text

Title: Apriel-Nemotron-15B-Thinker

Title: From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement

Title: Empowering Multimodal LLMs with External Tools: A Comprehensive Survey

Title: CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving

Title: Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis

Title: Rule2Text: A Framework for Generating and Evaluating Natural Language Explanations of Knowledge Graph Rules

Title: BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Title: MCP-Guard: A Defense Framework for Model Context Protocol Integrity in Large Language Model Applications

Title: Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models

Title: Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling

Title: SproutBench: A Benchmark for Safe and Ethical Large Language Models for Youth

Title: CURE: Critical-Token-Guided Re-concatenation for Entropy-collapse Prevention

Title: Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics

Title: Can Multi-modal (reasoning) LLMs detect document manipulation?

Title: Hell or High Water: Evaluating Agentic Recovery from External Failures

Title: MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

Title: SHLIME: Foiling adversarial attacks fooling SHAP and LIME

Title: BIPOLAR: Polarization-based granular framework for LLM bias evaluation

Title: Data-Driven Abdominal Phenotypes of Type 2 Diabetes in Lean, Overweight, and Obese Cohorts

Title: Approaching the Source of Symbol Grounding with Confluent Reductions of Abstract Meaning Representation Directed Graphs

Title: Abundance-Aware Set Transformer for Microbiome Sample Embedding

Title: Relative Advantage Debiasing for Watch-Time Prediction in Short-Video Recommendation

Title: Compressive Meta-Learning

Title: HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing

Title: UWB-PostureGuard: A Privacy-Preserving RF Sensing System for Continuous Ergonomic Sitting Posture Monitoring

Title: Towards Reliable Multi-Agent Systems for Marketing Applications via Reflection, Memory, and Planning

Title: MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents

Title: Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation

Title: Towards the Next-generation Bayesian Network Classifiers

Title: LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction

Title: Mitigating Modality Quantity and Quality Imbalance in Multimodal Online Federated Learning

Title: MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering

Title: Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models

Title: Overcoming Low-Resource Barriers in Tulu: Neural Models and Corpus Creation for OffensiveLanguage Identification

Title: VFM-Guided Semi-Supervised Detection Transformer for Source-Free Object Detection in Remote Sensing Images

Title: A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels

Title: Versatile Video Tokenization with Generative 2D Gaussian Splatting

Title: Personalized Distractor Generation via MCTS-Guided Reasoning Reconstruction

Title: CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector

Title: Quantum-Boosted High-Fidelity Deep Learning

Title: Generating Dialogues from Egocentric Instructional Videos for Task Assistance: Dataset, Method and Benchmark

Title: UAV-VL-R1: Generalizing Vision-Language Models via Supervised Fine-Tuning and Multi-Stage GRPO for UAV Visual Reasoning

Title: E-CaTCH: Event-Centric Cross-Modal Attention with Temporal Consistency and Class-Imbalance Handling for Misinformation Detection

Title: A Coarse-to-Fine Human Pose Estimation Method based on Two-stage Distillation and Progressive Graph Neural Network

Title: Air Quality PM2.5 Index Prediction Model Based on CNN-LSTM

Title: A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving

Title: Enhancing Interactive Voting-Based Map Matching: Improving Efficiency and Robustness for Heterogeneous GPS Trajectories

Title: Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering

Title: Graph Neural Diffusion via Generalized Opinion Dynamics

Title: FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

Title: Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception

Title: Group Fairness Meets the Black Box: Enabling Fair Algorithms on Closed LLMs via Post-Processing

Title: UNVEILING: What Makes Linguistics Olympiad Puzzles Tricky for LLMs?

Title: Vision-Language Models display a strong gender bias

Title: Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds

Title: Probing the Representational Power of Sparse Autoencoders in Vision Models

Title: Boosting the Robustness-Accuracy Trade-off of SNNs by Robust Temporal Self-Ensemble

Title: LETToT: Label-Free Evaluation of Large Language Models On Tourism Using Expert Tree-of-Thought

Title: ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection

Title: Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction

Title: TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation

Title: AI in Mental Health: Emotional and Sentiment Analysis of Large Language Models' Responses to Depression, Anxiety, and Stress Queries

Title: Hyperspectral vs. RGB for Pedestrian Segmentation in Urban Driving Scenes: A Comparative Study

Title: SGSimEval: A Comprehensive Multifaceted and Similarity-Enhanced Benchmark for Automatic Survey Generation Systems

Title: LLM Compression: How Far Can We Go in Balancing Size and Performance?

Title: Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking

Title: Salty Seagull: A VSAT Honeynet to Follow the Bread Crumb of Attacks in Ship Networks

Title: Noise Matters: Optimizing Matching Noise for Diffusion Classifiers

Title: GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition

Title: RegimeNAS: Regime-Aware Differentiable Architecture Search With Theoretical Guarantees for Financial Trading

Title: Index-Aligned Query Distillation for Transformer-based Incremental Object Detection

Title: Semantically Guided Adversarial Testing of Vision Models Using Language Models

Title: SpecDetect: Simple, Fast, and Training-Free Detection of LLM-Generated Text via Spectral Analysis

Title: NeMo: A Neuron-Level Modularizing-While-Training Approach for Decomposing DNN Models

Title: HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model

Title: Leveraging the RETFound foundation model for optic disc segmentation in retinal images

Title: ETTRL: Balancing Exploration and Exploitation in LLM Test-Time Reinforcement Learning Via Entropy Mechanism

Title: PTSM: Physiology-aware and Task-invariant Spatio-temporal Modeling for Cross-Subject EEG Decoding

Title: Feedback Indicators: The Alignment between Llama and a Teacher in Language Learning

Title: Does the Skeleton-Recall Loss Really Work?

Title: When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Title: Retrieval-augmented reasoning with lean language models

Title: Model Interpretability and Rationale Extraction by Input Mask Optimization

Title: Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training

Title: On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Title: RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence Mitigator

Title: SelfAdapt: Unsupervised Domain Adaptation of Cell Segmentation Models

Title: Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions

Title: Training-free Dimensionality Reduction via Feature Truncation: Enhancing Efficiency in Privacy-preserving Multi-Biometric Systems

Title: Generative Co-Design of Antibody Sequences and Structures via Black-Box Guidance in a Shared Latent Space

Title: ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving

Title: HumorPlanSearch: Structured Planning and HuCoT for Contextual AI Humor

Title: Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting

Title: Robust Convolution Neural ODEs via Contractivity-promoting regularization

Title: MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation

Title: Online Anti-sexist Speech: Identifying Resistance to Gender Bias in Political Discourse

Title: Multi-Sensory Cognitive Computing for Learning Population-level Brain Connectivity

Title: Inside Knowledge: Graph-based Path Generation with Explainable Data Augmentation and Curriculum Learning for Visual Indoor Navigation

Title: Reference Points in LLM Sentiment Analysis: The Role of Structured Context

Title: Data-Driven Deepfake Image Detection Method -- The 2024 Global Deepfake Image Detection Challenge

Title: CoFi: A Fast Coarse-to-Fine Few-Shot Pipeline for Glomerular Basement Membrane Segmentation

Title: RMSL: Weakly-Supervised Insider Threat Detection with Robust Multi-sphere Learning

Title: TACR-YOLO: A Real-time Detection Framework for Abnormal Human Behaviors Enhanced with Coordinate and Task-Aware Representations

Title: OpenConstruction: A Systematic Synthesis of Open Visual Datasets for Data-Centric Artificial Intelligence in Construction Monitoring

Title: CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models

Title: Automated Building Heritage Assessment Using Street-Level Imagery

Title: KV-Auditor: Auditing Local Differential Privacy for Correlated Key-Value Estimation

Title: Hierarchical Graph Feature Enhancement with Adaptive Frequency Modulation for Visual Recognition

Title: Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models

Title: AIM: Amending Inherent Interpretability via Self-Supervised Masking

Title: Predicting and Explaining Traffic Crash Severity Through Crash Feature Selection

Title: Towards Faithful Class-level Self-explainability in Graph Neural Networks by Subgraph Dependencies

Title: A Real-time Concrete Crack Detection and Segmentation Model Based on YOLOv11

Title: Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series

Title: DFed-SST: Building Semantic- and Structure-aware Topologies for Decentralized Federated Graph Learning

Title: Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction

Title: An Efficient Medical Image Classification Method Based on a Lightweight Improved ConvNeXt-Tiny Architecture

Title: Speciesism in AI: Evaluating Discrimination Against Animals in Large Language Models

Title: Reinforcing Video Reasoning Segmentation to Think Before It Segments

Title: Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends

Title: Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model

Title: Pushing the Limits of Frequency Analysis in Leakage Abuse Attacks

Title: AgentMental: An Interactive Multi-Agent Framework for Explainable and Adaptive Mental Health Assessment

Title: TrajSV: A Trajectory-based Model for Sports Video Representations and Applications

Title: Activate Me!: Designing Efficient Activation Functions for Privacy-Preserving Machine Learning with Fully Homomorphic Encryption

Title: Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

Title: CryptoScope: Utilizing Large Language Models for Automated Cryptographic Logic Vulnerability Detection

Title: CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion

Title: Dataset Creation for Visual Entailment using Generative AI

Title: TinyTim: A Family of Language Models for Divergent Generation

Title: Controlling Multimodal LLMs via Reward-guided Decoding

Title: Optimal CO2 storage management considering safety constraints in multi-stakeholder multi-site CCS projects: a game theoretic perspective

Title: LoRAtorio: An intrinsic approach to LoRA Skill Composition

Title: Is ChatGPT-5 Ready for Mammogram VQA?