2025-06-30

Title: VAT-KG: Knowledge-Intensive Multimodal Knowledge Graph Dataset for Retrieval-Augmented Generation

Title: Debunk and Infer: Multimodal Fake News Detection via Diffusion-Generated Evidence and LLM Reasoning

Title: GraphLAMA: Enabling Efficient Adaptation of Graph Language Models with Limited Annotations

Title: Reasoning Isn't Enough: Examining Truth-Bias and Sycophancy in LLMs

Title: FloorPlan-DeepSeek (FPDS): A multimodal approach to floorplan generation using vector-based next room prediction

Title: FormosanBench: Benchmarking Low-Resource Austronesian Languages in the Era of Large Language Models

Title: Team QUST at SemEval-2025 Task 10: Evaluating Large Language Models in Multiclass Multi-label Classification of News Entity Framing

Title: A Multi-Agent Probabilistic Inference Framework Inspired by Kairanban-Style CoT System with IdoBata Conversation for Debiasing

Title: BioPars: A Pretrained Biomedical Large Language Model for Persian Biomedical Text Mining

Title: Assessing RAG and HyDE on 1B vs. 4B-Parameter Gemma LLMs for Personal Assistants Integretion

Title: Hybrid-NL2SVA: Integrating RAG and Finetuning for LLM-based NL2SVA

Title: Towards Understanding the Cognitive Habits of Large Reasoning Models

Title: Aligning MLLM Benchmark With Human Preferences via Structural Equation Modeling

Title: Instruction Learning Paradigms: A Dual Perspective on White-box and Black-box LLMs

Title: Digital Gatekeepers: Exploring Large Language Model's Role in Immigration Decisions

Title: STRuCT-LLM: Unifying Tabular and Graph Reasoning with Reinforcement Learning for Semantic Parsing

Title: Language-Aware Prompt Tuning for Parameter-Efficient Seamless Language Expansion in Multilingual ASR

Title: HealthQA-BR: A System-Wide Benchmark Reveals Critical Knowledge Gaps in Large Language Models

Title: From General Reasoning to Domain Expertise: Uncovering the Limits of Generalization in Large Language Models

Title: VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Title: Hope Speech Detection in code-mixed Roman Urdu tweets: A Positive Turn in Natural Language Processing

Title: Empirical Evidence for Alignment Faking in Small LLMs and Prompt-Based Mitigation Techniques

Title: Evaluation of LLM-based Strategies for the Extraction of Food Product Information from Online Shops

Title: Can Vision Language Models Understand Mimed Actions?

Title: Is DeepSeek a New Voice Among LLMs in Public Opinion Simulation?

Title: Understanding Verbatim Memorization in LLMs Through Circuit Discovery

Title: A General Method for Detecting Information Generated by Large Language Models

Title: Representation Consistency for Accurate and Coherent LLM Answer Aggregation

Title: FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models' Knowledge and Reasoning

Title: SignBart -- New approach with the skeleton sequence for Isolated Sign language Recognition

Title: Gazal-R1: Achieving State-of-the-Art Medical Reasoning with Parameter-Efficient Two-Stage Training

Title: Evaluating Multimodal Large Language Models on Educational Textbook Question Answering

Title: Overview of the ClinIQLink 2025 Shared Task on Medical Question-Answering

Title: Structured Attention Matters to Multimodal LLMs in Document Understanding

Title: BiMark: Unbiased Multilayer Watermarking for Large Language Models

Title: Operationalizing Automated Essay Scoring: A Human-Aware Approach

Title: Large Language Models as symbolic DNA of cultural dynamics

Title: CORE-KG: An LLM-Driven Knowledge Graph Construction Framework for Human Smuggling Networks

Title: From Thinking to Output: Chain-of-Thought and Text Generation Characteristics in Reasoning Language Models

Title: Does Multimodality Lead to Better Time Series Forecasting?

Title: ChildGuard: A Specialized Dataset for Combatting Child-Targeted Hate Speech

Title: LastingBench: Defend Benchmarks Against Knowledge Leakage

Title: Refine Medical Diagnosis Using Generation Augmented Retrieval and Clinical Practice Guidelines

Title: TIM: A Large-Scale Dataset and large Timeline Intelligence Model for Open-domain Timeline Summarization

Title: TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge

Title: How Large Language Models play humans in online conversations: a simulated study of the 2016 US politics on Reddit

Title: The Open Proof Corpus: A Large-Scale Study of LLM-Generated Mathematical Proofs

Title: Performance of diverse evaluation metrics in NLP-based assessment and text generation of consumer complaints

Title: Doc2SAR: A Synergistic Framework for High-Fidelity Extraction of Structure-Activity Relationships from Scientific Documents

Title: APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization

Title: TanDiT: Tangent-Plane Diffusion Transformer for High-Quality 360° Panorama Generation

Title: CyGym: A Simulation-Based Game-Theoretic Analysis Framework for Cybersecurity

Title: Unimodal Strategies in Density-Based Clustering

Title: FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering

Title: CAST: Cross-Attentive Spatio-Temporal feature fusion for Deepfake detection

Title: Identifying Speaker Information in Feed-Forward Layers of Self-Supervised Speech Transformers

Title: $\textrm{ODE}_t \left(\textrm{ODE}_l \right)$: Shortcutting the Time and Length in Diffusion and Flow Models for Faster Sampling

Title: Elucidating and Endowing the Diffusion Training Paradigm for General Image Restoration

Title: Exploring Image Generation via Mutually Exclusive Probability Spaces and Local Correlation Hypothesis

Title: Equitable Federated Learning with NCA

Title: Federated Item Response Theory Models

Title: (Fact) Check Your Bias

Title: M3PO: Massively Multi-Task Model-Based Policy Optimization

Title: Evaluating List Construction and Temporal Understanding capabilities of Large Language Models

Title: Multi-task parallelism for robust pre-training of graph foundation models on multi-source, multi-fidelity atomistic modeling data

Title: Offensive Language Detection on Social Media Using XLNet

Title: Towards Transparent AI: A Survey on Explainable Large Language Models

Title: CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery

Title: Exploring the Structure of AI-Induced Language Change in Scientific English

Title: Few-Shot Segmentation of Historical Maps via Linear Probing of Vision Foundation Models

Title: TaleForge: Interactive Multimodal System for Personalized Story Creation

Title: PrefPaint: Enhancing Image Inpainting through Expert Human Feedback

Title: ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts

Title: PARSI: Persian Authorship Recognition via Stylometric Integration

Title: 3D-Telepathy: Reconstructing 3D Objects from EEG Signals

Title: LinguaSynth: Heterogeneous Linguistic Signals for News Classification

Title: The Consistency Hypothesis in Uncertainty Quantification for Large Language Models

Title: End-to-End RGB-IR Joint Image Compression With Channel-wise Cross-modality Entropy Model

Title: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Title: DeepTalk: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoE

Title: Dual-Perspective United Transformer for Object Segmentation in Optical Remote Sensing Images

Title: Grounding-Aware Token Pruning: Recovering from Drastic Performance Drops in Visual Grounding Caused by Pruning

Title: On the Feasibility of Poisoning Text-to-Image AI Models via Adversarial Mislabeling

Title: WildSpeech-Bench: Benchmarking Audio LLMs in Natural Speech Conversation

Title: A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs

Title: GRASP-PsONet: Gradient-based Removal of Spurious Patterns for PsOriasis Severity Classification

Title: Integrating Multi-Modal Sensors: A Review of Fusion Techniques for Intelligent Vehicles

Title: DIVE: Deep-search Iterative Video Exploration A Technical Report for the CVRR Challenge at CVPR 2025

Title: Exploring Task-Solving Paradigm for Generalized Cross-Domain Face Anti-Spoofing via Reinforcement Fine-Tuning

Title: One Video to Steal Them All: 3D-Printing IP Theft through Optical Side-Channels

Title: TOAST: Task-Oriented Adaptive Semantic Transmission over Dynamic Wireless Environments

Title: RAUM-Net: Regional Attention and Uncertainty-aware Mamba Network

Title: Consumer Beware! Exploring Data Brokers' CCPA Compliance

Title: SepFormer: Coarse-to-fine Separator Regression Network for Table Structure Recognition

Title: SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding

Title: HQCM-EBTC: A Hybrid Quantum-Classical Model for Explainable Brain Tumor Classification

Title: GuiderNet: A Meta-Learning Framework for Optimizing Quantum Circuit Geometry and Mitigating Barren Plateaus

Title: SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images

Title: Physics-informed network paradigm with data generation and background noise removal for diverse distributed acoustic sensing applications

Title: Optimal Return-to-Go Guided Decision Transformer for Auto-Bidding in Advertisement

Title: Exploring Semantic Masked Autoencoder for Self-supervised Point Cloud Understanding

Title: PapersPlease: A Benchmark for Evaluating Motivational Values of Large Language Models Based on ERG Theory

Title: More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents

Title: Advancing Jailbreak Strategies: A Hybrid Approach to Exploiting LLM Vulnerabilities and Bypassing Modern Defenses

Title: Don't Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism

Title: TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models

Title: SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model

Title: R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning

Title: Analyzing and Fine-Tuning Whisper Models for Multilingual Pilot Speech Transcription in the Cockpit

Title: RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation

Title: Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method

Title: Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation

Title: Hyper-modal Imputation Diffusion Embedding with Dual-Distillation for Federated Multimodal Knowledge Graph Completion

Title: Can Peter Pan Survive MT? A Stylometric Study of LLMs, NMTs, and HTs in Children's Literature Translation

Title: Few-Shot Identity Adaptation for 3D Talking Heads via Global Gaussian Field

Title: GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling

Title: Decoding Machine Translationese in English-Chinese News: LLMs vs. NMTs

Title: Lost at the Beginning of Reasoning

Title: MirrorMe: Towards Realtime and High Fidelity Audio-Driven Halfbody Animation

Title: Transformers are Graph Neural Networks

Title: Tied Prototype Model for Few-Shot Medical Image Segmentation

Title: Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD

Title: Low-Rank Implicit Neural Representation via Schatten-p Quasi-Norm and Jacobian Regularization

Title: Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs

Title: RetFiner: A Vision-Language Refinement Scheme for Retinal Foundation Models

Title: Training Language Model to Critique for Better Refinement

Title: Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Title: Reliability Analysis of Smart Contract Execution Architectures: A Comparative Simulation Study

Title: Exploring Modularity of Agentic Systems for Drug Discovery

Title: dreaMLearning: Data Compression Assisted Machine Learning

Title: Robust and Accurate Multi-view 2D/3D Image Registration with Differentiable X-ray Rendering and Dual Cross-view Constraints

Title: EFRame: Deeper Reasoning via Exploration-Filtering-Replay Reinforcement Learning Framework

Title: Boosting Classification with Quantum-Inspired Augmentations

Title: Projected Compression: Trainable Projection for Efficient Transformer Compression

Title: Rethinking Visual Token Reduction in LVLMs under Cross-modal Misalignment

Title: OutDreamer: Video Outpainting with a Diffusion Transformer

Title: Weakly-Supervised Domain Adaptation with Proportion-Constrained Pseudo-Labeling

Title: Unfolding Generative Flows with Koopman Operators: Fast and Interpretable Sampling

Title: Detection of Personal Data in Structured Datasets Using a Large Language Model

Title: Evaluating Scoring Bias in LLM-as-a-Judge

Title: Under the Hood of BlotchyQuasar: DLL-Based RAT Campaigns Against Latin America

Title: Less Greedy Equivalence Search

Title: MatChA: Cross-Algorithm Matching with Feature Augmentation

Title: A Framework for Multi-source Privacy Preserving Epidemic Analysis

Title: Closing the Performance Gap in Biometric Cryptosystems: A Deeper Analysis on Unlinkable Fuzzy Vaults

Title: From Ground to Air: Noise Robustness in Vision Transformers and CNNs for Event-Based Vehicle Classification with Potential UAV Applications

Title: Why Are Parsing Actions for Understanding Message Hierarchies Not Random?

Title: Sheaf-Based Decentralized Multimodal Learning for Next-Generation Wireless Communication Systems

Title: Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation

Title: Probabilistic Optimality for Inference-time Scaling

Title: Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment

Title: Towards Distributed Neural Architectures

Title: Multi-View Contrastive Learning for Robust Domain Adaptation in Medical Time Series Analysis

Title: Test-Time Consistency in Vision Language Models

Title: QuickSilver -- Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization

Title: Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL

Title: Refining Czech GEC: Insights from a Multi-Experiment Approach

Title: HyperCLOVA X THINK Technical Report

Title: ARMOR: Robust Reinforcement Learning-based Control for UAVs under Physical Attacks

Title: CLoVE: Personalized Federated Learning through Clustering of Loss Vector Embeddings

Title: Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy