2025-07-08

Title: Loki's Dance of Illusions: A Comprehensive Survey of Hallucination in Large Language Models

Title: Learning to Generate Vectorized Maps at Intersections with Multiple Roadside Cameras

Title: Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions

Title: Controllable diffusion-based generation for multi-channel biological data

Title: Harnessing Near-Infrared Spectroscopy and Machine Learning for Traceable Classification of Hanwoo and Holstein Beef

Title: Enhancing Sports Strategy with Video Analytics and Data Mining: Assessing the effectiveness of Multimodal LLMs in tennis video analysis

Title: Enhancing Sports Strategy with Video Analytics and Data Mining: Automated Video-Based Analytics Framework for Tennis Doubles

Title: Scaling Transformers for Time Series Forecasting: Do Pretrained Large Models Outperform Small-Scale Alternatives?

Title: Hyperbolic Kernel Graph Neural Networks for Neurocognitive Decline Analysis from Multimodal Brain Imaging

Title: Efficient Certified Reasoning for Binarized Neural Networks

Title: Echo State Transformer: When chaos brings memory

Title: ChatGPT is not A Man but Das Man: Representativeness and Structural Consistency of Silicon Samples Generated by Large Language Models

Title: Domain Knowledge in Artificial Intelligence: Using Conceptual Modeling to Increase Machine Learning Accuracy and Explainability

Title: Modeling Urban Food Insecurity with Google Street View Images

Title: Large Language Model Agent for Modular Task Execution in Drug Discovery

Title: A Unified Speech LLM for Diarization and Speech Recognition in Multilingual Conversations

Title: Mitigating Hidden Confounding by Progressive Confounder Imputation via Large Language Models

Title: MolProphecy: Bridging Medicinal Chemists' Knowledge and Molecular Pre-Trained Models via a Multi-Modal Framework

Title: Theory of Mind in Action: The Instruction Inference Task

Title: FoGE: Fock Space inspired encoding for graph prompting

Title: A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis

Title: Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting

Title: Towards a Comparative Framework for Compositional AI Models

Title: GameTileNet: A Semantic Dataset for Low-Resolution Game Art in Procedural Content Generation

Title: Beyond Parallelism: Synergistic Computational Graph Effects in Multi-Head Attention

Title: Iterative Zoom-In: Temporal Interval Exploration for Long Video Understanding

Title: The Application of Large Language Models on Major Depressive Disorder Support Based on African Natural Products

Title: RADIANT: Retrieval AugmenteD entIty-context AligNmenT -- Introducing RAG-ability and Entity-Context Divergence

Title: Evaluating AI Counseling in Japanese: Counselor, Client, and Evaluator Roles Assessed by Motivational Interviewing Criteria

Title: Bittensor Protocol: The Bitcoin in Decentralized Artificial Intelligence? A Critical and Empirical Analysis

Title: Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III

Title: A Representation Engineering Perspective on the Effectiveness of Multi-Turn Jailbreaks

Title: CS-VLM: Compressed Sensing Attention for Efficient Vision-Language Representation Learning

Title: Real-World En Call Center Transcripts Dataset with PII Redaction

Title: A Novel Active Learning Approach to Label One Million Unknown Malware Variants

Title: RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism

Title: VR-YOLO: Enhancing PCB Defect Detection with Viewpoint Robustness Based on YOLO

Title: Less Data, More Security: Advancing Cybersecurity LLMs Specialization via Resource-Efficient Domain-Adaptive Continuous Pre-training with Minimal Tokens

Title: Concept-based Adversarial Attack: a Probabilistic Perspective

Title: PB-LLMs: Privacy- and Bias-aware NLP Models using Named-Entity Recognition

Title: YOLO-Based Pipeline Monitoring in Challenging Visual Environments

Title: Unveiling Privacy Policy Complexity: An Exploratory Study Using Graph Mining, Machine Learning, and Natural Language Processing

Title: Reinforcement Learning for Automated Cybersecurity Penetration Testing

Title: Aim High, Stay Private: Differentially Private Synthetic Data Enables Public Release of Behavioral Health Information with High Utility

Title: Farm-Level, In-Season Crop Identification for India

Title: InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy

Title: Introducing Answered with Evidence -- a framework for evaluating whether LLM responses to biomedical questions are founded in evidence

Title: Are AI-Generated Fixes Secure? Analyzing LLM and Agent Patches on SWE-bench

Title: Iterative Misclassification Error Training (IMET): An Optimized Neural Network Training Technique for Image Classification

Title: We Need Knowledge Distillation for Solving Math Word Problems

Title: Truth, Trust, and Trouble: Medical AI on the Edge

Title: From Answers to Rationales: Self-Aligning Multimodal Reasoning with Answer-Oriented Chain-of-Thought

Title: Gated Recursive Fusion: A Stateful Approach to Scalable Multimodal Transformers

Title: GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language Models

Title: A Comparative Study of Competency Question Elicitation Methods from Ontology Requirements

Title: `For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts

Title: Physics Augmented Machine Learning Discovery of Composition-Dependent Constitutive Laws for 3D Printed Digital Materials

Title: Enabling Robust, Real-Time Verification of Vision-Based Navigation through View Synthesis

Title: MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group Relative Policy Optimization

Title: FreqCross: A Multi-Modal Frequency-Spatial Fusion Network for Robust Detection of Stable Diffusion 3.5 Generated Images

Title: Text-Guided Multi-Instance Learning for Scoliosis Screening via Gait Video Analysis

Title: What to Do Next? Memorizing skills from Egocentric Instructional Video

Title: A Weakly Supervised Transformer to Support Rare Disease Diagnosis from Electronic Health Records: Methods and Applications in Rare Pulmonary Disease

Title: Evaluating Hierarchical Clinical Document Classification Using Reasoning-Based LLMs

Title: Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Title: CLUES: Collaborative High-Quality Data Selection for LLMs via Training Dynamics

Title: Topological Signatures vs. Gradient Histograms: A Comparative Study for Medical Image Classification

Title: PDFMathTranslate: Scientific Document Translation Preserving Layouts

Title: Intrinsic Fingerprint of LLMs: Continue Training is NOT All You Need to Steal A Model!

Title: Beyond Overcorrection: Evaluating Diversity in T2I Models with DIVBENCH

Title: OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering

Title: Look-Back: Implicit Visual Re-focusing in MLLM Reasoning

Title: A Multi-Resolution Dynamic Game Framework for Cross-Echelon Decision-Making in Cyber Warfare

Title: Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains

Title: The Book of Life approach: Enabling richness and scale for life course research

Title: Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation

Title: Rethinking Data Protection in the (Generative) Artificial Intelligence Era

Title: Optimas: Optimizing Compound AI Systems with Globally Aligned Local Rewards

Title: Dynamic Long Short-Term Memory Based Memory Storage For Long Horizon LLM Interaction

Title: Counterfactual Tuning for Temporal Sensitivity Enhancement in Large Language Model-based Recommendation

Title: Monitoring of Static Fairness

Title: Improving LLM Reasoning for Vulnerability Detection via Group Relative Policy Optimization

Title: From 2:4 to 8:16 sparsity patterns in LLMs for Outliers and Weights with Variance Correction

Title: LATTE: Latent Trajectory Embedding for Diffusion-Generated Image Detection

Title: Automated Grading of Students' Handwritten Graphs: A Comparison of Meta-Learning and Vision-Large Language Models

Title: BERT4Traj: Transformer Based Trajectory Reconstruction for Sparse Mobility Data

Title: LLM-Driven Auto Configuration for Transient IoT Device Collaboration

Title: Cycle-Consistent Helmholtz Machine: Goal-Seeded Simulation via Inverted Inference

Title: Large Language Models for Automating Clinical Data Standardization: HL7 FHIR Use Case

Title: Mitigating Goal Misgeneralization with Minimax Regret

Title: ARF-RLHF: Adaptive Reward-Following for RLHF through Emotion-Driven Self-Supervision and Trace-Biased Dynamic Optimization

Title: RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Title: BLaST: High Performance Inference and Pretraining using BLock Sparse Transformers

Title: How Overconfidence in Initial Choices and Underconfidence Under Criticism Modulate Change of Mind in Large Language Models

Title: ReliableMath: Benchmark of Reliable Mathematical Reasoning on Large Language Models

Title: Holographic Projection and Cyber Attack Surface: A Physical Analogy for Digital Security

Title: From Measurement to Mitigation: Exploring the Transferability of Debiasing Approaches to Gender Bias in Maltese Language Models

Title: Set Valued Predictions For Robust Domain Generalization

Title: HGCA: Hybrid GPU-CPU Attention for Long Context LLM Inference

Title: PiCME: Pipeline for Contrastive Modality Evaluation and Encoding in the MIMIC Dataset

Title: Adversarial Manipulation of Reasoning Models using Internal Representations

Title: Adopting a human developmental visual diet yields robust, shape-based AI vision

Title: Latent Thermodynamic Flows: Unified Representation Learning and Generative Modeling of Temperature-Dependent Behaviors from Limited Data

Title: How Much Content Do LLMs Generate That Induces Cognitive Bias in Users?

Title: DistZO2: High-Throughput and Memory-Efficient Zeroth-Order Fine-tuning LLMs with Distributed Parallel Computing

Title: Development of an Improved Capsule-Yolo Network for Automatic Tomato Plant Disease Early Detection and Diagnosis

Title: Neural Inhibition Improves Dynamic Routing and Mixture of Experts

Title: On Jailbreaking Quantized Language Models Through Fault Injection Attacks

Title: A Vision-Based Closed-Form Solution for Measuring the Rotation Rate of an Object by Tracking One Point

Title: KinyaColBERT: A Lexically Grounded Retrieval Model for Low-Resource Retrieval-Augmented Generation

Title: RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Title: LACONIC: A 3D Layout Adapter for Controllable Image Creation

Title: Novel Blockchain-based Protocols for Electronic Voting and Auctions

Title: Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders

Title: Dual-frequency Selected Knowledge Distillation with Statistical-based Sample Rectification for PolSAR Image Classification

Title: ConceptMix++: Leveling the Playing Field in Text-to-Image Benchmarking via Iterative Prompt Optimization

Title: Securing Transformer-based AI Execution via Unified TEE and Crypto-protected Accelerators

Title: Conformal Information Pursuit for Interactively Guiding Large Language Models

Title: NOVO: Unlearning-Compliant Vision Transformers

Title: MolVision: Molecular Property Prediction with Vision Language Models

Title: Global Variational Inference Enhanced Robust Domain Adaptation

Title: MGAA: Multi-Granular Adaptive Allocation fof Low-Rank Compression of LLMs

Title: CPKD: Clinical Prior Knowledge-Constrained Diffusion Models for Surgical Phase Recognition in Endoscopic Submucosal Dissection

Title: LRM-1B: Towards Large Routing Model

Title: Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary Model

Title: Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations

Title: MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion

Title: ReTimeCausal: EM-Augmented Additive Noise Models for Interpretable Causal Discovery in Irregular Time Series

Title: GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation

Title: MPX: Mixed Precision Training for JAX

Title: Personalized Image Generation from an Author Writing Style

Title: Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization

Title: Source-Free Domain Adaptation via Multi-view Contrastive Learning

Title: A Note on Single-Cut Full-Open Protocols

Title: Mirror in the Model: Ad Banner Image Generation via Reflective Multi-LLM and Multi-modal Agents

Title: Read Quietly, Think Aloud: Decoupling Comprehension and Reasoning in LLMs

Title: Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling

Title: De-Fake: Style based Anomaly Deepfake Detection

Title: Degrees of Freedom for Linear Attention: Distilling Softmax Attention with Optimal Feature Efficiency

Title: SHNU Multilingual Conversational Speech Recognition System for INTERSPEECH 2025 MLC-SLM Challenge

Title: Securing Mixed Rust with Hardware Capabilities

Title: Accelerating Private Heavy Hitter Detection on Continual Observation Streams

Title: Action Robust Reinforcement Learning via Optimal Adversary Aware Policy Optimization

Title: WETBench: A Benchmark for Detecting Task-Specific Machine-Generated Text on Wikipedia

Title: MRC-DETR: An Adaptive Multi-Residual Coupled Transformer for Bare Board PCB Defect Detection

Title: Breaking the Bulkhead: Demystifying Cross-Namespace Reference Vulnerabilities in Kubernetes Operators

Title: Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Title: Pose-Star: Anatomy-Aware Editing for Open-World Fashion Images

Title: Graph Repairs with Large Language Models: An Empirical Study

Title: A Hybrid Game-Theory and Deep Learning Framework for Predicting Tourist Arrivals via Big Data Analytics and Opinion Leader Detection

Title: SMCLM: Semantically Meaningful Causal Language Modeling for Autoregressive Paraphrase Generation

Title: Rectifying Adversarial Sample with Low Entropy Prior for Test-Time Defense

Title: Multi-Level Fusion Graph Neural Network for Molecule Property Prediction

Title: Improving Social Determinants of Health Documentation in French EHRs Using Large Language Models

Title: Unlearning the Noisy Correspondence Makes CLIP More Robust

Title: Radar Tracker: Moving Instance Tracking in Sparse and Noisy Radar Point Clouds

Title: Evaluating the Evaluators: Trust in Adversarial Robustness Tests

Title: Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach

Title: Radar Velocity Transformer: Single-scan Moving Object Segmentation in Noisy Radar Point Clouds

Title: Beyond Weaponization: NLP Security for Medium and Lower-Resourced Languages in Their Own Right

Title: Molecular Machine Learning Using Euler Characteristic Transforms

Title: Four Shades of Life Sciences: A Dataset for Disinformation Detection in the Life Sciences

Title: Reinforcement Learning-based Feature Generation Algorithm for Scientific Data

Title: Decoupled Relative Learning Rate Schedules

Title: Multimodal Alignment with Cross-Attentive GRUs for Fine-Grained Video Understanding

Title: CLOT: Closed Loop Optimal Transport for Unsupervised Action Segmentation

Title: Foundation versus Domain-specific Models: Performance Comparison, Fusion, and Explainability in Face Recognition

Title: H2HTalk: Evaluating Large Language Models as Emotional Companion

Title: Communication Efficient, Differentially Private Distributed Optimization using Correlation-Aware Sketching

Title: An Advanced Deep Learning Framework for Ischemic and Hemorrhagic Brain Stroke Diagnosis Using Computed Tomography (CT) Images

Title: 2.5D Object Detection for Intelligent Roadside Infrastructure

Title: Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation

Title: Kinetic Langevin Diffusion for Crystalline Materials Generation

Title: VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification

Title: EMERGE: A Benchmark for Updating Knowledge Graphs with Emerging Textual Knowledge

Title: Blackbox Dataset Inference for LLM

Title: SecureT2I: No More Unauthorized Manipulation on AI Generated Images from Prompts

Title: When There Is No Decoder: Removing Watermarks from Stable Diffusion Models in a No-box Setting

Title: When Network Architecture Meets Physics: Deep Operator Learning for Coupled Multiphysics

Title: Re-Emergent Misalignment: How Narrow Fine-Tuning Erodes Safety Alignment in LLMs

Title: TRACE: Training and Inference-Time Interpretability Analysis for Language Models

Title: Recon, Answer, Verify: Agents in Search of Truth

Title: TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection

Title: STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking

Title: Plugging Attention into Power Grids: Towards Transparent Forecasting

Title: Willchain: Decentralized, Privacy-Preserving, Self-Executing, Digital Wills

Title: SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Title: Sign Spotting Disambiguation using Large Language Models

Title: Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making

Title: Predicting Business Angel Early-Stage Decision Making Using AI

Title: MemOS: A Memory OS for AI System

Title: FAROS: Fair Graph Generation via Attribute Switching Mechanisms

Title: Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps

Title: Flow-Anchored Consistency Models

Title: ChestGPT: Integrating Large Language Models and Vision Transformers for Disease Detection and Localization in Chest X-Rays

Title: StreamDiT: Real-Time Streaming Text-to-Video Generation

Title: Skewed Score: A statistical framework to assess autograders

Title: RVISmith: Fuzzing Compilers for RVV Intrinsics

Title: Alpay Algebra IV: Symbiotic Semantics and the Fixed-Point Convergence of Observer Embeddings

Title: FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed

Title: Zero Memory Overhead Approach for Protecting Vision Transformer Parameters

Title: IMPACT: Importance-Aware Activation Space Reconstruction

Title: Query-Based Adaptive Aggregation for Multi-Dataset Joint Training Toward Universal Visual Place Recognition

Title: Regularizing Log-Linear Cost Models for Inpatient Stays by Merging ICD-10 Codes

Title: Interpretable Diffusion Models with B-cos Networks

Title: KEA Explain: Explanations of Hallucinations using Graph Kernel Analysis

Title: OrbitAll: A Unified Quantum Mechanical Representation Deep Learning Framework for All Molecular Systems

Title: Transformer with Koopman-Enhanced Graph Convolutional Network for Spatiotemporal Dynamics Forecasting

Title: Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs

Title: OrthoRank: Token Selection via Sink Token Orthogonality for Efficient LLM inference

Title: Enhancing Adaptive Behavioral Interventions with LLM Inference from Participant-Described States

Title: Demystifying ChatGPT: How It Masters Genre Recognition

Title: Hierarchical Semantic-Visual Fusion of Visible and Near-infrared Images for Long-range Haze Removal

Title: GenAI-Powered Inference

Title: Transformer Model for Alzheimer's Disease Progression Prediction Using Longitudinal Visit Sequences

Title: Bridging Vision and Language: Optimal Transport-Driven Radiology Report Generation via LLMs

Title: Return of the Latent Space COWBOYS: Re-thinking the use of VAEs for Bayesian Optimisation of Structured Spaces

Title: Learning Disentangled Stain and Structural Representations for Semi-Supervised Histopathology Segmentation

Title: DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering

Title: Losing our Tail -- Again: On (Un)Natural Selection And Multilingual Large Language Models

Title: VISC: mmWave Radar Scene Flow Estimation using Pervasive Visual-Inertial Supervision

Title: A Modular Unsupervised Framework for Attribute Recognition from Unstructured Text

Title: Evaluating Adversarial Protections for Diffusion Personalization: A Comprehensive Study

Title: Robust Low-light Scene Restoration via Illumination Transition

Title: CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning

Title: MalVol-25: A Diverse, Labelled and Detailed Volatile Memory Dataset for Malware Detection and Response Testing and Validation

Title: NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models

Title: Seamlessly Integrating Tree-Based Positional Embeddings into Transformer Models for Source Code Representation

Title: Group-wise Scaling and Orthogonal Decomposition for Domain-Invariant Feature Extraction in Face Anti-Spoofing

Title: Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Title: Nunchi-Bench: Benchmarking Language Models on Cultural Reasoning with a Focus on Korean Superstition

Title: Habitat Classification from Ground-Level Imagery Using Deep Neural Networks

Title: Exploring Kolmogorov-Arnold Network Expansions in Vision Transformers for Mitigating Catastrophic Forgetting in Continual Learning

Title: LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models

Title: Benchmarking Stochastic Approximation Algorithms for Fairness-Constrained Training of Deep Neural Networks

Title: PresentAgent: Multimodal Agent for Presentation Video Generation

Title: T-SYNTH: A Knowledge-Based Dataset of Synthetic Breast Images

Title: Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation

Title: Predictive Modeling of Effluent Temperature in SAT Systems Using Ambient Meteorological Data: Implications for Infiltration Management

Title: Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery

Title: Rethinking and Exploring String-Based Malware Family Classification in the Era of LLMs and RAG

Title: Attributing Data for Sharpness-Aware Minimization

Title: Consistent and Invariant Generalization Learning for Short-video Misinformation Detection

Title: Beyond Independent Passages: Adaptive Passage Combination Retrieval for Retrieval Augmented Open-Domain Question Answering

Title: Accurate and Efficient World Modeling with Masked Latent Transformers

Title: S-Leak: Leakage-Abuse Attack Against Efficient Conjunctive SSE via s-term Leakage

Title: Conversation Forests: The Key to Fine Tuning Large Language Models for Multi-Turn Medical Conversations is Branching

Title: Hierarchical Testing with Rabbit Optimization for Industrial Cyber-Physical Systems

Title: Human-Centered Interactive Anonymization for Privacy-Preserving Machine Learning: A Case for Human-Guided k-Anonymity

Title: Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning

Title: Integrated Gaussian Processes for Robust and Adaptive Multi-Object Tracking

Title: PromptSR: Cascade Prompting for Lightweight Image Super-Resolution

Title: When Data-Free Knowledge Distillation Meets Non-Transferable Teacher: Escaping Out-of-Distribution Trap is All You Need

Title: Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge

Title: Graph Neural Networks as a Substitute for Transformers in Single-Cell Transcriptomics

Title: BlowPrint: Blow-Based Multi-Factor Biometrics for Smartphone User Authentication

Title: BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering

Title: Token Level Hallucination Detection via Variance in Language Models

Title: Dissecting Clinical Reasoning in Language Models: A Comparative Study of Prompts and Model Adaptation Strategies

Title: Large Language Models for Zero-Shot Multicultural Name Recognition

Title: Unlocking Compositional Control: Self-Supervision for LVLM-Based Image Generation

Title: LVLM-Composer's Explicit Planning for Image Generation

Title: Uncertainty Quantification in the Tsetlin Machine

Title: skfolio: Portfolio Optimization in Python

Title: SymbolicThought: Integrating Language Models and Symbolic Reasoning for Consistent and Interpretable Human Relationship Understanding

Title: ML-Enhanced AES Anomaly Detection for Real-Time Embedded Security

Title: An explicit formulation of the learned noise predictor $ε_θ({\bf x}_t, t)$ via the forward-process noise $ε_{t}$ in denoising diffusion probabilistic models (DDPMs)

Title: Quick Bypass Mechanism of Zero-Shot Diffusion-Based Image Restoration

Title: Can Large Language Models Automate the Refinement of Cellular Network Specifications?

Title: DreamPoster: A Unified Framework for Image-Conditioned Generative Poster Design

Title: Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

Title: Context Tuning for In-Context Optimization

Title: Fairness Evaluation of Large Language Models in Academic Library Reference Services

Title: Zero-Shot Cyclic Peptide Design with Composable Geometric Conditions

Title: Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties

Title: Scaling Context Requires Rethinking Attention

Title: Domain Generalizable Portrait Style Transfer

Title: Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning

Title: MoReMouse: Monocular Reconstruction of Laboratory Mouse

Title: An Explainable Transformer Model for Alzheimer's Disease Detection Using Retinal Imaging

Title: ZERO: Multi-modal Prompt-based Visual Grounding

Title: VOLTRON: Detecting Unknown Malware Using Graph-Based Zero-Shot Learning

Title: SeqTex: Generate Mesh Textures in Video Sequence

Title: M$^3$-Med: A Benchmark for Multi-lingual, Multi-modal, and Multi-hop Reasoning in Medical Instructional Video Understanding

Title: MPQ-DMv2: Flexible Residual Mixed Precision Quantization for Low-Bit Diffusion Models with Temporal Distillation

Title: QF: Quick Feedforward AI Model Training without Gradient Back Propagation

Title: Exploring Remote Physiological Signal Measurement under Dynamic Lighting Conditions at Night: Dataset, Experiment, and Analysis

Title: Heterogeneous Federated Learning with Prototype Alignment and Upscaling

Title: TinyProto: Communication-Efficient Federated Learning with Sparse Prototypes in Resource-Constrained Environments

Title: No Language Data Left Behind: A Comparative Study of CJK Language Datasets in the Hugging Face Ecosystem

Title: Computed Tomography Visual Question Answering with Cross-modal Feature Graphing

Title: Large Language Models' Varying Accuracy in Recognizing Risk-Promoting and Health-Supporting Sentiments in Public Health Discourse: The Cases of HPV Vaccination and Heated Tobacco Products

Title: Attention Slipping: A Mechanistic Understanding of Jailbreak Attacks and Defenses in LLMs

Title: Adaptive Malware Detection using Sequential Feature Selection: A Dueling Double Deep Q-Network (D3QN) Framework for Intelligent Classification

Title: Multi-Modal Semantic Parsing for the Interpretation of Tombstone Inscriptions

Title: Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic

Title: Tractable Representation Learning with Probabilistic Circuits

Title: Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers

Title: Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning?

Title: RegistrationMamba: A Mamba-based Registration Framework Integrating Multi-Expert Feature Learning for Cross-Modal Remote Sensing Images

Title: Sat2City: 3D City Generation from A Single Satellite Image with Cascaded Latent Diffusion

Title: MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone Architecture

Title: Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models

Title: THM@SimpleText 2025 -- Task 1.1: Revisiting Text Simplification based on Complex Terms for Non-Experts

Title: MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind

Title: RAT: Bridging RNN Efficiency and Attention Accuracy in Language Modeling

Title: Enhancing Phishing Detection in Financial Systems through NLP

Title: Tail-aware Adversarial Attacks: A Distributional Approach to Efficient LLM Jailbreaking

Title: DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge

Title: CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step

Title: ESSA: Evolutionary Strategies for Scalable Alignment

Title: GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models

Title: UniAud: A Unified Auditing Framework for High Auditing Power and Utility with One Training Run

Title: Arbiter PUF: Uniqueness and Reliability Analysis Using Hybrid CMOS-Stanford Memristor Model

Title: Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models

Title: Source Attribution in Retrieval-Augmented Generation

Title: Dealing with Uncertainty in Contextual Anomaly Detection

Title: Machine Learning-Based Prediction of Metal-Organic Framework Materials: A Comparative Analysis of Multiple Models

Title: README: Robust Error-Aware Digital Signature Framework via Deep Watermarking Model

Title: LINE: Public-key encryption

Title: U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration

Title: Unveiling the Potential of Diffusion Large Language Model in Controllable Generation

Title: MVL-Loc: Leveraging Vision-Language Model for Generalizable Multi-Scene Camera Relocalization

Title: DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

Title: A Data-Driven Novelty Score for Diverse In-Vehicle Data Recording

Title: DP-Fusion: Token-Level Differentially Private Inference for Large Language Models

Title: MambaVideo for Discrete Video Tokenization with Channel-Split Quantization

Title: Evaluating LLMs on Real-World Forecasting Against Human Superforecasters

Title: Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts

Title: S$^2$Edit: Text-Guided Image Editing with Precise Semantic and Spatial Control

Title: CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection

Title: Photon Splatting: A Physics-Guided Neural Surrogate for Real-Time Wireless Channel Prediction

Title: QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation

Title: PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes

Title: any4: Learned 4-bit Numeric Representation for LLMs

Title: Information-Guided Diffusion Sampling for Dataset Distillation

Title: Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences

Title: Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs

Title: Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts

Title: LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction

Title: MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

Title: Put Teacher in Student's Shoes: Cross-Distillation for Ultra-compact Model Compression Framework

Title: UGG-ReID: Uncertainty-Guided Graph Model for Multi-Modal Object Re-Identification

Title: R1-RE: Cross-Domain Relationship Extraction with RLVR

Title: VectorLLM: Human-like Extraction of Structured Building Contours vis Multimodal LLMs

Title: Hybrid Adversarial Spectral Loss Conditional Generative Adversarial Networks for Signal Data Augmentation in Ultra-precision Machining Surface Roughness Prediction

Title: What's Making That Sound Right Now? Video-centric Audio-Visual Localization

Title: DANCE: Resource-Efficient Neural Architecture Search with Data-Aware and Continuous Adaptation

Title: ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Sensing

Title: Colorectal Cancer Tumor Grade Segmentation in Digital Histopathology Images: From Giga to Mini Challenge

Title: TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation

Title: Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal

Title: Interpretable Reward Modeling with Active Concept Bottlenecks

Title: Performance Evaluation of General Purpose Large Language Models for Basic Linear Algebra Subprograms Code Generation

Title: A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets

Title: XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL

Title: Tempo-R0: A Video-MLLM for Temporal Video Grounding through Efficient Temporal Sensing Reinforcement Learning

Title: Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled Representations

Title: Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce

Title: Spooky Action at a Distance: Normalization Layers Enable Side-Channel Spatial Communication

Title: Geometric-Guided Few-Shot Dental Landmark Detection with Human-Centric Foundation Model

Title: LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework

Title: Losing Control: Data Poisoning Attack on Guided Diffusion via ControlNet

Title: "This Suits You the Best": Query Focused Comparative Explainable Summarization

Title: An analysis of vision-language models for fabric retrieval

Title: MatDecompSDF: High-Fidelity 3D Shape and PBR Material Decomposition from Multi-View Images

Title: MCFormer: A Multi-Cost-Volume Network and Comprehensive Benchmark for Particle Image Velocimetry

Title: LLMs as Architects and Critics for Multi-Source Opinion Summarization

Title: Large Language Models for Network Intrusion Detection Systems: Foundations, Implementations, and Future Directions

Title: CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Title: Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking

Title: GraphBrep: Learning B-Rep in Graph Structure for Efficient CAD Generation

Title: ABench-Physics: Benchmarking Physical Reasoning in LLMs via High-Difficulty and Dynamic Physics Problems

Title: From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection

Title: Efficient Unlearning with Privacy Guarantees

Title: FIDESlib: A Fully-Fledged Open-Source FHE Library for Efficient CKKS on GPUs

Title: FedPall: Prototype-based Adversarial and Collaborative Learning for Federated Learning with Feature Drift

Title: Reason to Rote: Rethinking Memorization in Reasoning

Title: SeqGrowGraph: Learning Lane Topology as a Chain of Graph Expansions

Title: Discrete Diffusion Trajectory Alignment via Stepwise Decomposition

Title: RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction

Title: Spec-TOD: A Specialized Instruction-Tuned LLM Framework for Efficient Task-Oriented Dialogue Systems

Title: Efficient SAR Vessel Detection for FPGA-Based On-Satellite Sensing

Title: Dialogue-Based Multi-Dimensional Relationship Extraction from Novels

Title: $\textit{Grahak-Nyay:}$ Consumer Grievance Redressal through Large Language Models

Title: Semantically Consistent Discrete Diffusion for 3D Biological Graph Modeling

Title: NTSFormer: A Self-Teaching Graph Transformer for Multimodal Cold-Start Node Classification

Title: HGNet: High-Order Spatial Awareness Hypergraph and Multi-Scale Context Attention Network for Colorectal Polyp Detection

Title: Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning

Title: Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

Title: O_FT@EvalLLM2025 : étude comparative de choix de données et de stratégies d'apprentissage pour l'adaptation de modèles de langue à un domaine

Title: BackFed: An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated Learning

Title: HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding

Title: Leveraging Self-Supervised Features for Efficient Flooded Region Identification in UAV Aerial Images

Title: Object-centric Denoising Diffusion Models for Physical Reasoning

Title: RainShift: A Benchmark for Precipitation Downscaling Across Geographies

Title: LIFT: Automating Symbolic Execution Optimization with Large Language Models for AI Networks

Title: ReLoop: "Seeing Twice and Thinking Backwards" via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding

Title: Taming the Tri-Space Tension: ARC-Guided Hallucination Modeling and Control for Text-to-Image Generation

Title: DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer

Title: ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Title: Boosting Temporal Sentence Grounding via Causal Inference

Title: InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior

Title: Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models

Title: Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy Grading

Title: Classification of autoimmune diseases from Peripheral blood TCR repertoires by multimodal multi-instance learning

Title: TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation

Title: Robust Incomplete-Modality Alignment for Ophthalmic Disease Grading and Diagnosis via Labeled Optimal Transport

Title: Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification

Title: Verified Language Processing with Hybrid Explainability: A Technical Report

Title: Meta-Learning Transformers to Improve In-Context Generalization

Title: Beyond Scaling Curves: Internal Dynamics of Neural Networks Through the NTK Lens

Title: AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics

Title: Replacing thinking with tool usage enables reasoning in small language models

Title: ICAS: Detecting Training Data from Autoregressive Image Generative Models

Title: Exploring Semantic Clustering and Similarity Search for Heterogeneous Traffic Scenario Graph

Title: MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation

Title: The Hidden Threat in Plain Text: Attacking RAG Data Loaders

Title: DICE: Discrete inverse continuity equation for learning population dynamics

Title: VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting

Title: An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques

Title: Extreme Learning Machine Based System for DDoS Attacks Detections on IoMT Devices

Title: Deep Learning to Automate Parameter Extraction and Model Fitting of Two-Dimensional Transistors

Title: Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization

Title: VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems

Title: AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models

Title: InfoSteer: Steering Information Utility in Language Model Post-Training

Title: 4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

Title: Critiques of World Models

Title: OpenS2S: Advancing Open-Source End-to-End Empathetic Large Speech Language Model

Title: From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

Title: $φ$-Adapt: A Physics-Informed Adaptation Learning Approach to 2D Quantum Material Discovery

Title: Satellite-based Rabi rice paddy field mapping in India: a case study on Telangana state

Title: Pre-Trained Policy Discriminators are General Reward Models

Title: All in One: Visual-Description-Guided Unified Point Cloud Segmentation

Title: Hunting in the Dark: Metrics for Early Stage Traffic Discovery

Title: CTA: Cross-Task Alignment for Better Test Time Training

Title: Cascade: Token-Sharded Private LLM Inference

Title: Self-Supervised Real-Time Tracking of Military Vehicles in Low-FPS UAV Footage

Title: Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models

Title: From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving

Title: Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Title: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Title: Spatio-Temporal LLM: Reasoning about Environments and Actions

Title: Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing

Title: Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations