2026-01-01

Title: Enriching Historical Records: An OCR and AI-Driven Approach for Database Integration

Title: CAT: A Metric-Driven Framework for Analyzing the Consistency-Accuracy Relation of LLMs under Controlled Input Variations

Title: STED and Consistency Scoring: A Framework for Evaluating LLM Structured Output Reliability

Title: PharmaShip: An Entity-Centric, Reading-Order-Supervised Benchmark for Chinese Pharmaceutical Shipping Documents

Title: Noise-Driven Persona Formation in Reflexive Neural Language Generation

Title: HarmTransform: Transforming Explicit Harmful Queries into Stealthy via Multi-Agent Debate

Title: Emergent World Beliefs: Exploring Transformers in Stochastic Games

Title: Break Out the Silverware -- Semantic Understanding of Stored Household Items

Title: A Comprehensive Study of Deep Learning Model Fixing Approaches

Title: A Review of Diffusion-based Simulation-Based Inference: Foundations and Applications in Non-Ideal Data Scenarios

Title: Coordinate Matrix Machine: A Human-level Concept Learning to Classify Very Similar Documents

Title: Geometric Scaling of Bayesian Inference in LLMs

Title: HINTS: Extraction of Human Insights from Time-Series Without External Sources

Title: Audited Skill-Graph Self-Improvement for Agentic LLMs via Verifiable Rewards, Experience Synthesis, and Continual Memory

Title: Exploring Cumulative Effects in Survival Data Using Deep Learning Networks

Title: Entropy-Aware Speculative Decoding Toward Improved LLM Reasoning

Title: Secure and Governed API Gateway Architectures for Multi-Cluster Cloud Environments

Title: SyncGait: Robust Long-Distance Authentication for Drone Delivery via Implicit Gait Behaviors

Title: Prompt-Induced Over-Generation as Denial-of-Service: A Black-Box Attack-Side Benchmark

Title: Application-Specific Power Side-Channel Attacks and Countermeasures: A Survey

Title: Leveraging Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments

Title: TabMixNN: A Unified Deep Learning Framework for Structural Mixed Effects Modeling on Tabular Data

Title: Zero-Trust Agentic Federated Learning for Secure IIoT Defense Systems

Title: Improved Bounds for Private and Robust Alignment

Title: Exploiting the Prior of Generative Time Series Imputation

Title: Explaining News Bias Detection: A Comparative SHAP Analysis of Transformer Model Decision Mechanisms

Title: Retrieval Augmented Question Answering: When Should LLMs Admit Ignorance?

Title: Adversarial Lens: Exploiting Attention Layers to Generate Adversarial Examples for Evaluation

Title: Integrating Domain Knowledge for Financial QA: A Multi-Retriever RAG Approach with LLMs

Title: Security Without Detection: Economic Denial as a Primitive for Edge and IoT Defense

Title: Trellis: Learning to Compress Key-Value Memory in Attention Models

Title: Flow Matching Neural Processes

Title: Lifelong Domain Adaptive 3D Human Pose Estimation

Title: Probing the Limits of Compressive Memory: A Study of Infini-Attention in Small-Scale Pretraining

Title: Max-Entropy Reinforcement Learning with Flow Matching and A Case Study on LQR

Title: MRI-to-CT Synthesis With Cranial Suture Segmentations Using A Variational Autoencoder Framework

Title: Efficient Deep Learning for Short-Term Solar Irradiance Time Series Forecasting: A Benchmark Study in Ho Chi Minh City

Title: Scaling Remote Sensing Foundation Models: Data Domain Tradeoffs at the Peta-Scale

Title: Constraint Breeds Generalization: Temporal Dynamics as an Inductive Bias

Title: MGML: A Plug-and-Play Meta-Guided Multi-Modal Learning Framework for Incomplete Multimodal Brain Tumor Segmentation

Title: Kinematic-Based Assessment of Surgical Actions in Microanastomosis

Title: Improved Balanced Classification with Theoretically Grounded Loss Functions

Title: DivQAT: Enhancing Robustness of Quantized Convolutional Neural Networks against Model Extraction Attacks

Title: U-Net-Like Spiking Neural Networks for Single Image Dehazing

Title: T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models

Title: Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Title: Physics-informed Graph Neural Networks for Operational Flood Modeling

Title: CEC-Zero: Zero-Supervision Character Error Correction with Self-Generated Rewards

Title: Assured Autonomy: How Operations Research Powers and Orchestrates Generative AI Systems

Title: DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation

Title: MeLeMaD: Adaptive Malware Detection via Chunk-wise Feature Selection and Meta-Learning

Title: Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Title: GCA-ResUNet: Medical Image Segmentation Using Grouped Coordinate Attention

Title: RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress

Title: Bridging Structure and Appearance: Topological Features for Robust Self-Supervised Segmentation

Title: WISE: Web Information Satire and Fakeness Evaluation

Title: Bridging the Perception-Cognition Gap:Re-engineering SAM2 with Hilbert-Mamba for Robust VLM-based Medical Diagnosis

Title: iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning

Title: On Exact Editing of Flow-Based Diffusion Models

Title: RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations

Title: PipeFlow: Pipelined Processing and Motion-Aware Frame Selection for Long-Form Video Editing

Title: Reinforced Diffusion: Learning to Push the Limits of Anisotropic Diffusion for Image Denoising

Title: Jailbreaking Attacks vs. Content Safety Filters: How Far Are We in the LLM Safety Arms Race?

Title: Beyond Hallucinations: A Composite Score for Measuring Reliability in Open-Source Large Language Models

Title: How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Title: Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval

Title: Pathology Context Recalibration Network for Ocular Disease Recognition

Title: Time-varying Mixing Matrix Design for Energy-efficient Decentralized Federated Learning

Title: Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images

Title: Multi-Scenario Highway Lane-Change Intention Prediction: A Temporal Physics-Informed Multi-Modal Framework

Title: RainFusion2.0: Temporal-Spatial Awareness and Hardware-Efficient Block-wise Sparse Attention

Title: FedLiTeCAN : A Federated Lightweight Transformer for Fast and Robust CAN Bus Intrusion Detection

Title: HY-MT1.5 Technical Report

Title: Training a Huggingface Model on AWS Sagemaker (Without Tears)

Title: Autoregressivity in the Latent Space of a GP-VAE Language Model: An Empirical Ablation Study

Title: Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks

Title: GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Title: Enhancing LLM-Based Neural Network Generation: Few-Shot Prompting and Efficient Validation for Automated Architecture Design

Title: OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization

Title: GARDO: Reinforcing Diffusion Models without Reward Hacking

Title: Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction

Title: Activation Steering for Masked Diffusion Language Models

Title: Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Title: Large Emotional World Model

Title: Training Report of TeleChat3-MoE

Title: Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset

Title: Bayesian Self-Distillation for Image Classification

Title: DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Title: Deep Global Clustering for Hyperspectral Image Segmentation: Concepts, Applications, and Open Challenges

Title: Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Title: MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring

Title: CorGi: Contribution-Guided Block-Wise Interval Caching for Training-Free Acceleration of Diffusion Transformers

Title: Micro-Macro Tensor Neural Surrogates for Uncertainty Quantification in Collisional Plasma

Title: Medical Image Classification on Imbalanced Data Using ProGAN and SMA-Optimized ResNet: Application to COVID-19

Title: ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation

Title: Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Title: MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model

Title: LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring

Title: MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation

Title: How Would Oblivious Memory Boost Graph Analytics on Trusted Processors?

Title: Physically-Grounded Manifold Projection with Foundation Priors for Metal Artifact Reduction in Dental CBCT

Title: Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Title: Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Title: One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training

Title: Automated Analysis of Sustainability Reports: Using Large Language Models for the Extraction and Prediction of EU Taxonomy-Compliant KPIs

Title: Virtual-Eyes: Quantitative Validation of a Lung CT Quality-Control Pipeline for Foundation-Model Cancer Risk Prediction

Title: QianfanHuijin Technical Report: A Novel Multi-Stage Training Paradigm for Finance Industrial LLMs

Title: UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots

Title: Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention

Title: Empower Low-Altitude Economy: A Reliability-Aware Dynamic Weighting Allocation for Multi-modal UAV Beam Prediction

Title: World model inspired sarcasm reasoning with large language model agents

Title: SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Title: Spatial-aware Vision Language Model for Autonomous Driving

Title: DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images

Title: FedSecureFormer: A Fast, Federated and Secure Transformer Framework for Lightweight Intrusion Detection in Connected and Autonomous Vehicles

Title: Skim-Aware Contrastive Learning for Efficient Document Representation

Title: Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Title: RedunCut: Measurement-Driven Sampling and Accuracy Performance Modeling for Low-Cost Live Video Analytics

Title: SourceBroken: A large-scale analysis on the (un)reliability of SourceRank in the PyPI ecosystem

Title: Lifting Vision: Ground to Aerial Localization with Reasoning Guided Planning

Title: DyStream: Streaming Dyadic Talking Heads Generation via Flow Matching-based Autoregressive Model

Title: AI-Driven Evaluation of Surgical Skill via Action Recognition

Title: Language Model Agents Under Attack: A Cross Model-Benchmark of Profit-Seeking Behaviors in Customer Service

Title: GateChain: A Blockchain Based Application for Country Entry Exit Registry Management

Title: Exploring Compositionality in Vision Transformers using Wavelet Representations

Title: Generative forecasting with joint probability models

Title: Document Data Matching for Blockchain-Supported Real Estate

Title: IELTS Writing Revision Platform with Automated Essay Scoring and Adaptive Feedback

Title: F2IDiff: Real-world Image Super-resolution using Feature to Image Diffusion Foundation Model

Title: HOLOGRAPH: Active Causal Discovery via Sheaf-Theoretic Alignment of Large Language Model Priors

Title: Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways

Title: Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Title: Paragraph Segmentation Revisited: Towards a Standard Task for Structuring Speech

Title: Using Large Language Models To Translate Machine Results To Human Results

Title: Correctness of Extended RSA Public Key Cryptosystem

Title: More Than Bits: Multi-Envelope Double Binary Factorization for Extreme Quantization

Title: OCP-LS: An Efficient Algorithm for Visual Localization

Title: From Perception to Punchline: Empowering VLM with the Art of In-the-wild Meme

Title: Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Title: RGBT-Ground Benchmark: Visual Grounding Beyond RGB in Complex Real-World Scenarios

Title: HaluNet: Multi-Granular Uncertainty Modeling for Efficient Hallucination Detection in LLM Question Answering

Title: CPR: Causal Physiological Representation Learning for Robust ECG Analysis under Distribution Shifts

Title: SynRAG: A Large Language Model Framework for Executable Query Generation in Heterogeneous SIEM System

Title: Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Title: Improving Few-Shot Change Detection Visual Question Answering via Decision-Ambiguity-guided Reinforcement Fine-Tuning

Title: SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks

Title: 3D Semantic Segmentation for Post-Disaster Assessment

Title: Secure Digital Semantic Communications: Fundamentals, Challenges, and Opportunities

Title: Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers

Title: Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Title: Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Title: LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning

Title: AutoFed: Manual-Free Federated Traffic Prediction via Personalized Prompt

Title: A Scalable Framework for logP Prediction: From Terabyte-Scale Data Integration to Interpretable Ensemble Modeling

Title: Practical Traceable Over-Threshold Multi-Party Private Set Intersection

Title: Do Large Language Models Know What They Are Capable Of?

Title: Renormalization Group Guided Tensor Network Structure Search

Title: HeteroHBA: A Generative Structure-Manipulating Backdoor Attack on Heterogeneous Graphs

Title: CellSecInspector: Safeguarding Cellular Networks via Automated Security Analysis on Specifications

Title: MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models

Title: Mobility-Assisted Decentralized Federated Learning: Convergence Analysis and A Data-Driven Approach

Title: Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting

Title: FPGA Co-Design for Efficient N:M Sparse and Quantized Model Inference

Title: BIOME-Bench: A Benchmark for Biomolecular Interaction Inference and Multi-Omics Pathway Mechanism Elucidation from Scientific Literature

Title: UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning

Title: Uncertainty-aware Semi-supervised Ensemble Teacher Framework for Multilingual Depression Detection

Title: Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models

Title: Gradient Descent as Implicit EM in Distance-Based Neural Models

Title: Projection-based Adversarial Attack using Physics-in-the-Loop Optimization for Monocular Depth Estimation

Title: Unregularized Linear Convergence in Zero-Sum Game from Preference Feedback

Title: Triangulation as an Acceptance Rule for Multilingual Mechanistic Interpretability

Title: AODDiff: Probabilistic Reconstruction of Aerosol Optical Depth via Diffusion-based Bayesian Inference

Title: PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI

Title: VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents

Title: OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image Segmentation

Title: Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Title: SoK: Web3 RegTech for Cryptocurrency VASP AML/CFT Compliance

Title: MTSP-LDP: A Framework for Multi-Task Streaming Data Publication under Local Differential Privacy

Title: FinMMDocR: Benchmarking Financial Multimodal Reasoning with Scenario Awareness, Document Understanding, and Multi-Step Computation

Title: Frequent subgraph-based persistent homology for graph classification

Title: Towards Provably Secure Generative AI: Reliable Consensus Sampling

Title: Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline

Title: HaineiFRDM: Explore Diffusion to Restore Defects in Fast-Movement Films

Title: CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement

Title: ProDM: Synthetic Reality-driven Property-aware Progressive Diffusion Model for Coronary Calcium Motion Correction in Non-gated Chest CT

Title: VIPER: Process-aware Evaluation for Generative Video Reasoning

Title: MSACL: Multi-Step Actor-Critic Learning with Lyapunov Certificates for Exponentially Stabilizing Control

Title: Semi-overlapping Multi-bandit Best Arm Identification for Sequential Support Network Learning

Title: ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

Title: Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions

Title: DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments

Title: Efficiently Estimating Data Efficiency for Language Model Fine-tuning

Title: Classifying long legal documents using short random chunks

Title: Bi-C2R: Bidirectional Continual Compatible Representation for Re-indexing Free Lifelong Person Re-identification

Title: FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAM

Title: Diffusion Language Models are Provably Optimal Parallel Samplers

Title: MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes

Title: ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning

Title: Modeling Language as a Sequence of Thoughts

Title: Generative Classifiers Avoid Shortcut Solutions

Title: AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Title: Many Minds from One Model: Bayesian Transformers for Population Intelligence

Title: From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing

Title: FineTec: Fine-Grained Action Recognition Under Temporal Corruption via Skeleton Decomposition and Sequence Completion

Title: GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

Title: SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time