2025-12-15

Title: Scalable Data Synthesis for Computer Use Agents with Step-Level Filtering

Title: Multimodal Fusion of Regional Brain Experts for Interpretable Alzheimer's Disease Diagnosis

Title: ASR Under the Stethoscope: Evaluating Biases in Clinical Speech Recognition across Indian Languages

Title: TECM*: A Data-Driven Assessment to Reinforcement Learning Methods and Application to Heparin Treatment Strategy for Surgical Sepsis

Title: MolSculpt: Sculpting 3D Molecular Geometries from Chemical Syntax

Title: MedBioRAG: Semantic Search and Retrieval-Augmented Generation with Large Language Models for Medical and Biological QA

Title: SCOUT: A Defense Against Data Poisoning Attacks in Fine-Tuned Language Models

Title: KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

Title: Leveraging Text Guidance for Enhancing Demographic Fairness in Gender Classification

Title: Weakly Supervised Tuberculosis Localization in Chest X-rays through Knowledge Distillation

Title: VDAWorld: World Modelling via VLM-Directed Abstraction and Simulation

Title: E-CHUM: Event-based Cameras for Human Detection and Urban Monitoring

Title: Investigating ECG Diagnosis with Ambiguous Labels using Partial Label Learning

Title: VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction

Title: Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization

Title: Explanation Bias is a Product: Revealing the Hidden Lexical and Position Preferences in Post-Hoc Feature Attribution

Title: Limits and Gains of Test-Time Scaling in Vision-Language Reasoning

Title: FIBER: A Multilingual Evaluation Resource for Factual Inference Bias

Title: An LLVM-Based Optimization Pipeline for SPDZ

Title: In-Context Multi-Objective Optimization

Title: Learning from a Generative Oracle: Domain Adaptation for Restoration

Title: Cybersecurity policy adoption in South Africa: Does public trust matter?

Title: Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Title: Fairness-Regularized Online Optimization with Switching Costs

Title: Network and Compiler Optimizations for Efficient Linear Algebra Kernels in Private Transformer Inference

Title: Learning complete and explainable visual representations from itemized text supervision

Title: Automated Penetration Testing with LLM Agents and Classical Planning

Title: Autoencoder-based Semi-Supervised Dimensionality Reduction and Clustering for Scientific Ensembles

Title: MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents

Title: Multi-task Learning with Extended Temporal Shift Module for Temporal Action Localization

Title: Beyond Memorization: Gradient Projection Enables Selective Learning in Diffusion Models

Title: CADKnitter: Compositional CAD Generation from Text and Geometry Guidance

Title: AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Title: SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection

Title: Adaptive Soft Rolling KV Freeze with Entropy-Guided Recovery: Sublinear Memory Growth for Efficient LLM Inference

Title: VFMF: World Modeling by Forecasting Vision Foundation Model Features

Title: REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Title: WildCap: Facial Appearance Capture in the Wild via Hybrid Inverse Rendering

Title: PersonaLive! Expressive Portrait Image Animation for Live Streaming

Title: A Simple Generalisation of the Implicit Dynamics of In-Context Learning

Title: Do We Need Reformer for Vision? An Experimental Comparison with Vision Transformers

Title: Leveraging LLMs for Title and Abstract Screening for Systematic Review: A Cost-Effective Dynamic Few-Shot Learning Approach

Title: A Scalable Multi-GPU Framework for Encrypted Large-Model Inference

Title: Vision-Based Learning for Cyberattack Detection in Blockchain Smart Contracts and Transactions

Title: FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion

Title: When Actions Teach You to Think: Reasoning-Action Synergy via Reinforcement Learning in Conversational Agents

Title: AdaSD: Adaptive Speculative Decoding for Efficient Language Model Inference

Title: CIP: A Plug-and-Play Causal Prompting Framework for Mitigating Hallucinations under Long-Context Noise

Title: RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Title: SRLR: Symbolic Regression based Logic Recovery to Counter Programmable Logic Controller Attacks

Title: Unifying Dynamic Tool Creation and Cross-Task Experience Sharing through Cognitive Memory Architecture

Title: QGEC : Quantum Golay Code Error Correction

Title: Benchmarking the Generality of Vision-Language-Action Models

Title: Visualisation for the CIS benchmark scanning results

Title: SATMapTR: Satellite Image Enhanced Online HD Map Construction

Title: KeyframeFace: From Text to Expressive Facial Keyframes

Title: MLLM Machine Unlearning via Visual Knowledge Distillation

Title: Spectral entropy prior-guided deep feature fusion architecture for magnetic core loss

Title: FreqDINO: Frequency-Guided Adaptation for Generalized Boundary-Aware Ultrasound Image Segmentation

Title: UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models

Title: Symmetry-Aware Steering of Equivariant Diffusion Policies: Benefits and Limits

Title: Surveillance Video-Based Traffic Accident Detection Using Transformer Architecture

Title: CAT: Can Trust be Predicted with Context-Awareness in Dynamic Heterogeneous Networks?

Title: A Multi-Mode Structured Light 3D Imaging System with Multi-Source Information Fusion for Underwater Pipeline Detection

Title: Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video

Title: Attacking and Securing Community Detection: A Game-Theoretic Framework

Title: Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts

Title: qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs

Title: Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection

Title: Out-of-Distribution Segmentation via Wasserstein-Based Evidential Uncertainty

Title: Mining Legal Arguments to Study Judicial Formalism

Title: Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization

Title: Bhargava Cube--Inspired Quadratic Regularization for Structured Neural Embeddings

Title: Minimal Clips, Maximum Salience: Long Video Summarization via Key Moment Extraction

Title: Collaborative Reconstruction and Repair for Multi-class Industrial Anomaly Detection

Title: JoyAvatar: Real-time and Infinite Audio-Driven Avatar Generation with Autoregressive Diffusion

Title: Proving DNSSEC Correctness: A Formal Approach to Secure Domain Name Resolution

Title: CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare

Title: Hyperbolic Gaussian Blurring Mean Shift: A Statistical Mode-Seeking Framework for Clustering in Curved Spaces

Title: Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation

Title: Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Title: DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation

Title: Rethinking Expert Trajectory Utilization in LLM Post-training

Title: CADMorph: Geometry-Driven Parametric CAD Editing via a Plan-Generate-Verify Loop

Title: Capacitive Touchscreens at Risk: Recovering Handwritten Trajectory on Smartphone via Electromagnetic Emanations

Title: Mistake Notebook Learning: Selective Batch-Wise Context Optimization for In-Context Learning

Title: VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing

Title: Building Patient Journeys in Hebrew: A Language Model for Clinical Timeline Extraction

Title: TSkel-Mamba: Temporal Dynamic Modeling via State Space Model for Human Skeleton-based Action Recognition

Title: On Geometric Understanding and Learned Data Priors in VGGT

Title: Does Less Hallucination Mean Less Creativity? An Empirical Investigation in LLMs

Title: Reconstruction as a Bridge for Event-Based Visual Question Answering

Title: NeuralOGCM: Differentiable Ocean Modeling with Learnable Physics

Title: xGR: Efficient Generative Recommendation Serving at Scale

Title: HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning

Title: A Multi-Criteria Automated MLOps Pipeline for Cost-Effective Cloud-Based Classifier Retraining in Response to Data Distribution Shifts

Title: Infinity and Beyond: Compositional Alignment in VAR and Diffusion T2I Models

Title: Optimizing the Training Diet: Data Mixture Search for Robust Time Series Forecasting

Title: Elastic-Net Multiple Kernel Learning: Combining Multiple Data Sources for Prediction

Title: SSL-MedSAM2: A Semi-supervised Medical Image Segmentation Framework Powered by Few-shot Learning of SAM2

Title: 3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation

Title: DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Title: Multi-temporal Calving Front Segmentation

Title: Visualizing token importance for black-box language models

Title: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Title: Atomic Action Slicing: Planner-Aligned Options for Generalist VLA Agents

Title: Granite: Granular Runtime Enforcement for GitHub Actions Permissions

Title: Bounding Hallucinations: Information-Theoretic Guarantees for RAG Systems via Merlin-Arthur Protocols

Title: A Fast Interpretable Fuzzy Tree Learner

Title: Automating Historical Insight Extraction from Large-Scale Newspaper Archives via Neural Topic Modeling

Title: FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint

Title: Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation

Title: Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing

Title: Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection

Title: Leveraging FPGAs for Homomorphic Matrix-Vector Multiplication in Oblivious Message Retrieval

Title: Text images processing system using artificial intelligence models

Title: SoK: Demystifying the multiverse of MPC protocols

Title: EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Title: Speculative Decoding Speed-of-Light: Optimal Lower Bounds via Branching Random Walks

Title: Referring Change Detection in Remote Sensing Imagery

Title: Weak-to-Strong Generalization Enables Fully Automated De Novo Training of Multi-head Mask-RCNN Model for Segmenting Densely Overlapping Cell Nuclei in Multiplex Whole-slice Brain Images

Title: SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Title: SpectralKrum: A Spectral-Geometric Defense Against Byzantine Attacks in Federated Learning

Title: Reducing Domain Gap with Diffusion-Based Domain Adaptation for Cell Counting

Title: Smudged Fingerprints: A Systematic Evaluation of the Robustness of AI Image Fingerprints

Title: MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator

Title: Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously

Title: Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective

Title: Uncertainty-Aware Domain Adaptation for Vitiligo Segmentation in Clinical Photographs

Title: Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Title: Particulate: Feed-Forward 3D Object Articulation