2025-12-03

Title: Human-Level and Beyond: Benchmarking Large Language Models Against Clinical Pharmacists in Prescription Review

Title: Pharmacophore-based design by learning on voxel grids

Title: Deep Research: A Systematic Survey

Title: Mirror, Mirror on the Wall -- Which is the Best Model of Them All?

Title: Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models

Title: Contextual Gating within the Transformer Stack: Synergistic Feature Modulation for Enhanced Lyrical Classification and Calibration

Title: Leveraging AI multimodal geospatial foundation models for improved near-real-time flood mapping at a global scale

Title: Reversing Large Language Models for Efficient Training and Fine-Tuning

Title: Opening the Black Box: An Explainable, Few-shot AI4E Framework Informed by Physics and Expert Knowledge for Materials Engineering

Title: Superpixel Attack: Enhancing Black-box Adversarial Attack with Image-driven Division Areas

Title: Large Language Model based Smart Contract Auditing with LLMBugScanner

Title: DPWMixer: Dual-Path Wavelet Mixer for Long-Term Time Series Forecasting

Title: FDRMFL:Multi-modal Federated Feature Extraction Model Based on Information Maximization and Contrastive Learning

Title: Deterministic Random Bit Generators Based on Ascon for Embedded Systems

Title: A survey about Hidden Subgroup Problem from a mathematical and cryptographic perspective

Title: Cross-View Topology-Aware Graph Representation Learning

Title: Factor(T,U): Factored Cognition Strengthens Monitoring of Untrusted AI

Title: FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges

Title: RobustSurg: Tackling domain generalisation for out-of-distribution surgical scene segmentation

Title: Enforcing Orderedness to Improve Feature Consistency

Title: Multifractal Recalibration of Neural Networks for Medical Imaging Segmentation

Title: WhAM: Towards A Translative Model of Sperm Whale Vocalization

Title: InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages

Title: On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks

Title: See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Title: Lightweight Latent Reasoning for Narrative Tasks

Title: PhishSnap: Image-Based Phishing Detection Using Perceptual Hashing

Title: DETAIL Matters: Measuring the Impact of Prompt Specificity on Reasoning in Large Language Models

Title: CAIRNS: Balancing Readability and Scientific Accuracy in Climate Adaptation Question Answering

Title: CVE Breadcrumbs: Tracking Vulnerabilities Through Versioned Apache Libraries

Title: The Effect of Enforcing Fairness on Reshaping Explanations in Machine Learning Models

Title: Spatiotemporal Pyramid Flow Matching for Climate Emulation

Title: Progressive Image Restoration via Text-Conditioned Video Generation

Title: HOT Protocol

Title: Enhancing Cross Domain SAR Oil Spill Segmentation via Morphological Region Perturbation and Synthetic Label-to-SAR Generation

Title: Quantum Vanguard: Server Optimized Privacy Fortified Federated Intelligence for Future Vehicles

Title: Training Dynamics of Learning 3D-Rotational Equivariance

Title: When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers

Title: COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers

Title: LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems

Title: Unlocking the Power of Boltzmann Machines by Parallelizable Sampler and Efficient Temperature Estimation

Title: Retrieval-Augmented Memory for Online Learning

Title: SpecPV: Improving Self-Speculative Decoding for Long-Context Generation via Partial Verification

Title: Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision

Title: TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction

Title: A multi-weight self-matching visual explanation for cnns on sar images

Title: FOVA: Offline Federated Reinforcement Learning with Mixed-Quality Data

Title: VACoT: Rethinking Visual Data Augmentation with VLMs

Title: Memory-Augmented Knowledge Fusion with Safety-Aware Decoding for Domain-Adaptive Question Answering

Title: SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains

Title: Reproducing and Extending RaDelft 4D Radar with Camera-Assisted Labels

Title: AtomGraph: Tackling Atomicity Violation in Smart Contracts using Multimodal GCNs

Title: TaleFrame: An Interactive Story Generation System with Fine-Grained Control and Large Language Models

Title: ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity

Title: WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate

Title: MitUNet: Enhancing Floor Plan Recognition using a Hybrid Mix-Transformer and U-Net Architecture

Title: Characterizing Cyber Attacks against Space Infrastructures with Missing Data: Framework and Case Study

Title: Leveraging Large Language Models to Bridge On-chain and Off-chain Transparency in Stablecoins

Title: WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

Title: LightHCG: a Lightweight yet powerful HSIC Disentanglement based Causal Glaucoma Detection Model framework

Title: Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources

Title: Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation

Title: When Refusals Fail: Unstable Safety Mechanisms in Long-Context LLM Agents

Title: See, Think, Learn: A Self-Taught Multimodal Reasoner

Title: Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation

Title: Vision to Geometry: 3D Spatial Memory for Sequential Embodied MLLM Reasoning and Exploration

Title: TabGRU: An Enhanced Design for Urban Rainfall Intensity Estimation Using Commercial Microwave Links

Title: G-SHARP: Gaussian Surgical Hardware Accelerated Real-time Pipeline

Title: UCAgents: Unidirectional Convergence for Visual Evidence Anchored Multi-Agent Medical Decision-Making

Title: Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts

Title: Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding

Title: YingVideo-MV: Music-Driven Multi-Stage Video Generation

Title: Attention-guided reference point shifting for Gaussian-mixture-based partial point set registration

Title: A Large Scale Benchmark for Test Time Adaptation Methods in Medical Image Segmentation

Title: dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model

Title: GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding

Title: Water Quality Estimation Through Machine Learning Multivariate Analysis

Title: Two-Stage Vision Transformer for Image Restoration: Colorization Pretraining + Residual Upsampling

Title: Decentralized Fairness Aware Multi Task Federated Learning for VR Network

Title: SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of Experts

Title: On the Problem of Consistent Anomalies in Zero-Shot Anomaly Detection

Title: WeMMU: Enhanced Bridging of Vision-Language Models and Diffusion Models via Noisy Query Tokens

Title: AVGGT: Rethinking Global Attention for Accelerating VGGT

Title: CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Title: What Signals Really Matter for Misinformation Tasks? Evaluating Fake-News Detection and Virality Prediction under Real-World Constraints

Title: OmniPerson: Unified Identity-Preserving Pedestrian Generation

Title: ADORE: Autonomous Domain-Oriented Relevance Engine for E-commerce

Title: DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Title: From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature

Title: Co-speech Gesture Video Generation via Motion-Based Graph Retrieval

Title: From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

Title: GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

Title: Spoken Conversational Agents with Large Language Models

Title: S3C2 SICP Summit 2025-06: Vulnerability Response Summit

Title: Semigroup action based on skew polynomial evaluation with applications to Cryptography

Title: Modeling and Inverse Identification of Interfacial Heat Conduction in Finite Layer and Semi-Infinite Substrate Systems via a Physics-Guided Neural Framework

Title: PPTBench: Towards Holistic Evaluation of Large Language Models for PowerPoint Layout and Design Understanding

Title: CryptoQA: A Large-scale Question-answering Dataset for AI-assisted Cryptography

Title: Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models

Title: Leveraging Large-Scale Pretrained Spatial-Spectral Priors for General Zero-Shot Pansharpening

Title: Hear What Matters! Text-conditioned Selective Video-to-Audio Generation

Title: Adaptive Weighted LSSVM for Multi-View Classification

Title: Cybersecurity AI: The World's Top AI Agent for Security Capture-the-Flag (CTF)

Title: Distill, Forget, Repeat: A Framework for Continual Unlearning in Text-to-Image Diffusion Models

Title: Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation

Title: PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes

Title: Input Order Shapes LLM Semantic Alignment in Multi-Document Summarization

Title: Graph VQ-Transformer (GVT): Fast and Accurate Molecular Generation via High-Fidelity Discrete Latents

Title: PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution

Title: Unsupervised Structural Scene Decomposition via Foreground-Aware Slot Attention with Pseudo-Mask Guidance

Title: ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data

Title: An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation

Title: ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection

Title: GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization

Title: Tissue-mask supported inter-subject whole-body image registration in the UK Biobank - A method benchmarking study

Title: CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer

Title: GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding

Title: Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs

Title: DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions

Title: Reasoning-Aware Multimodal Fusion for Hateful Video Detection

Title: AttMetNet: Attention-Enhanced Deep Neural Network for Methane Plume Detection in Sentinel-2 Satellite Imagery

Title: PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models

Title: Towards Unification of Hallucination Detection and Fact Verification for Large Language Models

Title: Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset

Title: LumiX: Structured and Coherent Text-to-Intrinsic Generation

Title: FiMMIA: scaling semantic perturbation-based membership inference across modalities

Title: TrackNetV5: Residual-Driven Spatio-Temporal Refinement and Motion Direction Decoupling for Fast Object Tracking

Title: PhyCustom: Towards Realistic Physical Customization in Text-to-Image Generation

Title: TriLex: A Framework for Multilingual Sentiment Analysis in Low-Resource South African Languages

Title: SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

Title: A benchmark dataset for evaluating Syndrome Differentiation and Treatment in large language models

Title: Decryption thorough polynomial ambiguity: noise-enhanced high-memory convolutional codes for post-quantum cryptography

Title: From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity

Title: Defense That Attacks: How Robust Models Become Better Attackers

Title: ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning

Title: promptolution: A Unified, Modular Framework for Prompt Optimization

Title: Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages

Title: Bangla Hate Speech Classification with Fine-tuned Transformer Models

Title: Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?

Title: Are Detectors Fair to Indian IP-AIGC? A Cross-Generator Study

Title: Adaptive Decentralized Federated Learning for Robust Optimization

Title: MICCAI STSR 2025 Challenge: Semi-Supervised Teeth and Pulp Segmentation and CBCT-IOS Registration

Title: Taming Camera-Controlled Video Generation with Verifiable Geometry Reward

Title: OptPO: Optimal Rollout Allocation for Test-time Policy Optimization

Title: Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules

Title: MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm

Title: Polar Perspectives: Evaluating 2-D LiDAR Projections for Robust Place Recognition with Visual Foundation Models

Title: Glance: Accelerating Diffusion Models with 1 Sample

Title: FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization

Title: MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding

Title: Belobog: Move Language Fuzzing Framework For Real-World Smart Contracts

Title: AutoNeural: Co-Designing Vision-Language Models for NPU Inference

Title: DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation

Title: LoVoRA: Text-guided and Mask-free Video Object Removal and Addition with Learnable Object-aware Localization

Title: Layout Anything: One Transformer for Universal Room Layout Estimation

Title: A Lightweight Real-Time Low-Light Enhancement Network for Embedded Automotive Vision Systems

Title: BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection

Title: Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities

Title: InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration

Title: U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences

Title: Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic

Title: GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection

Title: TEXTRIX: Latent Attribute Grid for Native Texture Generation and Beyond

Title: DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Title: In-Context Sync-LoRA for Portrait Video Editing

Title: Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks

Title: AutoBrep: Autoregressive B-Rep Generation with Unified Topology and Geometry

Title: Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge

Title: Unrolled Networks are Conditional Probability Flows in MRI Reconstruction

Title: TokenPowerBench: Benchmarking the Power Consumption of LLM Inference

Title: The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models

Title: MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation

Title: Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation

Title: OneThinker: All-in-one Reasoning Model for Image and Video

Title: CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models

Title: MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues