2025-06-27

Title: Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems

Title: E-ABIN: an Explainable module for Anomaly detection in BIological Networks

Title: Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models

Title: On Convolutions, Intrinsic Dimension, and Diffusion Models

Title: Test-time Scaling Techniques in Theoretical Physics -- A Comparison of Methods on the TPBench Dataset

Title: OTSurv: A Novel Multiple Instance Learning Framework for Survival Prediction with Heterogeneity-aware Optimal Transport

Title: A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools

Title: Multiple Streams of Relation Extraction: Enriching and Recalling in Transformers

Title: Towards Probabilistic Question Answering Over Tabular Data

Title: Characterization and Mitigation of Training Instabilities in Microscaling Formats

Title: StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation

Title: Perry: A High-level Framework for Accelerating Cyber Deception Experimentation

Title: Stochastic and Non-local Closure Modeling for Nonlinear Dynamical Systems via Latent Score-based Generative Models

Title: AI-Driven MRI-based Brain Tumour Segmentation Benchmarking

Title: Stochastic Parameter Decomposition

Title: Multi-lingual Functional Evaluation for Large Language Models

Title: SIMulator: SIM Tracing on a (Pico-)Budget

Title: The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas

Title: Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis

Title: Divide, Specialize, and Route: A New Approach to Efficient Ensemble Learning

Title: Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers

Title: MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering

Title: Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes

Title: Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models

Title: Leaner Training, Lower Leakage: Revisiting Memorization in LLM Fine-Tuning with LoRA

Title: Empowering Digital Agriculture: A Privacy-Preserving Framework for Data Sharing and Collaborative Research

Title: THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion

Title: MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans

Title: Omniwise: Predicting GPU Kernels Performance with LLMs

Title: On the Necessity of Output Distribution Reweighting for Effective Class Unlearning

Title: FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Title: ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models

Title: Optimising Language Models for Downstream Tasks: A Post-Training Perspective

Title: FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Title: LLM-guided Chemical Process Optimization with a Multi-Agent Approach

Title: M2SFormer: Multi-Spectral and Multi-Scale Attention with Edge-Aware Difficulty Guidance for Image Forgery Localization

Title: KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model

Title: CodeGuard: A Generalized and Stealthy Backdoor Watermarking for Generative Code Models

Title: Interpretable Representation Learning for Additive Rule Ensembles

Title: SPA: Towards More Stealth and Persistent Backdoor Attacks in Federated Learning

Title: Model State Arithmetic for Machine Unlearning

Title: Hierarchical Sub-action Tree for Continuous Sign Language Recognition

Title: Antibody Design and Optimization with Multi-scale Equivariant Graph Diffusion Models for Accurate Complex Antigen Binding

Title: Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

Title: DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing

Title: From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging

Title: PrivacyGo: Privacy-Preserving Ad Measurement with Multidimensional Intersection

Title: Rethink Sparse Signals for Pose-guided Text-to-image Generation

Title: Segment Anything in Pathology Images with Natural Language

Title: Can Gradient Descent Simulate Prompting?

Title: TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation

Title: SAC: A Framework for Measuring and Inducing Personality Traits in LLMs with Dynamic Intensity Control

Title: DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting

Title: Style-Aligned Image Composition for Robust Detection of Abnormal Cells in Cytopathology

Title: Inverse Scene Text Removal

Title: Distilling Normalizing Flows

Title: Detection of Breast Cancer Lumpectomy Margin with SAM-incorporated Forward-Forward Contrastive Learning

Title: The Aging Multiverse: Generating Condition-Aware Facial Aging Tree via Training-Free Diffusion

Title: FedSC: Federated Learning with Semantic-Aware Collaboration

Title: HybridQ: Hybrid Classical-Quantum Generative Adversarial Network for Skin Disease Image Generation

Title: Multimodal Prompt Alignment for Facial Expression Recognition

Title: LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection

Title: Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

Title: Large Language Models Acing Chartered Accountancy

Title: DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation

Title: Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning

Title: An Information-Theoretic Analysis for Federated Learning under Concept Drift

Title: Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Title: Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling

Title: Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features

Title: MT2-CSD: A New Dataset and Multi-Semantic Knowledge Fusion Method for Conversational Stance Detection

Title: FedDAA: Dynamic Client Clustering for Concept Drift Adaptation in Federated Learning

Title: Class-Agnostic Region-of-Interest Matching in Document Images

Title: SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification

Title: TEMPEST-LoRa: Cross-Technology Covert Communication

Title: Enhancing LLM Tool Use with High-quality Instruction Data from Knowledge Graph

Title: Chain-of-Thought Enhanced Shallow Transformers for Wireless Symbol Detection

Title: FeDa4Fair: Client-Level Federated Datasets for Fairness Evaluation

Title: OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography

Title: Interpretable Hierarchical Concept Reasoning through Attention-Guided Graph Learning

Title: Learning to Skip the Middle Layers of Transformers

Title: PhishKey: A Novel Centroid-Based Approach for Enhanced Phishing Detection Using Adaptive HTML Component Extraction

Title: Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges

Title: IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes

Title: CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization

Title: Progtuning: Progressive Fine-tuning Framework for Transformer-based Language Models

Title: Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments

Title: Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks

Title: Learning to See in the Extremely Dark

Title: Inside Job: Defending Kubernetes Clusters Against Network Misconfigurations

Title: YOLO-FDA: Integrating Hierarchical Attention and Detail Enhancement for Surface Defect Detection

Title: NaLaFormer: Norm-Aware Linear Attention for Transformer Models

Title: DBConformer: Dual-Branch Convolutional Transformer for EEG Decoding

Title: Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks

Title: Personalized Federated Learning via Dual-Prompt Optimization and Cross Fusion

Title: Tree-based Semantic Losses: Application to Sparsely-supervised Large Multi-class Hyperspectral Segmentation

Title: Robust Deep Learning for Myocardial Scar Segmentation in Cardiac MRI with Noisy Labels

Title: Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image

Title: Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design

Title: Topology-Aware Modeling for Unsupervised Simulation-to-Reality Point Cloud Recognition

Title: Compressed and Smooth Latent Space for Text Diffusion Modeling

Title: Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks

Title: Task-Aware KV Compression For Cost-Effective Long Video Understanding

Title: Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout

Title: GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding

Title: Prompt-Guided Turn-Taking Prediction

Title: Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation

Title: MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification

Title: BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models

Title: Complexity-aware fine-tuning

Title: Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval

Title: ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

Title: Real-Time ESFP: Estimating, Smoothing, Filtering, and Pose-Mapping

Title: DiMPLe -- Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation

Title: Zero-Shot Learning for Obsolescence Risk Forecasting

Title: Temporal Rate Reduction Clustering for Human Motion Segmentation

Title: Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents

Title: DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster

Title: Video Virtual Try-on with Conditional Diffusion Transformer Inpainter

Title: Cat and Mouse -- Can Fake Text Generation Outpace Detector Systems?

Title: HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

Title: Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning

Title: HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation

Title: Small Encoders Can Rival Large Decoders in Detecting Groundedness

Title: Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models

Title: Balancing Privacy and Utility in Correlated Data: A Study of Bayesian Differential Privacy

Title: Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing

Title: DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document Images

Title: Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts

Title: Holistic Surgical Phase Recognition with Hierarchical Input Dependent State Space Models

Title: AGTCNet: A Graph-Temporal Approach for Principled Motor Imagery EEG Classification

Title: DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Title: PanSt3R: Multi-view Consistent Panoptic Segmentation

Title: Generalizable Neural Electromagnetic Inverse Scattering

Title: Lipschitz Bounds for Persistent Laplacian Eigenvalues under One-Simplex Insertions

Title: SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

Title: ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Title: Structuralist Approach to AI Literary Criticism: Leveraging Greimas Semiotic Square for Large Language Models

Title: GenFlow: Interactive Modular System for Image Generation

Title: Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation

Title: Early Stopping Tabular In-Context Learning

Title: Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction

Title: Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference

Title: Distributed Cross-Channel Hierarchical Aggregation for Foundation Models

Title: XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation

Title: Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

Title: HyperSORT: Self-Organising Robust Training with hyper-networks

Title: Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection

Title: Text2Cypher Across Languages: Evaluating Foundational Models Beyond English

Title: Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Title: A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario

Title: Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency

Title: Aligning Spoken Dialogue Models from User Interactions

Title: TopK Language Models

Title: Logios : An open source Greek Polytonic Optical Character Recognition system

Title: Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection

Title: Bridging Offline and Online Reinforcement Learning for LLMs

Title: Potemkin Understanding in Large Language Models

Title: "What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets

Title: Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval

Title: DeOcc-1-to-3: 3D De-Occlusion from a Single Image via Self-Supervised Multi-View Diffusion

Title: HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation

Title: SAM4D: Segment Anything in Camera and LiDAR Streams

Title: SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark

Title: mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale

Title: Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Title: Whole-Body Conditioned Egocentric Video Prediction