2025-05-06

Title: Multi-party Collaborative Attention Control for Image Customization

Title: Explainable AI-Driven Detection of Human Monkeypox Using Deep Learning and Vision Transformers: A Comprehensive Analysis

Title: Deconstructing Bias: A Multifaceted Framework for Diagnosing Cultural and Compositional Inequities in Text-to-Image Generative Models

Title: ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation

Title: Firewall Regulatory Networks for Autonomous Cyber Defense

Title: Enhancing IoT-Botnet Detection using Variational Auto-encoder and Cost-Sensitive Learning: A Deep Learning Approach for Imbalanced Datasets

Title: Global Stress Generation and Spatiotemporal Super-Resolution Physics-Informed Operator under Dynamic Loading for Two-Phase Random Materials

Title: Explainable AI for Correct Root Cause Analysis of Product Quality in Injection Moulding

Title: OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

Title: COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Title: Towards Film-Making Production Dialogue, Narration, Monologue Adaptive Moving Dubbing Benchmarks

Title: Sparsification Under Siege: Defending Against Poisoning Attacks in Communication-Efficient Federated Learning

Title: Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation

Title: MoxE: Mixture of xLSTM Experts with Entropy-Aware Routing for Efficient Language Modeling

Title: Development of an Adapter for Analyzing and Protecting Machine Learning Models from Competitive Activity in the Networks Services

Title: Enhancing the Cloud Security through Topic Modelling

Title: SafeTab-P: Disclosure Avoidance for the 2020 Census Detailed Demographic and Housing Characteristics File A (Detailed DHC-A)

Title: Watermark Overwriting Attack on StegaStamp algorithm

Title: SymPlanner: Deliberate Planning in Language Models with Symbolic Representation

Title: VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Title: LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps

Title: Explainable Machine Learning for Cyberattack Identification from Traffic Flows

Title: Machine Learning for Cyber-Attack Identification from Traffic Flows

Title: WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation

Title: Securing the Future of IVR: AI-Driven Innovation with Agile Security, Data Regulation, and Ethical AI Integration

Title: Rubber Mallet: A Study of High Frequency Localized Bit Flips and Their Impact on Security

Title: Subset Selection for Fine-Tuning: A Utility-Diversity Balanced Approach for Mathematical Domain Adaptation

Title: The DCR Delusion: Measuring the Privacy Risk of Synthetic Data

Title: Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer

Title: Rethinking RGB-Event Semantic Segmentation with a Novel Bidirectional Motion-enhanced Event Representation

Title: A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative Learning

Title: On the effectiveness of Large Language Models in the mechanical design domain

Title: AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

Title: PainFormer: a Vision Foundation Model for Automatic Pain Assessment

Title: TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

Title: Understanding and Exploiting Plasticity for Non-stationary Network Resource Adaptation

Title: Machine Learning Fairness in House Price Prediction: A Case Study of America's Expanding Metropolises

Title: PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents

Title: Always Tell Me The Odds: Fine-grained Conditional Probability Estimation

Title: Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation

Title: Don't be lazy: CompleteP enables compute-efficient deep transformers

Title: A Domain Adaptation of Large Language Models for Classifying Mechanical Assembly Components

Title: Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability

Title: Causally Fair Node Classification on Non-IID Graph Data

Title: A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory

Title: A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Title: Automated ARAT Scoring Using Multimodal Video Analysis, Multi-View Fusion, and Hierarchical Bayesian Models: A Clinician Study

Title: High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers

Title: Component-Based Fairness in Face Attribute Classification with Bayesian Network-informed Meta Learning

Title: Knowledge-Augmented Language Models Interpreting Structured Chest X-Ray Findings

Title: Vision and Intention Boost Large Language Model in Long-Term Action Anticipation

Title: Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes

Title: PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth

Title: Efficient Shapley Value-based Non-Uniform Pruning of Large Language Models

Title: Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Title: Unified Steganography via Implicit Neural Representation

Title: Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models

Title: Multimodal Graph Representation Learning for Robust Surgical Workflow Recognition with Adversarial Feature Disentanglement

Title: Energy-Efficient NTT Sampler for Kyber Benchmarked on FPGA

Title: Context-Aware Online Conformal Anomaly Detection with Prediction-Powered Data Acquisition

Title: Privacy Preserving Machine Learning Model Personalization through Federated Personalized Learning

Title: A Multimodal Framework for Explainable Evaluation of Soft Skills in Educational Environments

Title: Distinguishing AI-Generated and Human-Written Text Through Psycholinguistic Analysis

Title: Not Every Tree Is a Forest: Benchmarking Forest Types from Satellite Remote Sensing

Title: Conformal Prediction for Indoor Positioning with Correctness Coverage Guarantees

Title: Backdoor Attacks Against Patch-based Mixture of Experts

Title: $\textit{New News}$: System-2 Fine-tuning for Robust Integration of New Knowledge

Title: Rogue Cell: Adversarial Attack and Defense in Untrusted O-RAN Setup Exploiting the Traffic Steering xApp

Title: An LSTM-PINN Hybrid Method to the specific problem of population forecasting

Title: Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning

Title: PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach

Title: CVVNet: A Cross-Vertical-View Network for Gait Recognition

Title: MVHumanNet++: A Large-scale Dataset of Multi-view Daily Dressing Human Captures with Richer Annotations for 3D Human Digitization

Title: Mitigating Group-Level Fairness Disparities in Federated Visual Language Models

Title: Intra-Layer Recurrence in Transformers for Language Modeling

Title: DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion

Title: PQS-BFL: A Post-Quantum Secure Blockchain-based Federated Learning Framework

Title: Positional Attention for Efficient BERT-Based Named Entity Recognition

Title: An Approach for Handling Missing Attribute Values in Attribute-Based Access Control Policy Mining

Title: Towards Trustworthy Federated Learning with Untrusted Participants

Title: PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications

Title: CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture

Title: Automated Sentiment Classification and Topic Discovery in Large-Scale Social Media Streams

Title: Rethinking Score Distilling Sampling for 3D Editing and Generation

Title: OODTE: A Differential Testing Engine for the ONNX Optimizer

Title: CAMOUFLAGE: Exploiting Misinformation Detection Systems Through LLM-driven Adversarial Claim Transformation

Title: From Players to Champions: A Generalizable Machine Learning Approach for Match Outcome Prediction with Insights from the FIFA World Cup

Title: LookAlike: Consistent Distractor Generation in Math MCQs

Title: BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models

Title: Unemployment Dynamics Forecasting with Machine Learning Regression Models

Title: GauS-SLAM: Dense RGB-D SLAM with Gaussian Surfels

Title: UK Finfluencers: Exploring Content, Reach, and Responsibility

Title: Multi-Scale Graph Learning for Anti-Sparse Downscaling

Title: Segment Any RGB-Thermal Model with Language-aided Distillation

Title: A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models

Title: EnsembleCI: Ensemble Learning for Carbon Intensity Forecasting

Title: Analyzing Cognitive Differences Among Large Language Models through the Lens of Social Worldview

Title: MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection

Title: Visual Dominance and Emerging Multimodal Approaches in Distracted Driving Detection: A Review of Machine Learning Techniques

Title: A Survey on Privacy Risks and Protection in Large Language Models

Title: LLM-based Text Simplification and its Effect on User Comprehension and Cognitive Load

Title: Always Skip Attention

Title: Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

Title: Triple-identity Authentication: The Future of Secure Access

Title: Efficient Noise Calculation in Deep Learning-based MRI Reconstructions

Title: Towards Safer Pretraining: Analyzing and Filtering Harmful Content in Webscale datasets for Responsible LLMs

Title: CASA: CNN Autoencoder-based Score Attention for Efficient Multivariate Long-term Time-series Forecasting

Title: MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution

Title: Wide & Deep Learning for Node Classification

Title: Secrets of GFlowNets' Learning Behavior: A Theoretical Study

Title: Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction

Title: Regression s all you need for medical image translation

Title: Transforming faces into video stories -- VideoFace2.0

Title: RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

Title: Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning

Title: Lightweight Defense Against Adversarial Attacks in Time Series Classification

Title: Learning Local Causal World Models with State Space Models and Attention

Title: Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation

Title: Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Title: LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning

Title: LLM-OptiRA: LLM-Driven Optimization of Resource Allocation for Non-Convex Problems in Wireless Communications

Title: SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations

Title: Deep Representation Learning for Electronic Design Automation

Title: GRAIL: Graph Edit Distance and Node Alignment Using LLM-Generated Code

Title: Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation

Title: Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

Title: QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach

Title: Local Herb Identification Using Transfer Learning: A CNN-Powered Mobile Application for Nepalese Flora

Title: Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving

Title: Small Clips, Big Gains: Learning Long-Range Refocused Temporal Information for Video Super-Resolution

Title: Focus What Matters: Matchability-Based Reweighting for Local Feature Matching

Title: Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use

Title: A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking

Title: Identifying Legal Holdings with LLMs: A Systematic Study of Performance, Scale, and Memorization

Title: Saliency-Guided Training for Fingerprint Presentation Attack Detection

Title: Measuring Hong Kong Massive Multi-Task Language Understanding

Title: ProDisc-VAD: An Efficient System for Weakly-Supervised Anomaly Detection in Video Surveillance Applications

Title: Robust AI-Generated Face Detection with Imbalanced Data

Title: DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization

Title: DNAZEN: Enhanced Gene Sequence Representations via Mixed Granularities of Coding Units

Title: An Empirical Study of Qwen3 Quantization

Title: Enhanced Outsourced and Secure Inference for Tall Sparse Decision Trees

Title: Risk Assessment and Threat Modeling for safe autonomous driving technology

Title: SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation

Title: Improving Physical Object State Representation in Text-to-Image Generative Systems

Title: Federated Causal Inference in Healthcare: Methods, Challenges, and Applications

Title: Performance Analysis and Deployment Considerations of Post-Quantum Cryptography for Consumer Electronics

Title: Quantizing Diffusion Models from a Sampling-Aware Perspective

Title: RISE: Radius of Influence based Subgraph Extraction for 3D Molecular Graph Explanation

Title: Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models

Title: Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset

Title: Parameter-Efficient Transformer Embeddings

Title: Demystifying optimized prompts in language models

Title: Epistemic Wrapping for Uncertainty Quantification

Title: Entropy-Guided Sampling of Flat Modes in Discrete Spaces

Title: Adaptive Scoring and Thresholding with Human Feedback for Robust Out-of-Distribution Detection

Title: Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition

Title: Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques

Title: Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering

Title: VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

Title: 6D Pose Estimation on Spoons and Hands

Title: An End-to-End Model For Logits Based Large Language Models Watermarking

Title: Catastrophic Overfitting, Entropy Gap and Participation Ratio: A Noiseless $l^p$ Norm Solution for Fast Adversarial Training

Title: Advancing Email Spam Detection: Leveraging Zero-Shot Learning and Large Language Models

Title: Sharpness-Aware Minimization with Z-Score Gradient Filtering for Neural Networks

Title: EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices

Title: Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret

Title: RM-R1: Reward Modeling as Reasoning

Title: MetaScenes: Towards Automated Replica Creation for Real-world 3D Scans

Title: Quantitative Analysis of Performance Drop in DeepSeek Model Quantization

Title: Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Title: Moneros Decentralized P2P Exchanges: Functionality, Adoption, and Privacy Risks

Title: Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection

Title: Token Coordinated Prompt Attention is Needed for Visual Prompting

Title: Encrypted Federated Search Using Homomorphic Encryption

Title: T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models

Title: Towards One-shot Federated Learning: Advances, Challenges, and Future Directions

Title: FairPO: Robust Preference Optimization for Fair Multi-Label Learning

Title: A New Approach to Backtracking Counterfactual Explanations: A Causal Framework for Efficient Model Interpretability

Title: Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs

Title: Targeted Fuzzing for Unsafe Rust Code: Leveraging Selective Instrumentation

Title: Timing Is Everything: Finding the Optimal Fusion Points in Multimodal Medical Imaging

Title: Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction

Title: Finger Pose Estimation for Under-screen Fingerprint Sensor

Title: SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning

Title: Bayesian Robust Aggregation for Federated Learning

Title: Dynamic Graph-based Fingerprinting of In-browser Cryptomining

Title: An Efficient Hybrid Key Exchange Mechanism

Title: Unveiling the Landscape of LLM Deployment in the Wild: An Empirical Study

Title: Exploring Design Choices for Autoregressive Deep Learning Climate Models

Title: FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization

Title: Attestable builds: compiling verifiable binaries on untrusted systems using trusted execution environments

Title: Text to Image Generation and Editing: A Survey

Title: RobSurv: Vector Quantization-Based Multi-Modal Learning for Robust Cancer Survival Prediction

Title: Marker-Based Extrinsic Calibration Method for Accurate Multi-Camera 3D Reconstruction

Title: Lazy But Effective: Collaborative Personalized Federated Learning with Heterogeneous Data

Title: Robust Duality Learning for Unsupervised Visible-Infrared Person Re-Identfication

Title: Bielik v3 Small: Technical Report

Title: Robustness questions the interpretability of graph neural networks: what to do?

Title: Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Title: Rethinking Federated Graph Learning: A Data Condensation Perspective

Title: EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning

Title: Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era

Title: RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet

Title: DELTA: Dense Depth from Events and LiDAR using Transformer's Attention

Title: Low-Loss Space in Neural Networks is Continuous and Fully Connected

Title: Automatic Proficiency Assessment in L2 English Learners

Title: Mirror Mean-Field Langevin Dynamics

Title: LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Title: Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models

Title: Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning

Title: MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Title: Sim2Real in endoscopy segmentation with a novel structure aware image translation

Title: SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting

Title: A Note on Statistically Accurate Tabular Data Generation Using Large Language Models

Title: A Survey on Progress in LLM Alignment from the Perspective of Reward Design

Title: Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Title: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery

Title: SoK: Stealing Cars Since Remote Keyless Entry Introduction and How to Defend From It

Title: Less is More: Efficient Weight Farcasting with 1-Layer Neural Network

Title: Acoustic Side-Channel Attacks on a Computer Mouse

Title: Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation

Title: Cooperative Bayesian and variance networks disentangle aleatoric and epistemic uncertainties

Title: Using Knowledge Graphs to harvest datasets for efficient CLIP model training

Title: Platelet enumeration in dense aggregates

Title: Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models

Title: Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models

Title: Advances in Automated Fetal Brain MRI Segmentation and Biometry: Insights from the FeTA 2024 Challenge

Title: HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models

Title: Towards Quantifying the Hessian Structure of Neural Networks

Title: Database-Agnostic Gait Enrollment using SetTransformers

Title: ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Title: MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing

Title: Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models

Title: AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation

Title: No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves

Title: R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Title: Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation