2025-04-14

Title: ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

Title: Metamorphic Testing for Fairness Evaluation in Large Language Models: Identifying Intersectional Bias in LLaMA and GPT

Title: Psychological Health Knowledge-Enhanced LLM-based Social Network Crisis Intervention Text Transfer Recognition Method

Title: Topic mining based on fine-tuning Sentence-BERT and LDA

Title: SEAL: Steerable Reasoning Calibration of Large Language Models for Free

Title: 'Neural howlround' in large language models: a self-reinforcing bias phenomenon, and a dynamic attenuation solution

Title: SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness

Title: BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models

Title: Linguistic Interpretability of Transformer-based Language Models: a systematic review

Title: More diverse more adaptive: Comprehensive Multi-task Learning for Improved LLM Domain Adaptation in E-commerce

Title: Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

Title: Self-Bootstrapping for Versatile Test-Time Adaptation

Title: Can Reasoning LLMs Enhance Clinical Document Classification?

Title: Teaching Humans Subtle Differences with DIFFusion

Title: Compositional Flows for 3D Molecule and Synthesis Pathway Co-design

Title: X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization

Title: Deep Reinforcement Learning for Day-to-day Dynamic Tolling in Tradable Credit Schemes

Title: Differentially Private Selection using Smooth Sensitivity

Title: ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting

Title: Multi-view autoencoders for Fake News Detection

Title: Geneshift: Impact of different scenario shift on Jailbreaking LLM

Title: Towards Unconstrained 2D Pose Estimation of the Human Spine

Title: POEM: Precise Object-level Editing via MLLM control

Title: Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling

Title: Benchmarking Suite for Synthetic Aperture Radar Imagery Anomaly Detection (SARIAD) Algorithms

Title: DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Title: Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects

Title: A physics informed neural network approach to simulating ice dynamics governed by the shallow ice approximation

Title: Impact of Language Guidance: A Reproducibility Study

Title: LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution

Title: Beyond Feature Importance: Feature Interactions in Predicting Post-Stroke Rigidity with Graph Explainable AI

Title: Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing

Title: Investigating Vision-Language Model for Point Cloud-based Vehicle Classification

Title: Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Title: Learning Object Focused Attention

Title: On the Practice of Deep Hierarchical Ensemble Network for Ad Conversion Rate Prediction

Title: Multi-person Physics-based Pose Estimation for Combat Sports

Title: GenXSS: an AI-Driven Framework for Automated Detection of XSS Attacks in WAFs

Title: TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation

Title: SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs

Title: The More is not the Merrier: Investigating the Effect of Client Size on Federated Learning

Title: Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

Title: EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models

Title: DrivAer Transformer: A high-precision and fast prediction method for vehicle aerodynamic drag coefficient based on the DrivAerNet++ dataset

Title: VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions

Title: DaemonSec: Examining the Role of Machine Learning for Daemon Security in Linux Environments

Title: Out of Style: RAG's Fragility to Linguistic Variation

Title: Millions of States: Designing a Scalable MoE Architecture with RWKV-7 Meta-learner

Title: Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images

Title: Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy

Title: CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model

Title: Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare

Title: To See or Not to See -- Fingerprinting Devices in Adversarial Environments Amid Advanced Machine Learning

Title: VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering

Title: Palmprint De-Identification Using Diffusion Model for High-Quality and Diverse Synthesis

Title: PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection

Title: ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation

Title: DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Title: Generative AI for Film Creation: A Survey of Recent Advances

Title: Large language models could be rote learners

Title: STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge

Title: DSM: Building A Diverse Semantic Map for 3D Visual Grounding

Title: SortBench: Benchmarking LLMs based on their ability to sort lists

Title: Practical Secure Aggregation by Combining Cryptography and Trusted Execution Environments

Title: EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model

Title: Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models

Title: An Adaptive Clustering Scheme for Client Selections in Communication-Efficient Federated Learning

Title: SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis

Title: Proofs as Explanations: Short Certificates for Reliable Predictions

Title: Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash

Title: Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking

Title: PCA-RAG: Principal Component Analysis for Efficient Retrieval-Augmented Generation

Title: MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Title: Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models

Title: A Knowledge-guided Adversarial Defense for Resisting Malicious Visual Manipulation

Title: Adversarial Examples in Environment Perception for Automated Driving (Review)

Title: seeBias: A Comprehensive Tool for Assessing and Visualizing AI Fairness

Title: GeoTexBuild: 3D Building Model Generation from Map Footprints

Title: Customizing Spider Silk: Generative Models with Mechanical Property Conditioning for Protein Engineering

Title: SARFormer -- An Acquisition Parameter Aware Vision Transformer for Synthetic Aperture Radar Data

Title: Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion

Title: Road Grip Uncertainty Estimation Through Surface State Segmentation

Title: Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation

Title: Toward Realistic Adversarial Attacks in IDS: A Novel Feasibility Metric for Transferability

Title: A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification

Title: An Early Experience with Confidential Computing Architecture for On-Device Model Protection

Title: Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

Title: Explainability and Continual Learning meet Federated Learning at the Network Edge

Title: Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review

Title: Discriminator-Free Direct Preference Optimization for Video Diffusion

Title: UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection

Title: Slicing the Gaussian Mixture Wasserstein Distance

Title: Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving Applications

Title: Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

Title: Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities

Title: Boosting multi-demographic federated learning for chest x-ray analysis using general-purpose self-supervised representations

Title: Playpen: An Environment for Exploring Learning Through Conversational Interaction

Title: ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration

Title: Hands-On: Segmenting Individual Signs from Continuous Sequences

Title: On Background Bias of Post-Hoc Concept Embeddings in Computer Vision DNNs

Title: A Survey of Machine Learning Models and Datasets for the Multi-label Classification of Textual Hate Speech in English

Title: Enhancing knowledge retention for continual learning with domain-specific adapters and features gating

Title: Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition

Title: A Hybrid Chaos-Based Cryptographic Framework for Post-Quantum Secure Communications

Title: Efficient Mixture of Geographical Species for On Device Wildlife Monitoring

Title: Enterprise-Grade Security for the Model Context Protocol (MCP): Frameworks and Mitigation Strategies

Title: Deep Learning Methods for Detecting Thermal Runaway Events in Battery Production Lines

Title: Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging

Title: Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

Title: Title block detection and information extraction for enhanced building drawings search

Title: MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction

Title: The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation

Title: Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Title: Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Title: Fast-Slow-Thinking: Complex Task Solving with Large Language Models

Title: TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning

Title: Large Language Models as Span Annotators

Title: Hypergraph Vision Transformers: Images are More than Nodes, More than Edges

Title: Beyond Black-Box Predictions: Identifying Marginal Feature Effects in Tabular Transformer Networks

Title: Generating Fine Details of Entity Interactions

Title: ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance

Title: EMO-X: Efficient Multi-Person Pose and Shape Estimation in One-Stage

Title: SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling

Title: Steering CLIP's vision transformer with sparse autoencoders