2025-05-09

Title: How Social is It? A Benchmark for LLMs' Capabilities in Multi-user Multi-turn Social Agent Tasks

Title: MatMMFuse: Multi-Modal Fusion model for Material Property Prediction

Title: Adaptive Token Boundaries: Integrating Human Chunking Mechanisms into Multimodal LLMs

Title: Language translation, and change of accent for speech-to-speech task using diffusion model

Title: Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation

Title: ChatGPT for automated grading of short answer questions in mechanical ventilation

Title: FRAME: Feedback-Refined Agent Methodology for Enhancing Medical Research Insights

Title: Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions

Title: Advancing Conversational Diagnostic AI with Multimodal Reasoning

Title: A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient

Title: Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction

Title: AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection

Title: Personalized Risks and Regulatory Strategies of Large Language Models in Digital Advertising

Title: Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes

Title: Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards

Title: Histo-Miner: Deep Learning based Tissue Features Extraction Pipeline from H&E Whole Slide Images of Cutaneous Squamous Cell Carcinoma

Title: REVEAL: Multi-turn Evaluation of Image-Input Harms for Vision LLM

Title: Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers

Title: False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims

Title: SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding

Title: Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting

Title: SetONet: A Deep Set-based Operator Network for Solving PDEs with permutation invariant variable input sampling

Title: Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer

Title: When Bad Data Leads to Good Models

Title: A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models

Title: Replay to Remember (R2R): An Efficient Uncertainty-driven Unsupervised Continual Learning Framework Using Generative Replay

Title: Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World

Title: DetReIDX: A Stress-Test Dataset for Real-World UAV-Based Person Recognition

Title: Robust ML Auditing using Prior Knowledge

Title: Safeguard-by-Development: A Privacy-Enhanced Development Paradigm for Multi-Agent Collaboration Systems

Title: ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

Title: Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs

Title: Guide your favorite protein sequence generative model

Title: Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?

Title: Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Title: Osiris: A Lightweight Open-Source Hallucination Detection System

Title: Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model

Title: Auto-regressive transformation for image alignment

Title: Federated Learning for Cyber Physical Systems: A Comprehensive Survey

Title: Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection

Title: FedRE: Robust and Effective Federated Learning with Privacy Preference

Title: Clustering with Communication: A Variational Framework for Single Cell Representation Learning

Title: Memory Under Siege: A Comprehensive Survey of Side-Channel Attacks on Memory

Title: OWT: A Foundational Organ-Wise Tokenization Framework for Medical Imaging

Title: Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization

Title: VaCDA: Variational Contrastive Alignment-based Scalable Human Activity Recognition

Title: SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models

Title: GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing

Title: An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education

Title: A Simple Detector with Frame Dynamics is a Strong Tracker

Title: Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Title: Canny2Palm: Realistic and Controllable Palmprint Generation for Large-scale Pre-training

Title: Fair Uncertainty Quantification for Depression Prediction

Title: FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration

Title: Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping

Title: Chain-of-Thought Tokens are Computer Program Variables

Title: Graffe: Graph Representation Learning via Diffusion Probabilistic Models

Title: ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis

Title: DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding

Title: ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment

Title: ChainMarks: Securing DNN Watermark with Cryptographic Chain

Title: Federated Deconfounding and Debiasing Learning for Out-of-Distribution Generalization

Title: Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks

Title: Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes

Title: Rethinking Invariance in In-context Learning

Title: StabStitch++: Unsupervised Online Video Stitching with Spatiotemporal Bidirectional Warps

Title: Automated Thoracolumbar Stump Rib Detection and Analysis in a Large CT Cohort

Title: Adaptive Contextual Embedding for Robust Far-View Borehole Detection

Title: An Agent-Based Modeling Approach to Free-Text Keyboard Dynamics for Continuous Authentication

Title: The Pitfalls of Growing Group Complexity: LLMs and Social Choice-Based Aggregation for Group Recommendations

Title: Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization

Title: Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints

Title: Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme

Title: SOAP: Style-Omniscient Animatable Portraits

Title: Split Matching for Inductive Zero-shot Semantic Segmentation

Title: G-FOCUS: Towards a Robust Method for Assessing UI Design Persuasiveness

Title: Dequantified Diffusion Schrödinger Bridge for Density Ratio Estimation

Title: xTrace: A Facial Expressive Behaviour Analysis Tool for Continuous Affect Recognition

Title: UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model

Title: CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts

Title: WaterDrum: Watermarking for Data-centric Unlearning Metric

Title: Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization

Title: Visual Affordances: Enabling Robots to Understand Object Functionality

Title: PIDiff: Image Customization for Personalized Identities with Diffusion Models

Title: ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model

Title: Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction

Title: Beyond Low-rank Decomposition: A Shortcut Approach for Efficient On-Device Learning

Title: DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions

Title: Balancing Client Participation in Federated Learning Using AoI

Title: MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models

Title: A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models Network

Title: Unveiling Language-Specific Features in Large Language Models via Sparse Autoencoders

Title: Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach

Title: Automated vision-based assistance tools in bronchoscopy: stenosis severity estimation

Title: Research on Anomaly Detection Methods Based on Diffusion Models

Title: Understanding In-context Learning of Addition via Activation Subspaces

Title: A Benchmark Dataset and a Framework for Urdu Multimodal Named Entity Recognition

Title: FedTDP: A Privacy-Preserving and Unified Framework for Trajectory Data Preparation via Federated Learning

Title: Bandit Max-Min Fair Allocation

Title: Stochastic Variational Propagation: Local, Scalable and Efficient Alternative to Backpropagation

Title: PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting

Title: Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models

Title: Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks

Title: Concept-Based Unsupervised Domain Adaptation

Title: EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution

Title: HQC-NBV: A Hybrid Quantum-Classical View Planning Approach

Title: Diffusion Model Quantization: A Review

Title: GFlowNets for Active Learning Based Resource Allocation in Next Generation Wireless Networks

Title: QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation

Title: Does CLIP perceive art the same way we do?

Title: Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning

Title: PADriver: Towards Personalized Autonomous Driving

Title: T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction

Title: MTL-UE: Learning to Learn Nothing for Multi-Task Learning

Title: QUIC-Exfil: Exploiting QUIC's Server Preferred Address Feature to Perform Data Exfiltration Attacks

Title: Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design

Title: Scalable Chain of Thoughts via Elastic Reasoning

Title: Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects

Title: Feature-Augmented Deep Networks for Multiscale Building Segmentation in High-Resolution UAV and Satellite Imagery

Title: ICon: In-Context Contribution for Automatic Data Selection

Title: SUUM: Timestamp-based Nakamoto-style Blockchains are Vulnerable

Title: Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors

Title: Joint Super-Resolution and Segmentation for 1-m Impervious Surface Area Mapping in China's Yangtze River Economic Belt

Title: Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks

Title: GeomHair: Reconstruction of Hair Strands from Colorless 3D Scans

Title: Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting

Title: EDmamba: A Simple yet Effective Event Denoising Method with State Space Model

Title: PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model

Title: Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?

Title: Crosslingual Reasoning through Test-Time Scaling

Title: Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It

Title: TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Title: TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering

Title: Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data

Title: clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations

Title: Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Title: UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections

Title: Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Title: ComPO: Preference Alignment via Comparison Oracles

Title: StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Title: Generating Physically Stable and Buildable LEGO Designs from Text

Title: Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation

Title: DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion

Title: 3D Scene Generation: A Survey

Title: SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation