2025-04-07

Title: Optimizing Humor Generation in Large Language Models: Temperature Configurations and Architectural Trade-offs

Title: The Material Contracts Corpus

Title: The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances

Title: OpenFACADES: An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery

Title: Multi-Agent LLM Judge: automatic personalized LLM judge design for evaluating natural language generation applications

Title: AI Hiring with LLMs: A Context-Aware and Explainable Multi-Agent Framework for Resume Screening

Title: Synthesized Annotation Guidelines are Knowledge-Lite Boosters for Clinical Information Extraction

Title: Scraping the Shadows: Deep Learning Breakthroughs in Dark Web Intelligence

Title: Short-PHD: Detecting Short LLM-generated Text with Topological Data Analysis After Off-topic Content Insertion

Title: TheBlueScrubs-v1, a comprehensive curated medical dataset derived from the internet

Title: Multimodal Reference Visual Grounding

Title: Revisiting Funnel Transformers for Modern LLM Architectures with Comprehensive Ablations in Training and Inference Configurations

Title: Exploring the Capabilities of LLMs for IMU-based Fine-grained Human Activity Understanding

Title: Better Bill GPT: Comparing Large Language Models against Legal Invoice Reviewers

Title: DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Title: SemEval-2025 Task 4: Unlearning sensitive content from Large Language Models

Title: Enhancing Traffic Sign Recognition On The Performance Based On Yolov8

Title: Processes Matter: How ML/GAI Approaches Could Support Open Qualitative Coding of Online Discourse Datasets

Title: A Status Quo Investigation of Large Language Models towards Cost-Effective CFD Automation with OpenFOAMGPT: ChatGPT vs. Qwen vs. Deepseek

Title: Scaling Test-time Compute for Low-resource Languages: Multilingual Reasoning in LLMs

Title: Automated Survey Collection with LLM-based Conversational Agents

Title: OnRL-RAG: Real-Time Personalized Mental Health Dialogue System

Title: UAC: Uncertainty-Aware Calibration of Neural Networks for Gesture Detection

Title: A Practical Synthesis of Detecting AI-Generated Textual, Visual, and Audio Content

Title: Comparative Analysis of Deepfake Detection Models: New Approaches and Perspectives

Title: Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered Assistance

Title: Beyond Accuracy: The Role of Calibration in Self-Improving Large Language Models

Title: How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence

Title: Enhancing Chart-to-Code Generation in Multimodal Large Language Models via Iterative Dual Preference Learning

Title: Noiser: Bounded Input Perturbations for Attributing Large Language Models

Title: Haphazard Inputs as Images in Online Learning

Title: Bias in Large Language Models Across Clinical Applications: A Systematic Review

Title: Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments

Title: HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse

Title: Robustly identifying concepts introduced during chat fine-tuning using crosscoders

Title: Graph Attention for Heterogeneous Graphs with Positional Encoding

Title: VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Title: Cultural Learning-Based Culture Adaptation of Language Models

Title: Digital Forensics in the Age of Large Language Models

Title: QID: Efficient Query-Informed ViTs in Data-Scarce Regimes for OCR-free Visual Document Understanding

Title: Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching

Title: Multi-Screaming-Channel Attacks: Frequency Diversity for Enhanced Attacks

Title: Hummus: A Dataset of Humorous Multimodal Metaphor Use

Title: Noise-Aware Generalization: Robustness to In-Domain Noise and Out-of-Domain Generalization

Title: Improving Efficiency in Federated Learning with Optimized Homomorphic Encryption

Title: DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery

Title: Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization

Title: Deep Reinforcement Learning via Object-Centric Attention

Title: VIP: Video Inpainting Pipeline for Real World Human Removal

Title: Sliced Wasserstein Discrepancy in Disentangling Representation and Adaptation Networks for Unsupervised Domain Adaptation

Title: Extending CREAMT: Leveraging Large Language Models for Literary Translation Post-Editing

Title: Attention-Aware Multi-View Pedestrian Tracking

Title: Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models

Title: AD-GPT: Large Language Models in Alzheimer's Disease

Title: How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models

Title: Integrating Identity-Based Identification against Adaptive Adversaries in Federated Learning

Title: SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections

Title: Machine Learning-Based Detection and Analysis of Suspicious Activities in Bitcoin Wallet Transactions in the USA

Title: Post-processing for Fair Regression via Explainable SVD

Title: Scaling Open-Vocabulary Action Detection

Title: Single-Pass Document Scanning for Question Answering

Title: Multi-Granularity Vision Fastformer with Fusion Mechanism for Skin Lesion Segmentation

Title: Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents

Title: NuWa: Deriving Lightweight Task-Specific Vision Transformers for Edge Devices

Title: FontGuard: A Robust Font Watermarking Approach Leveraging Deep Font Knowledge

Title: Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion

Title: Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable

Title: Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Title: Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)

Title: MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories

Title: TokenFLEX: Unified VLM Training for Flexible Visual Tokens Inference

Title: Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction

Title: Beyond Progress Measures: Theoretical Insights into the Mechanism of Grokking

Title: Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms

Title: Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation

Title: RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation

Title: REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval

Title: PPFPL: Cross-silo Privacy-preserving Federated Prototype Learning Against Data Poisoning Attacks on Non-IID Data

Title: Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents

Title: On the Connection Between Diffusion Models and Molecular Dynamics

Title: Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation

Title: Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation

Title: PIONM: A Generalized Approach to Solving Density-Constrained Mean-Field Games Equilibrium under Modified Boundary Conditions

Title: Structured Knowledge Accumulation: The Principle of Entropic Least Action in Forward-Only Neural Learning

Title: Electromyography-Based Gesture Recognition: Hierarchical Feature Extraction for Enhanced Spatial-Temporal Dynamics

Title: Unlocking Neural Transparency: Jacobian Maps for Explainable AI in Alzheimer's Detection

Title: Crash Time Matters: HybridMamba for Fine-Grained Temporal Localization in Traffic Surveillance Footage

Title: Malware Detection in Docker Containers: An Image is Worth a Thousand Logs

Title: Rotation Invariance in Floor Plan Digitization using Zernike Moments

Title: FaR: Enhancing Multi-Concept Text-to-Image Diffusion via Concept Fusion and Localized Refinement

Title: Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task

Title: Noise Augmented Fine Tuning for Mitigating Hallucinations in Large Language Models

Title: Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices

Title: Steerable Anatomical Shape Synthesis with Implicit Neural Representations

Title: Data Augmentation of Time-Series Data in Human Movement Biomechanics: A Scoping Review

Title: QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning

Title: BabyLM's First Words: Word Segmentation as a Phonological Probing Task

Title: Optimizing Password Cracking for Digital Investigations

Title: Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognition

Title: SoK: Attacks on Modern Card Payments

Title: FLAIRBrainSeg: Fine-grained brain segmentation using FLAIR MRI only

Title: Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Title: BitHEP -- The Limits of Low-Precision ML in HEP

Title: Autonomous state-space segmentation for Deep-RL sparse reward scenarios

Title: DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models

Title: Locations of Characters in Narratives: Andersen and Persuasion Datasets

Title: ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving

Title: Know What You do Not Know: Verbalized Uncertainty Estimation Robustness on Corrupted Images in Vision-Language Models

Title: Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection

Title: D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations

Title: Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis

Title: Multi-encoder nnU-Net outperforms Transformer models with self-supervised pretraining

Title: ATM-Net: Anatomy-Aware Text-Guided Multi-Modal Fusion for Fine-Grained Lumbar Spine Segmentation

Title: Probabilistic Machine Learning for Noisy Labels in Earth Observation

Title: Online Traffic Density Estimation using Physics-Informed Neural Networks

Title: Discovering Partially Known Ordinary Differential Equations: a Case Study on the Chemical Kinetics of Cellulose Degradation

Title: BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution

Title: Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography

Title: Quantifying Robustness: A Benchmarking Framework for Deep Learning Forecasting in Cyber-Physical Systems

Title: Hierarchical Knowledge Structuring for Effective Federated Learning in Heterogeneous Environments

Title: FADConv: A Frequency-Aware Dynamic Convolution for Farmland Non-agriculturalization Identification and Segmentation

Title: Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles

Title: HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

Title: Agentic Knowledgeable Self-awareness

Title: PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector

Title: Scalable Hypergraph Structure Learning with Diverse Smoothness Priors

Title: EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline

Title: Robust Human Registration with Body Part Segmentation on Noisy Point Clouds

Title: Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal

Title: AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Title: Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution

Title: Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task

Title: VISTA-OCR: Towards generative and interactive end to end OCR models

Title: Align to Structure: Aligning Large Language Models with Structural Information

Title: Quantifying the uncertainty of model-based synthetic image quality metrics

Title: Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Title: MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models