2025-03-27

Title: Robust Object Detection of Underwater Robot based on Domain Generalization

Title: VisualQuest: A Diverse Image Dataset for Evaluating Visual Recognition in LLMs

Title: Continual Learning With Quasi-Newton Methods

Title: Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders

Title: Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards

Title: LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation

Title: ACVUBench: Audio-Centric Video Understanding Benchmark

Title: SLIP: Spoof-Aware One-Class Face Anti-Spoofing with Language Image Pretraining

Title: ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Title: The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs

Title: Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception

Title: Experience Replay Addresses Loss of Plasticity in Continual Learning

Title: Deep Learning Approaches for Blood Disease Diagnosis Across Hematopoietic Lineages

Title: Poor Alignment and Steerability of Large Language Models: Evidence from College Admission Essays

Title: iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed Species

Title: Cross-Tokenizer Distillation via Approximate Likelihood Matching

Title: Can Multi-modal (reasoning) LLMs work as deepfake detectors?

Title: Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success

Title: Fundamental Limits of Perfect Concept Erasure

Title: Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion

Title: Bigger But Not Better: Small Neural Language Models Outperform Large Language Models in Detection of Thought Disorder

Title: "Is There Anything Else?'': Examining Administrator Influence on Linguistic Features from the Cookie Theft Picture Description Cognitive Test

Title: From Interpretation to Correction: A Decentralized Optimization Framework for Exact Convergence in Federated Learning

Title: Unlocking the Value of Decentralized Data: A Federated Dual Learning Approach for Model Aggregation

Title: AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions

Title: Guiding Human-Object Interactions with Rich Geometry and Relations

Title: Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration

Title: Offline Reinforcement Learning with Discrete Diffusion Skills

Title: Leveraging Implicit Sentiments: Enhancing Reliability and Validity in Psychological Trait Evaluation of LLMs

Title: Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector

Title: Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology

Title: GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization

Title: Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models

Title: Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery

Title: SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain

Title: BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors

Title: Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors

Title: Qwen2.5-Omni Technical Report

Title: Video Motion Graphs

Title: DINeMo: Learning Neural Mesh Models with no 3D Annotations

Title: Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding

Title: TeleLoRA: Teleporting Model-Specific Alignment Across LLMs

Title: TraNCE: Transformative Non-linear Concept Explainer for CNNs

Title: Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection

Title: Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models

Title: Software Vulnerability Analysis Across Programming Language and Program Representation Landscapes: A Survey

Title: LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions

Title: How Secure is Forgetting? Linking Machine Unlearning to Machine Learning Attacks

Title: Revisit Time Series Classification Benchmark: The Impact of Temporal Information for Classification

Title: EGVD: Event-Guided Video Diffusion Model for Physically Realistic Large-Motion Frame Interpolation

Title: ViLBench: A Suite for Vision-Language Process Reward Modeling

Title: sudo rm -rf agentic_security

Title: Are We There Yet? Unraveling the State-of-the-Art Graph Network Intrusion Detection Systems

Title: Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation

Title: RelTriple: Learning Plausible Indoor Layouts by Integrating Relationship Triples into the Diffusion Process

Title: Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement

Title: Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model

Title: Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability

Title: A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications

Title: Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Title: Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks

Title: Wan: Open and Advanced Large-Scale Video Generative Models

Title: SpikeDerain: Unveiling Clear Videos from Rainy Sequences Using Color Spike Streams

Title: Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models

Title: Recovering Dynamic 3D Sketches from Videos

Title: Dynamic Pyramid Network for Efficient Multimodal Large Language Model

Title: Progressive Focused Transformer for Single Image Super-Resolution

Title: Wasserstein Distributionally Robust Bayesian Optimization with Continuous Context

Title: Consistency Trajectory Matching for One-Step Generative Super-Resolution

Title: CNN+Transformer Based Anomaly Traffic Detection in UAV Networks for Emergency Rescue

Title: UnReference: analysis of the effect of spoofing on RTK reference stations for connected rovers

Title: RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task

Title: FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies

Title: Active Data Sampling and Generation for Bias Remediation

Title: CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing

Title: ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On

Title: Cherry Yield Forecast: Harvest Prediction for Individual Sweet Cherry Trees

Title: TempTest: Local Normalization Distortion and the Detection of Machine-generated Text

Title: Evaluating Facial Expression Recognition Datasets for Deep Learning: A Benchmark Study with Novel Similarity Metrics

Title: Latent Beam Diffusion Models for Decoding Image Sequences

Title: Siformer: Feature-isolated Transformer for Efficient Skeleton-based Sign Language Recognition

Title: Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks

Title: From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment

Title: Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability

Title: Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation

Title: VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Title: Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models

Title: Enhancing Depression Detection via Question-wise Modality Fusion

Title: MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning

Title: Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering

Title: Explainable ICD Coding via Entity Linking

Title: Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications

Title: MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation

Title: GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Title: StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs

Title: TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration

Title: A Theoretical Framework for Prompt Engineering: Approximating Smooth Functions with Transformer Prompts

Title: Low-resource Information Extraction with the European Clinical Case Corpus

Title: Feature Statistics with Uncertainty Help Adversarial Robustness

Title: Diffusion Counterfactuals for Image Regressors

Title: IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting

Title: State-Aware Perturbation Optimization for Robust Deep Reinforcement Learning

Title: ProFed: a Benchmark for Proximity-based non-IID Federated Learning

Title: Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions

Title: $β$-GNN: A Robust Ensemble Approach Against Graph Structure Perturbation

Title: PVLens: Enhancing Pharmacovigilance Through Automated Label Extraction

Title: Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Title: MMGen: Unified Multi-modal Image Generation and Understanding in One Go

Title: Imitating Radiological Scrolling: A Global-Local Attention Model for 3D Chest CT Volumes Multi-Label Anomaly Classification

Title: DR-PETS: Learning-Based Control With Planning in Adversarial Environments

Title: ARMO: Autoregressive Rigging for Multi-Category Objects

Title: Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy

Title: Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound

Title: From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models

Title: Learning Straight Flows by Learning Curved Interpolants

Title: A weakly-supervised deep learning model for fast localisation and delineation of the skeleton, internal organs, and spinal canal on Whole-Body Diffusion-Weighted MRI (WB-DWI)

Title: Dynamic Motion Blending for Versatile Motion Editing

Title: RecTable: Fast Modeling Tabular Data with Rectified Flow

Title: SChanger: Change Detection from a Semantic Change and Spatial Consistency Perspective

Title: High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching

Title: MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams

Title: UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Title: Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework

Title: Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning

Title: MindfulLIME: A Stable Solution for Explanations of Machine Learning Models with Enhanced Localization Precision -- A Medical Image Case Study

Title: Reliable algorithm selection for machine learning-guided design

Title: An Empirical Study of the Impact of Federated Learning on Machine Learning Model Accuracy

Title: Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data

Title: Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Title: BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

Title: Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising

Title: FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks

Title: Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

Title: Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark