2025-01-28

Title: Towards Foundation Models: Evaluation of Geoscience Artificial Intelligence with Uncertainty

Title: An Ensemble Model with Attention Based Mechanism for Image Captioning

Title: Unmasking Conversational Bias in AI Multiagent Systems

Title: Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval

Title: On the locality bias and results in the Long Range Arena

Title: JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models

Title: Dynamic Adaptation of LoRA Fine-Tuning for Efficient and Task-Specific Optimization of Large Language Models

Title: DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images

Title: Verify with Caution: The Pitfalls of Relying on Imperfect Factuality Metrics

Title: Hybrid Interpretable Deep Learning Framework for Skin Cancer Diagnosis: Integrating Radial Basis Function Networks with Explainable AI

Title: Measuring and Mitigating Hallucinations in Vision-Language Dataset Generation for Remote Sensing

Title: Feasible Learning

Title: Light3R-SfM: Towards Feed-forward Structure-from-Motion

Title: Self-reflecting Large Language Models: A Hegelian Dialectical Approach

Title: Interpretability in Parameter Space: Minimizing Mechanistic Description Length with Attribution-based Parameter Decomposition

Title: Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy

Title: Motion-enhancement to Echocardiography Segmentation via Inserting a Temporal Attention Module: An Efficient, Adaptable, and Scalable Approach

Title: Context-Aware Neural Gradient Mapping for Fine-Grained Instruction Processing

Title: CASE-Bench: Context-Aware Safety Evaluation Benchmark for Large Language Models

Title: MATCHA:Towards Matching Anything

Title: E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic Expressions

Title: ExPerT: Effective and Explainable Evaluation of Personalized Long-Form Text Generation

Title: LLM4DistReconfig: A Fine-tuned Large Language Model for Power Distribution Network Reconfiguration

Title: Personalized Layer Selection for Graph Neural Networks

Title: A Deep State Space Model for Rainfall-Runoff Simulations

Title: DepressionX: Knowledge Infused Residual Attention for Explainable Depression Severity Assessment

Title: Advances in Set Function Learning: A Survey of Techniques and Applications

Title: Federated Retrieval Augmented Generation for Multi-Product Question Answering

Title: VideoPure: Diffusion-based Adversarial Purification for Video Recognition

Title: MDEval: Evaluating and Enhancing Markdown Awareness in Large Language Models

Title: Towards Distributed Backdoor Attacks with Network Detection in Decentralized Federated Learning

Title: HuGDiffusion: Generalizable Single-Image Human Rendering via 3D Gaussian Diffusion

Title: On Accelerating Edge AI: Optimizing Resource-Constrained Environments

Title: Utilizing Graph Neural Networks for Effective Link Prediction in Microservice Architectures

Title: AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models

Title: Using Large Language Models for education managements in Vietnamese with low resources

Title: A Portable and Stealthy Inaudible Voice Attack Based on Acoustic Metamaterials

Title: Semi-supervised Anomaly Detection with Extremely Limited Labels in Dynamic Graphs

Title: Adaptive Client Selection in Federated Learning: A Network Anomaly Detection Use Case

Title: Towards Robust Unsupervised Attention Prediction in Autonomous Driving

Title: An Attempt to Unraveling Token Prediction Refinement and Identifying Essential Layers of Large Language Models

Title: KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment

Title: PolaFormer: Polarity-aware Linear Attention for Vision Transformers

Title: Exact Fit Attention in Node-Holistic Graph Convolutional Network for Improved EEG-Based Driver Fatigue Detection

Title: Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts

Title: Unifying Prediction and Explanation in Time-Series Transformers via Shapley-based Pretraining

Title: SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos

Title: PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures

Title: Cryptanalysis via Machine Learning Based Information Theoretic Metrics

Title: NetChain: Authenticated Blockchain Top-k Graph Data Queries and its Application in Asset Management

Title: Hierarchical Pattern Decryption Methodology for Ransomware Detection Using Probabilistic Cryptographic Footprints

Title: LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion

Title: Speech Translation Refinement using Large Language Models

Title: Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos

Title: CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter

Title: Bringing RGB and IR Together: Hierarchical Multi-Modal Enhancement for Robust Transmission Line Detection

Title: Comprehensive Evaluation of Cloaking Backdoor Attacks on Object Detector in Real-World

Title: Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Title: Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training

Title: HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Title: Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads

Title: TranStable: Towards Robust Pixel-level Online Video Stabilization by Jointing Transformer and CNN

Title: Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models

Title: Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models

Title: PromptShield: Deployable Detection for Prompt Injection Attacks

Title: SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks

Title: Enhancing Intent Understanding for Ambiguous Prompts through Human-Machine Co-Adaptation

Title: Option-ID Based Elimination For Multiple Choice Questions

Title: Uni-Sign: Toward Unified Sign Language Understanding at Scale

Title: A Floating Normalization Scheme for Deep Learning-Based Custom-Range Parameter Extraction in BSIM-CMG Compact Models

Title: A Training-free Synthetic Data Selection Method for Semantic Segmentation

Title: "Stones from Other Hills can Polish Jade": Zero-shot Anomaly Image Synthesis via Cross-domain Anomaly Injection

Title: Efficient and Interpretable Neural Networks Using Complex Lehmer Transform

Title: SEAL: Scaling to Emphasize Attention for Long-Context Retrieval

Title: Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

Title: ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval

Title: Prompting ChatGPT for Chinese Learning as L2: A CEFR and EBCL Level Study

Title: Enhancing Fetal Plane Classification Accuracy with Data Augmentation Using Diffusion Models

Title: Generalizable Deepfake Detection via Effective Local-Global Feature Extraction

Title: Lightweight and Post-Training Structured Pruning for On-Device Large Lanaguage Models

Title: Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning

Title: Dynamic Estimation of Tea Flowering Based on an Improved YOLOv5 and ANN Model

Title: Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data

Title: Enhanced Intrusion Detection in IIoT Networks: A Lightweight Approach with Autoencoder-Based Feature Learning

Title: New Evaluation Paradigm for Lexical Simplification

Title: Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink

Title: Killing it with Zero-Shot: Adversarially Robust Novelty Detection

Title: PIP: Perturbation-based Iterative Pruning for Large Language Models

Title: Pre-training a Transformer-Based Generative Model Using a Small Sepedi Dataset

Title: Are Human Interactions Replicable by Generative Agents? A Case Study on Pronoun Usage in Hierarchical Interactions

Title: Efficient Point Clouds Upsampling via Flow Matching

Title: A Two-Stage CAE-Based Federated Learning Framework for Efficient Jamming Detection in 5G Networks

Title: Advanced Real-Time Fraud Detection Using RAG-Based LLMs

Title: Deep Learning in Early Alzheimers diseases Detection: A Comprehensive Survey of Classification, Segmentation, and Feature Extraction Methods

Title: You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning

Title: The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?

Title: I Know What You Did Last Summer: Identifying VR User Activity Through VR Network Traffic

Title: ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning

Title: A Post-Processing-Based Fair Federated Learning Framework

Title: Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Title: Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection

Title: Federated Class-Incremental Learning: A Hybrid Approach Using Latent Exemplars and Data-Free Techniques to Address Local and Global Forgetting

Title: Decentralized Low-Rank Fine-Tuning of Large Language Models

Title: AI-Driven Secure Data Sharing: A Trustworthy and Privacy-Preserving Approach

Title: A Transfer Learning Framework for Anomaly Detection in Multivariate IoT Traffic Data

Title: iFormer: Integrating ConvNet and Transformer for Mobile Application

Title: Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis

Title: Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models

Title: Fine Tuning without Catastrophic Forgetting via Selective Low Rank Adaptation

Title: MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies

Title: DDUNet: Dual Dynamic U-Net for Highly-Efficient Cloud Segmentation

Title: CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation

Title: Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception

Title: Hiding in Plain Sight: An IoT Traffic Camouflage Framework for Enhanced Privacy

Title: How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Title: Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency

Title: Episodic Novelty Through Temporal Distance

Title: Visual Generation Without Guidance

Title: OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Title: Self-supervised Benchmark Lottery on ImageNet: Do Marginal Improvements Translate to Improvements on Similar Datasets?

Title: Mitigating Spurious Negative Pairs for Robust Industrial Anomaly Detection

Title: Making Sense Of Distributed Representations With Activation Spectroscopy

Title: Dfilled: Repurposing Edge-Enhancing Diffusion for Guided DSM Void Filling

Title: InfoBFR: Real-World Blind Face Restoration via Information Bottleneck

Title: StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces

Title: Token Democracy: The Architectural Limits of Alignment in Transformer-Based Language Models

Title: SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity

Title: STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection

Title: Identifying Critical Tokens for Accurate Predictions in Transformer-based Medical Imaging Models

Title: Data-adaptive Safety Rules for Training Reward Models

Title: FiberPool: Leveraging Multiple Blockchains for Decentralized Pooled Mining

Title: TractoGPT: A GPT architecture for White Matter Segmentation

Title: Low-altitude Friendly-Jamming for Satellite-Maritime Communications via Generative AI-enabled Deep Reinforcement Learning

Title: CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry

Title: LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs

Title: FedAlign: Federated Domain Generalization with Cross-Client Feature Alignment

Title: Color Flow Imaging Microscopy Improves Identification of Stress Sources of Protein Aggregates in Biopharmaceuticals

Title: Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification

Title: FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint

Title: Universal Image Restoration Pre-training via Degradation Classification

Title: Fuzzy-aware Loss for Source-free Domain Adaptation in Visual Emotion Recognition

Title: UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning

Title: Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles

Title: Building Efficient Lightweight CNN Models

Title: Optimal Transport on Categorical Data for Counterfactuals using Compositional Data and Dirichlet Transport

Title: Real-CATS: A Practical Training Ground for Emerging Research on Cryptocurrency Cybercrime Detection

Title: BoTier: Multi-Objective Bayesian Optimization with Tiered Composite Objectives

Title: Distributionally Robust Graph Out-of-Distribution Recommendation via Diffusion Model

Title: Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Title: CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary

Title: PCAP-Backdoor: Backdoor Poisoning Generator for Network Traffic in CPS/IoT Environments

Title: ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Title: Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models

Title: Approximate Message Passing for Bayesian Neural Networks

Title: Instruction Tuning for Story Understanding and Generation with Weak Supervision

Title: A Complexity-Informed Approach to Optimise Cyber Defences

Title: ConceptCLIP: Towards Trustworthy Medical AI via Concept-Enhanced Contrastive Langauge-Image Pre-training

Title: Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Title: SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain

Title: SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Title: IPVTON: Image-based 3D Virtual Try-on with Image Prompt Adapter

Title: Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets

Title: HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI

Title: Quantum-Enhanced Attention Mechanism in NLP: A Hybrid Classical-Quantum Approach

Title: A Comprehensive Survey on Self-Interpretable Neural Networks

Title: Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting

Title: Can Pose Transfer Models Generate Realistic Human Motion?

Title: A Privacy Enhancing Technique to Evade Detection by Street Video Cameras Without Using Adversarial Accessories

Title: People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Title: A Machine Learning Approach to Automatic Fall Detection of Combat Soldiers

Title: Classifying Deepfakes Using Swin Transformers

Title: StagFormer: Time Staggering Transformer Decoding for RunningLayers In Parallel

Title: MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation

Title: TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs

Title: Exploring the Feasibility of Deep Learning Models for Long-term Disease Prediction: A Case Study for Wheat Yellow Rust in England

Title: Transformer-Based Multimodal Knowledge Graph Completion with Link-Aware Contexts

Title: Random Walk Guided Hyperbolic Graph Distillation

Title: Adapting Biomedical Abstracts into Plain language using Large Language Models

Title: Disentanglement Analysis in Deep Latent Variable Models Matching Aggregate Posterior Distributions

Title: StaICC: Standardized Evaluation for Classification Task in In-context Learning

Title: CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling

Title: A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks

Title: Integrating Personalized Federated Learning with Control Systems for Enhanced Performance

Title: Renewable Energy Prediction: A Comparative Study of Deep Learning Models for Complex Dataset Analysis

Title: IndicMMLU-Pro: Benchmarking the Indic Large Language Models

Title: A Privacy Model for Classical & Learned Bloom Filters

Title: Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference

Title: GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design

Title: Efficiency Bottlenecks of Convolutional Kolmogorov-Arnold Networks: A Comprehensive Scrutiny with ImageNet, AlexNet, LeNet and Tabular Classification

Title: Investigating Application of Deep Neural Networks in Intrusion Detection System Design

Title: Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages

Title: Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution

Title: Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?

Title: Large Language Models to Diffusion Finetuning

Title: Memorization and Regularization in Generative Diffusion Models

Title: Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?

Title: LemmaHead: RAG Assisted Proof Generation Using Large Language Models

Title: MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining

Title: ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring

Title: MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer

Title: Intelligent Code Embedding Framework for High-Precision Ransomware Detection via Multimodal Execution Path Analysis

Title: Controllable Hand Grasp Generation for HOI and Efficient Evaluation Methods

Title: Beyond In-Distribution Performance: A Cross-Dataset Study of Trajectory Prediction Robustness

Title: Can Location Embeddings Enhance Super-Resolution of Satellite Imagery?

Title: LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models

Title: D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation

Title: LCTG Bench: LLM Controlled Text Generation Benchmark

Title: Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation

Title: A Data-Centric Approach: Dimensions of Visual Complexity and How to find Them

Title: Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects

Title: The Sample Complexity of Online Reinforcement Learning: A Multi-model Perspective

Title: Web Execution Bundles: Reproducible, Accurate, and Archivable Web Measurements

Title: Parametric Retrieval Augmented Generation

Title: TimeHF: Billion-Scale Time Series Models Guided by Human Feedback

Title: Enhancing the Convergence of Federated Learning Aggregation Strategies with Limited Data

Title: Inverse Reinforcement Learning via Convex Optimization

Title: An Explainable Disease Surveillance System for Early Prediction of Multiple Chronic Diseases

Title: Integrating Probabilistic Trees and Causal Networks for Clinical and Epidemiological Data

Title: Provisioning Time-Based Subscription in NDN: A Secure and Efficient Access Control Scheme

Title: MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models

Title: 3CEL: A corpus of legal Spanish contract clauses

Title: Improving Tropical Cyclone Forecasting With Video Diffusion Models

Title: TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

Title: Freestyle Sketch-in-the-Loop Image Segmentation

Title: FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments

Title: Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge

Title: CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

Title: PISCO: Pretty Simple Compression for Retrieval-Augmented Generation

Title: RelCAT: Advancing Extraction of Clinical Inter-Entity Relationships from Unstructured Electronic Health Records

Title: Integration of LLM Quality Assurance into an NLG System

Title: Generating Spatial Synthetic Populations Using Wasserstein Generative Adversarial Network: A Case Study with EU-SILC Data for Helsinki and Thessaloniki

Title: Automated Detection of Sport Highlights from Audio and Video Sources

Title: Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Title: A Unified Analysis of Stochastic Gradient Descent with Arbitrary Data Permutations and Beyond

Title: Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors

Title: MILP initialization for solving parabolic PDEs with PINNs

Title: AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought

Title: Demystifying OS Kernel Fuzzing with a Novel Taxonomy

Title: BAG: Body-Aligned 3D Wearable Asset Generation

Title: SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

Title: The Linear Attention Resurrection in Vision Transformer

Title: UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images

Title: Provence: efficient and robust context pruning for retrieval-augmented generation

Title: Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction

Title: Language-Based Bayesian Optimization Research Assistant (BORA)

Title: PDC-ViT : Source Camera Identification using Pixel Difference Convolution and Vision Transformer

Title: Application of Structured State Space Models to High energy physics with locality-sensitive hashing

Title: Distilling foundation models for robust and efficient models in digital pathology

Title: Phase Transitions in Large Language Models and the $O(N)$ Model

Title: CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation

Title: Zero-Shot Decision Tree Construction via Large Language Models

Title: Multi-Agent Geospatial Copilots for Remote Sensing Workflows

Title: A foundation model for human-AI collaboration in medical literature mining

Title: URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT

Title: Multi-view Structural Convolution Network for Domain-Invariant Point Cloud Recognition of Autonomous Vehicles

Title: Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Title: FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

Title: Large Models in Dialogue for Active Perception and Anomaly Detection

Title: Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width

Title: RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval

Title: Tailored Forecasting from Short Time Series via Meta-learning

Title: sDREAMER: Self-distilled Mixture-of-Modality-Experts Transformer for Automatic Sleep Staging

Title: RelightVid: Temporal-Consistent Diffusion Model for Video Relighting