2025-02-24

Title: KKA: Improving Vision Anomaly Detection through Anomaly-related Knowledge from Large Language Models

Title: A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations

Title: From 16-Bit to 1-Bit: Visual KV Cache Quantization for Memory-Efficient Multimodal Large Language Models

Title: SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image

Title: Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review

Title: Vision-Enhanced Time Series Forecasting via Latent Diffusion Models

Title: The Multi-Faceted Monosemanticity in Multimodal Representations

Title: Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability

Title: WeedVision: Multi-Stage Growth and Classification of Weeds using DETR and RetinaNet for Precision Agriculture

Title: CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection

Title: NOTA: Multimodal Music Notation Understanding for Visual Large Language Model

Title: FOCUS on Contamination: A Geospatial Deep Learning Framework with a Noise-Aware Loss for Surface Water PFAS Prediction

Title: A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models

Title: Retrieval-augmented systems can be dangerous medical communicators

Title: Can AI mimic the human ability to define neologisms?

Title: PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational Paths

Title: Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence

Title: Beyond Words: Exploring Cultural Value Sensitivity in Multimodal Models

Title: GneissWeb: Preparing High Quality Data for LLMs at Scale

Title: KOALA: Knowledge Conflict Augmentations for Robustness in Vision Language Models

Title: EvoP: Robust LLM Inference via Evolutionary Pruning

Title: Batayan: A Filipino NLP benchmark for evaluating Large Language Models

Title: Universal Semantic Embeddings of Chemical Elements for Enhanced Materials Inference and Discovery

Title: OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment

Title: What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs

Title: Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning

Title: RAPTOR: Refined Approach for Product Table Object Recognition

Title: The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Title: SIFT: Grounding LLM Reasoning in Contexts via Stickers

Title: A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?

Title: Learning to Retrieve and Reason on Knowledge Graph through Active Self-Reflection

Title: Online hand gesture recognition using Continual Graph Transformers

Title: FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models

Title: KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

Title: CyberSentinel: An Emergent Threat Detection System for AI Security

Title: Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries

Title: EigenShield: Causal Subspace Filtering via Random Matrix Theory for Adversarially Robust Vision-Language Models

Title: LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection

Title: Generative Modeling of Individual Behavior at Scale

Title: LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Title: Contextualizing Search Queries In-Context Learning for Conversational Rewriting with LLMs

Title: Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models

Title: CrossOver: 3D Scene Cross-Modal Alignment

Title: Graph in the Vault: Protecting Edge GNN Inference with Trusted Execution Environment

Title: Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition

Title: TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation

Title: Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability

Title: Using tournaments to calculate AUROC for zero-shot classification with LLMs

Title: MACPruning: Dynamic Operation Pruning to Mitigate Side-Channel DNN Model Extraction

Title: Simpler Fast Vision Transformers with a Jumbo CLS Token

Title: A Meta-Evaluation of Style and Attribute Transfer Metrics

Title: GeoAggregator: An Efficient Transformer Model for Geo-Spatial Tabular Data

Title: Reducing Hallucinations of Medical Multimodal Large Language Models with Visual Retrieval-Augmented Generation

Title: Benchmarking Android Malware Detection: Rethinking the Role of Traditional and Deep Learning Models

Title: Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease

Title: Visualizing Machine Learning Models for Enhanced Financial Decision-Making and Risk Management

Title: More for Keys, Less for Values: Adaptive KV Cache Quantization

Title: Hardware-Friendly Static Quantization Method for Video Diffusion Transformers

Title: UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning

Title: Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models

Title: Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans

Title: Optimizing Singular Spectrum for Large Language Model Compression

Title: Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models

Title: LUME: LLM Unlearning with Multitask Evaluations

Title: Leveraging ChatGPT for Sponsored Ad Detection and Keyword Extraction in YouTube Videos

Title: Assessing a Single Student's Concentration on Learning Platforms: A Machine Learning-Enhanced EEG-Based Framework

Title: Unveiling Reasoning Thresholds in Language Models: Scaling, Fine-Tuning, and Interpretability through Attention Maps

Title: DAM-Seg: Anatomically accurate cardiac segmentation using Dense Associative Networks

Title: TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba

Title: CoT-ICL Lab: A Petri Dish for Studying Chain-of-Thought Learning from In-Context Demonstrations

Title: Chain-of-Rank: Enhancing Large Language Models for Domain-Specific RAG in Edge Device

Title: Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns

Title: Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation

Title: Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems

Title: Extreme Speech Classification in the Era of LLMs: Exploring Open-Source and Proprietary Models

Title: M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment

Title: Methods and Trends in Detecting Generated Images: A Comprehensive Review

Title: Optimizing Product Provenance Verification using Data Valuation Methods

Title: Nonlinear Dynamical Systems for Automatic Face Annotation in Head Tracking and Pose Estimation

Title: Hierarchical Context Transformer for Multi-level Semantic Scene Understanding

Title: Image Translation-Based Unsupervised Cross-Modality Domain Adaptation for Medical Image Segmentation

Title: TETRIS: Optimal Draft Token Selection for Batch Speculative Decoding

Title: UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction

Title: FlipConcept: Tuning-Free Multi-Concept Personalization for Text-to-Image Generation

Title: Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing

Title: The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning

Title: Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs

Title: Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews

Title: AutoMR: A Universal Time Series Motion Recognition Pipeline

Title: A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation

Title: Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness

Title: Real-Time Moving Flock Detection in Pedestrian Trajectories Using Sequential Deep Learning Models

Title: LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design

Title: Corrections Meet Explanations: A Unified Framework for Explainable Grammatical Error Correction

Title: Retrieval-Augmented Speech Recognition Approach for Domain Challenges

Title: A Training-free LLM-based Approach to General Chinese Character Error Correction

Title: On the (In)Security of Non-resettable Device Identifiers in Custom Android Systems

Title: Analyzing the Inner Workings of Transformers in Compositional Generalization

Title: CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models

Title: DITING: A Static Analyzer for Identifying Bad Partitioning Issues in TEE Applications

Title: Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks

Title: Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference

Title: SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention

Title: Road Traffic Sign Recognition method using Siamese network Combining Efficient-CNN based Encoder

Title: Tight Clusters Make Specialized Experts

Title: SentiFormer: Metadata Enhanced Transformer for Image Sentiment Analysis

Title: Detecting Future-related Contexts of Entity Mentions

Title: Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment

Title: Stepwise Informativeness Search for Improving LLM Reasoning

Title: Learning with Limited Shared Information in Multi-agent Multi-armed Bandit

Title: PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments

Title: Tokenization is Sensitive to Language Variation

Title: Efficiently Solving Discounted MDPs with Predictions on Transition Matrices

Title: Constructing a Norm for Children's Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models

Title: AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms

Title: Evaluating Social Biases in LLM Reasoning

Title: Weakly Supervised Video Scene Graph Generation via Natural Language Supervision

Title: MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing

Title: Enhancing Vehicle Make and Model Recognition with 3D Attention Modules

Title: Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning

Title: Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution

Title: HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

Title: MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models

Title: Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking

Title: Evaluating Multimodal Generative AI with Korean Educational Standards

Title: Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs

Title: Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations

Title: Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation

Title: Single-pass Detection of Jailbreaking Input in Large Language Models

Title: Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

Title: When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models

Title: MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition

Title: A fast convergence algorithm based on binary integer programming for expert load balancing in MoE LLMs

Title: R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning

Title: Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs

Title: Decoding for Punctured Convolutional and Turbo Codes: A Deep Learning Solution for Protocols Compliance

Title: Confidence-Based Annotation Of Brain Tumours In Ultrasound

Title: ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models

Title: Network Resource Optimization for ML-Based UAV Condition Monitoring with Vibration Analysis

Title: Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Title: Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing

Title: Activation Steering in Neural Theorem Provers

Title: SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning

Title: Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection

Title: PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning

Title: Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach

Title: A Defensive Framework Against Adversarial Attacks on Machine Learning-Based Network Intrusion Detection Systems

Title: Model Privacy: A Unified Framework to Understand Model Stealing Attacks and Defenses

Title: A Cautionary Tale About "Neutrally" Informative AI Tools Ahead of the 2025 Federal Elections in Germany

Title: DReSD: Dense Retrieval for Speculative Decoding

Title: Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

Title: FLARE: Fault Attack Leveraging Address Reconfiguration Exploits in Multi-Tenant FPGAs

Title: Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid

Title: LightThinker: Thinking Step-by-Step Compression

Title: Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning

Title: SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention

Title: Robust Bias Detection in MLMs and its Application to Human Trait Ratings

Title: WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents

Title: Do Multilingual LLMs Think In English?

Title: On the Robustness of Transformers against Context Hijacking for Linear Classification

Title: PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification

Title: LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

Title: Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Title: Extraction multi-étiquettes de relations en utilisant des couches de Transformer

Title: Mildly Accurate Computationally Differentially Private Inner Product Protocols Imply Oblivious Transfer

Title: The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer

Title: Continual Person Identification using Footstep-Induced Floor Vibrations on Heterogeneous Floor Structures

Title: RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes

Title: Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification

Title: Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models

Title: AutoTandemML: Active Learning Enhanced Tandem Neural Networks for Inverse Design Problems

Title: Predicting gene essentiality and drug response from perturbation screens in preclinical cancer models with LEAP: Layered Ensemble of Autoencoders and Predictors

Title: Blockchain-based Trust Management in Security Credential Management System for Vehicular Network

Title: Machine-generated text detection prevents language model collapse

Title: Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing

Title: VaViM and VaVAM: Autonomous Driving through Video Generative Modeling

Title: FLEKE: Federated Locate-then-Edit Knowledge Editing

Title: Testing the limits of fine-tuning to improve reasoning in vision language models

Title: Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

Title: One-step Diffusion Models with $f$-Divergence Distribution Matching