2024-08-06

Title: Siamese Transformer Networks for Few-shot Image Classification

Title: Transferable Adversarial Facial Images for Privacy Protection

Title: SUSTechGAN: Image Generation for Object Recognition in Adverse Conditions of Autonomous Driving

Title: VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance

Title: Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks

Title: MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts

Title: Blockchain Economic Denial of Sustainability Attack: Exploiting Latency Optimization in Ethereum Transaction Forwarding

Title: Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans

Title: Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics

Title: Trainable Pointwise Decoder Module for Point Cloud Segmentation

Title: Multi-task SAR Image Processing via GAN-based Unsupervised Manipulation

Title: Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder

Title: THOR2: Leveraging Topological Soft Clustering of Color Space for Human-Inspired Object Recognition in Unseen Environments

Title: Deep Learning Approach for Ear Recognition and Longitudinal Evaluation in Children

Title: Trustworthy Machine Learning under Social and Adversarial Data Sources

Title: CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models

Title: JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Language Model

Title: Fair Risk Minimization under Causal Path-Specific Effect Constraints

Title: Transforming Slot Schema Induction with Generative Dialogue State Inference

Title: Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation

Title: SAT3D: Image-driven Semantic Attribute Transfer in 3D

Title: Automated Phishing Detection Using URLs and Webpages

Title: Multiple Contexts and Frequencies Aggregation Network forDeepfake Detection

Title: iControl3D: An Interactive System for Controllable 3D Scene Generation

Title: MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph

Title: SiamMo: Siamese Motion-Centric 3D Object Tracking

Title: Controllable Unlearning for Image-to-Image Generative Models via $\varepsilon$-Constrained Optimization

Title: IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection

Title: TreeCSS: An Efficient Framework for Vertical Federated Learning

Title: A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection

Title: Bayesian Active Learning for Semantic Segmentation

Title: Signal-SGN: A Spiking Graph Convolutional Network for Skeletal Action Recognition via Learning Temporal-Frequency Dynamics

Title: Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers

Title: AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Title: A General Ambiguity Model for Binary Edge Images with Edge Tracing and its Implementation

Title: Intuitionistic Fuzzy Generalized Eigenvalue Proximal Support Vector Machine

Title: Joint Universal Adversarial Perturbations with Interpretations

Title: A Novel Evaluation Framework for Image2Text Generation

Title: Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation

Title: LAM3D: Leveraging Attention for Monocular 3D Object Detection

Title: Summarization of Investment Reports Using Pre-trained Model

Title: Indexing and Visualization of Climate Change Narratives Using BERT and Causal Extraction

Title: Domain penalisation for improved Out-of-Distribution Generalisation

Title: Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification

Title: Joint Model Pruning and Resource Allocation for Wireless Time-triggered Federated Learning

Title: MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition

Title: Comparison of Embedded Spaces for Deep Learning Classification

Title: STDA: Spatio-Temporal Dual-Encoder Network Incorporating Driver Attention to Predict Driver Behaviors Under Safety-Critical Scenarios

Title: MathLearner: A Large Language Model Agent Framework for Learning to Solve Mathematical Problems

Title: Towards an ontology of state actors in cyberspace

Title: Optimizing Intrusion Detection System Performance Through Synergistic Hyperparameter Tuning and Advanced Data Processing

Title: MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Title: STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Title: ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features

Title: SkyDiffusion: Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm

Title: GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer

Title: TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks

Title: Supervised Image Translation from Visible to Infrared Domain for Object Detection

Title: Efficient Solutions For An Intriguing Failure of LLMs: Long Context Window Does Not Mean LLMs Can Analyze Long Sequences Flawlessly

Title: MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance

Title: Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval

Title: Cross-layer Attention Sharing for Large Language Models

Title: Remote Staking with Economic Safety

Title: CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery

Title: DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models

Title: A Survey and Evaluation of Adversarial Attacks for Object Detection

Title: Defining and Evaluating Decision and Composite Risk in Language Models Applied to Natural Language Inference

Title: RobNODDI: Robust NODDI Parameter Estimation with Adaptive Sampling under Continuous Representation

Title: CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization

Title: Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI

Title: AnomalySD: Few-Shot Multi-Class Anomaly Detection with Stable Diffusion Model

Title: A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios

Title: Top K Enhanced Reinforcement Learning Attacks on Heterogeneous Graph Node Classification

Title: Optimal and efficient text counterfactuals using Graph Neural Networks

Title: Single-Point Supervised High-Resolution Dynamic Network for Infrared Small Target Detection

Title: Label Augmentation for Neural Networks Robustness

Title: AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning

Title: DeMansia: Mamba Never Forgets Any Tokens

Title: Towards Automatic Hands-on-Keyboard Attack Detection Using LLMs in EDR Solutions

Title: Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response

Title: AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis

Title: LLaSA: Large Language and E-Commerce Shopping Assistant

Title: Personalized Federated Learning on Heterogeneous and Long-Tailed Data via Expert Collaborative Learning

Title: Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning

Title: A Smart City Infrastructure Ontology for Threats, Cybercrime, and Digital Forensic Investigation

Title: Faster Diffusion Action Segmentation

Title: Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models

Title: Enhancing Human Action Recognition and Violence Detection Through Deep Learning Audiovisual Fusion

Title: Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping

Title: Robustness of Watermarking on Text-to-Image Diffusion Models

Title: Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation

Title: Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation

Title: Fine-tuning multilingual language models in Twitter/X sentiment analysis: a study on Eastern-European V4 languages

Title: PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone

Title: Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation

Title: MedSyn: LLM-based Synthetic Medical Text Generation Framework

Title: ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning

Title: PromptSAM+: Malware Detection based on Prompt Segment Anything Model

Title: Case-based reasoning approach for diagnostic screening of children with developmental delays

Title: FDiff-Fusion:Denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation

Title: LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation

Title: Secure and Transparent Medical Record Management System Using Python and Blockchain

Title: Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Title: KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving

Title: View-consistent Object Removal in Radiance Fields

Title: Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process

Title: AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos

Title: Assessing the XDC Network: A Comprehensive Evaluation of its qualitative and technical aspects

Title: FovEx: Human-inspired Explanations for Vision Transformers and Convolutional Neural Networks

Title: Table Transformers for Imputing Textual Attributes

Title: Model Hijacking Attack in Federated Learning

Title: VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces

Title: Analyzing Cultural Representations of Emotions in LLMs through Mixed Emotion Survey

Title: ARVO: Atlas of Reproducible Vulnerabilities for Open Source Software

Title: Rethinking Affect Analysis: A Protocol for Ensuring Fairness and Consistency

Title: X.509 Information Security Certification Based on Post-Quantum Cryptography

Title: AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines

Title: CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Title: Evaluating the Performance of Large Language Models for SDG Mapping (Technical Report)

Title: Source-Free Domain-Invariant Performance Prediction

Title: ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning

Title: Climate-Driven Doubling of Maize Loss Probability in U.S. Crop Insurance: Spatiotemporal Prediction and Possible Policy Responses

Title: SoK: Fighting Counterfeits with Cyber-Physical Synergy Based on Physically-Unclonable Identifiers of Paper Surface

Title: Cross-modulated Attention Transformer for RGBT Tracking

Title: Large Language Model Aided QoS Prediction for Service Recommendation

Title: ProCreate, Don\'t Reproduce! Propulsive Energy Diffusion for Creative Generation

Title: REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Title: A Multi-Source Heterogeneous Knowledge Injected Prompt Learning Method for Legal Charge Prediction

Title: Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings

Title: BOTS-LM: Training Large Language Models for Setswana

Title: Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders

Title: Contrastive Learning and Abstract Concepts: The Case of Natural Numbers

Title: ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems

Title: Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs

Title: VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking

Title: One-Shot Collaborative Data Distillation

Title: Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes

Title: DRFormer: Multi-Scale Transformer Utilizing Diverse Receptive Fields for Long Time-Series Forecasting

Title: Joint-Motion Mutual Learning for Pose Estimation in Videos

Title: Generalized Gaussian Temporal Difference Error For Uncertainty-aware Reinforcement Learning

Title: SNFinLLM: Systematic and Nuanced Financial Domain Adaptation of Chinese Large Language Models

Title: PROF: Protected Order Flow in a Profit-Seeking World

Title: Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization

Title: On the Robustness of Malware Detectors to Adversarial Samples

Title: A Lean Transformer Model for Dynamic Malware Analysis and Detection

Title: XDC Network Assessment: Decentralization, Scalability and Security

Title: A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models

Title: From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection

Title: Infusing Environmental Captions for Long-Form Video Language Grounding

Title: Machine Learning Applications in Medical Prognostics: A Comprehensive Review

Title: Earth System Data Cubes: Avenues for advancing Earth system research

Title: Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding

Title: The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024

Title: A Few-Shot Approach for Relation Extraction Domain Adaptation using Large Language Models

Title: Cross Psuedo Supervision Framework for Sparsely Labelled Geo-spatial Images

Title: Strategic Federated Learning: Application to Smart Meter Data Clustering

Title: CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration

Title: Terracorder: Sense Long and Prosper

Title: Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models

Title: Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models

Title: Attenuation-adjusted deep learning of pore defects in 2D radiographs of additive manufacturing powders

Title: Long Input Benchmark for Russian Analysis

Title: Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Title: Enhancing Heterogeneous Knowledge Graph Completion with a Novel GAT-based Approach

Title: Fairness and Bias Mitigation in Computer Vision: A Survey

Title: Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection

Title: UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

Title: Estimating Pore Location of PBF-LB/M Processes with Segmentation Models

Title: Practical Attacks against Black-box Code Completion Engines

Title: Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions

Title: RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Title: MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Title: Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information

Title: Contrastive Learning-based Multi Modal Architecture for Emoticon Prediction by Employing Image-Text Pairs

Title: Operational range bounding of spectroscopy models with anomaly detection

Title: Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization

Title: Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection

Title: Progressively Selective Label Enhancement for Language Model Alignment

Title: LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

Title: Language Model Can Listen While Speaking

Title: SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models

Title: Interactive 3D Medical Image Segmentation with SAM 2

Title: Command-line Obfuscation Detection using Small Language Models

Title: Detection of Compromised Functions in a Serverless Cloud Environment

Title: Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models?

Title: On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization

Title: Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining