2024-07-09

Title: QMViT: A Mushroom is worth 16x16 Words

Title: MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models

Title: AgriLLM: Harnessing Transformers for Farmer Queries

Title: PhishNet: A Phishing Website Detection Tool using XGBoost

Title: A Unified Learn-to-Distort-Data Framework for Privacy-Utility Trade-off in Trustworthy Federated Learning

Title: SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Title: Secure Rewind and Discard on ARM Morello

Title: SPINEX: Similarity-based Predictions with Explainable Neighbors Exploration for Anomaly and Outlier Detection

Title: Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning

Title: On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks

Title: Segmentation-Free Guidance for Text-to-Image Diffusion Models

Title: The Impact of Quantization and Pruning on Deep Reinforcement Learning Models

Title: Fair Submodular Cover

Title: NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning

Title: 3D Adaptive Structural Convolution Network for Domain-Invariant Point Cloud Recognition

Title: K-Nearest Neighbor Classification over Semantically Secure Encrypted Relational Data

Title: Associative Recurrent Memory Transformer

Title: MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Title: Amazing Things Come From Having Many Good Models

Title: Statistical investigations into the geometry and homology of random programs

Title: Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs

Title: Late Breaking Results: Fortifying Neural Networks: Safeguarding Against Adversarial Attacks with Stochastic Computing

Title: Explainable Metric Learning for Deflating Data Bias

Title: KESIC: Kerberos Extensions for Smart, IoT and CPS Devices

Title: Differentially Private Convex Approximation of Two-Layer ReLU Networks

Title: Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques

Title: MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Title: Privacy or Transparency? Negotiated Smartphone Access as a Signifier of Trust in Romantic Relationships

Title: SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions

Title: qlty: handling large tensors in scientific imaging

Title: CLIPVQA:Video Quality Assessment via CLIP

Title: SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

Title: Quantizing YOLOv7: A Comprehensive Study

Title: FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Title: Beyond the Federation: Topology-aware Federated Learning for Generalization to Unseen Clients

Title: Granular Privacy Control for Geolocation with Vision Language Models

Title: Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations

Title: Entropy-Informed Weighting Channel Normalizing Flow

Title: Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression

Title: EVA-Score: Evaluation of Long-form Summarization on Informativeness through Extraction and Validation

Title: TRACE: TRansformer-based Attribution using Contrastive Embeddings in LLMs

Title: The Solution for the AIGC Inference Performance Optimization Competition

Title: The Solution for Language-Enhanced Image New Category Discovery

Title: The Solution for the sequential task continual learning track of the 2nd Greater Bay Area International Algorithm Competition

Title: Rethinking the Effectiveness of Graph Classification Datasets in Benchmarks for Assessing GNNs

Title: Personalized Federated Domain-Incremental Learning based on Adaptive Knowledge Matching

Title: Recent Advancements and Challenges of Turkic Central Asian Language Processing

Title: BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support

Title: PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference

Title: Progress or Regress? Self-Improvement Reversal in Post-training

Title: How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Title: GCON: Differentially Private Graph Convolutional Network via Objective Perturbation

Title: Enhance the Robustness of Text-Centric Multimodal Alignments

Title: Robust Skin Color Driven Privacy Preserving Face Recognition via Function Secret Sharing

Title: BrainMetDetect: Predicting Primary Tumor from Brain Metastasis MRI Data Using Radiomic Features and Machine Learning Algorithms

Title: A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation

Title: Reverse Engineered MiniFS File System

Title: FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning

Title: DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Title: Releasing Malevolence from Benevolence: The Menace of Benign Data on Machine Unlearning

Title: SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

Title: SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention

Title: Impact of Network Topology on Byzantine Resilience in Decentralized Federated Learning

Title: DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer

Title: Synthetic Data Aided Federated Learning Using Foundation Models

Title: R-Trans -- A Recurrent Transformer Model for Clinical Feedback in Surgical Skill Assessment

Title: A Novel Bifurcation Method for Observation Perturbation Attacks on Reinforcement Learning Agents: Load Altering Attacks on a Cyber Physical Power System

Title: CBM: Curriculum by Masking

Title: LLMCloudHunter: Harnessing LLMs for Automated Extraction of Detection Rules from Cloud-Based CTI

Title: Helios: An extremely low power event-based gesture recognition for always-on smart eyewear

Title: VisioBlend: Sketch and Stroke-Guided Denoising Diffusion Probabilistic Model for Realistic Image Generation

Title: BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records

Title: Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course

Title: Effect of Rotation Angle in Self-Supervised Pre-training is Dataset-Dependent

Title: Flood of Techniques and Drought of Theories: Emotion Mining in Disasters

Title: Tracking Reflected Objects: A Benchmark

Title: Privacy of the last iterate in cyclically-sampled DP-SGD on nonconvex composite losses

Title: P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds

Title: Deep Probability Aggregation Clustering

Title: Self-Paced Sample Selection for Barely-Supervised Medical Image Segmentation

Title: CLIMB: A Benchmark of Clinical Bias in Large Language Models

Title: CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Title: DTR: A Unified Deep Tensor Representation Framework for Multimedia Data Recovery

Title: Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients

Title: Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions

Title: HyperKAN: Kolmogorov-Arnold Networks make Hyperspectral Image Classificators Smarter

Title: UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Title: SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning

Title: Gradient Diffusion: A Perturbation-Resilient Gradient Leakage Attack

Title: Model-agnostic meta-learners for estimating heterogeneous treatment effects over time

Title: Lack of Systematic Approach to Security of IoT Context Sharing Platforms

Title: Mamba Hawkes Process

Title: An Improved Method for Personalizing Diffusion Models

Title: Leveraging Topological Guidance for Improved Knowledge Distillation

Title: Vulnerability-Hunter: An Adaptive Feature Perception Attention Network for Smart Contract Vulnerabilities

Title: Rethinking Targeted Adversarial Attacks For Neural Machine Translation

Title: Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?

Title: Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Title: VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool

Title: CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Title: Multi-branch Collaborative Learning Network for 3D Visual Grounding

Title: PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Title: Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition

Title: Online Drift Detection with Maximum Concept Discrepancy

Title: Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking

Title: Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Title: Image-Conditional Diffusion Transformer for Underwater Image Enhancement

Title: Evolutionary Trigger Detection and Lightweight Model Repair Based Backdoor Defense

Title: FM-OSD: Foundation Model-Enabled One-Shot Detection of Anatomical Landmarks

Title: DIVESPOT: Depth Integrated Volume Estimation of Pile of Things Based on Point Cloud

Title: Cross Prompting Consistency with Segment Anything Model for Semi-supervised Medical Image Segmentation

Title: EMBANet: A Flexible Efffcient Multi-branch Attention Network

Title: Multimodal Language Models for Domain-Specific Procedural Video Summarization

Title: LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning in Large Language Models

Title: Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness

Title: SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification

Title: Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images

Title: Training Task Experts through Retrieval Based Distillation

Title: Biomedical Nested NER with Large Language Model and UMLS Heuristics

Title: Just read twice: closing the recall gap for recurrent language models

Title: How Effective are State Space Models for Machine Translation?

Title: Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models

Title: Addressing single object tracking in satellite imagery through prompt-engineered solutions

Title: Rethinking Image Skip Connections in StyleGAN2

Title: An accurate detection is not all you need to combat label noise in web-noisy datasets

Title: LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction

Title: Read, Watch and Scream! Sound Generation from Text and Video

Title: Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder

Title: PANS: Probabilistic Airway Navigation System for Real-time Robust Bronchoscope Localization

Title: LLMBox: A Comprehensive Library for Large Language Models

Title: GMC: A General Framework of Multi-stage Context Learning and Utilization for Visual Detection Tasks

Title: Spatio-Temporal Encoding and Decoding-Based Method for Future Human Activity Skeleton Synthesis

Title: ORMNet: Object-centric Relationship Modeling for Egocentric Hand-object Segmentation

Title: $\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model

Title: On the Power of Convolution Augmented Transformer

Title: An Experimental Comparison of Transfer Learning against Self-supervised Learning

Title: SLIM: Spuriousness Mitigation with Minimal Human Annotations

Title: GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

Title: Generative Debunking of Climate Misinformation

Title: GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing

Title: WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering

Title: Open-world Multi-label Text Classification with Extremely Weak Supervision

Title: AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing

Title: Deep Learning-based Anomaly Detection and Log Analysis for Computer Networks

Title: OneDiff: A Generalist Model for Image Difference

Title: Graph Attention with Random Rewiring

Title: The Dynamic Net Architecture: Learning Robust and Holistic Visual Representations Through Self-Organizing Networks

Title: MSTF: Multiscale Transformer for Incomplete Trajectory Prediction

Title: BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space

Title: Fine-Grained Multi-View Hand Reconstruction Using Inverse Rendering

Title: Retrieved In-Context Principles from Previous Mistakes

Title: Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations

Title: Sub-SA: Strengthen In-context Learning via Submodular Selective Annotation

Title: InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

Title: LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Videos

Title: Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge

Title: PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation

Title: FairPFN: Transformers Can do Counterfactual Fairness

Title: Is GPT-4 Alone Sufficient for Automated Essay Scoring?: A Comparative Judgment Approach Based on Rater Cognition

Title: Empirical Study of Symmetrical Reasoning in Conversational Chatbots

Title: Do Multilingual Large Language Models Mitigate Stereotype Bias?

Title: Large Language Models Understand Layouts

Title: Enlarging Feature Support Overlap for Domain Generalization

Title: Multi-agent Reinforcement Learning-based Network Intrusion Detection System

Title: When is the consistent prediction likely to be a correct prediction?

Title: Large Language Models for Judicial Entity Extraction: A Comparative Study

Title: FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging

Title: MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles

Title: Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition

Title: 3D Vessel Graph Generation Using Denoising Diffusion

Title: Evaluating the Fairness of Neural Collapse in Medical Image Classification

Title: Anatomy-guided Pathology Segmentation

Title: Wavelet Convolutions for Large Receptive Fields

Title: Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning

Title: KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions

Title: Scaling Exponents Across Parameterizations and Optimizers

Title: Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling

Title: HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Title: Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

Title: Submodular video object proposal selection for semantic object segmentation

Title: Non-parametric Contextual Relationship Learning for Semantic Video Object Segmentation

Title: Fostering Trust and Quantifying Value of AI and ML

Title: Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop

Title: Reducing Vision Transformer Latency on Edge Devices via GPU Tail Effect and Training-free Token Pruning

Title: What Do We Know About the Psychology of Insider Threats?

Title: Redactable Blockchain Solutions for IoT: A Review of Mechanisms and Applications

Title: Causality-driven Sequence Segmentation for Enhancing Multiphase Industrial Process Data Analysis and Soft Sensing

Title: T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models

Title: On Bellman equations for continuous-time policy evaluation I: discretization and approximation

Title: STMR: Spiral Transformer for Hand Mesh Reconstruction

Title: Deform-Mamba Network for MRI Super-Resolution

Title: Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise

Title: LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Title: Towards A Comprehensive Visual Saliency Explanation Framework for AI-based Face Recognition Systems

Title: Self-Prior Guided Mamba-UNet Networks for Medical Image Super-Resolution

Title: Pseudo-triplet Guided Few-shot Composed Image Retrieval

Title: Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Title: Advancing Automated Deception Detection: A Multimodal Approach to Feature Extraction and Analysis

Title: Evaluating Predictive Models in Cybersecurity: A Comparative Analysis of Machine and Deep Learning Techniques for Threat Detection

Title: RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation

Title: Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained Videos

Title: Distilling System 2 into System 1

Title: PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

Title: Enabling Performant and Secure EDA as a Service in Public Clouds Using Confidential Containers

Title: MST5 -- Multilingual Question Answering over Knowledge Graphs

Title: Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

Title: From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Title: Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis

Title: Semantic Communication Networks Empowered Artificial Intelligence of Things

Title: A Survey of Controllable Learning: Methods and Applications in Information Retrieval

Title: 3D Vision and Language Pretraining with Large-Scale Synthetic Data

Title: LLMcap: Large Language Model for Unsupervised PCAP Failure Detection

Title: Analytic Convolutional Layer: A Step to Analytic Neural Network

Title: Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models

Title: Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation

Title: Epistemological Bias As a Means for the Automated Detection of Injustices in Text

Title: Physics-Informed Machine Learning Towards A Real-Time Spacecraft Thermal Simulator

Title: PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models

Title: FGA: Fourier-Guided Attention Network for Crowd Count Estimation

Title: Structured Generations: Using Hierarchical Clusters to guide Diffusion Models

Title: Better Sampling, towards Better End-to-end Small Object Detection

Title: Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer

Title: ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Title: Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning

Title: OMuSense-23: A Multimodal Dataset for Contactless Breathing Pattern Recognition and Biometric Analysis

Title: CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

Title: Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks

Title: Temporal Grounding of Activities using Multimodal Large Language Models

Title: A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion

Title: RNNs, CNNs and Transformers in Human Action Recognition: A Survey and A Hybrid Model

Title: The Tug-of-War Between Deepfake Generation and Detection

Title: Contour-weighted loss for class-imbalanced image segmentation

Title: Transfer Learning with Self-Supervised Vision Transformers for Snake Identification

Title: Compositional Video Generation as Flow Equalization

Title: JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation

Title: CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

Title: Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Title: Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images