2024-08-21

Title: A Comprehensive Survey on Diffusion Models and Their Applications

Title: A Survey on Symbolic Knowledge Distillation of Large Language Models

Title: VyAnG-Net: A Novel Multi-Modal Sarcasm Recognition Model by Uncovering Visual, Acoustic and Glossary Features

Title: NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild

Title: Contrastive Learning on Medical Intents for Sequential Prescription Recommendation

Title: Relational Graph Convolutional Networks Do Not Learn Sound Rules

Title: Kolmogorov Arnold Networks in Fraud Detection: Bridging the Gap Between Theory and Practice

Title: Diffusion Model for Planning: A Systematic Literature Review

Title: Towards Efficient Machine Learning Method for IoT DDoS Attack Detection

Title: OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction

Title: SEAL: Systematic Error Analysis for Value ALignment

Title: FedKBP: Federated dose prediction framework for knowledge-based planning in radiation therapy

Title: FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models

Title: Increasing transformer token length with a Maximum Entropy Principle Method

Title: NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models

Title: AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference

Title: BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction

Title: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching

Title: Leveraging Superfluous Information in Contrastive Representation Learning

Title: On the Identifiability of Sparse ICA without Assuming Non-Gaussianity

Title: Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Title: Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models

Title: HaSPeR: An Image Repository for Hand Shadow Puppet Recognition

Title: Security Risks Due to Data Persistence in Cloud FPGA Platforms

Title: Value Alignment from Unstructured Text

Title: Evaluating Image-Based Face and Eye Tracking with Event Cameras

Title: Parallel Processing of Point Cloud Ground Segmentation for Mechanical and Solid-State LiDARs

Title: CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Title: Understanding Generative AI Content with Embedding Models

Title: Private Means and the Curious Incident of the Free Lunch

Title: Goldfish: Monolingual Language Models for 350 Languages

Title: Federated Learning of Large ASR Models in the Real World

Title: The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks

Title: Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation

Title: Differentially Private Stochastic Gradient Descent with Fixed-Size Minibatches: Tighter RDP Guarantees with or without Replacement

Title: Parkinson's Disease Classification via EEG: All You Need is a Single Convolutional Layer

Title: Learning Multimodal Latent Space with EBM Prior and MCMC Inference

Title: Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions

Title: LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS

Title: Enhancing One-shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism

Title: PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting

Title: MambaEVT: Event Stream based Visual Object Tracking using State Space Model

Title: Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm

Title: QUITO-X: An Information Bottleneck-based Compression Algorithm with Cross-Attention

Title: Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers

Title: Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups

Title: Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba

Title: EdgeNAT: Transformer for Efficient Edge Detection

Title: FAGStyle: Feature Augmentation on Geodesic Surface for Zero-shot Text-guided Diffusion Image Style Transfer

Title: Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Title: The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution

Title: Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

Title: Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution

Title: Target-Prompt Online Graph Collaborative Learning for Temporal QoS Prediction

Title: Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation

Title: Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models

Title: Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter

Title: Multi-view Hand Reconstruction with a Point-Embedded Transformer

Title: An Efficient Sign Language Translation Using Spatial Configuration and Motion Dynamics with LLMs

Title: MV-MOS: Multi-View Feature Fusion for 3D Moving Object Segmentation

Title: MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

Title: Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory

Title: PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Title: Enhancing Robustness in Large Language Models: Prompting for Mitigating the Impact of Irrelevant Information

Title: Novel Change Detection Framework in Remote Sensing Imagery Using Diffusion Models and Structural Similarity Index (SSIM)

Title: TextMastero: Mastering High-Quality Scene Text Editing in Diverse Languages and Styles

Title: WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Title: Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?

Title: Finding the DeepDream for Time Series: Activation Maximization for Univariate Time Series

Title: LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models

Title: Interactive Counterfactual Generation for Univariate Time Series

Title: Industry Perception of Security Challenges with Identity Access Management Solutions

Title: Beneath the Surface of Consistency: Exploring Cross-lingual Knowledge Representation Sharing in LLMs

Title: Privacy-preserving Universal Adversarial Defense for Black-box Models

Title: Smart Contract Coordinated Privacy Preserving Crowd-Sensing Campaigns

Title: Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant

Title: UIE-UnFold: Deep Unfolding Network with Color Priors and Vision Transformer for Underwater Image Enhancement

Title: ETGuard: Malicious Encrypted Traffic Detection in Blockchain-based Power Grid Systems

Title: REInstruct: Building Instruction Data from Unlabeled Corpus

Title: Federated Clustering: An Unsupervised Cluster-Wise Training for Decentralized Data Distributions

Title: Probing the Safety Response Boundary of Large Language Models via Unsafe Decoding Path Generation

Title: Tensor tree learns hidden relational structures in data to construct generative models

Title: A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning

Title: Neural Exploratory Landscape Analysis

Title: Iterative Window Mean Filter: Thwarting Diffusion-based Adversarial Purification

Title: Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models

Title: Unconditional Truthfulness: Learning Conditional Dependency for Uncertainty Quantification of Large Language Models

Title: MsMemoryGAN: A Multi-scale Memory GAN for Palm-vein Adversarial Purification

Title: AnyGraph: Graph Foundation Model in the Wild

Title: Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique

Title: Large Language Models for Multimodal Deformable Image Registration

Title: Variable Assignment Invariant Neural Networks for Learning Logic Programs

Title: Coarse-to-Fine Detection of Multiple Seams for Robotic Welding

Title: MEGen: Generative Backdoor in Large Language Models via Model Editing

Title: Crafting Tomorrow's Headlines: Neural News Generation and Detection in English, Turkish, Hungarian, and Persian

Title: Towards Efficient Large Language Models for Scientific Text: A Review

Title: PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection

Title: Security Assessment of Hierarchical Federated Deep Learning

Title: Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation

Title: Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

Title: An Open Source Python Library for Anonymizing Sensitive Data

Title: Detection of Intracranial Hemorrhage for Trauma Patients

Title: Generative AI in Industrial Machine Vision -- A Review

Title: LightMDETR: A Lightweight Approach for Low-Cost Open-Vocabulary Object Detection Training

Title: Tapping in a Remote Vehicle's onboard LLM to Complement the Ego Vehicle's Field-of-View

Title: Adversarial Attack for Explanation Robustness of Rationalization Models

Title: Honeyquest: Rapidly Measuring the Enticingness of Cyber Deception Techniques with Code-based Questionnaires

Title: MPL: Lifting 3D Human Pose from Multi-view 2D Poses

Title: ColBERT Retrieval and Ensemble Response Scoring for Language Model Question Answering

Title: Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?

Title: Learning Randomized Algorithms with Transformers

Title: Exploiting Large Language Models Capabilities for Question Answer-Driven Knowledge Graph Completion Across Static and Temporal Domains

Title: Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic Forecasting

Title: Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement

Title: Benchmarking Large Language Models for Math Reasoning Tasks

Title: Detecting Wildfires on UAVs with Real-time Segmentation Trained by Larger Teacher Models

Title: CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving

Title: Harmonizing Attention: Training-free Texture-aware Geometry Transfer

Title: Perception-guided Jailbreak against Text-to-Image Models

Title: Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning

Title: Feature Selection from Differentially Private Correlations

Title: Open 3D World in Autonomous Driving

Title: Low-Quality Image Detection by Hierarchical VAE

Title: ViLReF: A Chinese Vision-Language Retinal Foundation Model

Title: A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse

Title: Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs

Title: BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model

Title: ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining

Title: To Code, or Not To Code? Exploring Impact of Code in Pre-training

Title: CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network

Title: Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations

Title: LBC: Language-Based-Classifier for Out-Of-Variable Generalization

Title: Large Point-to-Gaussian Model for Image-to-3D Generation

Title: Robust Regression with Ensembles Communicating over Noisy Channels

Title: SysBench: Can Large Language Models Follow System Messages?

Title: HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments

Title: GAIM: Attacking Graph Neural Networks via Adversarial Influence Maximization

Title: KeySpace: Public Key Infrastructure Considerations in Interplanetary Networks

Title: Facial Demorphing via Identity Preserving Image Decomposition

Title: CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models

Title: SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining

Title: MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning

Title: While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output?

Title: Athena: Safe Autonomous Agents with Verbal Contrastive Learning

Title: Scaling Law with Learning Rate Annealing

Title: Atmospheric Transport Modeling of CO$_2$ with Neural Networks

Title: Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders

Title: MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Title: FLAME: Learning to Navigate with Multimodal LLM in Urban Environments

Title: NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency