2024-07-23

Title: A Foundation Model for Soccer

Title: Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence

Title: Learning Visual Grounding from Generative Vision and Language Model

Title: SQLfuse: Enhancing Text-to-SQL Performance through Comprehensive LLM Synergy

Title: Trading Devil Final: Backdoor attack via Stock market and Bayesian Optimization

Title: Adversarial Databases Improve Success in Retrieval-based Large Language Models

Title: Evaluating language models as risk scores

Title: The Research of Group Re-identification from Multiple Cameras

Title: BOND: Aligning LLMs with Best-of-N Distillation

Title: CVE-LLM : Automatic vulnerability evaluation in medical device industry using large language models

Title: Human-Interpretable Adversarial Prompt Attack on Large Language Models with Situational Context

Title: The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments

Title: OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning

Title: LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition

Title: Is $F_1$ Score Suboptimal for Cybersecurity Models? Introducing $C_{score}$, a Cost-Aware Alternative for Model Assessment

Title: DefTesPY: Cyber defense model with enhanced data modeling and analysis for Tesla company via Python Language

Title: Compact Language Models via Pruning and Knowledge Distillation

Title: Data Poisoning: An Overlooked Threat to Power Grid Resilience

Title: A Comprehensive Guide to Combining R and Python code for Data Science, Machine Learning and Reinforcement Learning

Title: $\infty$-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Title: Universally Harmonizing Differential Privacy Mechanisms for Federated Learning: Boosting Accuracy and Convergence

Title: Differential Privacy of Cross-Attention with Provable Guarantee

Title: Downstream-Pretext Domain Knowledge Traceback for Active Learning

Title: CrowdMAC: Masked Crowd Density Completion for Robust Crowd Density Forecasting

Title: FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models

Title: Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Title: Flatness-aware Sequential Learning Generates Resilient Backdoors

Title: Difflare: Removing Image Lens Flare with Latent Diffusion Model

Title: Enhancing Skin Disease Classification Leveraging Transformer-based Deep Learning Architectures and Explainable AI

Title: Data Augmentation in Graph Neural Networks: The Role of Generated Synthetic Graphs

Title: Implementing Fairness: the view from a FairDream

Title: Subgraph Clustering and Atom Learning for Improved Image Classification

Title: On the Design and Analysis of LLM-Based Algorithms

Title: FedPartWhole: Federated domain generalization via consistent part-whole hierarchies

Title: PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates

Title: FairViT: Fair Vision Transformer via Adaptive Masking

Title: WiFaKey: Generating Cryptographic Keys from Face in the Wild

Title: Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Title: GaitMA: Pose-guided Multi-modal Feature Fusion for Gait Recognition

Title: FMamba: Mamba based on Fast-attention for Multivariate Time-series Forecasting

Title: Blind Image Deconvolution by Generative-based Kernel Prior and Initializer via Latent Encoding

Title: CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation

Title: Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators

Title: Retrieval Augmented Generation Integrated Large Language Models in Smart Contract Vulnerability Detection

Title: Text-based Talking Video Editing with Cascaded Conditional Diffusion

Title: Understanding the Relationship between Prompts and Response Uncertainty in Large Language Models

Title: CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer

Title: Enhancing High-Energy Particle Physics Collision Analysis through Graph Data Attribution Techniques

Title: An Explainable Fast Deep Neural Network for Emotion Recognition

Title: Dual High-Order Total Variation Model for Underwater Image Restoration

Title: Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts

Title: Seal: Advancing Speech Language Models to be Few-Shot Learners

Title: Thompson Sampling Itself is Differentially Private

Title: Reduced Effectiveness of Kolmogorov-Arnold Networks on Functions with Noise

Title: Latent Pollution Model: The Hidden Carbon Footprint in 3D Image Synthesis

Title: AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

Title: Self-supervised transformer-based pre-training method with General Plant Infection dataset

Title: PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction

Title: RoIPoly: Vectorized Building Outline Extraction Using Vertex and Logit Embeddings

Title: RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies

Title: POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

Title: Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)

Title: From Ad Identifiers to Global Privacy Control: The Status Quo and Future of Opting Out of Ad Tracking on Android

Title: Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models

Title: Addressing Data Heterogeneity in Federated Learning of Cox Proportional Hazards Models

Title: Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives

Title: Base and Exponent Prediction in Mathematical Expressions using Multi-Output CNN

Title: Technical report: Improving the properties of molecules generated by LIMO

Title: Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

Title: ARoFace: Alignment Robustness to Improve Low-Quality Face Recognition

Title: Out of spuriousity: Improving robustness to spurious correlations without group annotations

Title: RGB2Point: 3D Point Cloud Generation from Single RGB Images

Title: GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation

Title: Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Title: All Against Some: Efficient Integration of Large Language Models for Message Passing in Graph Neural Networks

Title: Requiem for a drone: a machine-learning based framework for stealthy attacks against unmanned autonomous vehicles

Title: Knowledge Mechanisms in Large Language Models: A Survey and Perspective

Title: Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions

Title: Enhancing Incremental Summarization with Structured Representations

Title: ViT LoS V2X: Vision Transformers for Environment-aware LoS Blockage Prediction for 6G Vehicular Networks

Title: MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM

Title: Arondight: Red Teaming Large Vision Language Models with Auto-generated Multi-modal Jailbreak Prompts

Title: Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

Title: AGORA: Open More and Trust Less in Binary Verification Service

Title: Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Title: Navigation Instruction Generation with BEV Perception and Large Language Models

Title: SeqMIA: Sequential-Metric Based Membership Inference Attack

Title: A General Framework for Data-Use Auditing of ML Models

Title: D$^4$-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On

Title: DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

Title: Proximal Policy Distillation

Title: A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts

Title: D$^4$M: Dataset Distillation via Disentangled Diffusion Model

Title: Rethinking Feature Backbone Fine-tuning for Remote Sensing Object Detection

Title: SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator

Title: Anchored Diffusion for Video Face Reenactment

Title: Fine-grained Gender Control in Machine Translation with Large Language Models

Title: Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification

Title: HERGen: Elevating Radiology Report Generation with Longitudinal Data

Title: When Can Transformers Count to n?

Title: Adversarial Circuit Evaluation

Title: Semi-Supervised Pipe Video Temporal Defect Interval Localization

Title: Assessing Sample Quality via the Latent Space of Generative Models

Title: TADA: Temporal Adversarial Data Augmentation for Time Series Data

Title: Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope

Title: A Survey on Employing Large Language Models for Text-to-SQL Tasks

Title: HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

Title: HyperbolicLR: Epoch insensitive learning rate scheduler

Title: When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

Title: Efficient Visual Transformer by Learnable Token Merging

Title: PUFFLE: Balancing Privacy, Utility, and Fairness in Federated Learning

Title: The Hitchhiker's Guide to Human Alignment with *PO

Title: CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model

Title: TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data

Title: Variational Potential Flow: A Novel Probabilistic Framework for Energy-Based Generative Modelling

Title: BIGbench: A Unified Benchmark for Social Bias in Text-to-Image Generative Models Based on Multi-modal LLM

Title: XAI meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models

Title: An Adaptive System for Wearable Devices to Detect Stress Using Physiological Signals

Title: Weakly SSM : On the Viability of Weakly Supervised Segmentations for Statistical Shape Modeling

Title: A Learning-Based Attack Framework to Break SOTA Poisoning Defenses in Federated Learning

Title: MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Title: Minimizing the Number of Roles in Bottom-Up Role-Mining using Maximal Biclique Enumeration

Title: SynCPKL: Harnessing LLMs to Generate Synthetic Data for Commonsense Persona Knowledge Linking

Title: Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation

Title: Enhancing Hardware Fault Tolerance in Machines with Reinforcement Learning Policy Gradient Algorithms

Title: Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis

Title: Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Title: FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification

Title: Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Title: Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection

Title: Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Title: RoadPainter: Points Are Ideal Navigators for Topology transformER

Title: LLMExplainer: Large Language Model based Bayesian Inference for Graph Explanation Generation

Title: MAVEN-Fact: A Large-scale Event Factuality Detection Dataset

Title: Customized Retrieval Augmented Generation and Benchmarking for EDA Tool Documentation QA

Title: Attention Beats Linear for Fast Implicit Neural Representation Generation

Title: X-Recon: Learning-based Patient-specific High-Resolution CT Reconstruction from Orthogonal X-Ray Images

Title: Dissecting Multiplication in Transformers: Insights into LLMs

Title: Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias

Title: Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data

Title: Towards Robust Vision Transformer via Masked Adaptive Ensemble

Title: Poisoning with A Pill: Circumventing Detection in Federated Learning

Title: ALLaM: Large Language Models for Arabic and English

Title: Imposter.AI: Adversarial Attacks with Hidden Intentions towards Aligned Large Language Models

Title: Tackling Selfish Clients in Federated Learning

Title: A Solution toward Transparent and Practical AI Regulation: Privacy Nutrition Labels for Open-source Generative AI-based Applications

Title: Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models

Title: Weights Shuffling for Improving DPSGD in Transformer-based Models

Title: LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Title: Local All-Pair Correspondence for Point Tracking

Title: Planning behavior in a recurrent neural network that plays Sokoban

Title: Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention

Title: Empirical Capacity Model for Self-Attention Neural Networks

Title: Resource-Efficient Federated Multimodal Learning via Layer-wise and Progressive Training

Title: YOLO-pdd: A Novel Multi-scale PCB Defect Detection Method Using Deep Representations with Sequential Images

Title: Decoding BACnet Packets: A Large Language Model Approach for Packet Interpretation

Title: Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling

Title: Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures

Title: Merit-based Fair Combinatorial Semi-Bandit with Unrestricted Feedback Delays

Title: Developing a Reliable, General-Purpose Hallucination Detection and Mitigation Service: Insights and Lessons Learned

Title: Text2Place: Affordance-aware Text Guided Human Placement

Title: SIGMA:Sinkhorn-Guided Masked Video Modeling

Title: Text-to-Battery Recipe: A language modeling-based protocol for automatic battery recipe extraction and retrieval

Title: The Diversity Bonus: Learning from Dissimilar Distributed Clients in Personalized Federated Learning

Title: Learning deep illumination-robust features from multispectral filter array images

Title: In-Context Learning Improves Compositional Understanding of Vision-Language Models

Title: DiffX: Guide Your Layout to Cross-Modal Generative Modeling

Title: Fast computation of 2-isogenies in dimension 4 and cryptographic applications

Title: Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction

Title: TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping

Title: WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

Title: Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models

Title: SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time

Title: Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners

Title: Increasing the Robustness of Model Predictions to Missing Sensors in Earth Observation

Title: Attention Is All You Need But You Don't Need All Of It For Inference of Large Language Models

Title: Towards Efficient Transferable Preemptive Adversarial Defense

Title: Synthetic Image Learning: Preserving Performance and Preventing Membership Inference Attacks

Title: Inverted Activations

Title: Targeted Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs

Title: SETTP: Style Extraction and Tunable Inference via Dual-level Transferable Prompt Learning

Title: A New Theoretical Perspective on Data Heterogeneity in Federated Optimization

Title: An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought

Title: Unsupervised Robust Cross-Lingual Entity Alignment via Joint Modeling of Entity and Relation Texts

Title: Discrete Flow Matching

Title: Semi-Supervised Learning for Anomaly Detection in Blockchain-based Supply Chains

Title: StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation

Title: RadioRAG: Factual Large Language Models for Enhanced Diagnostics in Radiology Using Dynamic Retrieval Augmented Generation

Title: Reinforcement Learning Meets Visual Odometry

Title: Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Title: SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection

Title: TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly

Title: Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN

Title: DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Title: MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics

Title: HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning

Title: Enhancing Transferability of Targeted Adversarial Examples: A Self-Universal Perspective

Title: AI-Driven Fast and Early Detection of IoT Botnet Threats: A Comprehensive Network Traffic Analysis Approach

Title: Counter Turing Test ($CT^2$): Investigating AI-Generated Text Detection for Hindi -- Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$)

Title: A Life-long Learning Intrusion Detection System for 6G-Enabled IoV

Title: Estimating Probability Densities with Transformer and Denoising Diffusion

Title: Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition

Title: SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams

Title: Mamba meets crack segmentation

Title: Harmonizing Flows: Leveraging normalizing flows for unsupervised and source-free MRI harmonization

Title: GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI

Title: Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability

Title: DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design

Title: OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context

Title: Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond

Title: MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation

Title: Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Title: Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels

Title: Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach

Title: RADA: Robust and Accurate Feature Learning with Domain Adaptation

Title: Robust Mixture Learning when Outliers Overwhelm Small Groups

Title: CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning

Title: Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video

Title: MILAN: Milli-Annotations for Lidar Semantic Segmentation

Title: Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation

Title: Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems

Title: Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget

Title: Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

Title: dMel: Speech Tokenization made Simple

Title: MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity

Title: SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Title: Artist: Aesthetically Controllable Text-Driven Stylization without Training

Title: CarFormer: Self-Driving with Learned Object-Centric Representations

Title: Reconstructing Training Data From Real World Models Trained with Transfer Learning

Title: LLMmap: Fingerprinting For Large Language Models

Title: AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description