2024-09-17

Title: AutoGeo: Automating Geometric Image Dataset Creation for Enhanced Geometry Understanding

Title: HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning

Title: Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU

Title: Y-Drop: A Conductance based Dropout for fully connected layers

Title: Trimming the Risk: Towards Reliable Continuous Training for Deep Learning Inspection Systems

Title: Neural Message Passing Induced by Energy-Constrained Diffusion

Title: DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification

Title: PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

Title: Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss

Title: Curricula for Learning Robust Policies over Factored State Representations in Changing Environments

Title: Incorporation of Verifier Functionality in the Software for Operations and Network Attack Results Review and the Autonomous Penetration Testing System

Title: Cybersecurity Software Tool Evaluation Using a 'Perfect' Network Model

Title: Transformer with Controlled Attention for Synchronous Motion Captioning

Title: ProcessTBench: An LLM Plan Generation Dataset for Process Mining

Title: Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features

Title: Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases

Title: Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?

Title: Autoregressive + Chain of Thought (CoT) $\simeq$ Recurrent: Recurrence's Role in Language Models and a Revist of Recurrent Transformer

Title: Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery

Title: Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Title: NovAScore: A New Automated Metric for Evaluating Document Level Novelty

Title: ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance

Title: VSFormer: Mining Correlations in Flexible View Set for Multi-view 3D Shape Understanding

Title: Active Learning to Guide Labeling Efforts for Question Difficulty Estimation

Title: Informative Subgraphs Aware Masked Auto-Encoder in Dynamic Graphs

Title: SafeEar: Content Privacy-Preserving Audio Deepfake Detection

Title: LabellessFace: Fair Metric Learning for Face Recognition without Attribute Labels

Title: Language Models "Grok" to Copy

Title: SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2

Title: Generating API Parameter Security Rules with LLM for API Misuse Detection

Title: Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown

Title: ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion

Title: Registration between Point Cloud Streams and Sequential Bounding Boxes via Gradient Descent

Title: ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models

Title: ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild

Title: A Compressive Memory-based Retrieval Approach for Event Argument Extraction

Title: Efficient Fine-Tuning of Large Language Models for Automated Medical Documentation

Title: LawDNet: Enhanced Audio-Driven Lip Synthesis via Local Affine Warping Deformation

Title: Schr\"odinger Bridge Flow for Unpaired Data Translation

Title: OPUS: Occupancy Prediction Using a Sparse Set

Title: Towards Robust Detection of Open Source Software Supply Chain Poisoning Attacks in Industry Environments

Title: Symbolic Regression with a Learned Concept Library

Title: LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation

Title: Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Title: Generating Event-oriented Attribution for Movies via Two-Stage Prefix-Enhanced Multimodal LLM

Title: Models Are Codes: Towards Measuring Malicious Code Poisoning Attacks on Pre-trained Model Hubs

Title: BM$^2$: Coupled Schr\"{o}dinger Bridge Matching

Title: The Midas Touch: Triggering the Capability of LLMs for RM-API Misuse Detection

Title: LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach

Title: AMBER -- Advanced SegFormer for Multi-Band Image Segmentation: an application to Hyperspectral Imaging

Title: Tran-GCN: A Transformer-Enhanced Graph Convolutional Network for Person Re-Identification in Monitoring Videos

Title: Towards Diverse and Efficient Audio Captioning via Diffusion Models

Title: AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction

Title: Real-world Adversarial Defense against Patch Attacks based on Diffusion Model

Title: Weather Prediction Using CNN-LSTM for Time Series Analysis: A Case Study on Delhi Temperature Data

Title: Enhancing LLM Problem Solving with REAP: Reflection, Explicit Problem Deconstruction, and Advanced Prompting

Title: MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction

Title: Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision

Title: TX-Gen: Multi-Objective Optimization for Sparse Counterfactual Explanations for Time-Series Classification

Title: Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AI

Title: Hacking, The Lazy Way: LLM Augmented Pentesting

Title: Protecting Vehicle Location Privacy with Contextually-Driven Synthetic Location Generation

Title: Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation

Title: One missing piece in Vision and Language: A Survey on Comics Understanding

Title: Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models

Title: Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens

Title: Deep Learning Under Siege: Identifying Security Vulnerabilities and Risk Mitigation Strategies

Title: Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM Empowerment

Title: An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation

Title: Using Synthetic Data to Mitigate Unfairness and Preserve Privacy through Single-Shot Federated Learning

Title: COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare

Title: ASR Error Correction using Large Language Models

Title: A Statistical Viewpoint on Differential Privacy: Hypothesis Testing, Representation and Blackwell's Theorem

Title: Thesis proposal: Are We Losing Textual Diversity to Natural Language Processing?

Title: Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Title: NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training

Title: Open-World Test-Time Training: Self-Training with Contrast Learning

Title: Security Testbed for Preempting Attacks against Supercomputing Infrastructure

Title: DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Title: BULKHEAD: Secure, Scalable, and Efficient Kernel Compartmentalization with PKS

Title: TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer

Title: Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora

Title: HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models

Title: Enhancing Text Annotation through Rationale-Driven Collaborative Few-Shot Prompting

Title: Can Large Language Models Grasp Event Signals? Exploring Pure Zero-Shot Event-based Recognition

Title: Confidence Estimation for LLM-Based Dialogue State Tracking

Title: A Simple HMM with Self-Supervised Representations for Phone Segmentation

Title: SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks

Title: Unveiling Gender Bias in Large Language Models: Using Teacher's Evaluation in Higher Education As an Example

Title: Leveraging Open-Source Large Language Models for Native Language Identification

Title: EditBoard: Towards A Comprehensive Evaluation Benchmark for Text-based Video Editing Models

Title: SITSMamba for Crop Classification based on Satellite Image Time Series

Title: Nebula: Efficient, Private and Accurate Histogram Estimation

Title: E-Commerce Inpainting with Mask Guidance in Controlnet for Reducing Overcompletion

Title: Training Safe Neural Networks with Global SDP Bounds

Title: Predicting building types and functions at transnational scale

Title: GFlowNet Pretraining with Inexpensive Rewards

Title: AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs

Title: ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration

Title: Finetuning CLIP to Reason about Pairwise Differences

Title: MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection

Title: From Challenges and Pitfalls to Recommendations and Opportunities: Implementing Federated Learning in Healthcare

Title: PersonaMark: Personalized LLM watermarking for model protection and user attribution

Title: OML-AD: Online Machine Learning for Anomaly Detection in Time Series Data

Title: Taming the Ransomware Threats: Leveraging Prospect Theory for Rational Payment Decisions

Title: Explore the Hallucination on Low-level Perception for MLLMs

Title: Automated Lesion Segmentation in Whole-Body PET/CT in a multitracer setting

Title: Towards Multi-view Graph Anomaly Detection with Similarity-Guided Contrastive Clustering

Title: Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through $f$-divergence Minimization

Title: DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving

Title: Rewind-to-Delete: Certified Machine Unlearning for Nonconvex Functions

Title: Underwater Image Enhancement via Dehazing and Color Restoration

Title: Enhancing Lesion Segmentation in PET/CT Imaging with Deep Learning and Advanced Data Preprocessing Techniques

Title: Large Language Model Based Generative Error Correction: A Challenge and Baselines forSpeech Recognition, Speaker Tagging, and Emotion Recognition

Title: BEnDEM:A Boltzmann Sampler Based on Bootstrapped Denoising Energy Matching

Title: Multiple Rotation Averaging with Constrained Reweighting Deep Matrix Factorization

Title: Enhancing Data Quality through Self-learning on Imbalanced Financial Risk Data

Title: Federated Learning in Adversarial Environments: Testbed Design and Poisoning Resilience in Cybersecurity

Title: Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion

Title: PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics

Title: Causal Inference with Large Language Model: A Survey

Title: GP-GPT: Large Language Model for Gene-Phenotype Mapping

Title: Latent Diffusion Models for Controllable RNA Sequence Generation

Title: Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling

Title: A Benchmark Dataset with Larger Context for Non-Factoid Question Answering over Islamic Text

Title: A Survey of Out-of-distribution Generalization for Graph Machine Learning from a Causal View

Title: Revisiting Physical-World Adversarial Attack on Traffic Sign Recognition: A Commercial Systems Perspective

Title: Towards Kinetic Manipulation of the Latent Space

Title: REG: Refined Generalized Focal Loss for Road Asset Detection on Thai Highways Using Vision-Based Detection and Segmentation Models

Title: Proximal Ranking Policy Optimization for Practical Safety in Counterfactual Learning to Rank

Title: Flexible Diffusion Scopes with Parameterized Laplacian for Heterophilic Graph Learning

Title: Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation

Title: Estimating Wage Disparities Using Foundation Models

Title: GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion

Title: Rediscovering the Latent Dimensions of Personality with Large Language Models as Trait Descriptors

Title: Rapid Adaptation of Earth Observation Foundation Models for Segmentation

Title: SFR-RAG: Towards Contextually Faithful LLMs

Title: Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges

Title: High-Security Hardware Module with PUF and Hybrid Cryptography for Data Security

Title: Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Title: Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations

Title: Enhancing Industrial Cybersecurity: SoftHSM Implementation on SBCs for Mitigating MITM Attacks

Title: Optimal ablation for interpretability

Title: Artificial Intelligence-Based Opportunistic Coronary Calcium Screening in the Veterans Affairs National Healthcare System

Title: 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction

Title: Comprehensive Study on Sentiment Analysis: From Rule-based to modern LLM based system

Title: SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning

Title: FreeMark: A Non-Invasive White-Box Watermarking for Deep Neural Networks

Title: SelECT-SQL: Self-correcting ensemble Chain-of-Thought for Text-to-SQL

Title: HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making

Title: AttnMod: Attention-Based New Art Styles

Title: On the Diagram of Thought

Title: Benchmarking Large Language Model Uncertainty for Prompt Optimization

Title: Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective

Title: A Response to: A Note on "Privacy Preserving n-Party Scalar Product Protocol"

Title: Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation

Title: Steinmetz Neural Networks for Complex-Valued Data

Title: LLM-DER:A Named Entity Recognition Method Based on Large Language Models for Chinese Coal Chemical Domain

Title: DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion

Title: MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior

Title: DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection

Title: Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference

Title: Robust Reinforcement Learning with Dynamic Distortion Risk Measures

Title: Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression

Title: Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT

Title: Analysing Attacks on Blockchain Systems in a Layer-based Approach

Title: Evaluating the Efficacy of Instance Incremental vs. Batch Learning in Delayed Label Environments: An Empirical Study on Tabular Data Streaming for Fraud Detection

Title: StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models

Title: PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Title: AALF: Almost Always Linear Forecasting

Title: LLMs4OL 2024 Overview: The 1st Large Language Models for Ontology Learning Challenge

Title: AutoPET Challenge III: Testing the Robustness of Generalized Dice Focal Loss trained 3D Residual UNet for FDG and PSMA Lesion Segmentation from Whole-Body PET/CT Images

Title: Quantile Regression for Distributional Reward Models in RLHF

Title: ExelMap: Explainable Element-based HD-Map Change Detection and Update

Title: RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models

Title: Enhancing RL Safety with Counterfactual LLM Reasoning

Title: LLMs for clinical risk prediction

Title: PrePaMS: Privacy-Preserving Participant Management System for Studies with Rewards and Prerequisites

Title: Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models

Title: Garment Attribute Manipulation with Multi-level Attention

Title: Robust Bird's Eye View Segmentation by Adapting DINOv2

Title: From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs

Title: BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images

Title: Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data

Title: Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation

Title: On Synthetic Texture Datasets: Challenges, Creation, and Curation

Title: Fuse4Seg: Image-Level Fusion Based Multi-Modality Medical Image Segmentation

Title: InfoDisent: Explainability of Image Classification Models by Information Disentanglement

Title: Execution-time opacity control for timed automata

Title: Security, Trust and Privacy challenges in AI-driven 6G Networks

Title: The 20 questions game to distinguish large language models

Title: Hyperedge Modeling in Hypergraph Neural Networks by using Densest Overlapping Subgraphs

Title: Detecting Sexism in German Online Newspaper Comments with Open-Source Text Embeddings (Team GDA, GermEval2024 Shared Task 1: GerMS-Detect, Subtasks 1 and 2, Closed Track)

Title: Taming Diffusion Models for Image Restoration: A Review

Title: 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?

Title: Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

Title: Robust image representations with counterfactual contrastive learning

Title: Mamba-ST: State Space Model for Efficient Style Transfer

Title: Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation

Title: A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning and BERT Integration

Title: A Large-Scale Privacy Assessment of Android Third-Party SDKs

Title: Learning Semi-Supervised Medical Image Segmentation from Spatial Registration

Title: Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings

Title: MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Title: SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

Title: Schrodinger's Memory: Large Language Models

Title: Do Pre-trained Vision-Language Models Encode Object States?

Title: Flash STU: Fast Spectral Transform Units

Title: Partial Distribution Matching via Partial Wasserstein Adversarial Networks

Title: Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles

Title: DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction

Title: RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval