2024-11-19

Title: Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey

Title: Challenges in the Differential Classification of Individual Diagnoses from Co-Occurring Autism and ADHD Using Survey Data

Title: Biometrics in Extended Reality: A Review

Title: MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds

Title: Boundary Attention Constrained Zero-Shot Layout-To-Image Generation

Title: Structure Tensor Representation for Robust Oriented Object Detection

Title: Prompt-Guided Environmentally Consistent Adversarial Patch

Title: FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Title: Edge-Only Universal Adversarial Attacks in Distributed Learning

Title: OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models

Title: USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting

Title: DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration

Title: TESGNN: Temporal Equivariant Scene Graph Neural Networks for Efficient and Robust Multi-View 3D Scene Understanding

Title: SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

Title: On the Privacy Risk of In-context Learning

Title: Any2Any: Incomplete Multimodal Retrieval with Conformal Prediction

Title: "On the goals of linguistic theory": Revisiting Chomskyan theories in the era of AI

Title: Does Prompt Formatting Have Any Impact on LLM Performance?

Title: SoftLMs: Efficient Adaptive Low-Rank Approximation of Language Models using Soft-Thresholding Mechanism

Title: Debias-CLR: A Contrastive Learning Based Debiasing Method for Algorithmic Fairness in Healthcare Applications

Title: Efficient Alignment of Large Language Models via Data Sampling

Title: Low-Rank Optimal Transport through Factor Relaxation with Latent Coupling

Title: mlan: language-based instruction tuning improves zero-shot generalization of multimodal large language models

Title: Vision Eagle Attention: A New Lens for Advancing Image Classification

Title: Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera

Title: Creation and Evaluation of a Food Product Image Dataset for Product Property Extraction

Title: FedAli: Personalized Federated Learning with Aligned Prototypes through Optimal Transport

Title: AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment

Title: Contextualizing Security and Privacy of Software-Defined Vehicles: State of the Art and Industry Perspectives

Title: To Shuffle or not to Shuffle: Auditing DP-SGD with Shuffling

Title: Electrical Load Forecasting in Smart Grid: A Personalized Federated Learning Approach

Title: Is thermography a viable solution for detecting pressure injuries in dark skin patients?

Title: Leveraging large language models for efficient representation learning for entity resolution

Title: MTA: Multimodal Task Alignment for BEV Perception and Captioning

Title: BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Title: Enhancing PTSD Outcome Prediction with Ensemble Models in Disaster Contexts

Title: AutoIoT: Automated IoT Platform Using Large Language Models

Title: SAM Decoding: Speculative Decoding via Suffix Automaton

Title: Segmentation of Ink and Parchment in Dead Sea Scroll Fragments

Title: Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Title: IntentGPT: Few-shot Intent Discovery with Large Language Models

Title: How to Defend Against Large-scale Model Poisoning Attacks in Federated Learning: A Vertical Solution

Title: Two-layer consensus based on master-slave consortium chain data sharing for Internet of Vehicles

Title: Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines

Title: I'm Spartacus, No, I'm Spartacus: Measuring and Understanding LLM Identity Confusion

Title: MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for Mitigation of Spurious Correlations

Title: DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing

Title: HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization

Title: Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection

Title: Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting

Title: AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations

Title: A Regularized LSTM Method for Detecting Fake News Articles

Title: EVT: Efficient View Transformation for Multi-Modal 3D Object Detection

Title: FlowScope: Enhancing Decision Making by Time Series Forecasting based on Prediction Optimization using HybridFlow Forecast Framework

Title: Multi Scale Graph Neural Network for Alzheimer's Disease

Title: HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings

Title: On-device Anomaly Detection in Conveyor Belt Operations

Title: MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map

Title: It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment

Title: TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition

Title: LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges

Title: Can Generic LLMs Help Analyze Child-adult Interactions Involving Children with Autism in Clinical Observation?

Title: Steam Turbine Anomaly Detection: An Unsupervised Learning Approach Using Enhanced Long Short-Term Memory Variational Autoencoder

Title: Task Offloading for Vehicular Edge Computing Based on Improved Hotstuff under Parking Assistance

Title: Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer

Title: C-DiffSET: Leveraging Latent Diffusion for SAR-to-EO Image Translation with Confidence-Guided Reliable Object Generation

Title: Anatomy-Guided Radiology Report Generation with Pathology-Aware Regional Prompts

Title: Test-time Conditional Text-to-Image Synthesis Using Diffusion Models

Title: Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model

Title: Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay

Title: Information Anxiety in Large Language Models

Title: DEAL: Decoupled Classifier with Adaptive Linear Modulation for Group Robust Early Diagnosis of MCI to AD Conversion

Title: Conformation Generation using Transformer Flows

Title: An Oversampling-enhanced Multi-class Imbalanced Classification Framework for Patient Health Status Prediction Using Patient-reported Outcomes

Title: A Data-Efficient Sequential Learning Framework for Melt Pool Defect Classification in Laser Powder Bed Fusion

Title: ARM: Appearance Reconstruction Model for Relightable 3D Generation

Title: One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Title: Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation

Title: NeuroNURBS: Learning Efficient Surface Representations for 3D Solids

Title: On the Verification of Control Flow Attestation Evidence

Title: Large Vision-Language Models for Remote Sensing Visual Question Answering

Title: See-Saw Generative Mechanism for Scalable Recursive Code Generation with Generative AI

Title: Improvement in Facial Emotion Recognition using Synthetic Data Generated by Diffusion Model

Title: ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models

Title: Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm

Title: Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis

Title: BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization

Title: FIAS: Feature Imbalance-Aware Medical Image Segmentation with Dynamic Fusion and Mixing Attention

Title: I Know What You Sync: Covert and Side Channel Attacks on File Systems via syncfs

Title: MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation

Title: Practitioner Paper: Decoding Intellectual Property: Acoustic and Magnetic Side-channel Attack on a 3D Printer

Title: Watermarking Generative Categorical Data

Title: Attention-based U-Net Method for Autonomous Lane Detection

Title: SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment

Title: Generating Compositional Scenes via Text-to-image RGBA Instance Generation

Title: BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment

Title: Bias in Large Language Models: Origin, Evaluation, and Mitigation

Title: LLM-assisted Physical Invariant Extraction for Cyber-Physical Systems Anomaly Detection

Title: Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection

Title: Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data

Title: Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning

Title: Constrained Diffusion with Trust Sampling

Title: Analyzing Pok\'emon and Mario Streamers' Twitch Chat with LLM-based User Embeddings

Title: Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion

Title: Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry

Title: Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera

Title: Direct and Explicit 3D Generation from a Single Image

Title: Towards Accurate and Efficient Sub-8-Bit Integer Training

Title: Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering

Title: TSFormer: A Robust Framework for Efficient UHD Image Restoration

Title: V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

Title: VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

Title: Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations

Title: Framework for developing and evaluating ethical collaboration between expert and machine

Title: EROAM: Event-based Camera Rotational Odometry and Mapping in Real-time

Title: BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation

Title: CCi-YOLOv8n: Enhanced Fire Detection with CARAFE and Context-Guided Modules

Title: Time Step Generating: A Universal Synthesized Deepfake Image Detector

Title: A Study of Malware Prevention in Linux Distributions

Title: Training a Label-Noise-Resistant GNN with Reduced Complexity

Title: BianCang: A Traditional Chinese Medicine Large Language Model

Title: Wafer Map Defect Classification Using Autoencoder-Based Data Augmentation and Convolutional Neural Network

Title: EfQAT: An Efficient Framework for Quantization-Aware Training

Title: FedUHB: Accelerating Federated Unlearning via Polyak Heavy Ball Method

Title: Efficient Federated Unlearning with Adaptive Differential Privacy Preservation

Title: StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

Title: Knowledge-enhanced Transformer for Multivariate Long Sequence Time-series Forecasting

Title: SRA-MCTS: Self-driven Reasoning Aurmentation with Monte Carlo Tree Search for Enhanced Code Generation

Title: FastDraft: How to Train Your Draft

Title: Patching FPGAs: The Security Implications of Bitstream Modifications

Title: Beyond Human-Like Processing: Large Language Models Perform Equivalently on Forward and Backward Scientific Text

Title: TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models

Title: Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification

Title: Multilingual Large Language Models: A Systematic Survey

Title: The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection

Title: D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification

Title: MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Title: Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML

Title: Label Sharing Incremental Learning Framework for Independent Multi-Label Segmentation Tasks

Title: JailbreakLens: Interpreting Jailbreak Mechanism in the Lens of Representation and Circuit

Title: Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method

Title: CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning

Title: From Primes to Paths: Enabling Fast Multi-Relational Graph Analysis

Title: Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Title: Federated Learning for UAV-Based Spectrum Sensing: Enhancing Accuracy Through SNR-Weighted Model Aggregation

Title: MPLite: Multi-Aspect Pretraining for Mining Clinical Health Records

Title: RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer

Title: Enhanced Anime Image Generation Using USE-CMHSA-GAN

Title: AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers

Title: Careless Whisper: Exploiting Stealthy End-to-End Leakage in Mobile Instant Messengers

Title: SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach

Title: Stealing Training Graphs from Graph Neural Networks

Title: Countering Backdoor Attacks in Image Recognition: A Survey and Evaluation of Mitigation Strategies

Title: Capturing Sparks of Abstraction for the ARC Challenge

Title: Making Sigmoid-MSE Great Again: Output Reset Challenges Softmax Cross-Entropy in Neural Network Classification

Title: DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery

Title: Efficient Transfer Learning for Video-language Foundation Models

Title: MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis

Title: ZeFaV: Boosting Large Language Models for Zero-shot Fact Verification

Title: EXCON: Extreme Instance-based Contrastive Representation Learning of Severely Imbalanced Multivariate Time Series for Solar Flare Prediction

Title: Large corpora and large language models: a replicable method for automating grammatical annotation

Title: VersaTune: Fine-Tuning Multi-Ability LLMs Efficiently

Title: Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development

Title: Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications

Title: Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition

Title: LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models

Title: SADDE: Semi-supervised Anomaly Detection with Dependable Explanations

Title: Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation

Title: Steering Language Model Refusal with Sparse Autoencoders

Title: Toward Personalized Federated Node Classification in One-shot Communication

Title: TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation

Title: A Review on Machine Unlearning

Title: Establishing Minimum Elements for Effective Vulnerability Management in AI Software

Title: Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation

Title: Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge

Title: Zero-Shot Load Forecasting with Large Language Models

Title: Visual-Semantic Graph Matching Net for Zero-Shot Learning

Title: CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset

Title: MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models

Title: Continual Task Learning through Adaptive Policy Self-Composition

Title: Adapting to Cyber Threats: A Phishing Evolution Network (PEN) Framework for Phishing Generation and Analyzing Evolution Patterns using Large Language Models

Title: The GECo algorithm for Graph Neural Networks Explanation

Title: Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms

Title: The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models

Title: IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos

Title: TEEMATE: Fast and Efficient Confidential Container using Shared Enclave

Title: Membership Inference Attack against Long-Context Large Language Models

Title: CLUE-MARK: Watermarking Diffusion Models using CLWE

Title: Unveiling the Inflexibility of Adaptive Embedding in Traffic Forecasting

Title: Upside-Down Reinforcement Learning for More Interpretable Optimal Control

Title: Re-examining learning linear functions in context

Title: MGNiceNet: Unified Monocular Geometric Scene Understanding

Title: Generalizable Person Re-identification via Balancing Alignment and Uniformity

Title: Graph Artificial Intelligence for Quantifying Compatibility Mechanisms in Traditional Chinese Medicine

Title: MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion

Title: SoK: On the Role and Future of AIGC Watermarking in the Era of Gen-AI

Title: Exploring Emerging Trends and Research Opportunities in Visual Place Recognition

Title: Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models

Title: LaVin-DiT: Large Vision Diffusion Transformer

Title: Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation

Title: Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions

Title: Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization

Title: Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality

Title: Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment

Title: Real-Time Fitness Exercise Classification and Counting from Video Frames

Title: Simple But Not Secure: An Empirical Security Analysis of Two-factor Authentication Systems

Title: GNN-Based Code Annotation Logic for Establishing Security Boundaries in C Code

Title: OASIS: Open Agents Social Interaction Simulations on One Million Agents

Title: Generative Spatio-temporal GraphNet for Transonic Wing Pressure Distribution Forecasting

Title: Feature Selection for Network Intrusion Detection

Title: ST-Tree with Interpretability for Multivariate Time Series Classification

Title: Federated Incremental Named Entity Recognition

Title: Teapot: Efficiently Uncovering Spectre Gadgets in COTS Binaries

Title: Chapter 7 Review of Data-Driven Generative AI Models for Knowledge Extraction from Scientific Literature in Healthcare

Title: SP${ }^3$ : Superpixel-propagated pseudo-label learning for weakly semi-supervised medical image segmentation

Title: TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection

Title: Can Highlighting Help GitHub Maintainers Track Security Fixes?

Title: No-regret Exploration in Shuffle Private Reinforcement Learning

Title: Dissecting Misalignment of Multimodal Large Language Models via Influence Function

Title: Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Title: Few-shot Model Extraction Attacks against Sequential Recommender Systems

Title: Conceptwm: A Diffusion Model Watermark for Concept Protection

Title: Towards Degradation-Robust Reconstruction in Generalizable NeRF

Title: Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Title: Robust Reinforcement Learning under Diffusion Models for Data with Jumps

Title: Bitcoin Under Volatile Block Rewards: How Mempool Statistics Can Influence Bitcoin Mining

Title: FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models

Title: FLMarket: Enabling Privacy-preserved Pre-training Data Pricing for Federated Learning

Title: RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model

Title: Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

Title: Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment

Title: Advacheck at GenAI Detection Task 1: AI Detection Powered by Domain-Aware Multi-Tasking

Title: BitMoD: Bit-serial Mixture-of-Datatype LLM Acceleration

Title: Freezing of Gait Detection Using Gramian Angular Fields and Federated Learning from Wearable Sensors

Title: LLM-IE: A Python Package for Generative Information Extraction with Large Language Models

Title: A Potential Game Perspective in Federated Learning

Title: Tackling prediction tasks in relational databases with LLMs

Title: Bi-Mamba: Towards Accurate 1-Bit State Space Models

Title: Generative World Explorer