2024-12-06

Title: Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models

Title: Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts

Title: CovidLLM: A Robust Large Language Model with Missing Value Adaptation and Multi-Objective Learning Strategy for Predicting Disease Severity and Clinical Outcomes in COVID-19 Patients

Title: The Vulnerability of Language Model Benchmarks: Do They Accurately Reflect True LLM Performance?

Title: CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models

Title: HunyuanVideo: A Systematic Framework For Large Video Generative Models

Title: CBEval: A framework for evaluating and interpreting cognitive biases in LLMs

Title: Multimodal Sentiment Analysis Based on BERT and ResNet

Title: Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective

Title: MV-Adapter: Multi-view Consistent Image Generation Made Easy

Title: Explainable Malware Detection through Integrated Graph Reduction and Learning Techniques

Title: Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis

Title: Hyperparameter Tuning Through Pessimistic Bilevel Optimization

Title: Acquired TASTE: Multimodal Stance Detection with Textual and Structural Embeddings

Title: Designing DNNs for a trade-off between robustness and processing performance in embedded devices

Title: Interpretable Hierarchical Attention Network for Medical Condition Identification

Title: Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Title: Fairness without Demographics through Learning Graph of Gradients

Title: Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0

Title: PathletRL++: Optimizing Trajectory Pathlet Extraction and Dictionary Formation via Reinforcement Learning

Title: A Water Efficiency Dataset for African Data Centers

Title: Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach

Title: VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding

Title: Domain-specific Question Answering with Hybrid Search

Title: Multi-view Image Diffusion via Coordinate Noise and Fourier Attention

Title: Advancing Auto-Regressive Continuation for Video Frames

Title: Language Model Meets Prototypes: Towards Interpretable Text Classification Models through Prototypical Networks

Title: End to End Collaborative Synthetic Data Generation

Title: Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

Title: Modular addition without black-boxes: Compressing explanations of MLPs that compute numerical integration

Title: Coordinate In and Value Out: Training Flow Transformers in Ambient Space

Title: Agent AI with LangGraph: A Modular Framework for Enhancing Machine Translation Using Large Language Models

Title: EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM

Title: I$^2$OL-Net: Intra-Inter Objectness Learning Network for Point-Supervised X-Ray Prohibited Item Detection

Title: Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting

Title: Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration

Title: Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

Title: Residual Hyperbolic Graph Convolution Networks

Title: A large language model-type architecture for high-dimensional molecular potential energy surfaces

Title: LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Title: CCxTrust: Confidential Computing Platform Based on TEE and TPM Collaborative Trust

Title: HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Title: Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration

Title: Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer

Title: CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

Title: GP-FL: Model-Based Hessian Estimation for Second-Order Over-the-Air Federated Learning

Title: CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance

Title: Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization

Title: AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer

Title: DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism

Title: Uniform Discretized Integrated Gradients: An effective attribution based method for explaining large language models

Title: Machine Learning-based Android Intrusion Detection System

Title: A Noise is Worth Diffusion Guidance

Title: Can Targeted Clean-Label Poisoning Attacks Generalize?

Title: Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task

Title: A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios

Title: Privacy-Preserving in Medical Image Analysis: A Review of Methods and Applications

Title: MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction

Title: InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models

Title: AIpparel: A Large Multimodal Generative Model for Digital Garments

Title: Enhancing and Accelerating Diffusion-Based Inverse Problem Solving through Measurements Optimization

Title: WACANA: A Concolic Analyzer for Detecting On-chain Data Vulnerabilities in WASM Smart Contracts

Title: BEFL: Balancing Energy Consumption in Federated Learning for Mobile Edge IoT

Title: A Framework For Image Synthesis Using Supervised Contrastive Learning

Title: Local Curvature Smoothing with Stein's Identity for Efficient Score Matching

Title: Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation

Title: HyperDefect-YOLO: Enhance YOLO with HyperGraph Computation for Industrial Defect Detection

Title: Digital Twin for Evaluating Detective Countermeasures in Smart Grid Cybersecurity

Title: AI-based Attacker Models for Enhancing Multi-Stage Cyberattack Simulations in Smart Grids Using Co-Simulation Environments

Title: Exploring Fully Convolutional Networks for the Segmentation of Hyperspectral Imaging Applied to Advanced Driver Assistance Systems

Title: MTMT: Consolidating Multiple Thinking Modes to Form a Thought Tree for Strengthening LLM

Title: LaserGuider: A Laser Based Physical Backdoor Attack against Deep Neural Networks

Title: IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation

Title: Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Title: (Blind) Users Really Do Heed Aural Telephone Scam Warnings

Title: PriorMotion: Generative Class-Agnostic Motion Prediction with Raster-Vector Motion Field Priors

Title: M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction

Title: Mask of truth: model sensitivity to unexpected regions of medical images

Title: Dimension Reduction via Random Projection for Privacy in Multi-Agent Systems

Title: Dynamic Graph Representation with Contrastive Learning for Financial Market Prediction: Integrating Temporal Evolution and Static Relations

Title: AI4EF: Artificial Intelligence for Energy Efficiency in the Building Sector

Title: Hostility Detection in UK Politics: A Dataset on Online Abuse Targeting MPs

Title: How to design a Public Key Infrastructure for a Central Bank Digital Currency

Title: TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation

Title: SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning

Title: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning

Title: Federated Learning in Mobile Networks: A Comprehensive Case Study on Traffic Forecasting

Title: LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents

Title: MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Title: MVUDA: Unsupervised Domain Adaptation for Multi-view Pedestrian Detection

Title: DeepFEA: Deep Learning for Prediction of Transient Finite Element Analysis Solutions

Title: Deep priors for satellite image restoration with accurate uncertainties

Title: Compositional Generative Multiphysics and Multi-component Simulation

Title: Text Change Detection in Multilingual Documents Using Image Comparison

Title: Understanding Memorization in Generative Models via Sharpness in Probability Landscapes

Title: Reducing Tool Hallucination via Reliability Alignment

Title: AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models

Title: Frequency-Adaptive Low-Latency Object Detection Using Events and Frames

Title: On the Lack of Robustness of Binary Function Similarity Systems

Title: Multi-Layer Privacy-Preserving Record Linkage with Clerical Review based on gradual information disclosure

Title: SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization

Title: Linear Discriminant Analysis in Credit Scoring: A Transparent Hybrid Model Approach

Title: Instructional Video Generation

Title: AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic

Title: PANGAEA: A Global and Inclusive Benchmark for Geospatial Foundation Models

Title: A Context-aware Framework for Translation-mediated Conversations

Title: Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

Title: DistB-VNET: Distributed Cluster-based Blockchain Vehicular Ad-Hoc Networks through SDN-NFV for Smart City

Title: DEIM: DETR with Improved Matching for Fast Convergence

Title: Addressing Hallucinations with RAG and NMISS in Italian Healthcare LLM Chatbots

Title: VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction

Title: LMDM:Latent Molecular Diffusion Model For 3D Molecule Generation

Title: Quantifying the Limits of Segment Anything Model: Analyzing Challenges in Segmenting Tree-Like and Low-Contrast Structures

Title: Intriguing Properties of Robust Classification

Title: 3D Part Segmentation via Geometric Aggregation of 2D Visual Features

Title: CLINICSUM: Utilizing Language Models for Generating Clinical Summaries from Patient-Doctor Conversations

Title: SCADE: Scalable Command-line Anomaly Detection Engine

Title: Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation

Title: SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction

Title: Arabic Stable LM: Adapting Stable LM 2 1.6B to Arabic

Title: Learnable Infinite Taylor Gaussian for Dynamic View Rendering

Title: Evolutionary Pre-Prompt Optimization for Mathematical Reasoning

Title: SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

Title: SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion

Title: Towards Zero-shot 3D Anomaly Localization

Title: ALMA: Alignment with Minimal Annotation

Title: FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression

Title: The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation

Title: GRAM: Generalization in Deep RL with a Robust Adaptation Module

Title: Understanding Student Sentiment on Mental Health Support in Colleges Using Large Language Models

Title: Liquid: Language Models are Scalable Multi-modal Generators

Title: Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure

Title: Retrieval-Augmented Machine Translation with Unstructured Knowledge

Title: RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse

Title: Distributionally Robust Performative Prediction

Title: VMGuard: Reputation-Based Incentive Mechanism for Poisoning Attack Detection in Vehicular Metaverse

Title: ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Title: Machine Theory of Mind for Autonomous Cyber-Defence

Title: A Hitchhiker's Guide to Understanding Performances of Two-Class Classifiers

Title: Discriminative Fine-tuning of LVLMs

Title: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Title: Federated Automated Feature Engineering

Title: FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning

Title: Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Title: Grounding Descriptions in Images informs Zero-Shot Visual Recognition

Title: Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Title: Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Title: Towards Real-Time Open-Vocabulary Video Instance Segmentation

Title: Learning Artistic Signatures: Symmetry Discovery and Style Transfer

Title: DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models

Title: MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Title: p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

Title: Four-Plane Factorized Video Autoencoders

Title: HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

Title: Cubify Anything: Scaling Indoor 3D Object Detection

Title: LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors

Title: 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Title: MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos

Title: Turbo3D: Ultra-fast Text-to-3D Generation

Title: PaintScene4D: Consistent 4D Scene Generation from Text Prompts

Title: Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail