2024-12-31

Title: Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Title: GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection

Title: Back To The Future: A Hybrid Transformer-XGBoost Model for Action-oriented Future-proofing Nowcasting

Title: Multi-atlas Ensemble Graph Neural Network Model For Major Depressive Disorder Detection Using Functional MRI Data

Title: RoboSignature: Robust Signature and Watermarking on Network Attacks

Title: Data Poisoning Attacks to Local Differential Privacy Protocols for Graphs

Title: Multi-View Fusion Neural Network for Traffic Demand Prediction

Title: ERPA: Efficient RPA Model Integrating OCR and LLMs for Intelligent Document Processing

Title: Multimodal joint prediction of traffic spatial-temporal data with graph sparse attention mechanism and bidirectional temporal convolutional network

Title: A Review of Latent Representation Models in Neuroimaging

Title: Symbolic Disentangled Representations for Images

Title: Generative Landmarks Guided Eyeglasses Removal 3D Face Reconstruction

Title: Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation

Title: Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales

Title: Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation

Title: Minimax-Optimal Multi-Agent Robust Reinforcement Learning

Title: YOLO-MST: Multiscale deep learning method for infrared small target detection based on super-resolution and YOLO

Title: Leveraging Scene Geometry and Depth Information for Robust Image Deraining

Title: Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts

Title: Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models

Title: HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models

Title: Assessing Text Classification Methods for Cyberbullying Detection on Social Media Platforms

Title: Outfox: a Packet Format for a Layered Mixnet

Title: Standard-Deviation-Inspired Regularization for Improving Adversarial Robustness

Title: ErgoChat: a Visual Query System for the Ergonomic Risk Assessment of Construction Workers

Title: DepthMamba with Adaptive Fusion

Title: Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized Excerpts

Title: MobileNetV2: A lightweight classification model for home-based sleep apnea screening

Title: MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation

Title: Explainable Semantic Federated Learning Enabled Industrial Edge Network for Fire Surveillance

Title: The Fifth International Verification of Neural Networks Competition (VNN-COMP 2024): Summary and Results

Title: Delayed Random Partial Gradient Averaging for Federated Learning

Title: Caesar: A Low-deviation Compression Approach for Efficient Federated Learning

Title: A Robust Federated Learning Framework for Undependable Devices at Scale

Title: An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models

Title: Discrete Curvature Graph Information Bottleneck

Title: Comprehensive Review of EEG-to-Output Research: Decoding Neural Signals into Images, Videos, and Audio

Title: Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking

Title: OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Title: Adversarial Robustness for Deep Learning-based Wildfire Detection Models

Title: Calibre: Towards Fair and Accurate Personalized Federated Learning with Self-Supervised Learning

Title: A Robust Adversarial Ensemble with Causal (Feature Interaction) Interpretations for Image Classification

Title: STAYKATE: Hybrid In-Context Example Selection Combining Representativeness Sampling and Retrieval-based Approach -- A Case Study on Science Domains

Title: Enhancing Diffusion Models for Inverse Problems with Covariance-Aware Posterior Sampling

Title: GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting

Title: "My life is miserable, have to sign 500 autographs everyday": Exposing Humblebragging, the Brags in Disguise

Title: Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts

Title: MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion

Title: VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition

Title: On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Title: Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset

Title: MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing

Title: STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection

Title: Enhancing Marine Debris Acoustic Monitoring by Optical Flow-Based Motion Vector Analysis

Title: MAFT: Efficient Model-Agnostic Fairness Testing for Deep Neural Networks via Zero-Order Gradient Search

Title: On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs

Title: SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis

Title: ST$^3$: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming

Title: M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation

Title: Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering

Title: Distilled Transformers with Locally Enhanced Global Representations for Face Forgery Detection

Title: UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity

Title: Multi-Modality Driven LoRA for Adverse Condition Depth Estimation

Title: StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN

Title: Real-time Calibration Model for Low-cost Sensor in Fine-grained Time series

Title: Geo-ConvGRU: Geographically Masked Convolutional Gated Recurrent Unit for Bird-Eye View Segmentation

Title: Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation

Title: Lower bounds on transformers with infinite precision

Title: Federated Unlearning with Gradient Descent and Conflict Mitigation

Title: Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems

Title: Towards Visual Grounding: A Survey

Title: Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights

Title: Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance

Title: Building a Rich Dataset to Empower the Persian Question Answering Systems

Title: IMSSA: Deploying modern state-space models on memristive in-memory compute hardware

Title: YAD: Leveraging T5 for Improved Automatic Diacritization of Yor\`ub\'a Text

Title: LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Title: How To Think About End-To-End Encryption and AI: Training, Processing, Disclosure, and Consent

Title: Recommender Engine Driven Client Selection in Federated Brain Tumor Segmentation

Title: ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

Title: Election of Collaborators via Reinforcement Learning for Federated Brain Tumor Segmentation

Title: An Anomaly Detection System Based on Generative Classifiers for Controller Area Network

Title: Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues

Title: TeLU Activation Function for Fast and Stable Deep Learning

Title: Transformer-Based Contrastive Meta-Learning For Low-Resource Generalizable Activity Recognition

Title: An analytic theory of creativity in convolutional diffusion models

Title: An experimental study on fairness-aware machine learning for credit scoring problem

Title: EXAdam: The Power of Adaptive Cross-Moments

Title: Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain

Title: Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition

Title: Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization

Title: Asynchronous Federated Clustering with Unknown Number of Clusters

Title: Deep Learning in Image Classification: Evaluating VGG19's Performance on Complex Visual Data

Title: HindiLLM: Large Language Model for Hindi

Title: Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes

Title: LLM2: Let Large Language Models Harness System 2 Reasoning

Title: FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation

Title: Impact of Data Distribution on Fairness Guarantees in Equitable Deep Learning

Title: Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control

Title: Prot\'eg\'e: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs)

Title: Natural Language Fine-Tuning

Title: Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning

Title: Open-Sora: Democratizing Efficient Video Production for All

Title: A Multidisciplinary Approach to Telegram Data Analysis

Title: Multi-Objective Large Language Model Unlearning

Title: EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

Title: Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection

Title: Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment

Title: Bringing Objects to Life: 4D generation from 3D objects

Title: Integrating Natural Language Processing Techniques of Text Mining Into Financial System: Applications and Limitations

Title: Image Augmentation Agent for Weakly Supervised Semantic Segmentation

Title: Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs

Title: Sub-optimal Learning in Meta-Classifier Attacks: A Study of Membership Inference on Differentially Private Location Aggregates

Title: Single-image reflection removal via self-supervised diffusion models

Title: Utilizing Multimodal Data for Edge Case Robust Call-sign Recognition and Understanding

Title: JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling

Title: Cut the Deadwood Out: Post-Training Model Purification with Selective Module Substitution

Title: MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation

Title: Multimodal Variational Autoencoder: a Barycentric View

Title: A Multiparty Homomorphic Encryption Approach to Confidential Federated Kaplan Meier Survival Analysis

Title: ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding

Title: DPBridge: Latent Diffusion Bridge for Dense Prediction

Title: Dive into Time-Series Anomaly Detection: A Decade Review

Title: Goal-Conditioned Data Augmentation for Offline Reinforcement Learning

Title: Attacks on the neural network and defense methods

Title: KVC-onGoing: Keystroke Verification Challenge

Title: SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes

Title: Counterfactual Samples Constructing and Training for Commonsense Statements Estimation

Title: Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches

Title: Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection

Title: Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)

Title: MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Title: NLP-based Regulatory Compliance -- Using GPT 4.0 to Decode Regulatory Documents

Title: Privacy-Preserving Identity and Access Management in Multiple Cloud Environments: Models, Issues, and Solutions

Title: Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study

Title: FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition

Title: HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models

Title: NetFlowGen: Leveraging Generative Pre-training for Network Traffic Dynamics

Title: Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble

Title: SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy

Title: Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis

Title: Overcoming Class Imbalance: Unified GNN Learning with Structural and Semantic Connectivity Representations

Title: Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model

Title: Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner

Title: Prototypical Distillation and Debiased Tuning for Black-box Unsupervised Domain Adaptation

Title: Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA

Title: Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks

Title: HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images

Title: M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

Title: Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling

Title: AverageLinear: Enhance Long-Term Time series forcasting with simple averaging

Title: Towards nation-wide analytical healthcare infrastructures: A privacy-preserving augmented knee rehabilitation case study

Title: UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language Models

Title: Advancing Parkinson's Disease Progression Prediction: Comparing Long Short-Term Memory Networks and Kolmogorov-Arnold Networks

Title: Solar Filaments Detection using Active Contours Without Edges

Title: Attributing Culture-Conditioned Generations to Pretraining Corpora

Title: Sample Correlation for Fingerprinting Deep Face Recognition

Title: Accelerating Energy-Efficient Federated Learning in Cell-Free Networks with Adaptive Quantization

Title: SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity

Title: A Tale of Two Imperatives: Privacy and Explainability

Title: VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control

Title: Frequency-aware Event Cloud Network

Title: Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability

Title: Length-Aware DETR for Robust Moment Retrieval

Title: Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment

Title: Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Title: Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory

Title: Enhancing Annotated Bibliography Generation with LLM Ensembles

Title: SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation

Title: Attention Is All You Need For Mixture-of-Depths Routing

Title: LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training

Title: DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models

Title: Towards Compatible Fine-tuning for Vision-Language Model Updates

Title: DDIM sampling for Generative AIBIM, a faster intelligent structural design framework

Title: ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation

Title: WalkVLM:Aid Visually Impaired People Walking by Vision Language Model

Title: Low-Light Image Enhancement via Generative Perceptual Priors

Title: HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization

Title: Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering

Title: Generalizing in Net-Zero Microgrids: A Study with Federated PPO and TRPO

Title: GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search

Title: Conservation-informed Graph Learning for Spatiotemporal Dynamics Prediction

Title: AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies

Title: RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses

Title: Efficiently Serving LLM Reasoning Programs with Certaindex

Title: KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation

Title: Plug-and-Play Training Framework for Preference Optimization

Title: Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria

Title: Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline

Title: MapQaTor: A System for Efficient Annotation of Map Query Datasets

Title: Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models

Title: Improving Location-based Thermal Emission Side-Channel Analysis Using Iterative Transfer Learning

Title: GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models

Title: Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration

Title: E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models

Title: Learning Epidemiological Dynamics via the Finite Expression Method

Title: Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense

Title: Towards Effective Discrimination Testing for Generative AI

Title: BridgePure: Revealing the Fragility of Black-box Data Protection

Title: Varformer: Adapting VAR's Generative Prior for Image Restoration

Title: Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring

Title: Edicho: Consistent Image Editing in the Wild

Title: Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Title: On the Generalizability of Machine Learning-based Ransomware Detection in Block Storage

Title: Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation

Title: ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation

Title: Facilitating large language model Russian adaptation with Learned Embedding Propagation

Title: PyG-SSL: A Graph Self-Supervised Learning Toolkit

Title: Unified dimensionality reduction techniques in chronic liver disease detection

Title: A Large-Scale Study on Video Action Dataset Condensation

Title: PERSE: Personalized 3D Generative Avatars from A Single Portrait