2024-12-30

Title: Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Title: Investigating the Feasibility of Mitigating Potential Copyright Infringement via Large Language Model Unlearning

Title: Why Do Large Language Models (LLMs) Struggle to Count Letters?

Title: KRAIL: A Knowledge-Driven Framework for Base Human Reliability Analysis Integrating IDHEAS and Large Language Models

Title: Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings

Title: ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban Science

Title: DynaGRAG: Improving Language Understanding and Generation through Dynamic Subgraph Representation in Graph Retrieval-Augmented Generation

Title: Dissecting CLIP: Decomposition with a Schur Complement-based Approach

Title: Comparing analytic and data-driven approaches to parameter identifiability: A power systems case study

Title: From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs

Title: TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Title: Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation

Title: AgreeMate: Teaching LLMs to Haggle

Title: Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning

Title: CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era

Title: SurvAttack: Black-Box Attack On Survival Models through Ontology-Informed EHR Perturbation

Title: Design and Evaluation of Privacy-Preserving Protocols for Agent-Facilitated Mobile Money Services in Kenya

Title: Evaluating the Adversarial Robustness of Detection Transformers

Title: Using Large Language Models for Automated Grading of Student Writing about Science

Title: Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks

Title: Elucidating Flow Matching ODE Dynamics with respect to Data Geometries

Title: HELPNet: Hierarchical Perturbations Consistency and Entropy-guided Ensemble for Scribble Supervised Medical Image Segmentation

Title: Successes and Limitations of Object-centric Models at Compositional Generalisation

Title: Hierarchical Multi-Graphs Learning for Robust Group Re-Identification

Title: ObitoNet: Multimodal High-Resolution Point Cloud Reconstruction

Title: Unified Local and Global Attention Interaction Modeling for Vision Transformers

Title: Torque-Aware Momentum

Title: Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation

Title: DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images

Title: Ister: Inverted Seasonal-Trend Decomposition Transformer for Explainable Multivariate Time Series Forecasting

Title: Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation

Title: DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions

Title: DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search

Title: Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors

Title: CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection

Title: RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting

Title: Cryptanalysis of authentication and key establishment protocol in Mobile Edge Computing Environment

Title: Federated Learning with Partially Labeled Data: A Conditional Distillation Approach

Title: DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering

Title: Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration Path

Title: Enhancing Federated Graph Learning via Adaptive Fusion of Structural and Node Characteristics

Title: SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Title: Few-shot Metric Domain Adaptation: Practical Learning Strategies for an Automated Plant Disease Diagnosis

Title: Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models

Title: Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework

Title: IUST_PersonReId: A New Domain in Person Re-Identification Datasets

Title: Adversarial Training for Graph Neural Networks via Graph Subspace Energy Optimization

Title: FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated Learning

Title: Accelerating Diffusion Transformers with Dual Feature Caching

Title: Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model

Title: An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosis

Title: Generative Face Parsing Map Guided 3D Face Reconstruction Under Occluded Scenes

Title: HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Title: Exemplar-condensed Federated Class-incremental Learning

Title: UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Title: Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network

Title: Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference

Title: Single Trajectory Distillation for Accelerating Image and Video Style Transfer

Title: MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models

Title: TopoBDA: Towards Bezier Deformable Attention for Road Topology Understanding

Title: Bridging Interpretability and Robustness Using LIME-Guided Model Refinement

Title: ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement

Title: Adopting Trustworthy AI for Sleep Disorder Prediction: Deep Time Series Analysis with Temporal Attention Mechanism and Counterfactual Explanations

Title: Injecting Bias into Text Classification Models using Backdoor Attacks

Title: CGCOD: Class-Guided Camouflaged Object Detection

Title: HAND: Hierarchical Attention Network for Multi-Scale Handwritten Document Recognition and Layout Analysis

Title: MTCAE-DFER: Multi-Task Cascaded Autoencoder for Dynamic Facial Expression Recognition

Title: Detection and classification of DDoS flooding attacks by machine learning method

Title: Geospatial Data Fusion: Combining Lidar, SAR, and Optical Imagery with AI for Enhanced Urban Mapping

Title: MiTREE: Multi-input Transformer Ecoregion Encoder for Species Distribution Modelling

Title: MGAN-CRCM: A Novel Multiple Generative Adversarial Network and Coarse-Refinement Based Cognizant Method for Image Inpainting

Title: FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing

Title: Imperceptible Adversarial Attacks on Point Clouds Guided by Point-to-Surface Field

Title: Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability

Title: Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation

Title: CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers

Title: Revealing the Self: Brainwave-Based Human Trait Identification

Title: SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis

Title: DAPoinTr: Domain Adaptive Point Transformer for Point Cloud Completion

Title: Effective and secure federated online learning to rank

Title: Advancing LLM detection in the ALTA 2024 Shared Task: Techniques and Analysis

Title: Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation

Title: Investigating the Temporal Dynamics of Cyber Threat Intelligence

Title: Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security

Title: Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos

Title: Reconstruction Target Matters in Masked Image Modeling for Cross-Domain Few-Shot Learning

Title: "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Title: Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models

Title: Spectral Enhancement and Pseudo-Anchor Guidance for Infrared-Visible Person Re-Identification

Title: SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values

Title: Discrete vs. Continuous Trade-offs for Generative Models

Title: Evaluating Self-Supervised Learning in Medical Imaging: A Benchmark for Robustness, Generalizability, and Multi-Domain Impact

Title: Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing

Title: PlanLLM: Video Procedure Planning with Refinable Large Language Models

Title: SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis

Title: CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

Title: Impact of color and mixing proportion of synthetic point clouds on semantic segmentation

Title: AskChart: Universal Chart Understanding through Textual Enhancement

Title: Generating Editable Head Avatars with 3D Gaussian GANs

Title: Dual Channel Multi-Attention in ViT for Biometric Authentication using Forehead Subcutaneous Vein Pattern and Periocular Pattern

Title: GFG -- Gender-Fair Generation: A CALAMITA Challenge

Title: Mask Approximation Net: Merging Feature Extraction and Distribution Learning for Remote Sensing Change Captioning

Title: An End-to-End Depth-Based Pipeline for Selfie Image Rectification

Title: Context-Aware Deep Learning for Multi Modal Depression Detection

Title: Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining

Title: Applying the maximum entropy principle to multi-species neural networks improves species distribution models

Title: Transformer-Based Wireless Capsule Endoscopy Bleeding Tissue Detection and Classification

Title: Learning Cross-Domain Representations for Transferable Drug Perturbations on Single-Cell Transcriptional Responses

Title: Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning

Title: SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model

Title: Latenrgy: Model Agnostic Latency and Energy Consumption Prediction for Binary Classifiers

Title: MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes

Title: PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing

Title: Time Series Foundational Models: Their Role in Anomaly Detection and Prediction

Title: RAG with Differential Privacy

Title: When SAM2 Meets Video Shadow and Mirror Detection

Title: Manga Generation via Layout-controllable Diffusion

Title: Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries

Title: Protecting Cryptographic Libraries against Side-Channel and Code-Reuse Attacks

Title: Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Title: CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models

Title: Semi-Supervised Learning from Small Annotated Data and Large Unlabeled Data for Fine-grained PICO Entity Recognition

Title: On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages

Title: Federated Hybrid Training and Self-Adversarial Distillation: Towards Robust Edge Networks

Title: Dynamic Skill Adaptation for Large Language Models

Title: An In-Depth Analysis of Adversarial Discriminative Domain Adaptation for Digit Classification

Title: An Engorgio Prompt Makes Large Language Model Babble on

Title: MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios

Title: MINIMA: Modality Invariant Image Matching

Title: Multi-scale Latent Point Consistency Models for 3D Shape Generation

Title: Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning

Title: Revisiting PCA for time series reduction in temporal dimension

Title: Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints

Title: Residual Feature-Reutilization Inception Network for Image Classification

Title: Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models

Title: NijiGAN: Transform What You See into Anime with Contrastive Semi-Supervised Learning and Neural Ordinary Differential Equations

Title: DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes

Title: MNet-SAt: A Multiscale Network with Spatial-enhanced Attention for Segmentation of Polyps in Colonoscopy

Title: Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis

Title: Generative Adversarial Network on Motion-Blur Image Restoration

Title: Learning Radiance Fields from a Single Snapshot Compressive Image

Title: RAIN: Real-time Animation of Infinite Video Stream

Title: Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation

Title: Multi-P$^2$A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Title: MBQ: Modality-Balanced Quantization for Large Vision-Language Models

Title: Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Title: Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs

Title: Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model

Title: Is Your Text-to-Image Model Robust to Caption Noise?

Title: P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision

Title: StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture

Title: Diverse Rare Sample Generation with Pretrained GANs

Title: TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data

Title: Unprejudiced Training Auxiliary Tasks Makes Primary Better: A Multi-Task Learning Perspective

Title: Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference

Title: Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization

Title: A Comparative Study of Machine Unlearning Techniques for Image and Text Classification Models

Title: DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction

Title: Ultralight Signal Classification Model for Automatic Modulation Recognition

Title: ViDTA: Enhanced Drug-Target Affinity Prediction via Virtual Graph Nodes and Attention-based Feature Fusion

Title: Let Watermarks Speak: A Robust and Unforgeable Watermark for Language Models

Title: Gradient Weight-normalized Low-rank Projection for Efficient LLM Training

Title: RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Title: VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

Title: Chimera: A Block-Based Neural Architecture Search Framework for Event-Based Object Detection

Title: Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Title: FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios

Title: Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis

Title: CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs

Title: Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation

Title: A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

Title: From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

Title: Generative Pretrained Embedding and Hierarchical Irregular Time Series Representation for Daily Living Activity Recognition

Title: Enhancing Adversarial Robustness of Deep Neural Networks Through Supervised Contrastive Learning

Title: Generative Video Propagation

Title: Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration

Title: Tensor Network Estimation of Distribution Algorithms

Title: InfAlign: Inference-aware language model alignment

Title: MVTamperBench: Evaluating Robustness of Vision-Language Models