2024-12-04

Title: Explainable Artificial Intelligence for Medical Applications: A Review

Title: Data Augmentation through Background Removal for Apple Leaf Disease Classification Using the MobileNetV2 Model

Title: Composition of Experts: A Modular Compound AI System Leveraging Large Language Models

Title: Planar Gaussian Splatting

Title: Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers

Title: Enhancing Crop Segmentation in Satellite Image Time Series with Transformer Networks

Title: A Novel Generative Multi-Task Representation Learning Approach for Predicting Postoperative Complications in Cardiac Surgery Patients

Title: The use of large language models to enhance cancer clinical trial educational materials

Title: Enhancing Deep Learning Model Robustness through Metamorphic Re-Training

Title: FGATT: A Robust Framework for Wireless Data Imputation Using Fuzzy Graph Attention Networks and Transformer Encoders

Title: Smart Parking with Pixel-Wise ROI Selection for Vehicle Detection Using YOLOv8, YOLOv9, YOLOv10, and YOLOv11

Title: ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Title: Generalized EXTRA stochastic gradient Langevin dynamics

Title: Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson's Diagnosis

Title: Explore Reinforced: Equilibrium Approximation with Reinforcement Learning

Title: NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Title: Mutli-View 3D Reconstruction using Knowledge Distillation

Title: Predicting the Impact of Scope Changes on Project Cost and Schedule Using Machine Learning Techniques

Title: Impact of Data Snooping on Deep Learning Models for Locating Vulnerabilities in Lifted Code

Title: Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support

Title: BN-AuthProf: Benchmarking Machine Learning for Bangla Author Profiling on Social Media Texts

Title: CLERF: Contrastive LEaRning for Full Range Head Pose Estimation

Title: Performance Comparison of Deep Learning Techniques in Naira Classification

Title: Topology-Preserving Image Segmentation with Spatial-Aware Persistent Feature Matching

Title: Let's Think Var-by-Var: Large Language Models Enable Ad Hoc Probabilistic Reasoning

Title: Comparative Analysis of Black-Box and White-Box Machine Learning Model in Phishing Detection

Title: Offline Stochastic Optimization of Black-Box Objective Functions

Title: Crash Severity Risk Modeling Strategies under Data Imbalance

Title: AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation

Title: Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

Title: Retrofitting XoM for Stripped Binaries without Embedded Data Relocation

Title: OmniCreator: Self-Supervised Unified Generation with Universal Editing

Title: Streamlining Video Analysis for Efficient Violence Detection

Title: GSOT3D: Towards Generic 3D Single Object Tracking in the Wild

Title: WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

Title: Personalized Multimodal Large Language Models: A Survey

Title: Leveraging Large Language Models for Comparative Literature Summarization with Reflective Incremental Mechanisms

Title: Revisiting the Initial Steps in Adaptive Gradient Descent Optimization

Title: CausalMob: Causal Human Mobility Prediction with LLMs-derived Human Intentions toward Public Events

Title: Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach

Title: Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis

Title: Underload: Defending against Latency Attacks for Object Detectors on Edge Devices

Title: Deep Learning, Machine Learning, Advancing Big Data Analytics and Management

Title: LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

Title: Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images

Title: Transformer-Metric Loss for CNN-Based Face Recognition

Title: 3D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation

Title: CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Title: An Automated Data Mining Framework Using Autoencoders for Feature Extraction and Dimensionality Reduction

Title: Recovering implicit physics model under real-world constraints

Title: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs

Title: How to Use Diffusion Priors under Sparse Views?

Title: Learning from Concealed Labels

Title: Blockchain-Enabled Device-Enhanced Multi-Access Edge Computing in Open Adversarial Environments

Title: CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution

Title: Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

Title: Fast LiDAR Data Generation with Rectified Flows

Title: Vision Transformers for Weakly-Supervised Microorganism Enumeration

Title: Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Title: ProbPose: A Probabilistic Approach to 2D Human Pose Estimation

Title: Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis

Title: GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos

Title: Sustainable Self-evolution Adversarial Training

Title: MediaSpin: Exploring Media Bias Through Fine-Grained Analysis of News Headlines

Title: PCIM: Learning Pixel Attributions via Pixel-wise Channel Isolation Mixing in High Content Imaging

Title: A Comprehensive Evaluation of Large Language Models on Aspect-Based Sentiment Analysis

Title: GQWformer: A Quantum-based Transformer for Graph Representation Learning

Title: Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance

Title: Learn More by Using Less: Distributed Learning with Energy-Constrained Devices

Title: Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

Title: Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods

Title: LoCo: Low-Contrast-Enhanced Contrastive Learning for Semi-Supervised Endoscopic Image Segmentation

Title: HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset

Title: Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation

Title: Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script

Title: GRAND : Graph Reconstruction from potential partial Adjacency and Neighborhood Data

Title: SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models

Title: Amodal Depth Anything: Amodal Depth Estimation in the Wild

Title: Federated Analytics in Practice: Engineering for Privacy, Scalability and Practicality

Title: Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model

Title: UniForm: A Reuse Attention Mechanism Optimized for Efficient Vision Transformers on Edge Devices

Title: CTRAPS: CTAP Client Impersonation and API Confusion on FIDO2

Title: Dual Exposure Stereo for Extended Dynamic Range 3D Imaging

Title: LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization

Title: GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Title: Trajectory-based Road Autolabeling with Lidar-Camera Fusion in Winter Conditions

Title: TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity

Title: Active Negative Loss: A Robust Framework for Learning with Noisy Labels

Title: Who Walks With You Matters: Perceiving Social Interactions with Groups for Pedestrian Trajectory Prediction

Title: RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation

Title: VISTA: A Panoramic View of Neural Representations

Title: GerPS-Compare: Comparing NER methods for legal norm analysis

Title: Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining

Title: DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators

Title: OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations

Title: LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data

Title: Defending Against Diverse Attacks in Federated Learning Through Consensus-Based Bi-Level Optimization

Title: Automatic State Machine Inference for Binary Protocol Reverse Engineering

Title: Unveiling Concept Attribution in Diffusion Models

Title: Fractional Order Distributed Optimization

Title: Patent-CR: A Dataset for Patent Claim Revision

Title: Semantic Tokens in Retrieval Augmented Generation

Title: SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection

Title: Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Title: The Efficacy of Transfer-based No-box Attacks on Image Watermarking: A Pragmatic Analysis

Title: Private Linear Regression with Differential Privacy and PAC Privacy

Title: OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Title: CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs

Title: Interpretable Company Similarity with Sparse Autoencoders

Title: Wasserstein Markets for Differentially-Private Data

Title: AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Title: Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

Title: Time-Reversal Provides Unsupervised Feedback to LLMs

Title: Continual Learning of Personalized Generative Face Models with Experience Replay

Title: Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation

Title: Liquefaction: Privately Liquefying Blockchain Assets

Title: Robust soybean seed yield estimation using high-throughput ground robot videos

Title: A Bidirectional Long Short Term Memory Approach for Infrastructure Health Monitoring Using On-board Vibration Response

Title: Interpretable Generalized Additive Models for Datasets with Missing Values

Title: Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Title: AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

Title: T-REG: Preference Optimization with Token-Level Reward Regularization

Title: SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Title: FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation

Title: Diffusion-based Visual Anagram as Multi-task Learning

Title: Motion Prompting: Controlling Video Generation with Motion Trajectories