2024-11-28

Title: UVCG: Leveraging Temporal Consistency for Universal Video Protection

Title: Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach

Title: OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Title: Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation

Title: Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

Title: MTS-UNMixers: Multivariate Time Series Forecasting via Channel-Time Dual Unmixing

Title: MVBoost: Boost 3D Reconstruction with Multi-View Refinement

Title: Efficient Multi-modal Large Language Models via Visual Token Grouping

Title: Network Inversion and Its Applications

Title: Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space

Title: DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

Title: Geometric Point Attention Transformer for 3D Shape Reassembly

Title: Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Title: $H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

Title: NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?

Title: Scalable iterative pruning of large language and vision models using block coordinate descent

Title: Signs as Tokens: An Autoregressive Multilingual Sign Language Generator

Title: STAR: Synthesis of Tailored Architectures

Title: From memorization to generalization: a theoretical framework for diffusion-based generative models

Title: Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation

Title: CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos

Title: Rapid Distributed Fine-tuning of a Segmentation Model Onboard Satellites

Title: Adaptive Client Selection with Personalization for Communication Efficient Federated Learning

Title: Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extraction

Title: OracleSage: Towards Unified Visual-Linguistic Understanding of Oracle Bone Scripts through Cross-Modal Knowledge Fusion

Title: Rock the KASBA: Blazingly Fast and Accurate Time Series Clustering

Title: LongKey: Keyphrase Extraction for Long Documents

Title: Generative Image Layer Decomposition with Visual Effects

Title: Distributed Sign Momentum with Local Steps for Training Transformers

Title: Leveraging Large Language Models and Topic Modeling for Toxicity Classification

Title: Multimodal Crash Likelihood Prediction: A Complexity-Infused Approach Integrating Semantic, Contextual, and Driving Features

Title: HOPPR Medical-Grade Platform for Medical Imaging AI

Title: Automating grapevine LAI features estimation with UAV imagery and machine learning

Title: Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey

Title: DECODE: Domain-aware Continual Domain Expansion for Motion Prediction

Title: Exploring Superpixel Segmentation Methods in the Context of Citizen Science and Deforestation Detection

Title: A Practical Approach to Formal Methods: An Eclipse Integrated Development Environment (IDE) for Security Protocols

Title: Combining Threat Intelligence with IoT Scanning to Predict Cyber Attack

Title: Neural Networks Use Distance Metrics

Title: Stealthy Multi-Task Adversarial Attacks

Title: Spatio-temporal Causal Learning for Streamflow Forecasting

Title: Evaluating Generative AI-Enhanced Content: A Conceptual Framework Using Qualitative, Quantitative, and Mixed-Methods Approaches

Title: MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation

Title: ROICtrl: Boosting Instance Control for Visual Generation

Title: Optimization-Free Image Immunization Against Diffusion-Based Editing

Title: Adversarial Training in Low-Label Regimes with Margin-Based Interpolation

Title: Optimized Tradeoffs for Private Prediction with Majority Ensembling

Title: QuaLLM-Health: An Adaptation of an LLM-Based Framework for Quantitative Data Extraction from Online Health Discussions

Title: Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

Title: RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model

Title: Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery

Title: VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Title: New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing

Title: DRS: Deep Question Reformulation With Structured Output

Title: Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Title: AI-Driven Smartphone Solution for Digitizing Rapid Diagnostic Test Kits and Enhancing Accessibility for the Visually Impaired

Title: Causal and Local Correlations Based Network for Multivariate Time Series Classification

Title: Manual-PA: Learning 3D Part Assembly from Instruction Diagrams

Title: Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?

Title: Leveraging A New GAN-based Transformer with ECDH Crypto-system for Enhancing Energy Theft Detection in Smart Grid

Title: Privacy-preserving Robotic-based Multi-factor Authentication Scheme for Secure Automated Delivery System

Title: RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors

Title: ORIS: Online Active Learning Using Reinforcement Learning-based Inclusive Sampling for Robust Streaming Analytics System

Title: Lightweight Gaze Estimation Model Via Fusion Global Information

Title: GLS: Geometry-aware 3D Language Gaussian Splatting

Title: PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion

Title: Large Scale Evaluation of Deep Learning-based Explainable Solar Flare Forecasting Models with Attribution-based Proximity Analysis

Title: Dual-Level Boost Network for Long-Tail Prohibited Items Detection in X-ray Security Inspection

Title: Training Noise Token Pruning

Title: Comprehensive Kernel Safety in the Spectre Era: Mitigations and Performance Evaluation (Extended Version)

Title: Training and Evaluating Language Models with Template-based Data Generation

Title: Training Data Synthesis with Difficulty Controlled Diffusion Model

Title: When Large Vision-Language Models Meet Person Re-Identification

Title: Spectral-Spatial Transformer with Active Transfer Learning for Hyperspectral Image Classification

Title: A Machine Learning-based Framework towards Assessment of Decision-Makers' Biases

Title: Curriculum Demonstration Selection for In-Context Learning

Title: ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts

Title: Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models

Title: Harnessing Large Language Models for Seed Generation in Greybox Fuzzing

Title: MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models

Title: A survey on cutting-edge relation extraction techniques based on language models

Title: Type-R: Automatically Retouching Typos for Text-to-Image Generation

Title: SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment

Title: RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos

Title: KAN See Your Face

Title: PDZSeg: Adapting the Foundation Model for Dissection Zone Segmentation with Visual Prompts in Robot-assisted Endoscopic Submucosal Dissection

Title: Machine Unlearning reveals that the Gender-based Violence Victim Condition can be detected from Speech in a Speaker-Agnostic Setting

Title: InputSnatch: Stealing Input in LLM Services via Timing Side-Channel Attacks

Title: Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance

Title: Semantic Edge Computing and Semantic Communications in 6G Networks: A Unifying Survey and Research Challenges

Title: TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

Title: PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis

Title: SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation

Title: Thai Financial Domain Adaptation of THaLLE -- Technical Report

Title: Multimodal Integration of Longitudinal Noninvasive Diagnostics for Survival Prediction in Immunotherapy Using Deep Learning

Title: TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution

Title: Hidden Data Privacy Breaches in Federated Learning

Title: Grid-augumented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents

Title: Visual Adversarial Attack on Vision-Language Models for Autonomous Driving

Title: Neutralizing Backdoors through Information Conflicts for Large Language Models

Title: MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation

Title: Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks

Title: HiFiVFS: High Fidelity Video Face Swapping

Title: Aligning Pre-trained Models for Spoken Language Translation

Title: Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation

Title: InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Title: Real-time Video Target Tracking Algorithm Utilizing Convolutional Neural Networks (CNN)

Title: RITA: Automatic Framework for Designing of Resilient IoT Applications

Title: Using Malware Detection Techniques for HPC Application Classification

Title: EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond

Title: Can LLMs assist with Ambiguity? A Quantitative Evaluation of various Large Language Models on Word Sense Disambiguation

Title: FreqX: What neural networks learn is what network designers say

Title: TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models

Title: ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Title: GPT as ghostwriter at the White House

Title: Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models

Title: Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Title: ChatGPT as speechwriter for the French presidents

Title: Topic Modeling and Sentiment Analysis on Japanese Online Media's Coverage of Nuclear Energy

Title: Federated Learning with Uncertainty and Personalization via Efficient Second-order Optimization

Title: Politicians vs ChatGPT. A study of presuppositions in French and Italian political communication

Title: Deep Fourier-embedded Network for Bi-modal Salient Object Detection

Title: Adaptive Blind All-in-One Image Restoration

Title: FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving

Title: Streamlining Prediction in Bayesian Deep Learning

Title: Metric-DST: Mitigating Selection Bias Through Diversity-Guided Semi-Supervised Metric Learning

Title: Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator

Title: Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation

Title: Advancements in Myocardial Infarction Detection and Classification Using Wearable Devices: A Comprehensive Review

Title: Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification

Title: Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Title: Weakly Supervised Framework Considering Multi-temporal Information for Large-scale Cropland Mapping with Satellite Imagery

Title: Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Title: SoK: Watermarking for AI-Generated Content

Title: GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Title: LLM-ABBA: Understand time series via symbolic approximation

Title: Enhancing weed detection performance by means of GenAI-based image augmentation

Title: Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models

Title: AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans

Title: FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Title: Retrofitting (Large) Language Models with Dynamic Tokenization

Title: Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning

Title: Exploring Depth Information for Detecting Manipulated Face Videos

Title: Automated Literature Review Using NLP Techniques and LLM-Based Retrieval-Augmented Generation

Title: Hierarchical Information Flow for Generalized Efficient Image Restoration

Title: Task Arithmetic Through The Lens Of One-Shot Federated Learning

Title: Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Title: CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Title: Diffusion Self-Distillation for Zero-Shot Customized Image Generation

Title: Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data

Title: Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Title: GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data