2024-06-28

Title: VideoQA-SC: Adaptive Semantic Communication for Video Question Answering

Title: TexPainter: Generative Mesh Texturing with Multi-view Consistency

Title: Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing

Title: Refining 3D Point Cloud Normal Estimation via Sample Selection

Title: Generative AI Empowered LiDAR Point Cloud Generation with Multimodal Transformer

Title: A Set-based Approach for Feature Extraction of 3D CAD Models

Title: Visual Analysis of Prediction Uncertainty in Neural Networks for Deep Image Synthesis

Title: Application of Multimodal Fusion Deep Learning Model in Disease Recognition

Title: Planted: a dataset for planted forest identification from multi-satellite time series

Title: BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation

Title: Memorized Images in Diffusion Models share a Subspace that can be Located and Deleted

Title: A Diagnostic Model for Acute Lymphoblastic Leukemia Using Metaheuristics and Deep Learning Methods

Title: FLOW: Fusing and Shuffling Global and Local Views for Cross-User Human Activity Recognition with IMUs

Title: UltraCortex: Submillimeter Ultra-High Field 9.4 T1 Brain MR Image Collection and Manual Cortical Segmentations

Title: Research on Driver Facial Fatigue Detection Based on Yolov8 Model

Title: Shedding Light on Large Generative Networks: Estimating Epistemic Uncertainty in Diffusion Models

Title: Canonical Consolidation Fields: Reconstructing Dynamic Shapes from Point Clouds

Title: Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

Title: Varying Manifolds in Diffusion: From Time-varying Geometries to Visual Saliency

Title: Composition Vision-Language Understanding via Segment and Depth Anything Model

Title: Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo Labeling

Title: Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Title: Evaluating Copyright Takedown Methods for Language Models

Title: RouteLLM: Learning to Route LLMs with Preference Data

Title: A Zero Auxiliary Knowledge Membership Inference Attack on Aggregate Location Data

Title: Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Title: Few-shot Personalization of LLMs with Mis-aligned Responses

Title: CSI4Free: GAN-Augmented mmWave CSI for Improved Pose Classification

Title: Geometric Features Enhanced Human-Object Interaction Detection

Title: Learning to Correct for QA Reasoning with Black-box LLMs

Title: Jailbreaking LLMs with Arabic Transliteration and Arabizi

Title: Re-Ranking Step by Step: Investigating Pre-Filtering for Re-Ranking with Large Language Models

Title: 3D Feature Distillation with Object-Centric Priors

Title: QBI: Quantile-based Bias Initialization for Efficient Private Data Reconstruction in Federated Learning

Title: Competitive Algorithms for Online Knapsack with Succinct Predictions

Title: Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism

Title: Conformalized Link Prediction on Graph Neural Networks

Title: ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language Models

Title: Implicit Discourse Relation Classification For Nigerian Pidgin

Title: Aligning Model Properties via Conformal Risk Control

Title: Psychological Profiling in Cybersecurity: A Look at LLMs and Psycholinguistic Features

Title: MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data

Title: Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation

Title: Towards Secure Management of Edge-Cloud IoT Microservices using Policy as Code

Title: MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation

Title: Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering Analysis

Title: OutlierTune: Efficient Channel-Wise Quantization for Large Language Models

Title: Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach

Title: Revisiting Backdoor Attacks against Large Vision-Language Models

Title: Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition

Title: Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Title: LICO: Large Language Models for In-Context Molecular Optimization

Title: FFN: a Fine-grained Chinese-English Financial Domain Parallel Corpus

Title: Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report Simplification

Title: SSP: Self-Supervised Prompting for Cross-Lingual Transfer to Low-Resource Languages using Large Language Models

Title: AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models

Title: Assessing the Effectiveness of LLMs in Android Application Vulnerability Analysis

Title: Can we teach language models to gloss endangered languages?

Title: Autoencoder based approach for the mitigation of spurious correlations

Title: Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets

Title: A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow

Title: TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

Title: Capturing Minds, Not Just Words: Enhancing Role-Playing Language Models with Personality-Indicative Data

Title: Time Matters: Scaling Laws for Any Budget

Title: RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation

Title: Federated Graph Semantic and Structural Learning

Title: Efficient Verifiable Differential Privacy with Input Authenticity in the Local and Shuffle Model

Title: CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation

Title: Investigating and Defending Shortcut Learning in Personalized Diffusion Models

Title: AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Title: UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

Title: Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis

Title: FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity

Title: VideoMambaPro: A Leap Forward for Mamba in Video Understanding

Title: Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model

Title: Improving Weak-to-Strong Generalization with Reliability-Aware Alignment

Title: SD-BLS: Privacy Preserving Selective Disclosure and Unlinkable Revocation of Verifiable Credentials

Title: Towards Credential-based Device Registration in DApps for DePINs with ZKPs

Title: BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection

Title: Accuracy on the wrong line: On the pitfalls of noisy data for out-of-distribution generalisation

Title: FedMap: Iterative Magnitude-Based Pruning for Communication-Efficient Federated Learning

Title: A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)

Title: Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO

Title: STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis

Title: Dancing in the Shadows: Harnessing Ambiguity for Fairer Classifiers

Title: EmPO: Theory-Driven Dataset Construction for Empathetic Response Generation through Preference Optimization

Title: Dimensions underlying the representational alignment of deep neural networks with humans

Title: SubLock: Sub-Circuit Replacement based Input Dependent Key-based Logic Locking for Robust IP Protection

Title: Understanding the Security Benefits and Overheads of Emerging Industry Solutions to DRAM Read Disturbance

Title: Fairness and Bias in Multimodal AI: A Survey

Title: DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming

Title: Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs

Title: A Teacher Is Worth A Million Instructions

Title: CHEW: A Dataset of CHanging Events in Wikipedia

Title: Towards Learning Abductive Reasoning using VSA Distributed Representations

Title: YZS-model: A Predictive Model for Organic Drug Solubility Based on Graph Convolutional Networks and Transformer-Attention

Title: BackMix: Mitigating Shortcut Learning in Echocardiography with Minimal Supervision

Title: RAVEN: Multitask Retrieval Augmented Vision-Language Learning

Title: Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion

Title: Averaging log-likelihoods in direct alignment

Title: Think Step by Step: Chain-of-Gesture Prompting for Error Detection in Robotic Surgical Videos

Title: Hack Me If You Can: Aggregating AutoEncoders for Countering Persistent Access Threats Within Highly Imbalanced Data

Title: T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Title: ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation

Title: Simulating Classroom Education with LLM-Empowered Agents

Title: Aligning Teacher with Student Preferences for Tailored Training Data Generation

Title: Seeing Is Believing: Black-Box Membership Inference Attacks Against Retrieval Augmented Generation

Title: Revealing Fine-Grained Values and Opinions in Large Language Models

Title: NTFormer: A Composite Node Tokenized Graph Transformer for Node Classification

Title: AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Title: Advection Augmented Convolutional Neural Networks

Title: Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment

Title: Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers

Title: Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Title: AutoPureData: Automated Filtering of Web Data for LLM Fine-tuning

Title: HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Title: From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Title: Compositional Image Decomposition with Diffusion Models

Title: Mapping Land Naturalness from Sentinel-2 using Deep Contextual and Geographical Priors

Title: Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems

Title: LiveBench: A Challenging, Contamination-Free LLM Benchmark

Title: Jump Starting Bandits with LLM-Generated Prior Knowledge

Title: Efficient World Models with Context-Aware Tokenization

Title: Synthetic Embedding of Hidden Information in Industrial Control System Network Protocols for Evaluation of Steganographic Malware

Title: Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation

Title: IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language

Title: DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions

Title: The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Title: STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning

Title: SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues

Title: Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model

Title: Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space

Title: TTP-Based Cyber Resilience Index: A Probabilistic Quantitative Approach to Measure Defence Effectiveness Against Cyber Attacks

Title: The Remarkable Robustness of LLMs: Stages of Inference?

Title: OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Title: Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Title: ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos

Title: Looking 3D: Anomaly Detection with 2D-3D Alignment

Title: Dataset Size Recovery from LoRA Weights