2024-03-15

Title: Veagle: Advancements in Multimodal Representation Learning

Title: Procedural terrain generation with style transfer

Title: Image-Text Out-Of-Context Detection Using Synthetic Multimodal Misinformation

Title: Verification for Object Detection -- IBP IoU

Title: Bridging Human Concepts and Computer Vision for Explainable Face Verification

Title: CoBra: Complementary Branch Fusing Class and Semantic Knowledge for Robust Weakly Supervised Semantic Segmentation

Title: Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning

Title: Thermometer: Towards Universal Calibration for Large Language Models

Title: Diet-ODIN: A Novel Framework for Opioid Misuse Detection with Interpretable Dietary Patterns

Title: LoRA-SP: Streamlined Partial Parameter Adaptation for Resource-Efficient Fine-Tuning of Large Language Models

Title: TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation

Title: Structural Positional Encoding for knowledge integration in transformer-based medical process monitoring

Title: Predictive Clustering of Vessel Behavior Based on Hierarchical Trajectory Representation

Title: NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation

Title: DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

Title: ARtVista: Gateway To Empower Anyone Into Artist

Title: REFRESH: Responsible and Efficient Feature Reselection Guided by SHAP Values

Title: Federated Data Model

Title: From "um" to "yeah": Producing, predicting, and regulating information flow in human conversation

Title: Envision3D: One Image to 3D with Anchor Views Interpolation

Title: Efficiently Computing Similarities to Private Datasets

Title: Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images

Title: FogGuard: guarding YOLO against fog using perceptual loss

Title: LMStyle Benchmark: Evaluating Text Style Transfer for Chatbots

Title: Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

Title: Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis

Title: PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning

Title: Representing Anatomical Trees by Denoising Diffusion of Implicit Neural Fields

Title: AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents

Title: Ethos: Rectifying Language Models in Orthogonal Parameter Space

Title: CART: Caltech Aerial RGB-Thermal Dataset in the Wild

Title: AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic

Title: Semiparametric Token-Sequence Co-Supervision

Title: VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition

Title: VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework

Title: rFaceNet: An End-to-End Network for Enhanced Physiological Signal Extraction through Identity-Specific Facial Contours

Title: The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Title: Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains

Title: Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference

Title: StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control

Title: LAMP: A Language Model on the Map

Title: Distribution and Depth-Aware Transformers for 3D Human Mesh Recovery

Title: When Semantic Segmentation Meets Frequency Aliasing

Title: UniCode: Learning a Unified Codebook for Multimodal Large Language Models

Title: Large Language Models are Parallel Multilingual Learners

Title: Information Extraction: An application to the domain of hyper-local financial data on developing countries

Title: Ciphertext-Only Attack on a Secure $k$-NN Computation on Cloud

Title: Meaningful Learning: Advancing Abstract Reasoning in Large Language Models via Generic Fact Guidance

Title: Learning from straggler clients in federated learning

Title: Desigen: A Pipeline for Controllable Design Template Generation

Title: AI on AI: Exploring the Utility of GPT as an Expert Annotator of AI Publications

Title: Soften to Defend: Towards Adversarial Robustness via Self-Guided Label Refinement

Title: CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification

Title: Graph-Based DDoS Attack Detection in IoT Systems with Lossy Network

Title: Single Domain Generalization for Crowd Counting

Title: Rethinking Referring Object Removal

Title: ProSwitch: Knowledge-Guided Language Model Fine-Tuning to Generate Professional and Non-Professional Styled Text

Title: Metadata-Driven Federated Learning of Connectional Brain Templates in Non-IID Multi-Domain Scenarios

Title: Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

Title: Evaluating LLMs for Gender Disparities in Notable Persons

Title: Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation

Title: Unveiling the Generalization Power of Fine-Tuned Large Language Models

Title: Caveat Lector: Large Language Models in Legal Practice

Title: Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge

Title: Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse

Title: ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks

Title: SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph

Title: Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts

Title: Generalized Relevance Learning Grassmann Quantization

Title: Intention-aware Denoising Diffusion Model for Trajectory Prediction

Title: PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

Title: Intention-driven Ego-to-Exo Video Generation

Title: SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration

Title: Noise Dimension of GAN: An Image Compression Perspective

Title: Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation

Title: On the Laplace Approximation as Model Selection Criterion for Gaussian Processes

Title: MCformer: Multivariate Time Series Forecasting with Mixed-Channels Transformer

Title: D-YOLO a robust framework for object detection in adverse weather conditions

Title: WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images

Title: CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification

Title: DA-PFL: Dynamic Affinity Aggregation for Personalized Federated Learning

Title: SELECTOR: Heterogeneous graph network with convolutional masked autoencoder for multimodal robust prediction of cancer survival

Title: Anatomical Structure-Guided Medical Vision-Language Pre-training

Title: Annotation Free Semantic Segmentation with Vision Foundation Models

Title: Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection

Title: Semi- and Weakly-Supervised Learning for Mammogram Mass Segmentation with Limited Annotations

Title: SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios

Title: Privacy Preserving Anomaly Detection on Homomorphic Encrypted Data from IoT Sensors

Title: Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Title: Video Editing via Factorized Diffusion Distillation

Title: LocalMamba: Visual State Space Model with Windowed Selective Scan

Title: AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions

Title: LDPRecover: Recovering Frequencies from Poisoning Attacks against Local Differential Privacy

Title: REPQC: Reverse Engineering and Backdooring Hardware Accelerators for Post-quantum Cryptography

Title: Komodo: A Linguistic Expedition into Indonesia's Regional Languages

Title: Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm without Real Data Exposure

Title: DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification

Title: Impact of Synthetic Images on Morphing Attack Detection Using a Siamese Network

Title: GiT: Towards Generalist Vision Transformer through Universal Language Interface

Title: ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization

Title: XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization

Title: OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor Environments

Title: RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Title: Mitigating attribute amplification in counterfactual image generation

Title: Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity

Title: Efficient Transferability Assessment for Selection of Pre-trained Detectors

Title: 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Title: Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and Efficiency

Title: Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk

Title: Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Title: MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models

Title: Covert Communication for Untrusted UAV-Assisted Wireless Systems

Title: What Sketch Explainability Really Means for Downstream Tasks

Title: Rectifying Demonstration Shortcut in In-Context Learning

Title: EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning

Title: SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition

Title: AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Title: Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information

Title: MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation

Title: VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding

Title: Logits of API-Protected LLMs Leak Proprietary Information

Title: RANDAO-based RNG: Last Revealer Attacks in Ethereum 2.0 Randomness and a Potential Solution

Title: Explorations in Texture Learning

Title: Breast Cancer Classification Using Gradient Boosting Algorithms Focusing on Reducing the False Negative and SHAP for Explainability

Title: WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity

Title: Less is More: Data Value Estimation for Visual Instruction Tuning

Title: PreCurious: How Innocent Pre-Trained Language Models Turn into Privacy Traps

Title: Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Title: Renovating Names in Open-Vocabulary Segmentation Benchmarks

Title: Optimistic Verifiable Training by Controlling Hardware Nondeterminism

Title: Counterfactual contrastive learning: robust representations via causal image synthesis

Title: Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey

Title: MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Title: Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training

Title: Explore In-Context Segmentation via Latent Diffusion Models

Title: PosSAM: Panoptic Open-vocabulary Segment Anything

Title: Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Title: Score-Guided Diffusion for 3D Human Recovery

Title: Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

Title: Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

Title: Generalized Predictive Model for Autonomous Driving

Title: 3D-VLA: A 3D Vision-Language-Action Generative World Model

Title: Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models

Title: Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference

Title: SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior