2024-04-19

Title: SNP: Structured Neuron-level Pruning to Preserve Attention Scores

Title: Exploring DNN Robustness Against Adversarial Attacks Using Approximate Multipliers

Title: MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

Title: How often are errors in natural language reasoning due to paraphrastic variability?

Title: Missed Connections: Lateral Thinking Puzzles for Large Language Models

Title: Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Title: Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

Title: Improved Generalization Bounds for Communication Efficient Federated Learning

Title: Multimodal 3D Object Detection on Unseen Domains

Title: QGen: On the Ability to Generalize in Quantization Aware Training

Title: CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration

Title: REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models

Title: Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning

Title: TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation

Title: Cross-model Mutual Learning for Exemplar-based Medical Image Segmentation

Title: AquaSonic: Acoustic Manipulation of Underwater Data Center Operations and Resource Management

Title: Tailoring Generative Adversarial Networks for Smooth Airfoil Design

Title: Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement

Title: AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence

Title: Actor-Critic Reinforcement Learning with Phased Actor

Title: Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes

Title: Partial Large Kernel CNNs for Efficient Super-Resolution

Title: Progressive Multi-modal Conditional Prompt Tuning

Title: From Image to Video, what do we need in multimodal LLMs?

Title: Multi-view Graph Structural Representation Learning via Graph Coarsening

Title: Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory

Title: Group-On: Boosting One-Shot Segmentation with Supportive Query

Title: Using a Local Surrogate Model to Interpret Temporal Shifts in Global Annual Data

Title: The Dog Walking Theory: Rethinking Convergence in Federated Learning

Title: FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Title: FedMID: A Data-Free Method for Using Intermediate Outputs as a Defense Mechanism Against Poisoning Attacks in Federated Learning

Title: TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Title: SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up

Title: EdgeFusion: On-Device Text-to-Image Generation

Title: CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment

Title: LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights

Title: Trusted Multi-view Learning with Label Noise

Title: Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Title: The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models

Title: Aligning Language Models to Explicitly Handle Ambiguity

Title: EVIT: Event-Oriented Instruction Tuning for Event Reasoning

Title: Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation

Title: MultiPhys: Multi-Person Physics-aware 3D Motion Estimation

Title: Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Title: Token-level Direct Preference Optimization

Title: Variational Multi-Modal Hypergraph Attention Network for Multi-Modal Relation Extraction

Title: ParaFusion: A Large-Scale LLM-Driven English Paraphrase Dataset Infused with High-Quality Lexical and Syntactic Diversity

Title: Pseudo-random generators using linear feedback shift registers with output extraction

Title: Sequential Compositional Generalization in Multimodal Models

Title: Enhance Robustness of Language Models Against Variation Attack through Graph Integration

Title: What does CLIP know about peeling a banana?

Title: Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

Title: Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

Title: Meta-Auxiliary Learning for Micro-Expression Recognition

Title: Data-free Knowledge Distillation for Fine-grained Visual Categorization

Title: Uncovering Safety Risks in Open-source LLMs through Concept Activation Vector

Title: Can We Catch the Elephant? The Evolvement of Hallucination Evaluation on Natural Language Generation: A Survey

Title: Using Real-world Bug Bounty Programs in Secure Coding Course: Experience Report

Title: emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information

Title: RAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models

Title: MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification

Title: MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking

Title: Harnessing Joint Rain-/Detail-aware Representations to Eliminate Intricate Rains

Title: Evaluating the Security of Merkle Trees in the Internet of Things: An Analysis of Data Falsification Probabilities

Title: Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models

Title: Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Title: One-Shot Sequential Federated Learning for Non-IID Data by Enhancing Local Model Diversity

Title: Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models

Title: Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding

Title: From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency

Title: FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge

Title: StyleBooth: Image Style Editing with Multimodal Instruction

Title: Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization

Title: Stance Detection on Social Media with Fine-Tuned Large Language Models

Title: How to Benchmark Vision Foundation Models for Semantic Segmentation?

Title: Gait Recognition from Highly Compressed Videos

Title: Privacy-Preserving UCB Decision Process Verification via zk-SNARKs

Title: Estimating the Hessian Matrix of Ranking Objectives for Stochastic Learning to Rank with Gradient Boosted Trees

Title: Aligning Actions and Walking to LLM-Generated Textual Descriptions

Title: The Explicit values of the UBCT, the LBCT and the DBCT of the inverse function

Title: Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Title: A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

Title: Length Generalization of Causal Transformers without Position Encoding

Title: Neural Networks with Causal Graph Constraints: A New Approach for Treatment Effects Estimation

Title: CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

Title: Deep Gaussian mixture model for unsupervised image segmentation

Title: Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Title: DeepLocalization: Using change point detection for Temporal Action Localization

Title: Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models

Title: Physics-integrated generative modeling using attentive planar normalizing flow based variational autoencoder

Title: Advancing the Robustness of Large Language Models through Self-Denoised Smoothing

Title: Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting

Title: Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery

Title: Resilience through Scene Context in Visual Referring Expression Generation

Title: Augmenting emotion features in irony detection with Large language modeling

Title: Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

Title: Proactive Software Supply Chain Risk Management Framework (P-SSCRM) Version 1

Title: iRAG: An Incremental Retrieval Augmented Generation System for Videos

Title: Guided Discrete Diffusion for Electronic Health Record Generation

Title: A Perspective on Deep Vision Performance with Standard Image and Video Codecs

Title: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

Title: Measuring Feature Dependency of Neural Networks by Collapsing Feature Dimensions in the Data Manifold

Title: Large Language Models in Targeted Sentiment Analysis

Title: AniClipart: Clipart Animation with Text-to-Video Priors

Title: Point-In-Context: Understanding Point Cloud via In-Context Learning

Title: V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Title: Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation

Title: From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

Title: Inverse Neural Rendering for Explainable Multi-Object Tracking

Title: Transformer tricks: Removing weights for skipless transformers

Title: When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes

Title: Gradient-Regularized Out-of-Distribution Detection

Title: KDk: A Defense Mechanism Against Label Inference Attacks in Vertical Federated Learning

Title: MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale

Title: 6Img-to-3D: Few-Image Large-Scale Outdoor Driving Scene Reconstruction

Title: Lazy Diffusion Transformer for Interactive Image Editing

Title: G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis

Title: MeshLRM: Large Reconstruction Model for High-Quality Mesh

Title: SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Title: VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Title: Moving Object Segmentation: All You Need Is SAM (and Flow)

Title: BLINK: Multimodal Large Language Models Can See but Not Perceive