2024-06-21

Title: Scalable Training of Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN

Title: T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity Computation

Title: Semantic Graph Consistency: Going Beyond Patches for Regularizing Self-Supervised Vision Transformers

Title: Under the Hood of Tabular Data Generation Models: the Strong Impact of Hyperparameter Tuning

Title: Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying in Tabular Generative Models

Title: D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Title: Scale-Translation Equivariant Network for Oceanic Internal Solitary Wave Localization

Title: MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification

Title: RITA: A Real-time Interactive Talking Avatars Framework

Title: Exploring and Benchmarking the Planning Capabilities of Large Language Models

Title: Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models

Title: Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

Title: Learning to Generate Answers with Citations via Factual Consistency Models

Title: When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

Title: PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model

Title: Biomedical Visual Instruction Tuning with Clinician Preference Alignment

Title: Sparse High Rank Adapters

Title: Transferable Watermarking to Self-supervised Pre-trained Graph Encoders by Trigger Embeddings

Title: Learnable In-Context Vector for Visual Question Answering

Title: Surgical Triplet Recognition via Diffusion Model

Title: Neural Residual Diffusion Models for Deep Scalable Vision Generation

Title: AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models

Title: In-Context Learning on a Budget: A Case Study in Named Entity Recognition

Title: Media Forensics and Deepfake Systematic Survey

Title: ARDuP: Active Region Video Diffusion for Universal Policies

Title: Situational Instructions Database: Task Guidance in Dynamic Environments

Title: ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

Title: WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation

Title: Transferable speech-to-text large language model alignment module

Title: PPT-GNN: A Practical Pre-Trained Spatio-Temporal Graph Neural Network for Network Security

Title: Any360D: Towards 360 Depth Anything with Unlabeled 360 Data and M\"obius Spatial Augmentation

Title: Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images

Title: MoreHopQA: More Than Multi-hop Reasoning

Title: Towards a multimodal framework for remote sensing image change retrieval and captioning

Title: In-Context In-Context Learning with Transformer Neural Processes

Title: 4K4DGen: Panoramic 4D Generation at 4K Resolution

Title: Image Distillation for Safe Data Sharing in Histopathology

Title: In-Context Former: Lightning-fast Compressing Context for Large Language Model

Title: Enhance the Image: Super Resolution using Artificial Intelligence in MRI

Title: Can AI be enabled to dynamical downscaling? Training a Latent Diffusion Model to mimic km-scale COSMO-CLM downscaling of ERA5 over Italy

Title: InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising

Title: Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations

Title: Improving GFlowNets with Monte Carlo Tree Search

Title: Towards Minimal Targeted Updates of Language Models with Targeted Negative Training

Title: Hitchhiker's guide on Energy-Based Models: a comprehensive review on the relation with other generative models, sampling and statistical physics

Title: Breaking News: Case Studies of Generative AI's Use in Journalism

Title: On the Utility of Domain-Adjacent Fine-Tuned Model Ensembles for Few-shot Problems

Title: Tree-Sliced Wasserstein Distance on a System of Lines

Title: StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Title: GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation

Title: AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Title: Liveness Detection in Computer Vision: Transformer-based Self-Supervised Learning for Face Anti-Spoofing

Title: Splatter a Video: Video Gaussian Representation for Versatile Processing

Title: Knowledge Tagging System on Math Questions via LLMs with Flexible Demonstration Retriever

Title: Open Generative Large Language Models for Galician

Title: CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

Title: Generative AI for Enhancing Active Learning in Education: A Comparative Study of GPT-3.5 and GPT-4 in Crafting Customized Test Questions

Title: From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment

Title: Optimal deep learning of holomorphic operators between Banach spaces

Title: EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations

Title: Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models

Title: SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease Diagnosis

Title: The Elusive Pursuit of Replicating PATE-GAN: Benchmarking, Auditing, Debugging

Title: Image anomaly detection and prediction scheme based on SSA optimized ResNet50-BiGRU model

Title: A note on cyclic non-MDS matrices

Title: Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning

Title: Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing

Title: Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models

Title: HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models

Title: Dye4AI: Assuring Data Boundary on Generative AI Services

Title: ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

Title: Multi-modal Transfer Learning between Biological Foundation Models

Title: In Tree Structure Should Sentence Be Generated

Title: VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

Title: Uncertainty and Self-Supervision in Single-View Depth

Title: Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing

Title: aeon: a Python toolkit for learning from time series

Title: Unleashing the Potential of Tracklets for Unsupervised Video Person Re-Identification

Title: FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability

Title: Self-supervised Interpretable Concept-based Models for Text Classification

Title: Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extraction

Title: Active Diffusion Subsampling

Title: ATAC-Net: Zoomed view works better for Anomaly Detection

Title: CollaFuse: Collaborative Diffusion Models

Title: Video Generation with Learned Action Prior

Title: Self-supervised Multi-actor Social Activity Understanding in Streaming Videos

Title: Data-Centric AI in the Age of Large Language Models

Title: SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset

Title: Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary

Title: V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data

Title: Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps

Title: Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems

Title: Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Title: Consistency Models Made Easy

Title: Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation

Title: A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models