2023-12-18

language model

Title: Self-Evaluation Improves Selective Generation in Large Language Models. (arXiv:2312.09300v1 [cs.CL])

Title: ArchiGuesser -- AI Art Architecture Educational Game. (arXiv:2312.09334v1 [cs.AI])

Title: Large Language Models for Autonomous Driving: Real-World Experiments. (arXiv:2312.09397v1 [cs.AI])

Title: Clinical Text Deduplication Practices for Efficient Pretraining and Improved Clinical Tasks. (arXiv:2312.09469v1 [cs.CL])

Title: Grounding for Artificial Intelligence. (arXiv:2312.09532v1 [cs.AI])

Title: On a Functional Definition of Intelligence. (arXiv:2312.09546v1 [cs.AI])

Title: Prompting Large Language Models for Topic Modeling. (arXiv:2312.09693v1 [cs.AI])

Title: Improving Biomedical Entity Linking with Retrieval-enhanced Learning. (arXiv:2312.09806v1 [cs.CL])

Title: SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models. (arXiv:2312.09818v1 [cs.CL])

Title: Neurosymbolic Value-Inspired AI (Why, What, and How). (arXiv:2312.09928v1 [cs.AI])

Title: Distilling Large Language Models for Matching Patients to Clinical Trials. (arXiv:2312.09958v1 [cs.AI])

Title: Data and Approaches for German Text simplification -- towards an Accessibility-enhanced Communication. (arXiv:2312.09966v1 [cs.CL])

Title: Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision. (arXiv:2312.09390v1 [cs.CL])

Title: Marathon: A Race Through the Realm of Long Context with Large Language Models. (arXiv:2312.09542v1 [cs.CL])

Title: Extending Context Window of Large Language Models via Semantic Compression. (arXiv:2312.09571v1 [cs.CL])

Title: Probing Pretrained Language Models with Hierarchy Properties. (arXiv:2312.09670v1 [cs.CL])

Title: RJUA-QA: A Comprehensive QA Dataset for Urology. (arXiv:2312.09785v1 [cs.CL])

Title: ProCoT: Stimulating Critical Thinking and Writing of Students through Engagement with Large Language Models (LLMs). (arXiv:2312.09801v1 [cs.CL])

Title: Grammatical information in BERT sentence embeddings as two-dimensional arrays. (arXiv:2312.09890v1 [cs.CL])

Title: Generative Context-aware Fine-tuning of Self-supervised Speech Models. (arXiv:2312.09895v1 [cs.CL])

Title: The Art of Balancing: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment. (arXiv:2312.09979v1 [cs.CL])

Title: LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language. (arXiv:2312.09993v1 [cs.CL])

Title: Faithful Persona-based Conversational Dataset Generation with Large Language Models. (arXiv:2312.10007v1 [cs.CL])

gpt

Title: Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM. (arXiv:2312.09366v1 [cs.CL])

Title: GPT-4 Surpassing Human Performance in Linguistic Pragmatics. (arXiv:2312.09545v1 [cs.CL])

Title: 3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V. (arXiv:2312.09738v1 [cs.AI])

Title: A Novel Dataset for Financial Education Text Simplification in Spanish. (arXiv:2312.09897v1 [cs.AI])

Title: Red AI? Inconsistent Responses from GPT3.5 Models on Political Issues in the US and China. (arXiv:2312.09917v1 [cs.CL])

llm

Title: Challenges with unsupervised LLM knowledge discovery. (arXiv:2312.10029v1 [cs.LG])

Title: ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent. (arXiv:2312.10003v1 [cs.CL])

long context

lora

Title: Situation-Dependent Causal Influence-Based Cooperative Multi-agent Reinforcement Learning. (arXiv:2312.09539v1 [cs.AI])

Title: Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations. (arXiv:2312.09950v1 [cs.LG])

hallucination

prompt

code

Title: OTOv3: Automatic Architecture-Agnostic Neural Network Training and Compression from Structured Pruning to Erasing Operators. (arXiv:2312.09411v1 [cs.LG])

Title: GSQA: An End-to-End Model for Generative Spoken Question Answering. (arXiv:2312.09781v1 [cs.CL])

Title: Deep Unsupervised Domain Adaptation for Time Series Classification: a Benchmark. (arXiv:2312.09857v1 [cs.LG])

Title: Distributed Learning of Mixtures of Experts. (arXiv:2312.09877v1 [cs.LG])

Title: RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding. (arXiv:2312.09932v1 [cs.CL])

Title: Symbolic Numeric Planning with Patterns. (arXiv:2312.09963v1 [cs.AI])

Title: SAT-Based Algorithms for Regular Graph Pattern Matching. (arXiv:2312.09995v1 [cs.AI])

Title: Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition. (arXiv:2312.09583v1 [cs.CL])

Title: Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning. (arXiv:2312.09505v1 [cs.LG])

Title: Physics-informed Neural Network Estimation of Material Properties in Soft Tissue Nonlinear Biomechanical Models. (arXiv:2312.09787v1 [cs.LG])

Title: Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space. (arXiv:2312.09817v1 [cs.LG])

Title: Learning Distributions on Manifolds with Free-form Flows. (arXiv:2312.09852v1 [cs.LG])

Title: Automating reward function configuration for drug design. (arXiv:2312.09865v1 [cs.LG])

Title: Sketch and shift: a robust decoder for compressive clustering. (arXiv:2312.09940v1 [cs.LG])

Title: Modeling Unknown Stochastic Dynamical System via Autoencoder. (arXiv:2312.10001v1 [cs.LG])

Title: Symplectic Autoencoders for Model Reduction of Hamiltonian Systems. (arXiv:2312.10004v1 [cs.LG])

chat

retrieval augmented generation

rag

Title: Distributional Latent Variable Models with an Application in Active Cognitive Testing. (arXiv:2312.09316v1 [cs.AI])

Title: Prediction of rare events in the operation of household equipment using co-evolving time series. (arXiv:2312.09410v1 [cs.LG])

Title: Entropy Causal Graphs for Multivariate Time Series Anomaly Detection. (arXiv:2312.09478v1 [cs.LG])

Title: Multiple Instance Learning for Uplift Modeling. (arXiv:2312.09639v1 [cs.LG])

Title: Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach. (arXiv:2312.09758v1 [cs.LG])

Title: Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation. (arXiv:2312.09844v1 [cs.LG])

Title: MANTIS at #SMM4H 2023: Leveraging Hybrid and Ensemble Models for Detection of Social Anxiety Disorder on Reddit. (arXiv:2312.09451v1 [cs.CL])

Title: Discovering Highly Influential Shortcut Reasoning: An Automated Template-Free Approach. (arXiv:2312.09718v1 [cs.CL])

Title: Optimal Regret Bounds for Collaborative Learning in Bandits. (arXiv:2312.09674v1 [cs.LG])

Title: Urban Region Embedding via Multi-View Contrastive Prediction. (arXiv:2312.09681v1 [cs.LG])

Title: A Comparative Evaluation of Additive Separability Tests for Physics-Informed Machine Learning. (arXiv:2312.09775v1 [cs.LG])

Title: Hypergraph-MLP: Learning on Hypergraphs without Message Passing. (arXiv:2312.09778v1 [cs.LG])

Title: End-to-End Training of Neural Networks for Automotive Radar Interference Mitigation. (arXiv:2312.09790v1 [cs.LG])

Title: Fragility, Robustness and Antifragility in Deep Learning. (arXiv:2312.09821v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought