2024-01-02

language model

Title: Turing's Test, a Beautiful Thought Experiment. (arXiv:2401.00009v1 [cs.AI])

Title: Is Knowledge All Large Language Models Needed for Causal Reasoning?. (arXiv:2401.00139v1 [cs.AI])

Title: ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph. (arXiv:2401.00158v1 [cs.CL])

Title: Open-TI: Open Traffic Intelligence with Augmented Language Model. (arXiv:2401.00211v1 [cs.AI])

Title: Red Teaming for Large Language Models At Scale: Tackling Hallucinations on Mathematics Tasks. (arXiv:2401.00290v1 [cs.CL])

Title: A Reliable Knowledge Processing Framework for Combustion Science using Foundation Models. (arXiv:2401.00544v1 [cs.AI])

Title: AllSpark: a multimodal spatiotemporal general model. (arXiv:2401.00546v1 [cs.AI])

Title: Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing. (arXiv:2401.00579v1 [cs.CL])

Title: Fairness in Serving Large Language Models. (arXiv:2401.00588v1 [cs.AI])

Title: Large language model for Bible sentiment analysis: Sermon on the Mount. (arXiv:2401.00689v1 [cs.CL])

Title: Large Language Models aren't all that you need. (arXiv:2401.00698v1 [cs.CL])

Title: ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios. (arXiv:2401.00741v1 [cs.CL])

Title: Temporal Validity Change Prediction. (arXiv:2401.00779v1 [cs.CL])

Title: Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models. (arXiv:2401.00788v1 [cs.CL])

Title: Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education. (arXiv:2401.00832v1 [cs.AI])

Title: The Problem of Alignment. (arXiv:2401.00210v1 [cs.CL])

Title: Boosting Large Language Model for Speech Synthesis: An Empirical Study. (arXiv:2401.00246v1 [cs.CL])

Title: Evaluation is all you need. Prompting Generative Large Language Models for Annotation Tasks in the Social Sciences. A Primer using Open Models. (arXiv:2401.00284v1 [cs.CL])

Title: Improving Text Embeddings with Large Language Models. (arXiv:2401.00368v1 [cs.CL])

Title: FusionMind -- Improving question and answering with external context fusion. (arXiv:2401.00388v1 [cs.CL])

Title: RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models. (arXiv:2401.00396v1 [cs.CL])

Title: SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection. (arXiv:2401.00424v1 [cs.CL])

Title: GeoGalactica: A Scientific Large Language Model in Geoscience. (arXiv:2401.00434v1 [cs.CL])

Title: BatchEval: Towards Human-like Text Evaluation. (arXiv:2401.00437v1 [cs.CL])

Title: Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws. (arXiv:2401.00448v1 [cs.LG])

Title: HSC-GPT: A Large Language Model for Human Settlements Construction. (arXiv:2401.00504v1 [cs.CL])

Title: Neural Networks Against (and For) Self-Training: Classification with Small Labeled and Large Unlabeled Sets. (arXiv:2401.00575v1 [cs.CL])

Title: An Analysis of Embedding Layers and Similarity Scores using Siamese Neural Networks. (arXiv:2401.00582v1 [cs.CL])

Title: Predicting Anti-microbial Resistance using Large Language Models. (arXiv:2401.00642v1 [cs.CL])

Title: Benchmarking Large Language Models on Controllable Generation under Diversified Instructions. (arXiv:2401.00690v1 [cs.CL])

Title: SecFormer: Towards Fast and Accurate Privacy-Preserving Inference for Large Language Models. (arXiv:2401.00793v1 [cs.LG])

Title: If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents. (arXiv:2401.00812v1 [cs.CL])

Title: Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models. (arXiv:2401.00625v1 [cs.LG])

gpt

Title: GraphGPT: Graph Learning with Generative Pre-trained Transformers. (arXiv:2401.00529v1 [cs.LG])

llm

Title: LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning. (arXiv:2401.00125v1 [cs.AI])

Title: keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM. (arXiv:2401.00426v1 [cs.CL])

Title: The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness. (arXiv:2401.00287v1 [cs.CL])

Title: State of What Art? A Call for Multi-Prompt LLM Evaluation. (arXiv:2401.00595v1 [cs.CL])

Title: A Computational Framework for Behavioral Assessment of LLM Therapists. (arXiv:2401.00820v1 [cs.CL])

Title: KAXAI: An Integrated Environment for Knowledge Analysis and Explainable AI. (arXiv:2401.00193v1 [cs.LG])

long context

lora

Title: Modeling arousal potential of epistemic emotions using Bayesian information gain: Inquiry cycle driven by free energy fluctuations. (arXiv:2401.00007v1 [cs.AI])

Title: Policy Optimization with Smooth Guidance Rewards Learned from Sparse-Reward Demonstrations. (arXiv:2401.00162v1 [cs.LG])

Title: Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles. (arXiv:2401.00243v1 [cs.LG])

Title: Client-wise Modality Selection for Balanced Multi-modal Federated Learning. (arXiv:2401.00403v1 [cs.LG])

Title: Viz: A QLoRA-based Copyright Marketplace for Legally Compliant Generative AI. (arXiv:2401.00503v1 [cs.LG])

hallucination

prompt

code

Title: Consciousness as a logically consistent and prognostic model of reality. (arXiv:2401.00005v1 [cs.AI])

Title: HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes. (arXiv:2401.00365v1 [cs.LG])

Title: Graph-Convolutional Autoencoder Ensembles for the Humanities, Illustrated with a Study of the American Slave Trade. (arXiv:2401.00824v1 [cs.LG])

Title: Enabling Smart Retrofitting and Performance Anomaly Detection for a Sensorized Vessel: A Maritime Industry Experience. (arXiv:2401.00112v1 [cs.LG])

Title: Saliency-Aware Regularized Graph Neural Network. (arXiv:2401.00755v1 [cs.LG])

chat

Title: A Survey of Personality, Persona, and Profile in Conversational Agents and Chatbots. (arXiv:2401.00609v1 [cs.CL])

retrieval augmented generation

retrieval-augmented generation

rag

Title: Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism. (arXiv:2401.00015v1 [cs.LG])

Title: Semantic Computing for Organizational Effectiveness: From Organization Theory to Practice through Semantics-Based Modelling. (arXiv:2401.00062v1 [cs.AI])

Title: Causal State Distillation for Explainable Reinforcement Learning. (arXiv:2401.00104v1 [cs.LG])

Title: DiffHybrid-UQ: Uncertainty Quantification for Differentiable Hybrid Neural Modeling. (arXiv:2401.00161v1 [cs.LG])

Title: Transformer Multivariate Forecasting: Less is More?. (arXiv:2401.00230v1 [cs.LG])

Title: Multi-spatial Multi-temporal Air Quality Forecasting with Integrated Monitoring and Reanalysis Data. (arXiv:2401.00521v1 [cs.LG])

Title: Unsupervised Outlier Detection using Random Subspace and Subsampling Ensembles of Dirichlet Process Mixtures. (arXiv:2401.00773v1 [cs.LG])

Title: Automatic Essay Scoring in a Brazilian Scenario. (arXiv:2401.00095v1 [cs.CL])

Title: Machine Translation Testing via Syntactic Tree Pruning. (arXiv:2401.00751v1 [cs.CL])

Title: Online Algorithmic Recourse by Collective Action. (arXiv:2401.00055v1 [cs.LG])

Title: Fairness-Enhancing Vehicle Rebalancing in the Ride-hailing System. (arXiv:2401.00093v1 [cs.LG])

Title: Deep Generative Symbolic Regression. (arXiv:2401.00282v1 [cs.LG])

Title: Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise. (arXiv:2401.00364v1 [cs.LG])

Title: Real-Time FJ/MAC PDE Solvers via Tensorized, Back-Propagation-Free Optical PINN Training. (arXiv:2401.00413v1 [cs.LG])

Title: MSGNet: Learning Multi-Scale Inter-Series Correlations for Multivariate Time Series Forecasting. (arXiv:2401.00423v1 [cs.LG])

Title: Financial Time-Series Forecasting: Towards Synergizing Performance And Interpretability Within a Hybrid Machine Learning Approach. (arXiv:2401.00534v1 [cs.LG])

Title: Federated Class-Incremental Learning with New-Class Augmented Self-Distillation. (arXiv:2401.00622v1 [cs.LG])

Title: Adversarially Trained Actor Critic for offline CMDPs. (arXiv:2401.00629v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought

agent

Title: Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation. (arXiv:2401.00006v1 [cs.AI])

Title: Principal-Agent Reward Shaping in MDPs. (arXiv:2401.00298v1 [cs.AI])

Title: Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution. (arXiv:2401.00315v1 [cs.AI])

Title: Efficient Two-Phase Offline Deep Reinforcement Learning from Preference Feedback. (arXiv:2401.00330v1 [cs.LG])