language model

Title: LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement. (arXiv:2312.08402v1 [cs.LG])

Title: Contractive error feedback for gradient compression. (arXiv:2312.08538v1 [cs.LG])

Title: Learning adaptive planning representations with natural language guidance. (arXiv:2312.08566v1 [cs.AI])

Title: Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models. (arXiv:2312.08762v1 [cs.AI])

Title: Evaluating Large Language Models for Health-related Queries with Presuppositions. (arXiv:2312.08800v1 [cs.CL])

Title: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent. (arXiv:2312.08926v1 [cs.AI])

Title: LiFT: Unsupervised Reinforcement Learning with Foundation Models as Teachers. (arXiv:2312.08958v1 [cs.LG])

Title: Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model. (arXiv:2312.08987v1 [cs.AI])

Title: Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach. (arXiv:2312.08579v1 [cs.CL])

Title: Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention. (arXiv:2312.08618v1 [cs.CL])

Title: Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference. (arXiv:2312.08747v1 [cs.CL])

Title: Context-PEFT: Efficient Multi-Modal, Multi-Task Fine-Tuning. (arXiv:2312.08900v1 [cs.LG])

gpt

Title: Heterogeneous Graph Neural Architecture Search with GPT-4. (arXiv:2312.08680v1 [cs.AI])

Title: Detecting value-expressive text posts in Russian social media. (arXiv:2312.08968v1 [cs.CL])

Title: Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets. (arXiv:2312.08495v1 [cs.CL])

llm

Title: Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction. (arXiv:2312.08400v1 [cs.CL])

Title: ChatSOS: LLM-based knowledge Q&A system for safety engineering. (arXiv:2312.08629v1 [cs.AI])

Title: TigerBot: An Open Multilingual Multitask LLM. (arXiv:2312.08688v1 [cs.CL])

Title: Rational Sensibility: LLM Enhanced Empathetic Response Generation Guided by Self-presentation Theory. (arXiv:2312.08702v1 [cs.AI])

Title: Forbidden Facts: An Investigation of Competing Objectives in Llama-2. (arXiv:2312.08793v1 [cs.LG])

Title: Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning. (arXiv:2312.08901v1 [cs.CL])

Title: Math-Shepherd: A Label-Free Step-by-Step Verifier for LLMs in Mathematical Reasoning. (arXiv:2312.08935v1 [cs.AI])

Title: ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks. (arXiv:2312.08583v1 [cs.CL])

Title: A Comparative Analysis of Fine-Tuned LLMs and Few-Shot Learning of LLMs for Financial Sentiment Analysis. (arXiv:2312.08725v1 [cs.LG])

long context

lora

Title: Artificial Intelligence and Human Geography. (arXiv:2312.08827v1 [cs.AI])

Title: Fair Active Learning in Low-Data Regimes. (arXiv:2312.08559v1 [cs.LG])

Title: A Cyber-Physical Architecture for Microgrids based on Deep learning and LORA Technology. (arXiv:2312.08818v1 [cs.LG])

hallucination

prompt

Title: Metacognition-Enhanced Few-Shot Prompting With Positive Reinforcement. (arXiv:2312.08642v1 [cs.CL])

Title: Labels Need Prompts Too Mask Matching for Natural Language Understanding Tasks. (arXiv:2312.08726v1 [cs.CL])

Title: MotherNet: A Foundational Hypernetwork for Tabular Classification. (arXiv:2312.08598v1 [cs.LG])

code

Title: ALGNet: Attention Light Graph Memory Network for Medical Recommendation System. (arXiv:2312.08377v1 [cs.AI])

Title: Earthfarseer: Versatile Spatio-Temporal Dynamical Systems Modeling in One Model. (arXiv:2312.08403v1 [cs.AI])

Title: Harmonics of Learning: Universal Fourier Features Emerge in Invariant Networks. (arXiv:2312.08550v1 [cs.LG])

Title: CAT: A Causally Graph Attention Network for Trimming Heterophilic Graph. (arXiv:2312.08672v1 [cs.LG])

Title: Gradient Informed Proximal Policy Optimization. (arXiv:2312.08710v1 [cs.LG])

Title: Automated Process Planning Based on a Semantic Capability Model and SMT. (arXiv:2312.08801v1 [cs.AI])

Title: JPIS: A Joint Model for Profile-based Intent Detection and Slot Filling with Slot-to-Intent Attention. (arXiv:2312.08737v1 [cs.CL])

Title: Accelerating Meta-Learning by Sharing Gradients. (arXiv:2312.08398v1 [cs.LG])

Title: ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance. (arXiv:2312.08852v1 [cs.LG])

Title: Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems. (arXiv:2312.08884v1 [cs.LG])

Title: BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials. (arXiv:2312.08937v1 [cs.LG])

Title: EAT: Towards Long-Tailed Out-of-Distribution Detection. (arXiv:2312.08939v1 [cs.LG])

Title: Uncertainty in GNN Learning Evaluations: A Comparison Between Measures for Quantifying Randomness in GNN Community Detection. (arXiv:2312.09015v1 [cs.LG])

chat

retrieval augmented generation

rag

Title: Personalized Decision Supports based on Theory of Mind Modeling and Explainable Reinforcement Learning. (arXiv:2312.08397v1 [cs.LG])

Title: How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning. (arXiv:2312.08463v1 [cs.AI])

Title: On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL. (arXiv:2312.08468v1 [cs.AI])

Title: Revisiting Recommendation Loss Functions through Contrastive Learning (Technical Report). (arXiv:2312.08520v1 [cs.AI])

Title: World Models via Policy-Guided Trajectory Diffusion. (arXiv:2312.08533v1 [cs.LG])

Title: Adaptive Shortcut Debiasing for Online Continual Learning. (arXiv:2312.08677v1 [cs.LG])

Title: Learning Safety Constraints From Demonstration Using One-Class Decision Trees. (arXiv:2312.08837v1 [cs.LG])

Title: Diffusion-C: Unveiling the Generative Challenges of Diffusion Models through Corrupted Data. (arXiv:2312.08843v1 [cs.LG])

Title: Knowledge-Driven Modulation of Neural Networks with Attention Mechanism for Next Activity Prediction. (arXiv:2312.08847v1 [cs.AI])

Title: Weighted Ensemble Models Are Strong Continual Learners. (arXiv:2312.08977v1 [cs.LG])

Title: PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and Environments. (arXiv:2312.08755v1 [cs.CL])

Title: Simplicial Representation Learning with Neural $k$-forms. (arXiv:2312.08515v1 [cs.LG])

Title: Occupancy Detection Based on Electricity Consumption. (arXiv:2312.08535v1 [cs.LG])

Title: Estimating calibration error under label shift without labels. (arXiv:2312.08586v1 [cs.LG])

Title: Automated detection of Zika and dengue in Aedes aegypti using neural spiking analysis. (arXiv:2312.08654v1 [cs.LG])

Title: Read Between the Layers: Leveraging Intra-Layer Representations for Rehearsal-Free Continual Learning with Pre-Trained Models. (arXiv:2312.08888v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought