2023-12-25

language model

Title: Large Language Models in Medical Term Classification and Unexpected Misalignment Between Response and Reasoning. (arXiv:2312.14184v1 [cs.CL])

Title: Enhancing Neural Theorem Proving through Data Augmentation and Dynamic Sampling Method. (arXiv:2312.14188v1 [cs.AI])

Title: Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models. (arXiv:2312.14197v1 [cs.CL])

Title: Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models. (arXiv:2312.14202v1 [cs.CL])

Title: Experimenting with Large Language Models and vector embeddings in NASA SciX. (arXiv:2312.14211v1 [cs.CL])

Title: SimLM: Can Language Models Infer Parameters of Physical Systems?. (arXiv:2312.14215v1 [cs.CL])

Title: Deep de Finetti: Recovering Topic Distributions from Large Language Models. (arXiv:2312.14226v1 [cs.CL])

Title: Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models. (arXiv:2312.14346v1 [cs.CL])

Title: A Unified Industrial Large Knowledge Model Framework in Smart Manufacturing. (arXiv:2312.14428v1 [cs.LG])

Title: Language Model is a Branch Predictor for Simultaneous Machine Translation. (arXiv:2312.14488v1 [cs.CL])

Title: Large Language Model (LLM) Bias Index -- LLMBI. (arXiv:2312.14769v1 [cs.CL])

Title: YAYI 2: Multilingual Open-Source Large Language Models. (arXiv:2312.14862v1 [cs.CL])

Title: Robust Knowledge Extraction from Large Language Models using Social Choice Theory. (arXiv:2312.14877v1 [cs.CL])

Title: NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes. (arXiv:2312.14890v1 [cs.AI])

Title: Dynamic Topic Language Model on Heterogeneous Children's Mental Health Clinical Notes. (arXiv:2312.14180v1 [cs.CL])

Title: Efficacy of Machine-Generated Instructions. (arXiv:2312.14423v1 [cs.CL])

Title: Reasons to Reject? Aligning Language Models with Judgments. (arXiv:2312.14591v1 [cs.CL])

Title: Numerical Reasoning for Financial Reports. (arXiv:2312.14870v1 [cs.CL])

Title: A Survey of Reinforcement Learning from Human Feedback. (arXiv:2312.14925v1 [cs.LG])

gpt

Title: Generative Pretraining at Scale: Transformer-Based Encoding of Transactional Behavior for Fraud Detection. (arXiv:2312.14406v1 [cs.LG])

llm

Title: Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs. (arXiv:2312.14345v1 [cs.AI])

Title: Zero-shot Causal Graph Extrapolation from Text via LLMs. (arXiv:2312.14670v1 [cs.AI])

Title: Parameter Efficient Tuning Allows Scalable Personalization of LLMs for Text Entry: A Case Study on Abbreviation Expansion. (arXiv:2312.14327v1 [cs.CL])

long context

lora

Title: SEOpinion: Summarization and Exploration Opinion of E-Commerce Websites. (arXiv:2312.14171v1 [cs.CL])

Title: Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing. (arXiv:2312.14472v1 [cs.AI])

Title: Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration. (arXiv:2312.14470v1 [cs.LG])

Title: Progressing from Anomaly Detection to Automated Log Labeling and Pioneering Root Cause Analysis. (arXiv:2312.14748v1 [cs.LG])

hallucination

Title: On Early Detection of Hallucinations in Factual Question Answering. (arXiv:2312.14183v1 [cs.CL])

Title: Context-aware Decoding Reduces Hallucination in Query-focused Summarization. (arXiv:2312.14335v1 [cs.CL])

Title: Theory of Hallucinations based on Equivariance. (arXiv:2312.14504v1 [cs.CL])

prompt

Title: SIG: Speaker Identification in Literature via Prompt-Based Generation. (arXiv:2312.14590v1 [cs.CL])

Title: Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks. (arXiv:2312.14440v1 [cs.LG])

Title: Fast-NTK: Parameter-Efficient Unlearning for Large-Scale Models. (arXiv:2312.14923v1 [cs.LG])

code

Title: WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation. (arXiv:2312.14187v1 [cs.CL])

Title: Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning. (arXiv:2312.14222v1 [cs.LG])

Title: ADA-GAD: Anomaly-Denoised Autoencoders for Graph Anomaly Detection. (arXiv:2312.14535v1 [cs.LG])

Title: The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs. (arXiv:2312.14792v1 [cs.LG])

Title: TACO: Topics in Algorithmic COde generation dataset. (arXiv:2312.14852v1 [cs.AI])

Title: Balancing the Style-Content Trade-Off in Sentiment Transfer Using Polarity-Aware Denoising. (arXiv:2312.14708v1 [cs.CL])

Title: Semantic Parsing for Complex Data Retrieval: Targeting Query Plans vs. SQL for No-Code Access to Relational Databases. (arXiv:2312.14798v1 [cs.CL])

Title: Invariant Anomaly Detection under Distribution Shifts: A Causal Perspective. (arXiv:2312.14329v1 [cs.LG])

Title: PUMA: Efficient Continual Graph Learning with Graph Condensation. (arXiv:2312.14439v1 [cs.LG])

Title: SAVAE: Leveraging the variational Bayes autoencoder for survival analysis. (arXiv:2312.14651v1 [cs.LG])

Title: Spatiotemporal-Linear: Towards Universal Multivariate Time Series Forecasting. (arXiv:2312.14869v1 [cs.LG])

chat

Title: Aurora:Activating Chinese chat capability for Mistral-8x7B sparse Mixture-of-Experts through Instruction-Tuning. (arXiv:2312.14557v1 [cs.CL])

retrieval augmented generation

rag

Title: Auto311: A Confidence-guided Automated System for Non-emergency Call. (arXiv:2312.14185v1 [cs.CL])

Title: Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI. (arXiv:2312.14229v1 [cs.LG])

Title: Adaptive Reconvergence-driven AIG Rewriting via Strategy Learning. (arXiv:2312.14536v1 [cs.AI])

Title: Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference. (arXiv:2312.14646v1 [cs.AI])

Title: Automatic Data Retrieval for Cross Lingual Summarization. (arXiv:2312.14542v1 [cs.CL])

Title: Clustering and Uncertainty Analysis to Improve the Machine Learning-based Predictions of SAFARI-1 Control Follower Assembly Axial Neutron Flux Profiles. (arXiv:2312.14193v1 [cs.LG])

Title: Fine-grained Forecasting Models Via Gaussian Process Blurring Effect. (arXiv:2312.14280v1 [cs.LG])

Title: Federated Quantum Long Short-term Memory (FedQLSTM). (arXiv:2312.14309v1 [cs.LG])

Title: Training Neural Networks with Internal State, Unconstrained Connectivity, and Discrete Activations. (arXiv:2312.14359v1 [cs.LG])

Title: Graph Attention-Based Symmetry Constraint Extraction for Analog Circuits. (arXiv:2312.14405v1 [cs.LG])

Title: Room Occupancy Prediction: Exploring the Power of Machine Learning and Temporal Insights. (arXiv:2312.14426v1 [cs.LG])

Title: How to Overcome Curse-of-Dimensionality for Out-of-Distribution Detection?. (arXiv:2312.14452v1 [cs.LG])

Title: Non-Denoising Forward-Time Diffusions. (arXiv:2312.14589v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought

agent

Title: Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming. (arXiv:2312.14292v1 [cs.AI])

Title: AdapTraj: A Multi-Source Domain Generalization Framework for Multi-Agent Trajectory Prediction. (arXiv:2312.14394v1 [cs.AI])

Title: The Fairness Fair: Bringing Human Perception into Collective Decision-Making. (arXiv:2312.14402v1 [cs.AI])

Title: Hierarchical Multi-Agent Reinforcement Learning for Assessing False-Data Injection Attacks on Transportation Networks. (arXiv:2312.14625v1 [cs.AI])

Title: Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning. (arXiv:2312.14878v1 [cs.AI])

Title: Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels. (arXiv:2312.14259v1 [cs.LG])