2023-12-22

language model

Title: DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines. (arXiv:2312.13382v1 [cs.CL])

Title: The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction. (arXiv:2312.13558v1 [cs.LG])

Title: On Task Performance and Model Calibration with Supervised and Self-Ensembled In-Context Learning. (arXiv:2312.13772v1 [cs.CL])

Title: Typhoon: Thai Large Language Models. (arXiv:2312.13951v1 [cs.CL])

Title: Time is Encoded in the Weights of Finetuned Language Models. (arXiv:2312.13401v1 [cs.CL])

Title: Developing Interactive Tourism Planning: A Dialogue Robot System Powered by a Large Language Mode. (arXiv:2312.13545v1 [cs.CL])

Title: How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark. (arXiv:2312.13547v1 [cs.CL])

Title: Speech Translation with Large Language Models: An Industrial Practice. (arXiv:2312.13585v1 [cs.CL])

Title: Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries. (arXiv:2312.13671v1 [cs.CL])

Title: Exploiting Contextual Target Attributes for Target Sentiment Classification. (arXiv:2312.13766v1 [cs.CL])

Title: Capture the Flag: Uncovering Data Insights with Large Language Models. (arXiv:2312.13876v1 [cs.LG])

Title: Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs. (arXiv:2312.13881v1 [cs.CL])

Title: Structured Probabilistic Coding. (arXiv:2312.13933v1 [cs.CL])

Title: T-Eval: Evaluating the Tool Utilization Capability Step by Step. (arXiv:2312.14033v1 [cs.CL])

gpt

Title: Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation. (arXiv:2312.13608v1 [cs.CL])

Title: ChatGPT as a commenter to the news: can LLMs generate human-like opinions?. (arXiv:2312.13961v1 [cs.CL])

llm

Title: In-Context Reinforcement Learning for Variable Action Spaces. (arXiv:2312.13327v1 [cs.LG])

long context

lora

Title: Domain Adaptive Graph Classification. (arXiv:2312.13536v1 [cs.LG])

Title: Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing. (arXiv:2312.14000v1 [cs.LG])

Title: Diffusion Reward: Learning Rewards via Conditional Video Diffusion. (arXiv:2312.14134v1 [cs.LG])

hallucination

prompt

code

Title: Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast. (arXiv:2312.13508v1 [cs.LG])

Title: Automated Clinical Coding for Outpatient Departments. (arXiv:2312.13533v1 [cs.CL])

Title: EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models. (arXiv:2312.14069v1 [cs.CL])

Title: Unlocking Deep Learning: A BP-Free Approach for Parallel Block-Wise Training of Neural Networks. (arXiv:2312.13311v1 [cs.LG])

Title: MixEHR-SurG: a joint proportional hazard and guided topic model for inferring mortality-associated topics from electronic health records. (arXiv:2312.13454v1 [cs.LG])

Title: CR-SAM: Curvature Regularized Sharpness-Aware Minimization. (arXiv:2312.13555v1 [cs.LG])

Title: Adapt & Align: Continual Learning with Generative Models Latent Space Alignment. (arXiv:2312.13699v1 [cs.LG])

chat

retrieval augmented generation

Title: RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios. (arXiv:2312.13303v1 [cs.LG])

rag

Title: Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns. (arXiv:2312.13583v1 [cs.LG])

Title: Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion. (arXiv:2312.13616v1 [cs.LG])

Title: ProvFL: Client-Driven Interpretability of Global Model Predictions in Federated Learning. (arXiv:2312.13632v1 [cs.LG])

Title: Critic-Guided Decision Transformer for Offline Reinforcement Learning. (arXiv:2312.13716v1 [cs.LG])

Title: Solving Long-run Average Reward Robust MDPs via Stochastic Games. (arXiv:2312.13912v1 [cs.AI])

Title: Structure-Aware Path Inference for Neural Finite State Transducers. (arXiv:2312.13614v1 [cs.LG])

Title: Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity. (arXiv:2312.13380v1 [cs.LG])

Title: InvertibleNetworks.jl: A Julia package for scalable normalizing flows. (arXiv:2312.13480v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought

agent

Title: Towards Fair Graph Federated Learning via Incentive Mechanisms. (arXiv:2312.13306v1 [cs.LG])

Title: Adversarial Markov Games: On Adaptive Decision-Based Attacks and Defenses. (arXiv:2312.13435v1 [cs.AI])

Title: Understanding and Estimating Domain Complexity Across Domains. (arXiv:2312.13487v1 [cs.AI])

Title: Team Flow at DRC2023: Building Common Ground and Text-based Turn-taking in a Travel Agent Spoken Dialogue System. (arXiv:2312.13816v1 [cs.CL])

Title: Learning Human-like Representations to Enable Learning Human Values. (arXiv:2312.14106v1 [cs.AI])

Title: Manipulating Trajectory Prediction with Backdoors. (arXiv:2312.13863v1 [cs.LG])