language model

Title: Steering Llama 2 via Contrastive Activation Addition. (arXiv:2312.06681v1 [cs.CL])

Title: Enhanced E-Commerce Attribute Extraction: Innovating with Decorative Relation Correction and LLAMA 2.0-Based Annotation. (arXiv:2312.06684v1 [cs.AI])

Title: Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models. (arXiv:2312.06685v1 [cs.AI])

Title: Privacy Issues in Large Language Models: A Survey. (arXiv:2312.06717v1 [cs.AI])

Title: Building Trustworthy NeuroSymbolic AI Systems: Consistency, Reliability, Explainability, and Safety. (arXiv:2312.06798v1 [cs.AI])

Title: LLF-Bench: Benchmark for Interactive Learning from Language Feedback. (arXiv:2312.06853v1 [cs.AI])

Title: SM70: A Large Language Model for Medical Devices. (arXiv:2312.06974v1 [cs.CL])

Title: Alignment for Honesty. (arXiv:2312.07000v1 [cs.CL])

Title: Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models. (arXiv:2312.07028v1 [cs.CL])

Title: HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts. (arXiv:2312.07035v1 [cs.LG])

Title: Context Matter: Data-Efficient Augmentation of Large Language Models for Scientific Applications. (arXiv:2312.07069v1 [cs.CL])

Title: Efficiently Programming Large Language Models using SGLang. (arXiv:2312.07104v1 [cs.AI])

Title: Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning. (arXiv:2312.07250v1 [cs.CL])

Title: LLMEval: A Preliminary Study on How to Evaluate Large Language Models. (arXiv:2312.07398v1 [cs.AI])

Title: Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales. (arXiv:2312.07399v1 [cs.CL])

Title: On Diverse Preferences for Large Language Model Alignment. (arXiv:2312.07401v1 [cs.AI])

Title: Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection. (arXiv:2312.07476v1 [cs.CL])

Title: SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models. (arXiv:2312.07492v1 [cs.CL])

Title: Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models. (arXiv:2312.07046v1 [cs.LG])

Title: Improving Factual Error Correction by Learning to Inject Factual Errors. (arXiv:2312.07049v1 [cs.CL])

Title: Multilingual large language models leak human stereotypes across language boundaries. (arXiv:2312.07141v1 [cs.CL])

Title: Classifying complex documents: comparing bespoke solutions to large language models. (arXiv:2312.07182v1 [cs.CL])

Title: The GUA-Speech System Description for CNVSRC Challenge 2023. (arXiv:2312.07254v1 [cs.CL])

Title: ICL Markup: Structuring In-Context Learning using Soft-Token Tags. (arXiv:2312.07405v1 [cs.CL])

Title: Humans vs Large Language Models: Judgmental Forecasting in an Era of Advanced AI. (arXiv:2312.06941v1 [cs.LG])

Title: AI Control: Improving Safety Despite Intentional Subversion. (arXiv:2312.06942v1 [cs.LG])

gpt

Title: How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation. (arXiv:2312.07424v1 [cs.LG])

Title: Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack. (arXiv:2312.06924v1 [cs.CL])

Title: Perseus: Removing Energy Bloat from Large Model Training. (arXiv:2312.06902v1 [cs.LG])

llm

Title: Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations. (arXiv:2312.06674v1 [cs.CL])

Title: Intelligent Virtual Assistants with LLM-based Process Automation. (arXiv:2312.06677v1 [cs.LG])

Title: Extracting Self-Consistent Causal Insights from Users Feedback with LLMs and In-context Learning. (arXiv:2312.06820v1 [cs.AI])

Title: User Friendly and Adaptable Discriminative AI: Using the Lessons from the Success of LLMs and Image Generation Models. (arXiv:2312.06826v1 [cs.AI])

Title: Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Model. (arXiv:2312.07130v1 [cs.AI])

Title: Sequential Planning in Large Partially Observable Environments guided by LLMs. (arXiv:2312.07368v1 [cs.AI])

Title: LLMs Perform Poorly at Concept Extraction in Cyber-security Research Literature. (arXiv:2312.07110v1 [cs.CL])

Title: FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs. (arXiv:2312.07420v1 [cs.LG])

long context

Title: SCCA: Shifted Cross Chunk Attention for long contextual semantic expansion. (arXiv:2312.07305v1 [cs.CL])

lora

Title: Perceiving University Student's Opinions from Google App Reviews. (arXiv:2312.06705v1 [cs.CL])

Title: Forced Exploration in Bandit Problems. (arXiv:2312.07285v1 [cs.LG])

Title: Complex Recurrent Spectral Network. (arXiv:2312.07296v1 [cs.LG])

hallucination

prompt

Title: AI capabilities can be significantly improved without expensive retraining. (arXiv:2312.07413v1 [cs.AI])

Title: Get an A in Math: Progressive Rectification Prompting. (arXiv:2312.06867v1 [cs.CL])

code

Title: Evolving Reservoirs for Meta Reinforcement Learning. (arXiv:2312.06695v1 [cs.LG])

Title: A method for recovery of multidimensional time series based on the detection of behavioral patterns and the use of autoencoders. (arXiv:2312.06727v1 [cs.AI])

Title: Unsupervised Extractive Summarization with Learnable Length Control Strategies. (arXiv:2312.06901v1 [cs.AI])

Title: Patch-MI: Enhancing Model Inversion Attacks via Patch-Based Reconstruction. (arXiv:2312.07040v1 [cs.AI])

Title: Toward Robustness in Multi-label Classification: A Data Augmentation Strategy against Imbalance and Noise. (arXiv:2312.07087v1 [cs.LG])

Title: BED: Bi-Encoder-Decoder Model for Canonical Relation Extraction. (arXiv:2312.07088v1 [cs.CL])

Title: Neural Reasoning About Agents' Goals, Preferences, and Actions. (arXiv:2312.07122v1 [cs.AI])

Title: Dozerformer: Sequence Adaptive Sparse Transformer for Multivariate Time Series Forecasting. (arXiv:2312.06874v1 [cs.LG])

Title: DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models. (arXiv:2312.07066v1 [cs.CL])

Title: GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction. (arXiv:2312.07255v1 [cs.CL])

Title: Predictive variational autoencoder for learning robust representations of time-series data. (arXiv:2312.06932v1 [cs.LG])

chat

retrieval augmented generation

rag

Title: Adversarial Estimation of Topological Dimension with Harmonic Score Maps. (arXiv:2312.06869v1 [cs.LG])

Title: Understanding and Leveraging the Learning Phases of Neural Networks. (arXiv:2312.06887v1 [cs.LG])

Title: Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning. (arXiv:2312.07025v1 [cs.AI])

Title: Meta-survey on outlier and anomaly detection. (arXiv:2312.07101v1 [cs.AI])

Title: Equivariant Flow Matching with Hybrid Probability Transport. (arXiv:2312.07168v1 [cs.LG])

Title: Verbreitungsmechanismen sch\"adigender Sprache im Netz: Anatomie zweier Shitstorms. (arXiv:2312.07194v1 [cs.CL])

Title: A Novel Differentiable Loss Function for Unsupervised Graph Neural Networks in Graph Partitioning. (arXiv:2312.06877v1 [cs.LG])

Title: Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights. (arXiv:2312.06951v1 [cs.LG])

Title: General Tail Bounds for Non-Smooth Stochastic Mirror Descent. (arXiv:2312.07142v1 [cs.LG])

Title: Coupled Confusion Correction: Learning from Crowds with Sparse Annotations. (arXiv:2312.07331v1 [cs.LG])

multi-run

chain-of-thought

tree-of-thought