2024-10-10

Title: Output Scouting: Auditing Large Language Models for Catastrophic Responses

Title: Falcon Mamba: The First Competitive Attention-free 7B Language Model

Title: LLMs Are In-Context Reinforcement Learners

Title: Post-hoc Study of Climate Microtargeting on Social Media Ads with LLMs: Thematic Insights and Fairness Evaluation

Title: Neural machine translation system for Lezgian, Russian and Azerbaijani languages

Title: Self-rationalization improves LLM as a fine-grained judge

Title: On Instruction-Finetuning Neural Machine Translation Models

Title: Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives

Title: Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Title: Rational Metareasoning for Large Language Models

Title: ClaimBrush: A Novel Framework for Automated Patent Claim Refinement Based on Large Language Models

Title: Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?

Title: ParallelSpec: Parallel Drafter for Efficient Speculative Decoding

Title: Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning

Title: Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Title: Stereotype or Personalization? User Identity Biases Chatbot Recommendations

Title: Vector-ICL: In-context Learning with Continuous Vector Representations

Title: DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models

Title: Unlocking the Boundaries of Thought: A Reasoning Granularity Framework to Quantify and Optimize Chain-of-Thought

Title: Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes

Title: CodeCipher: Learning to Obfuscate Source Code Against LLMs

Title: Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation

Title: Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

Title: Probing Language Models on Their Knowledge Source

Title: A Zero-Shot approach to the Conversational Tree Search Task

Title: Multi-Session Client-Centered Treatment Outcome Evaluation in Psychotherapy

Title: From Tokens to Words: on the inner lexicon of LLMs

Title: MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

Title: Automatic Summarization of Long Documents

Title: Give me a hint: Can LLMs take a hint to solve math problems?

Title: Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Title: Can Language Models Induce Grammatical Knowledge from Indirect Evidence?

Title: Training-free LLM-generated Text Detection by Mining Token Probability Sequences

Title: TOWER: Tree Organized Weighting for Evaluating Complex Instructions

Title: Listen to the Patient: Enhancing Medical Dialogue Generation with Patient Hallucination Detection and Mitigation

Title: Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation

Title: Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA

Title: AgentSquare: Automatic LLM Agent Search in Modular Design Space

Title: Manual Verbalizer Enrichment for Few-Shot Text Classification

Title: Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective

Title: Integrating Planning into Single-Turn Long-Form Text Generation

Title: Round and Round We Go! What makes Rotary Positional Encodings useful?

Title: DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

Title: Probing the Robustness of Theory of Mind in Large Language Models

Title: The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning

Title: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning

Title: Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Title: Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Title: Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?

Title: Counterfactual Causal Inference in Natural Language with Large Language Models

Title: MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks

Title: ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments

Title: LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints

Title: LLM Compression with Neural Architecture Search

Title: On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task

Title: TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

Title: SEGMENT+: Long Text Processing with Short-Context Language Models

Title: A Novel LLM-based Two-stage Summarization Approach for Long Dialogues

Title: Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA

Title: Chip-Tuning: Classify Before Language Models Say

Title: TuringQ: Benchmarking AI Comprehension in Theory of Computation

Title: Investigating Cost-Efficiency of LLM-Generated Training Data for Conversational Semantic Frame Analysis

Title: The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models

Title: ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Title: Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare

Title: Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions

Title: Dissecting Fine-Tuning Unlearning in Large Language Models

Title: $\beta$-calibration of Language Model Confidence Scores for Generative QA

Title: Learning Evolving Tools for Large Language Models

Title: Tree of Problems: Improving structured problem solving with compositionality

Title: Subtle Errors Matter: Preference Learning via Error-injected Self-editing

Title: Large Language Models as Code Executors: An Exploratory Study

Title: Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Title: PII-Scope: A Benchmark for Training Data PII Leakage Assessment in LLMs

Title: Calibrating Verbalized Probabilities for Large Language Models

Title: Guaranteed Generation from Large Language Models

Title: Scaling Laws for Mixed quantization in Large Language Models

Title: Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles

Title: Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?

Title: CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Title: To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models

Title: From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models

Title: Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level

Title: MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders

Title: Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

Title: FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding

Title: Generative Model for Less-Resourced Language with 1 billion parameters

Title: Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Title: SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Title: CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages

Title: Self-Boosting Large Language Models with Synthetic Preference Data

Title: Uncovering Factor Level Preferences to Improve Human-Model Alignment

Title: Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara

Title: CursorCore: Assist Programming through Aligning Anything

Title: Pap2Pat: Towards Automated Paper-to-Patent Drafting using Chunk-based Outline-guided Generation

Title: PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness

Title: Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

Title: Data Selection via Optimal Control for Language Models

Title: ReIFE: Re-evaluating Instruction-Following Evaluation

Title: MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Title: Stanceformer: Target-Aware Transformer for Stance Detection

Title: MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Title: Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context

Title: I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy

Title: Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy

Title: Mental Disorders Detection in the Era of Large Language Models

Title: Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Title: Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Title: Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning

Title: Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Title: Sylber: Syllabic Embedding Representation of Speech from Raw Audio

Title: Do better language models have crisper vision?

Title: Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models