2025-02-20

Title: Private Text Generation by Seeding Large Language Model Prompts

Title: Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation

Title: SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Title: When People are Floods: Analyzing Dehumanizing Metaphors in Immigration Discourse with Large Language Models

Title: Grounding LLM Reasoning with Knowledge Graphs

Title: Neural Attention Search

Title: Multilingual Language Model Pretraining using Machine-translated Data

Title: HumT DumT: Measuring and controlling human-like language in LLMs

Title: Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models

Title: REALTALK: A 21-Day Real-World Dataset for Long-Term Conversation

Title: Understanding and Tackling Label Errors in Individual-Level Nature Language Understanding

Title: Improving Multi-turn Task Completion in Task-Oriented Dialog Systems via Prompt Chaining and Fine-Grained Feedback

Title: Evaluating and Enhancing Out-of-Domain Generalization of Task-Oriented Dialog Systems for Task Completion without Turn-level Dialog Annotations

Title: Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Title: Elucidating Mechanisms of Demographic Bias in LLMs for Healthcare

Title: Language Models Can Predict Their Own Behavior

Title: Language Models are Few-Shot Graders

Title: Craw4LLM: Efficient Web Crawling for LLM Pretraining

Title: Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments

Title: Bridging the Editing Gap in LLMs: FineEdit for Precise and Targeted Text Modifications

Title: RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering

Title: Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval

Title: Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor

Title: MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

Title: Prompting a Weighting Mechanism into LLM-as-a-Judge in Two-Step: A Case Study

Title: Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning

Title: RLTHF: Targeted Human Feedback for LLM Alignment

Title: TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition

Title: MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering

Title: The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?

Title: TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation

Title: ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

Title: Estimating Commonsense Plausibility through Semantic Shifts

Title: Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models

Title: LLM should think and action as a human

Title: Transferring Textual Preferences to Vision-Language Understanding through Model Merging

Title: What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis

Title: Towards Geo-Culturally Grounded LLM Generations

Title: PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference

Title: Unlocking Multimodal Integration in EHRs: A Prompt Learning Framework for Language and Time Series Fusion

Title: Shall Your Data Strategy Work? Perform a Swift Study

Title: Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference

Title: From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

Title: Detecting Linguistic Bias in Government Documents Using Large language Models

Title: STaR-SQL: Self-Taught Reasoner for Text-to-SQL

Title: PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models

Title: Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs

Title: Don't Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints

Title: MMTEB: Massive Multilingual Text Embedding Benchmark

Title: Efficient Safety Retrofitting Against Jailbreaking for LLMs

Title: BeamLoRA: Beam-Constraint Low-Rank Adaptation

Title: Complex Ontology Matching with Large Language Model Embeddings

Title: REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models

Title: Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts

Title: D.Va: Validate Your Demonstration First Before You Use It

Title: Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh

Title: Reliability Across Parametric and External Knowledge: Understanding Knowledge Handling in LLMs

Title: C2T: A Classifier-Based Tree Construction Method in Speculative Decoding

Title: Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models

Title: SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Title: Is This Collection Worth My LLM's Time? Automatically Measuring Information Potential in Text Corpora

Title: Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values

Title: Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method

Title: Enhancing Input-Label Mapping in In-Context Learning with Contrastive Decoding

Title: SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning

Title: GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking

Title: VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare

Title: EHOP: A Dataset of Everyday NP-Hard Optimization Problems

Title: Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions

Title: From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Title: Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking

Title: DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue

Title: DataSciBench: An LLM Agent Benchmark for Data Science

Title: How Do LLMs Perform Two-Hop Reasoning in Context?

Title: TESS 2: A Large-Scale Generalist Diffusion Language Model

Title: LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Title: Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?

Title: Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Title: RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision

Title: LIDDIA: Language-based Intelligent Drug Discovery Agent

Title: Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Title: MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads