2025-01-22

Title: ArxEval: Evaluating Retrieval and Generation in Language Models for Scientific Literature

Title: Tabular-TX: Theme-Explanation Structure-based Table Summarization via In-Context Learning

Title: The Geometry of Tokens in Internal Representations of Large Language Models

Title: Adapting Large Language Models for Character-based Augmentative and Alternative Communication

Title: Iterative Tree Analysis for Medical Critics

Title: DNA 1.0 Technical Report

Title: Harnessing the Potential of Large Language Models in Modern Marketing Management: Applications, Future Directions, and Strategic Recommendations

Title: Development of Application-Specific Large Language Models to Facilitate Research Ethics Review

Title: BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues

Title: Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking

Title: Generating Structured Outputs from Language Models: Benchmark and Studies

Title: LegalGuardian: A Privacy-Preserving Framework for Secure Integration of Large Language Models in Legal Practice

Title: Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data

Title: InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language Models

Title: The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs

Title: From Arabic Text to Puzzles: LLM-Driven Development of Arabic Educational Crosswords

Title: LF-Steering: Latent Feature Activation Steering for Enhancing Semantic Consistency in Large Language Models

Title: Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach

Title: IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Title: Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

Title: Clinical trial cohort selection using Large Language Models on n2c2 Challenges

Title: Tell me about yourself: LLMs are aware of their learned behaviors

Title: A Collection of Question Answering Datasets for Norwegian

Title: Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation

Title: Irony in Emojis: A Comparative Study of Human and LLM Interpretation

Title: Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios

Title: Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries

Title: Question-to-Question Retrieval for Hallucination-Free Knowledge Access: An Approach for Wikipedia and Wikidata Question Answering

Title: Few-shot Policy (de)composition in Conversational Question Answering

Title: Verifying Cross-modal Entity Consistency in News using Vision-language Models

Title: Neural Contextual Reinforcement Framework for Logical Structure Language Generation

Title: RACCOON: A Retrieval-Augmented Generation Approach for Location Coordinate Capture from News Articles

Title: Curiosity-Driven Reinforcement Learning from Human Feedback

Title: Graph-defined Language Learning with LLMs

Title: Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges

Title: Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas

Title: PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation

Title: Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems

Title: Trojan Detection Through Pattern Recognition for Large Language Models

Title: YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives

Title: Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy

Title: Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Title: Optimizing Pretraining Data Mixtures with LLM-Estimated Utility

Title: The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers

Title: Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection

Title: Benchmarking Large Language Models via Random Variables

Title: Fact-Preserved Personalized News Headline Generation

Title: Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs

Title: Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance

Title: Cross-Entropy Attacks to Language Models via Rare Event Simulation

Title: From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

Title: Med-R$^2$: Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine

Title: Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation

Title: HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja

Title: Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model

Title: TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection

Title: A Hybrid Attention Framework for Fake News Detection with Large Language Models

Title: Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues

Title: MedS$^3$: Towards Medical Small Language Models with Self-Evolved Slow Thinking

Title: Can open source large language models be used for tumor documentation in Germany? -- An evaluation on urological doctors' notes

Title: Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities

Title: AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding

Title: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Title: Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration