2025-01-30

Title: Tuning LLM Judges Hyperparameters

Title: LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering

Title: Visualizing Uncertainty in Translation Tasks: An Evaluation of LLM Performance and Confidence Metrics

Title: A Comprehensive Study on Fine-Tuning Large Language Models for Medical Question Answering Using Classification Models and Comparative Analysis

Title: Aspect-Aware Decomposition for Opinion Summarization

Title: Atla Selene Mini: A General Purpose Evaluation Model

Title: Improving LLM Leaderboards with Psychometrical Methodology

Title: NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

Title: Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics

Title: Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization

Title: Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Title: Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection

Title: Context-Aware Semantic Recomposition Mechanism for Large Language Models

Title: Leveraging In-Context Learning and Retrieval-Augmented Generation for Automatic Question Generation in Educational Domains

Title: MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs

Title: Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models

Title: Cross-Language Approach for Quranic QA

Title: DINT Transformer

Title: Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models

Title: A linguistically-motivated evaluation methodology for unraveling model's abilities in reading comprehension tasks

Title: CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs

Title: Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis

Title: Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment

Title: In-Context Meta LoRA Generation

Title: Tonguescape: Exploring Language Models Understanding of Vowel Articulation

Title: Exploring Vision Language Models for Multimodal and Multilingual Stance Detection

Title: Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Title: RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts

Title: Hybrid Graphs for Table-and-Text based Question Answering using LLMs

Title: 2SSP: A Two-Stage Framework for Structured Pruning of LLMs

Title: Reasoning Over the Glyphs: Evaluation of LLM's Decipherment of Rare Scripts

Title: BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights

Title: Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning?

Title: Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Title: Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations