2025-03-12

Title: Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models

Title: Training Domain Draft Models for Speculative Decoding: Best Practices and Insights

Title: Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Title: Modern Models, Medieval Texts: A POS Tagging Study of Old Occitan

Title: HalluVerse25: Fine-grained Multilingual Benchmark Dataset for LLM Hallucinations

Title: MapQA: Open-domain Geospatial Question Answering on Map Data

Title: Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

Title: Gemini Embedding: Generalizable Embeddings from Gemini

Title: Can Memory-Augmented Language Models Generalize on Reasoning-in-a-Haystack Tasks?

Title: EFPC: Towards Efficient and Flexible Prompt Compression

Title: LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking

Title: Enhancing Multilingual Language Models for Code-Switched Input Data

Title: In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents

Title: Learning to Search Effective Example Sequences for In-Context Learning

Title: Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations

Title: A General Framework to Evaluate Methods for Assessing Dimensions of Lexical Semantic Change Using LLM-Generated Synthetic Data

Title: Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation

Title: Context-aware Biases for Length Extrapolation

Title: OASIS: Order-Augmented Strategy for Improved Code Search

Title: RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware

Title: Automating Violence Detection and Categorization from Ancient Texts

Title: Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation

Title: DeepRAG: Building a Custom Hindi Embedding Model for Retrieval Augmented Generation from Scratch

Title: Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges

Title: Towards Scalable and Cross-Lingual Specialist Language Models for Oncology

Title: OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning

Title: Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information

Title: Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models

Title: ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews

Title: Position-Aware Depth Decay Decoding ($D^3$): Boosting Large Language Model Inference Efficiency

Title: DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering

Title: Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling

Title: DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process

Title: BiasEdit: Debiasing Stereotyped Language Models via Model Editing

Title: NSF-SciFy: Mining the NSF Awards Database for Scientific Claims

Title: Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Title: Exploring the Word Sense Disambiguation Capabilities of Large Language Models

Title: AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence

Title: Self-Taught Self-Correction for Small Language Models

Title: Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents