2025-02-26

Title: Towards Conditioning Clinical Text Generation for User Control

Title: End-to-End Chart Summarization via Visual Chain-of-Thought in Vision-Language Models

Title: Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility

Title: MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference

Title: PICASO: Permutation-Invariant Context Composition with State Space Models

Title: Evaluating the Effect of Retrieval Augmentation on Social Biases

Title: Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages

Title: Towards Human Cognition: Visual Context Guides Syntactic Priming in Fusion-Encoded Models

Title: Bridging Information Gaps with Comprehensive Answers: Improving the Diversity and Informativeness of Follow-Up Questions

Title: Knowledge Distillation with Training Wheels

Title: Spontaneous Giving and Calculated Greed in Language Models

Title: LLM Inference Acceleration via Efficient Operation Fusion

Title: FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks

Title: Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty

Title: AIR: Complex Instruction Generation via Automatic Iterative Refinement

Title: Enhancing Human Evaluation in Machine Translation with Comparative Judgment

Title: Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training

Title: URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models

Title: Can Multimodal LLMs Perform Time Series Anomaly Detection?

Title: Predicting Through Generation: Why Generation Is Better for Prediction

Title: Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation

Title: LR${}^{2}$Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems

Title: SYNTHEMPATHY: A Scalable Empathy Corpus Generated Using LLMs Without Any Crowdsourcing

Title: Towards Enhanced Immersion and Agency for LLM-based Interactive Drama

Title: RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts

Title: Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation

Title: Scaling LLM Pre-training with Vocabulary Curriculum

Title: FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models

Title: Advantage-Guided Distillation for Preference Alignment in Small Language Models

Title: CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

Title: Assessing Large Language Models in Agentic Multilingual National Bias

Title: DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning

Title: Language Models' Factuality Depends on the Language of Inquiry

Title: Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments

Title: On Synthetic Data Strategies for Domain-Specific Generative Retrieval

Title: Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning

Title: Verdict: A Library for Scaling Judge-Time Compute

Title: AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages

Title: Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference

Title: Harnessing Multiple Large Language Models: A Survey on LLM Ensemble

Title: Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Title: Uncertainty Quantification in Retrieval Augmented Question Answering

Title: LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers

Title: NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts

Title: Can LLMs Explain Themselves Counterfactually?

Title: SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models

Title: Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs

Title: Grandes modelos de lenguaje: de la predicción de palabras a la comprensión?

Title: LAG: LLM agents for Leaderboard Auto Generation on Demanding

Title: Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

Title: Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization

Title: Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases

Title: RefuteBench 2.0 -- Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction

Title: WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging

Title: Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology

Title: BottleHumor: Self-Informed Humor Explanation using the Information Bottleneck Principle

Title: Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

Title: BRIDO: Bringing Democratic Order to Abstractive Summarization

Title: DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models

Title: Monte Carlo Temperature: a robust sampling strategy for LLM's uncertainty quantification methods

Title: KiRAG: Knowledge-Driven Iterative Retriever for Enhancing Retrieval-Augmented Generation

Title: AgentRM: Enhancing Agent Generalization with Reward Modeling

Title: GLEAN: Generalized Category Discovery with Diverse and Quality-Enhanced LLM Feedback

Title: Compressing Language Models for Specialized Domains

Title: TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning

Title: Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Title: olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Title: Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing

Title: FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response

Title: DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers