2025-07-02

Title: Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data

Title: Prompting as Scientific Inquiry

Title: LineRetriever: Planning-Aware Observation Reduction for Web Agents

Title: Two-Stage Reasoning-Infused Learning: Improving Classification with LLM-Generated Reasoning

Title: Towards Style Alignment in Cross-Cultural Translation

Title: Linearly Decoding Refused Knowledge in Aligned Language Models

Title: EfficientXLang: Towards Improving Token Efficiency Through Cross-Lingual Reasoning

Title: Impact of Fine-Tuning Methods on Memorization in Large Language Models

Title: Natural language processing for African languages

Title: Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones

Title: Modeling Data Diversity for Joint Instance and Verbalizer Selection in Cold-Start Scenarios

Title: Question Decomposition for Retrieval-Augmented Generation

Title: Gregorian melody, modality, and memory: Segmenting chant with Bayesian nonparametrics

Title: Causal Prompting for Implicit Sentiment Analysis with Large Language Models

Title: Beyond Sociodemographic Prompting: Using Supervision to Align LLMs with Human Response Distributions

Title: Pitfalls of Evaluating Language Models with Open Benchmarks

Title: TeamCMU at Touché: Adversarial Co-Evolution for Advertisement Integration and Detection in Conversational Search

Title: TUM-MiKaNi at SemEval-2025 Task 3: Towards Multilingual and Knowledge-Aware Non-factual Hallucination Identification

Title: Transferable Modeling Strategies for Low-Resource LLM Tasks: A Prompt and Alignment-Based

Title: Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies

Title: SAFER: Probing Safety in Reward Models with Sparse Autoencoder

Title: Contrasting Cognitive Styles in Vision-Language Models: Holistic Attention in Japanese Versus Analytical Focus in English

Title: AI Analyst: Framework and Comprehensive Evaluation of Large Language Models for Financial Time Series Report Generation

Title: LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing

Title: Many LLMs Are More Utilitarian Than One

Title: ProxAnn: Use-Oriented Evaluations of Topic Models and Document Clustering

Title: Stylometry recognizes human and LLM-generated texts in short samples

Title: TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation

Title: Mathematics Isn't Culture-Free: Probing Cultural Gaps via Entity and Scenario Perturbations

Title: MemeCMD: An Automatically Generated Chinese Multi-turn Dialogue Dataset with Contextually Retrieved Memes

Title: Discourse Heuristics For Paradoxically Moral Self-Correction

Title: Should We Still Pretrain Encoders with Masked Language Modeling?

Title: La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

Title: SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks