2025-03-07

Title: Vision-Language Models Struggle to Align Entities across Modalities

Title: Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

Title: AI for Scaling Legal Reform: Mapping and Redacting Racial Covenants in Santa Clara County

Title: Tec-Habilidad: Skill Classification for Bridging Education and Employment

Title: Performance Comparison of Large Language Models on Advanced Calculus Problems

Title: On the Acquisition of Shared Grammatical Representations in Bilingual Language Models

Title: ReasonGraph: Visualisation of Reasoning Paths

Title: Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting

Title: Uncovering inequalities in new knowledge learning by large language models across different languages

Title: Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts

Title: Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English

Title: LLMs Can Generate a Better Answer by Aggregating Their Own Responses

Title: Uncovering Gaps in How Humans and LLMs Interpret Subjective Language

Title: Biological Sequence with Language Model Prompting: A Survey

Title: Ticktack : Long Span Temporal Alignment of Large Language Models Leveraging Sexagenary Cycle Time Expression

Title: BPQA Dataset: Evaluating How Well Language Models Leverage Blood Pressures to Answer Biomedical Questions

Title: Measuring temporal effects of agent knowledge by date-controlled tool use

Title: Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition

Title: FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Title: Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models

Title: DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models

Title: On Fact and Frequency: LLM Responses to Misinformation Expressed with Uncertainty

Title: Dual-Class Prompt Generation: Enhancing Indonesian Gender-Based Hate Speech Detection through Data Augmentation

Title: Solving Word-Sense Disambiguation and Word-Sense Induction with Dictionary Examples

Title: Adding Alignment Control to Language Models

Title: Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling

Title: Exploring the Multilingual NLG Evaluation Abilities of LLM-Based Evaluators

Title: Lost in Literalism: How Supervised Training Shapes Translationese in LLMs

Title: Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Title: TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge

Title: More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG

Title: Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication

Title: TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models

Title: Comparative Study of Zero-Shot Cross-Lingual Transfer for Bodo POS and NER Tagging Using Gemini 2.0 Flash Thinking Experimental Model

Title: Can Large Language Models Predict Antimicrobial Resistance Gene?

Title: Revisiting the Othello World Model Hypothesis

Title: A Dataset for Analysing News Framing in Chinese Media

Title: Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification

Title: Generalized Interpolating Discrete Diffusion

Title: Large Language Models in Bioinformatics: A Survey

Title: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

Title: Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation

Title: Compositional Causal Reasoning Evaluation in Language Models

Title: HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Title: Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

Title: HalluCounter: Reference-free LLM Hallucination Detection in the Wild!

Title: Better Process Supervision with Bi-directional Rewarding Signals

Title: SynGraph: A Dynamic Graph-LLM Synthesis Framework for Sparse Streaming User Sentiment Modeling

Title: START: Self-taught Reasoner with Tools

Title: SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing

Title: Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking

Title: IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Title: Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment

Title: An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding

Title: LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue

Title: DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module

Title: Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Title: UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets

Title: L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Title: Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Title: Enough Coin Flips Can Make LLMs Act Bayesian

Title: Shifting Long-Context LLMs Research from Input to Output

Title: LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Title: L$^2$M: Mutual Information Scaling Law for Long-Context Language Modeling