2025-02-19

Title: Leveraging large language models for structured information extraction from pathology reports

Title: Large Language Models for Extrapolative Modeling of Manufacturing Processes

Title: Hallucinations are inevitable but statistically negligible

Title: AI and the Law: Evaluating ChatGPT's Performance in Legal Classification

Title: A Closer Look at System Prompt Robustness

Title: Efficient and Effective Prompt Tuning via Prompt Decomposition and Compressed Outer Product

Title: BoT: Breaking Long Thought Processes of o1-like Large Language Models through Backdoor Attack

Title: Enhancing Frame Detection with Retrieval Augmented Generation

Title: Zero Token-Driven Deep Thinking in LLMs: Unlocking the Full Potential of Existing Parameters via Cyclic Refinement

Title: InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context

Title: Story Grammar Semantic Matching for Literary Study

Title: Evaluating Step-by-step Reasoning Traces: A Survey

Title: SMOL: Professionally translated parallel data for 115 under-represented languages

Title: Can Language Models Learn Typologically Implausible Languages?

Title: From Dense to Dynamic: Token-Difficulty Driven MoEfication of Pre-Trained LLMs

Title: LM Agents for Coordinating Multi-User Information Gathering

Title: ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Title: Classifiers of Data Sharing Statements in Clinical Trial Records

Title: Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation

Title: UltraGen: Extremely Fine-grained Controllable Generation via Attribute Reconstruction and Global Preference Optimization

Title: Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges

Title: WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

Title: Gradient Co-occurrence Analysis for Detecting Unsafe Prompts in Large Language Models

Title: Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models

Title: Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models

Title: Wi-Chat: Large Language Model Powered Wi-Fi Sensing

Title: Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL

Title: Multi-Attribute Steering of Language Models via Targeted Intervention

Title: DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs

Title: An Empirical Evaluation of Encoder Architectures for Fast Real-Time Long Conversational Understanding

Title: Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Title: Emulating Retrieval Augmented Generation via Prompt Engineering for Enhanced Long Context Comprehension in LLMs

Title: SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Title: Reasoning on a Spectrum: Aligning LLMs to System 1 and System 2 Thinking

Title: CoCo-CoLa: Evaluating Language Adherence in Multilingual LLMs

Title: Savaal: Scalable Concept-Driven Question Generation to Enhance Human Learning

Title: MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition

Title: The Knowledge Microscope: Features as Better Analytical Lenses than Neurons

Title: Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages -- A Singlish Case Study

Title: EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning

Title: Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Title: Efficient OpAmp Adaptation for Zoom Attention to Golden Contexts

Title: LegalCore: A Dataset for Legal Documents Event Coreference Resolution

Title: Aspect-Guided Multi-Level Perturbation Analysis of Large Language Models in Automated Peer Review

Title: Can LLMs Extract Frame-Semantic Arguments?

Title: Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Title: How does a Language-Specific Tokenizer affect LLMs?

Title: SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings

Title: Evaluating Language Models on Grooming Risk Estimation Using Fuzzy Theory

Title: Self Iterative Label Refinement via Robust Unlabeled Learning

Title: A Cognitive Writing Perspective for Constrained Long-Form Text Generation

Title: A Fuzzy Evaluation of Sentence Encoders on Grooming Risk Classification

Title: LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data

Title: PASER: Post-Training Data Selection for Efficient Pruned Large Language Model Recovery

Title: Bring Your Own Knowledge: A Survey of Methods for LLM Knowledge Expansion

Title: COPU: Conformal Prediction for Uncertainty Quantification in Natural Language Generation

Title: Who Writes What: Unveiling the Impact of Author Roles on AI-generated Text Detection

Title: Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions

Title: \textit{One Size doesn't Fit All}: A Personalized Conversational Tutoring Agent for Mathematics Instruction

Title: R.R.: Unveiling LLM Training Privacy through Recollection and Ranking

Title: Demystifying Multilingual Chain-of-Thought in Process Reward Modeling

Title: A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization

Title: Evaluation of Best-of-N Sampling Strategies for Language Model Alignment

Title: Baichuan-M1: Pushing the Medical Capability of Large Language Models

Title: Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming

Title: "I know myself better, but not really greatly": Using LLMs to Detect and Explain LLM-Generated Texts

Title: Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation

Title: MediaMind: Revolutionizing Media Monitoring using Agentification

Title: Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models

Title: R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs

Title: How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild

Title: Mind the Gap: Aligning the Brain with Language Models Requires a Nonlinear and Multimodal Approach

Title: Commonsense Reasoning in Arab Culture

Title: Towards Text-Image Interleaved Retrieval

Title: Simulating User Diversity in Task-Oriented Dialogue Systems using Large Language Models

Title: Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models

Title: Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models

Title: KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan

Title: Subword models struggle with word learning, but surprisal hides it

Title: An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation

Title: MeMo: Towards Language Models with Associative Memory Mechanisms

Title: MVL-SIB: A Massively Multilingual Vision-Language Benchmark for Cross-Modal Topical Matching

Title: S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Title: Rejected Dialects: Biases Against African American Language in Reward Models

Title: PAFT: Prompt-Agnostic Fine-Tuning

Title: How desirable is alignment between LLMs and linguistically diverse human users?

Title: Are Multilingual Language Models an Off-ramp for Under-resourced Languages? Will we arrive at Digital Language Equality in Europe in 2030?

Title: H-CoT: Hijacking the Chain-of-Thought Safety Reasoning Mechanism to Jailbreak Large Reasoning Models, Including OpenAI o1/o3, DeepSeek-R1, and Gemini 2.0 Flash Thinking

Title: Multilingual European Language Models: Benchmarking Approaches and Challenges

Title: None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks

Title: Soundwave: Less is More for Speech-Text Alignment in LLMs

Title: Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Title: Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation

Title: Q-STRUM Debate: Query-Driven Contrastive Summarization for Recommendation Comparison

Title: On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation

Title: Conditioning LLMs to Generate Code-Switched Text: A Methodology Grounded in Naturally Occurring Data

Title: SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems

Title: Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

Title: Synthetic Data Generation for Culturally Nuanced Commonsense Reasoning in Low-Resource Languages

Title: LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation

Title: Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models

Title: Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text

Title: AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages

Title: Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing

Title: Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs

Title: Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Title: Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Title: Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs

Title: B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability

Title: Adaptive Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge

Title: Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation

Title: HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Title: Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction

Title: AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks

Title: SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

Title: Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection

Title: Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Title: KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits

Title: Text2World: Benchmarking Large Language Models for Symbolic World Model Generation

Title: STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models

Title: Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context

Title: RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises

Title: Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Title: UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models