2025-04-02

Title: Medical Reasoning in LLMs: An In-Depth Analysis of DeepSeek R1

Title: ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding

Title: Generalization Bias in Large Language Model Summarization of Scientific Research

Title: Opioid Named Entity Recognition (ONER-2025) from Reddit

Title: Token-Driven GammaTune: Adaptive Calibration for Enchanced Speculative Decoding

Title: Beyond the Reported Cutoff: Where Large Language Models Fall Short on Financial Knowledge

Title: CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Title: Multi-Stakeholder Disaster Insights from Social Media Using Large Language Models

Title: Distill-C: Enhanced NL2SQL via Distilled Customization with LLMs

Title: JudgeLRM: Large Reasoning Models as a Judge

Title: Integrating Large Language Models with Human Expertise for Disease Detection in Electronic Health Records

Title: Evaluating the Feasibility and Accuracy of Large Language Models for Medical History-Taking in Obstetrics and Gynecology

Title: Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B

Title: Does "Reasoning" with Large Language Models Improve Recognizing, Generating, and Reframing Unhelpful Thoughts?

Title: Contradiction Detection in RAG Systems: Evaluating LLMs as Context Validators for Improved Information Consistency

Title: Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation

Title: Synthesizing Public Opinions with LLMs: Role Creation, Impacts, and the Future to eDemorcacy

Title: SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers

Title: Text Chunking for Document Classification for Urban System Management using Large Language Models

Title: Do Large Language Models Exhibit Spontaneous Rational Deception?

Title: Do Chinese models speak Chinese languages?

Title: Detecting and Mitigating Bias in LLMs through Knowledge Graph-Augmented Training

Title: VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation

Title: Leveraging Large Language Models for Automated Definition Extraction with TaxoMatic A Case Study on Media Bias

Title: When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)

Title: VerifiAgent: a Unified Verification Agent in Language Model Reasoning

Title: Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding

Title: Multimodal LLMs for OCR, OCR Post-Correction, and Named Entity Recognition in Historical Documents

Title: Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning

Title: Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences

Title: Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models

Title: Enhancing Negation Awareness in Universal Text Embeddings: A Data-efficient and Computational-efficient Approach

Title: Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Title: On the Consistency of Multilingual Context Utilization in Retrieval-Augmented Generation

Title: Efficient Construction of Model Family through Progressive Training Using Model Expansion

Title: DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism

Title: Do LLMs Surpass Encoders for Biomedical NER?

Title: GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition

Title: ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection

Title: Command A: An Enterprise-Ready Large Language Model

Title: Aplicação de Large Language Models na Análise e Síntese de Documentos Jurídicos: Uma Revisão de Literatura

Title: IHC-LLMiner: Automated extraction of tumour immunohistochemical profiles from PubMed abstracts using large language models

Title: LLMs4SchemaDiscovery: A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models

Title: RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model

Title: Digitally Supported Analysis of Spontaneous Speech (DigiSpon): Benchmarking NLP-Supported Language Sample Analysis of Swiss Children's Speech

Title: Z1: Efficient Test-time Scaling with Code

Title: ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Title: How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study

Title: m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Title: GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Title: On the Robustness of Agentic Function Calling

Title: Multi-Token Attention

Title: InformGen: An AI Copilot for Accurate and Compliant Clinical Research Consent Document Generation

Title: Experiential Semantic Information and Brain Alignment: Are Multimodal Models Better than Language Models?

Title: SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching

Title: Chinese Grammatical Error Correction: A Survey

Title: MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Title: Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models

Title: Token embeddings violate the manifold hypothesis

Title: When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

Title: Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization