2025-10-28

Title: Policy Optimization Prefers The Path of Least Resistance

Title: Language Ranker: A Lightweight Ranking framework for LLM Decoding

Title: Framework for Machine Evaluation of Reasoning Completeness in Large Language Models For Classification Tasks

Title: Preventing Catastrophic Forgetting: Behavior-Aware Sampling for Safer Language Model Fine-Tuning

Title: Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation

Title: Understanding Network Behaviors through Natural Language Question-Answering

Title: Deep Literature Survey Automation with an Iterative Workflow

Title: Model-Aware Tokenizer Transfer

Title: A Stylometric Application of Large Language Models

Title: Uncovering the Persuasive Fingerprint of LLMs in Jailbreaking Attacks

Title: Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models

Title: Penalizing Length: Uncovering Systematic Bias in Quality Estimation Metrics

Title: Emotions Where Art Thou: Understanding and Characterizing the Emotional Latent Space of Large Language Models

Title: Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds

Title: Generalization or Memorization: Dynamic Decoding for Mode Steering

Title: Gradual Forgetting: Logarithmic Compression for Extending Transformer Context Windows

Title: OlaMind: Towards Human-Like and Hallucination-Safe Customer Service for Retrieval-Augmented Dialogue

Title: DETECT: Determining Ease and Textual Clarity of German Text Simplifications

Title: Estimating the Error of Large Language Models at Pairwise Text Comparison

Title: You Don't Need Prompt Engineering Anymore: The Prompting Inversion

Title: SteerX: Disentangled Steering for LLM Personalization

Title: From Slides to Chatbots: Enhancing Large Language Models with University Course Materials

Title: Supervised Fine-Tuning or In-Context Learning? Evaluating LLMs for Clinical NER

Title: Memory-based Language Models: An Efficient, Explainable, and Eco-friendly Approach to Large Language Modeling

Title: FAIR-RAG: Faithful Adaptive Iterative Refinement for Retrieval-Augmented Generation

Title: Irony Detection in Urdu Text: A Comparative Study Using Machine Learning Models and Large Language Models

Title: GigaEmbeddings: Efficient Russian Language Embedding Model

Title: VisJudge-Bench: Aesthetics and Quality Assessment of Visualizations

Title: Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection

Title: CHOIR: Collaborative Harmonization fOr Inference Robustness

Title: Frustratingly Easy Task-aware Pruning for Large Language Models

Title: Text to Trust: Evaluating Fine-Tuning and LoRA Trade-offs in Language Models for Unfair Terms of Service Detection

Title: LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?

Title: SABlock: Semantic-Aware KV Cache Eviction with Adaptive Compression Block Size

Title: A Closed-Loop Personalized Learning Agent Integrating Neural Cognitive Diagnosis, Bounded-Ability Adaptive Testing, and LLM-Driven Feedback

Title: Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems

Title: AutoBench: Automating LLM Evaluation through Reciprocal Peer Assessment

Title: Personal Care Utility (PCU): Building the Health Infrastructure for Everyday Insight and Guidance

Title: Integrating Linguistics and AI: Morphological Analysis and Corpus development of Endangered Toto Language of West Bengal

Title: Rule-Based Explanations for Retrieval-Augmented LLM Systems

Title: SALSA: Single-pass Autoregressive LLM Structured Classification

Title: $\text{E}^2\text{Rank}$: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Title: Low-Resource Dialect Adaptation of Large Language Models: A French Dialect Case-Study

Title: Beyond Semantics: How Temporal Biases Shape Retrieval in Transformer and State-Space Models

Title: EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models

Title: Iterative Layer Pruning for Efficient Translation Inference

Title: MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion

Title: Scalable Supervising Software Agents with Patch Reasoner

Title: VEHME: A Vision-Language Model For Evaluating Handwritten Mathematics Expressions

Title: Cross-Lingual Stability and Bias in Instruction-Tuned Language Models for Humanitarian NLP

Title: Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays

Title: Leveraging Large Language Models to Identify Conversation Threads in Collaborative Learning

Title: Once Upon an Input: Reasoning via Per-Instance Program Synthesis

Title: Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

Title: Interpreting and Mitigating Unwanted Uncertainty in LLMs

Title: A Comprehensive Dataset for Human vs. AI Generated Text Detection

Title: Batch Speculative Decoding Done Right

Title: Language Server CLI Empowers Language Agents with Process Rewards

Title: Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Title: Tagging-Augmented Generation: Assisting Language Models in Finding Intricate Knowledge In Long Contexts

Title: MAD-Fact: A Multi-Agent Debate Framework for Long-Form Factuality Evaluation in LLMs

Title: Measuring Teaching with LLMs

Title: Understanding In-Context Learning Beyond Transformers: An Investigation of State Space and Hybrid Architectures

Title: LangLingual: A Personalised, Exercise-oriented English Language Learning Tool Leveraging Large Language Models

Title: Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Title: Knocking-Heads Attention

Title: Quality-Aware Translation Tagging in Multilingual RAG system

Title: A Survey on LLM Mid-training

Title: MAP4TS: A Multi-Aspect Prompting Framework for Time-Series Forecasting with Large Language Models

Title: Leveraging Hierarchical Organization for Medical Multi-document Summarization

Title: Beyond Higher Rank: Token-wise Input-Output Projections for Efficient Low-Rank Adaptation

Title: ENTP: Enhancing Low-Quality SFT Data via Neural-Symbolic Text Purge-Mix

Title: Beyond Direct Generation: A Decomposed Approach to Well-Crafted Screenwriting with LLMs

Title: SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human Conversations

Title: DREaM: Drug-Drug Relation Extraction via Transfer Learning Method

Title: Process Reward Models for Sentence-Level Verification of LVLM Radiology Reports

Title: Mubeen AI: A Specialized Arabic Language Model for Heritage Preservation and User Intent Understanding

Title: Code Aesthetics with Agentic Reward Feedback

Title: A Cocktail-Party Benchmark: Multi-Modal dataset and Comparative Evaluation Results

Title: DCMM-SQL: Automated Data-Centric Pipeline and Multi-Model Collaboration Training for Text-to-SQL Model

Title: Adaptive Blockwise Search: Inference-Time Alignment for Large Language Models

Title: BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning

Title: LightKGG: Simple and Efficient Knowledge Graph Generation from Textual Data

Title: How AI Forecasts AI Jobs: Benchmarking LLM Predictions of Labor Market Changes

Title: Detecting Religious Language in Climate Discourse

Title: EMTSF:Extraordinary Mixture of SOTA Models for Time Series Forecasting

Title: BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents

Title: Evaluating Large Language Models for Stance Detection on Financial Targets from SEC Filing Reports and Earnings Call Transcripts

Title: MMTutorBench: The First Multimodal Benchmark for AI Math Tutoring

Title: IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering

Title: LimRank: Less is More for Reasoning-Intensive Information Reranking

Title: Hope Speech Detection in Social Media English Corpora: Performance of Traditional and Transformer Models

Title: Think Twice: Branch-and-Rethink Reasoning Reward Model