2025-12-30

Title: Open-Source Multimodal Moxin Models with Moxin-VLM and Moxin-VLA

Title: Hierarchical Geometry of Cognitive States in Transformer Embedding Spaces

Title: SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Title: Towards Efficient Post-Training via Fourier-Driven Adapter Architectures

Title: LLM-Guided Exemplar Selection for Few-Shot Wearable-Sensor Human Activity Recognition

Title: Hallucination Detection and Evaluation of Large Language Model

Title: HiFi-RAG: Hierarchical Content Filtering and Two-Pass Generation for Open-Domain RAG

Title: Exploring the Vertical-Domain Reasoning Capabilities of Large Language Models

Title: Learning When Not to Attend Globally

Title: Structured Prompting and LLM Ensembling for Multimodal Conversational Aspect-based Sentiment Analysis

Title: Chain-of-thought Reviewing and Correction for Time Series Question Answering

Title: M2G-Eval: Enhancing and Evaluating Multi-granularity Multilingual Code Generation

Title: On the Role of Discreteness in Diffusion LLMs

Title: Evaluating GRPO and DPO for Faithful Chain-of-Thought Reasoning in LLMs

Title: Conformal Prediction Sets for Next-Token Prediction in Large Language Models: Balancing Coverage Guarantees with Set Efficiency

Title: Beg to Differ: Understanding Reasoning-Answer Misalignment Across Languages

Title: Mitigating Social Desirability Bias in Random Silicon Sampling

Title: WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Title: Harnessing Large Language Models for Biomedical Named Entity Recognition

Title: Text-Routed Sparse Mixture-of-Experts Model with Explanation and Temporal Alignment for Multi-Modal Sentiment Analysis

Title: Fake News Classification in Urdu: A Domain Adaptation Approach for a Low-Resource Language

Title: CNSight: Evaluation of Clinical Note Segmentation Tools

Title: AutoForge: Automated Environment Synthesis for Agentic Reinforcement Learning

Title: Diversity or Precision? A Deep Dive into Next Token Prediction

Title: Prompt engineering does not universally improve Large Language Model performance across clinical decision-making tasks

Title: Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping

Title: LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models

Title: Is Chain-of-Thought Really Not Explainability? Chain-of-Thought Can Be Faithful without Hint Verbalization

Title: Accelerating Language Model Workflows with Prompt Choreography

Title: Reservoir Computing inspired Matrix Multiplication-free Language Model

Title: Not too long do read: Evaluating LLM-generated extreme scientific summaries

Title: Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process

Title: Anka: A Domain-Specific Language for Reliable LLM Code Generation

Title: Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation

Title: Chinese Morph Resolution in E-commerce Live Streaming Scenarios

Title: AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration

Title: AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Title: A Stepwise-Enhanced Reasoning Framework for Large Language Models Based on External Subgraph Generation

Title: Entropy-Guided Token Dropout: Training Autoregressive Language Models with Limited Domain Data

Title: C2PO: Diagnosing and Disentangling Bias Shortcuts in LLMs

Title: ClinDEF: A Dynamic Evaluation Framework for Large Language Models in Clinical Reasoning

Title: Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Title: Semantic Tree Inference on Text Corpa using a Nested Density Approach together with Large Language Model Embeddings

Title: Single LLM Debate, MoLaCE: Mixture of Latent Concept Experts Against Confirmation Bias

Title: Lie to Me: Knowledge Graphs for Robust Hallucination Self-Detection in LLMs

Title: Instruction-Following Evaluation of Large Vision-Language Models

Title: Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models

Title: Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing

Title: Nested Browser-Use Learning for Agentic Information Seeking

Title: Less is more: Probabilistic reduction is best explained by small-scale predictability measures

Title: Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing

Title: PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech

Title: Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans

Title: Eliciting Behaviors in Multi-Turn Conversations