2025-01-29

Title: Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models

Title: How well can LLMs Grade Essays in Arabic?

Title: Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction

Title: A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain

Title: Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems

Title: CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Title: Why Do We Laugh? Annotation and Taxonomy Generation for Laughable Contexts in Spontaneous Text Conversation

Title: An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue

Title: DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models

Title: Large Language Model Critics for Execution-Free Evaluation of Code Changes

Title: Contextual Reinforcement in Multimodal Token Compression for Large Language Models

Title: Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting

Title: MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark

Title: 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow

Title: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking

Title: Through the Prism of Culture: Evaluating LLMs' Understanding of Indian Subcultures and Traditions

Title: A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process

Title: Misspellings in Natural Language Processing: A survey

Title: JRE-L: Journalist, Reader, and Editor LLMs in the Loop for Science Journalism for the General Audience

Title: Irony Detection, Reasoning and Understanding in Zero-shot Learning

Title: Detecting harassment and defamation in cyberbullying with emotion-adaptive training

Title: Multiple Abstraction Level Retrieve Augment Generation

Title: Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Title: How Linguistics Learned to Stop Worrying and Love the Language Models

Title: COS(M+O)S: Curiosity and RL-Enhanced MCTS for Exploring Story Space via Language Models

Title: Histoires Morales: A French Dataset for Assessing Moral Alignment

Title: FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data

Title: AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders