2025-05-27

Title: InjectLab: A Tactical Framework for Adversarial Threat Modeling Against Large Language Models

Title: A Blockchain-Based Approach for Secure and Transparent e-Faktur Issuance in Indonesia's VAT Reporting System

Title: Model-Distributed Inference for Large Language Models at the Edge

Title: Constrained Edge AI Deployment: Fine-Tuning vs Distillation for LLM Compression

Title: Emotion Knowledge Enhancement for Vision Large Language Models: A Self-Verification Approach for High-Quality Emotion Instruction Data Generation

Title: Interpretable Multi-Task PINN for Emotion Recognition and EDA Prediction

Title: Robust Knowledge Graph Embedding via Denoising

Title: GenAI Security: Outsmarting the Bots with a Proactive Testing Framework

Title: FedGRec: Dynamic Spatio-Temporal Federated Graph Learning for Secure and Efficient Cross-Border Recommendations

Title: GAIA: A Foundation Model for Operational Atmospheric Dynamics

Title: 2DNMRGym: An Annotated Experimental Dataset for Atom-Level Molecular Representation Learning in 2D NMR via Surrogate Supervision

Title: Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry

Title: Quantum-Resilient Blockchain for Secure Transactions in UAV-Assisted Smart Agriculture Networks

Title: CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games

Title: IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis

Title: Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality

Title: Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models

Title: NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache

Title: ELDeR: Getting Efficient LLMs through Data-Driven Regularized Layer-wise Pruning

Title: POSTER: A Multi-Signal Model for Detecting Evasive Smishing

Title: A Robust PPO-optimized Tabular Transformer Framework for Intrusion Detection in Industrial IoT Systems

Title: The Origins of Representation Manifolds in Large Language Models

Title: Think or Not? Exploring Thinking Efficiency in Large Reasoning Models via an Information-Theoretic Lens

Title: Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback

Title: Privacy-Preserving Bathroom Monitoring for Elderly Emergencies Using PIR and LiDAR Sensors

Title: Multi-Scale Probabilistic Generation Theory: A Hierarchical Framework for Interpreting Large Language Models

Title: Decomposition of Water Demand Patterns Using Skewed Gaussian Distributions for Behavioral Insights and Operational Planning

Title: MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning

Title: Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks

Title: TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification

Title: Convexified Message-Passing Graph Neural Networks

Title: InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning

Title: Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards

Title: PLUMAGE: Probabilistic Low rank Unbiased Min Variance Gradient Estimator for Efficient Large Model Training

Title: COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification

Title: Is It Bad to Work All the Time? Cross-Cultural Evaluation of Social Norm Biases in GPT-4

Title: Architectural Backdoors for Within-Batch Data Stealing and Model Inference Manipulation

Title: PerMedCQA: Benchmarking Large Language Models on Medical Consumer Question Answering in Persian Language

Title: An Attack to Break Permutation-Based Private Third-Party Inference Schemes for LLMs

Title: A Critical Evaluation of Defenses against Prompt Injection Attacks

Title: Model Editing with Graph-Based External Memory

Title: Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

Title: Diffusion Self-Weighted Guidance for Offline Reinforcement Learning

Title: Task Specific Pruning with LLM-Sieve: How Many Parameters Does Your Task Really Need?

Title: The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs

Title: CONCORD: Concept-Informed Diffusion for Dataset Distillation

Title: SchemaGraphSQL: Efficient Schema Linking with Pathfinding Graph Algorithms for Text-to-SQL on Large-Scale Databases

Title: Weakly-supervised Mamba-Based Mastoidectomy Shape Prediction for Cochlear Implant Surgery Using 3D T-Distribution Loss

Title: Small Models, Smarter Learning: The Power of Joint Task Training

Title: NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

Title: Dynamic Risk Assessments for Offensive Cybersecurity Agents

Title: Modeling interdependent privacy threats

Title: Applications of Modular Co-Design for De Novo 3D Molecule Generation

Title: Towards Anonymous Neural Network Inference

Title: Taming Diffusion for Dataset Distillation with High Representativeness

Title: AI/ML for 5G and Beyond Cybersecurity

Title: Thought calibration: Efficient and confident test-time scaling

Title: RaDeR: Reasoning-aware Dense Retrieval Models

Title: KL-regularization Itself is Differentially Private in Bandits and RLHF

Title: DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding

Title: Rehabilitation Exercise Quality Assessment and Feedback Generation Using Large Language Models with Prompt Engineering

Title: LatentLLM: Attention-Aware Joint Tensor Compression

Title: A Dual Basis Approach for Structured Robust Euclidean Distance Geometry

Title: CENet: Context Enhancement Network for Medical Image Segmentation

Title: Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps

Title: TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP

Title: DB-KSVD: Scalable Alternating Optimization for Disentangling High-Dimensional Embedding Spaces

Title: OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Title: Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling

Title: Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning

Title: BRIT: Bidirectional Retrieval over Unified Image-Text Graph

Title: MedScore: Factuality Evaluation of Free-Form Medical Answers

Title: Hybrid Latent Reasoning via Reinforcement Learning

Title: Anchored Diffusion Language Model

Title: Measuring South Asian Biases in Large Language Models

Title: HonestFace: Towards Honest Face Restoration with One-Step Diffusion Model

Title: Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services

Title: Using Large Language Models to Tackle Fundamental Challenges in Graph Learning: A Comprehensive Survey

Title: Syn3DTxt: Embedding 3D Cues for Scene Text Generation

Title: The Prompt is Mightier than the Example

Title: Investigating AI Rater Effects of Large Language Models: GPT, Claude, Gemini, and DeepSeek

Title: Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications

Title: FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation

Title: Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking

Title: The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

Title: G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

Title: Enhancing Training Data Attribution with Representational Optimization

Title: A Study of Semi-Fungible Token based Wi-Fi Access Control

Title: Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility

Title: How Does Sequence Modeling Architecture Influence Base Capabilities of Pre-trained Language Models? Exploring Key Architecture Design Principles to Avoid Base Capabilities Degradation

Title: metaTextGrad: Automatically optimizing language model optimizers

Title: TK-Mamba: Marrying KAN with Mamba for Text-Driven 3D Medical Image Segmentation

Title: CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs

Title: Preserving AUC Fairness in Learning with Noisy Protected Groups

Title: Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models

Title: Business as \textit{Rule}sual: A Benchmark and Framework for Business Rule Flow Modeling with LLMs

Title: Benchmarking Poisoning Attacks against Retrieval-Augmented Generation

Title: B-score: Detecting biases in large language models using response history

Title: Composable Cross-prompt Essay Scoring by Merging Models

Title: MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors

Title: LAMDA: A Longitudinal Android Malware Benchmark for Concept Drift Analysis

Title: Unraveling Misinformation Propagation in LLM Reasoning

Title: Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation

Title: TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation

Title: Joint-stochastic-approximation Autoencoders with Application to Semi-supervised Learning

Title: ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts

Title: From Word to World: Evaluate and Mitigate Culture Bias via Word Association Test

Title: Learning without Isolation: Pathway Protection for Continual Learning

Title: Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs

Title: Removal of Hallucination on Hallucination: Debate-Augmented RAG

Title: On Denoising Walking Videos for Gait Recognition

Title: Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations

Title: Guiding the Experts: Semantic Priors for Efficient and Focused MoE Routing

Title: HyperFake: Hyperspectral Reconstruction and Attention-Guided Analysis for Advanced Deepfake Detection

Title: Safety Alignment via Constrained Knowledge Unlearning

Title: EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models

Title: MisoDICE: Multi-Agent Imitation from Unlabeled Mixed-Quality Demonstrations

Title: Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models

Title: Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Title: Flex-Judge: Think Once, Judge Anywhere

Title: Rethinking Causal Mask Attention for Vision-Language Inference

Title: Spiking Transformers Need High Frequency Information

Title: PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

Title: Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter

Title: MLRan: A Behavioural Dataset for Ransomware Analysis and Detection

Title: Anonymity-washing

Title: Think Before You Accept: Semantic Reflective Verification for Faster Speculative Decoding

Title: DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation

Title: SerendibCoins: Exploring The Sri Lankan Coins Dataset

Title: Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Title: ThanoRA: Task Heterogeneity-Aware Multi-Task Low-Rank Adaptation

Title: Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster

Title: Flow Matching for Geometric Trajectory Simulation

Title: ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos

Title: On the Emergence of Linear Analogies in Word Embeddings

Title: Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU

Title: Climate-Eval: A Comprehensive Benchmark for NLP Tasks Related to Climate Change

Title: LLM-QFL: Distilling Large Language Model for Quantum Federated Learning

Title: Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

Title: So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Title: DVD-Quant: Data-free Video Diffusion Transformers Quantization

Title: Does Representation Intervention Really Identify Desired Concepts and Elicit Alignment?

Title: Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models

Title: Restoring Real-World Images with an Internal Detail Enhancement Diffusion Model

Title: Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Title: $PD^3F$: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models

Title: TULUN: Transparent and Adaptable Low-resource Machine Translation

Title: From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation

Title: WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation

Title: Large Language Models in the Task of Automatic Validation of Text Classifier Predictions

Title: Benchmarking and Rethinking Knowledge Editing for Large Language Models

Title: Can LLMs Alleviate Catastrophic Forgetting in Graph Continual Learning? A Systematic Study

Title: MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention

Title: GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains

Title: Towards Semantic Integration of Opinions: Unified Opinion Concepts Ontology and Extraction Task

Title: Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Title: Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization

Title: LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning

Title: FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment

Title: Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation

Title: Rethinking Direct Preference Optimization in Diffusion Models

Title: AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping

Title: LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Title: Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning

Title: Few-Shot Optimization for Sensor Data Using Large Language Models: A Case Study on Fatigue Detection

Title: Smart Energy Guardian: A Hybrid Deep Learning Model for Detecting Fraudulent PV Generation

Title: ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models

Title: ARMS: A Vision for Actor Reputation Metric Systems in the Open-Source Software Supply Chain

Title: How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark

Title: GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Title: Multiple Wasserstein Gradient Descent Algorithm for Multi-Objective Distributional Optimization

Title: StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations

Title: Dual-Path Stable Soft Prompt Generation for Domain Generalization

Title: Strong Membership Inference Attacks on Massive Datasets and (Moderately) Large Language Models

Title: Disentangling Knowledge Representations for Large Language Model Editing

Title: OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

Title: HD-PiSSA: High-Rank Distributed Orthogonal Adaptation

Title: Geometry Aware Operator Transformer as an Efficient and Accurate Neural Surrogate for PDEs on Arbitrary Domains

Title: Soft Weighted Machine Unlearning

Title: Leveraging Per-Instance Privacy for Machine Unlearning

Title: ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models

Title: Mal-D2GAN: Double-Detector based GAN for Malware Generation

Title: VORTA: Efficient Video Diffusion via Routing Sparse Attention

Title: SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

Title: Usability of Token-based and Remote Electronic Signatures: A User Experience Study

Title: Reasoning Segmentation for Images and Videos: A Survey

Title: MSLAU-Net: A Hybird CNN-Transformer Network for Medical Image Segmentation

Title: How to build a consistency model: Learning flow maps via self-distillation

Title: On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization

Title: Localizing Knowledge in Diffusion Transformers

Title: Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Title: Multi-Party Conversational Agents: A Survey

Title: LLM-Driven APT Detection for 6G Wireless Networks: A Systematic Review and Taxonomy

Title: Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation

Title: Writing Like the Best: Exemplar-Based Expository Text Generation

Title: Securing Credit Inquiries: The Role of Real-Time User Approval in Preventing SSN Identity Theft

Title: Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box Framework

Title: Distribution-Aware Mobility-Assisted Decentralized Federated Learning

Title: Sci-LoRA: Mixture of Scientific LoRAs for Cross-Domain Lay Paraphrasing

Title: Eye-See-You: Reverse Pass-Through VR and Head Avatars

Title: Understanding the Relationship Between Personal Data Privacy Literacy and Data Privacy Information Sharing by University Students

Title: Zero Trust Cybersecurity: Procedures and Considerations in Context

Title: Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Title: RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models

Title: CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Title: REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing

Title: Partition Generative Modeling: Masked Modeling Without Masks

Title: LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

Title: KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning

Title: Security Concerns for Large Language Models: A Survey

Title: Conformal Prediction for Uncertainty Estimation in Drug-Target Interaction Prediction

Title: Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos

Title: PromptWise: Online Learning for Cost-Aware Prompt Assignment in Generative Models

Title: Federated Retrieval-Augmented Generation: A Systematic Mapping Study

Title: Behavior Injection: Preparing Language Models for Reinforcement Learning

Title: Graph-Based Operator Learning from Limited Data on Irregular Domains

Title: LLM-Guided Taxonomy and Hierarchical Uncertainty for 3D Point CLoud Active Learning

Title: Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation

Title: Hybrid Neural-MPM for Interactive Fluid Simulations in Real-Time

Title: Benchmarking Large Language Models for Cyberbullying Detection in Real-World YouTube Comments

Title: MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Title: Exemplifying Emerging Phishing: QR-based Browser-in-The-Browser (BiTB) Attack

Title: Echo Planning for Autonomous Driving: From Current Observations to Future Trajectories and Back

Title: OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model

Title: Exact Expressive Power of Transformers with Padding

Title: The Price of Format: Diversity Collapse in LLMs

Title: BnMMLU: Measuring Massive Multitask Language Understanding in Bengali

Title: Online Knowledge Distillation with Reward Guidance

Title: How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Title: CDPDNet: Integrating Text Guidance with Hybrid Vision Encoders for Medical Image Segmentation

Title: System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts

Title: MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

Title: Protein Design with Dynamic Protein Vocabulary

Title: Learning to Explain: Prototype-Based Surrogate Models for LLM Classification

Title: Is Architectural Complexity Overrated? Competitive and Interpretable Knowledge Graph Completion with RelatE

Title: Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings

Title: AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models

Title: GhostPrompt: Jailbreaking Text-to-image Generative Models based on Dynamic Optimization

Title: FedSKC: Federated Learning with Non-IID Data via Structural Knowledge Collaboration

Title: AmorLIP: Efficient Language-Image Pretraining via Amortization

Title: STRICT: Stress Test of Rendering Images Containing Text

Title: SPARS: Self-Play Adversarial Reinforcement Learning for Segmentation of Liver Tumours

Title: Kernel Space Diffusion Model for Efficient Remote Sensing Pansharpening

Title: VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes

Title: FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM)

Title: Automatic and Structure-Aware Sparsification of Hybrid Neural ODEs

Title: VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization

Title: Secure IVSHMEM: End-to-End Shared-Memory Protocol with Hypervisor-CA Handshake and In-Kernel Access Control

Title: A quantitative notion of economic security for smart contract compositions

Title: Co-AttenDWG: Co-Attentive Dimension-Wise Gating and Expert Fusion for Multi-Modal Offensive Content Detection

Title: Faithful Group Shapley Value

Title: Tokenizing Electron Cloud in Protein-Ligand Interaction Learning

Title: Can Multimodal Large Language Models Understand Spatial Relations?

Title: CrosGrpsABS: Cross-Attention over Syntactic and Semantic Graphs for Aspect-Based Sentiment Analysis in a Low-Resource Language

Title: Querying Kernel Methods Suffices for Reconstructing their Training Data

Title: A Smart Healthcare System for Monkeypox Skin Lesion Detection and Tracking

Title: InfoChartQA: A Benchmark for Multimodal Question Answering on Infographic Charts

Title: A Systematic Classification of Vulnerabilities in MoveEVM Smart Contracts (MWC)

Title: Efficient Data Selection at Scale via Influence Distillation

Title: An Embarrassingly Simple Defense Against LLM Abliteration Attacks

Title: Distributionally Robust Deep Q-Learning

Title: UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models

Title: Training-free Stylized Text-to-Image Generation with Fast Inference

Title: Towards Harmonized Uncertainty Estimation for Large Language Models

Title: ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Title: Towards Generalized Proactive Defense against Face Swappingwith Contour-Hybrid Watermark

Title: Jodi: Unification of Visual Generation and Understanding via Joint Modeling

Title: Plug-and-Play Context Feature Reuse for Efficient Masked Generation

Title: CMoS: Rethinking Time Series Prediction Through the Lens of Chunk-wise Spatial Correlations

Title: Towards Robust Influence Functions with Flat Validation Minima

Title: ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning

Title: Optimization-Inspired Few-Shot Adaptation for Large Language Models

Title: CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models

Title: An Interpretable Representation Learning Approach for Diffusion Tensor Imaging

Title: Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Title: CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Title: FP4 All the Way: Fully Quantized Training of LLMs

Title: Controlling Language Confusion in Multilingual LLMs

Title: Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition

Title: Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models

Title: Exploring Magnitude Preservation and Rotation Modulation in Diffusion Transformers

Title: MMATH: A Multilingual Benchmark for Mathematical Reasoning

Title: RetrieveAll: A Multilingual Named Entity Recognition Framework with Large Language Models

Title: Fast and Accurate Power Load Data Completion via Regularization-optimized Low-Rank Factorization

Title: Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

Title: The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework

Title: Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Title: MIND-Edit: MLLM Insight-Driven Editing via Language-Vision Projection

Title: FHGS: Feature-Homogenized Gaussian Splatting

Title: Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs

Title: A Joint Learning Framework with Feature Reconstruction and Prediction for Incomplete Satellite Image Time Series in Agricultural Semantic Segmentation

Title: SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs

Title: JEDI: The Force of Jensen-Shannon Divergence in Disentangling Diffusion Models

Title: EventEgoHands: Event-based Egocentric 3D Hand Mesh Reconstruction

Title: Penetration Testing for System Security: Methods and Practical Approaches

Title: Assistant-Guided Mitigation of Teacher Preference Bias in LLM-as-a-Judge

Title: Federated Learning: From Theory to Practice

Title: Two LLMs debate, both are certain they've won

Title: PosePilot: An Edge-AI Solution for Posture Correction in Physical Exercises

Title: LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Title: I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts

Title: Misleading through Inconsistency: A Benchmark for Political Inconsistencies Detection

Title: Interpretable Graph Learning Over Sets of Temporally-Sparse Data

Title: Curvature Dynamic Black-box Attack: revisiting adversarial robustness via dynamic curvature estimation

Title: Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning

Title: DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding

Title: OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization

Title: SpeakStream: Streaming Text-to-Speech with Interleaved Data

Title: Domain and Task-Focused Example Selection for Data-Efficient Contrastive Medical Image Segmentation

Title: MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

Title: Towards Understanding the Mechanisms of Classifier-Free Guidance

Title: When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas

Title: The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

Title: LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Title: Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf's Law

Title: RAISE: Realness Assessment for Image Synthesis and Evaluation

Title: Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator

Title: Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees

Title: DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving

Title: LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models

Title: ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Title: Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model

Title: To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers

Title: Improving Value Estimation Critically Enhances Vanilla Policy Gradient

Title: Unveiling Dual Quality in Product Reviews: An NLP-Based Approach

Title: VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

Title: PolyPose: Localizing Deformable Anatomy in 3D from Sparse 2D X-ray Images using Polyrigid Transforms

Title: Towards Large Reasoning Models for Agriculture

Title: ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast \& Slow Reasoning for Robust Agent Defense

Title: Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Title: Cellular Traffic Prediction via Byzantine-robust Asynchronous Federated Learning

Title: Improving Novel view synthesis of 360$^\circ$ Scenes in Extremely Sparse Views by Jointly Training Hemisphere Sampled Synthetic Images

Title: A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Title: BSAGIoT: A Bayesian Security Aspect Graph for Internet of Things (IoT)

Title: A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models

Title: Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering

Title: TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis

Title: Alchemist: Turning Public Text-to-Image Data into Generative Gold

Title: A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations

Title: SituatedThinker: Grounding LLM Reasoning with Real-World through Situated Thinking

Title: A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control

Title: Concept Reachability in Diffusion Models: Beyond Dataset Constraints

Title: Likert or Not: LLM Absolute Relevance Judgments on Fine-Grained Ordinal Scales

Title: Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies

Title: Communication-Efficient Multi-Device Inference Acceleration for Transformer Models

Title: PatentScore: Multi-dimensional Evaluation of LLM-Generated Patent Claims

Title: Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions

Title: GC-KBVQA: A New Four-Stage Framework for Enhancing Knowledge Based Visual Question Answering Performance

Title: ChartLens: Fine-grained Visual Attribution in Charts

Title: RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

Title: SETransformer: A Hybrid Attention-Based Architecture for Robust Human Activity Recognition

Title: DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models

Title: Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality

Title: Absolute Coordinates Make Motion Generation Easy

Title: GSA-TTS : Toward Zero-Shot Speech Synthesis based on Gradual Style Adaptor

Title: Advancing Limited-Angle CT Reconstruction Through Diffusion-Based Sinogram Completion

Title: Alignment of large language models with constrained learning

Title: gec-metrics: A Unified Library for Grammatical Error Correction Evaluation

Title: VADER: A Human-Evaluated Benchmark for Vulnerability Assessment, Detection, Explanation, and Remediation

Title: Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains

Title: Erasing Concepts, Steering Generations: A Comprehensive Survey of Concept Suppression

Title: Exploring the Possibility of TypiClust for Low-Budget Federated Active Learning

Title: CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems

Title: Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering

Title: ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting

Title: LlamaSeg: Image Segmentation via Autoregressive Mask Generation

Title: Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

Title: Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation

Title: The Role of Diversity in In-Context Learning for Large Language Models

Title: WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

Title: Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation

Title: Importance Weighted Score Matching for Diffusion Samplers with Enhanced Mode Coverage

Title: Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Title: Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers

Title: The Birth of Knowledge: Emergent Features across Time, Space, and Scale in Large Language Models

Title: MetaGMT: Improving Actionable Interpretability of Graph Multilinear Networks via Meta-Learning Filtration

Title: An Empirical Study of JavaScript Inclusion Security Issues in Chrome Extensions

Title: Your Classifier Can Do More: Towards Bridging the Gaps in Classification, Robustness, and Generation

Title: Residual Cross-Attention Transformer-Based Multi-User CSI Feedback with Deep Joint Source-Channel Coding

Title: Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory

Title: Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks

Title: Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

Title: Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs

Title: Language of Network: A Generative Pre-trained Model for Encrypted Traffic Comprehension

Title: CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data Synthesis

Title: Understanding Transformer from the Perspective of Associative Memory

Title: ViewCraft3D: High-Fidelity and View-Consistent 3D Vector Graphics Synthesis

Title: The Role of Video Generation in Enhancing Data-Limited Action Understanding

Title: DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation

Title: LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study

Title: Causal Distillation: Transferring Structured Explanations from Large to Compact Language Models

Title: SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback

Title: Toward Patient-specific Partial Point Cloud to Surface Completion for Pre- to Intra-operative Registration in Image-guided Liver Interventions

Title: Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift

Title: Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning

Title: Navigating loss manifolds via rigid body dynamics: A promising avenue for robustness and generalisation

Title: AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection

Title: Minimalist Softmax Attention Provably Learns Constrained Boolean Functions

Title: Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning

Title: ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models

Title: TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs

Title: SMART-PC: Skeletal Model Adaptation for Robust Test-Time Training in Point Clouds

Title: STRAP: Spatio-Temporal Pattern Retrieval for Out-of-Distribution Generalization

Title: How Syntax Specialization Emerges in Language Models

Title: Towards Multi-Granularity Memory Association and Selection for Long-Term Conversational Agents

Title: On scalable and efficient training of diffusion samplers

Title: Aggregated Structural Representation with Large Language Models for Human-Centric Layout Generation

Title: Few-Shot Class-Incremental Learning For Efficient SAR Automatic Target Recognition

Title: What You Perceive Is What You Conceive: A Cognition-Inspired Framework for Open Vocabulary Image Segmentation

Title: DocMEdit: Towards Document-Level Model Editing

Title: Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes

Title: Beyond Segmentation: Confidence-Aware and Debiased Estimation of Ratio-based Biomarkers

Title: TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Title: WQLCP: Weighted Adaptive Conformal Prediction for Robust Uncertainty Quantification Under Distribution Shifts

Title: Model Agnostic Differentially Private Causal Inference

Title: Learning to Reason without External Rewards

Title: Multi-Agent Collaboration via Evolving Orchestration

Title: Evaluating Robustness of Large Audio Language Models to Audio Injection: An Empirical Study

Title: Preference Optimization by Estimating the Ratio of the Data Distribution

Title: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression

Title: Rep3D: Re-parameterize Large 3D Kernels with Low-Rank Receptive Modeling for Medical Imaging

Title: Kuramoto-FedAvg: Using Synchronization Dynamics to Improve Federated Learning Optimization under Statistical Heterogeneity

Title: Energy-based Preference Optimization for Test-time Adaptation

Title: Skrull: Towards Efficient Long Context Fine-tuning through Dynamic Data Scheduling

Title: JailBound: Jailbreaking Internal Safety Boundaries of Vision-Language Models

Title: TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization

Title: Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models

Title: SESaMo: Symmetry-Enforcing Stochastic Modulation for Normalizing Flows

Title: Decoupling Spatio-Temporal Prediction: When Lightweight Large Models Meet Adaptive Hypergraphs

Title: HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices

Title: DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue

Title: Segment First or Comprehend First? Explore the Limit of Unsupervised Word Segmentation with Large Language Models

Title: Weak-Jamming Detection in IEEE 802.11 Networks: Techniques, Scenarios and Mobility

Title: Faster and Better LLMs via Latency-Aware Test-Time Scaling

Title: Interleaved Reasoning for Large Language Models via Reinforcement Learning

Title: MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE

Title: Energy-based generator matching: A neural sampler for general state space

Title: Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Title: ReDDiT: Rehashing Noise for Discrete Visual Generation

Title: LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation

Title: GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models

Title: LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

Title: Burst Image Super-Resolution via Multi-Cross Attention Encoding and Multi-Scan State-Space Decoding

Title: Reshaping Representation Space to Balance the Safety and Over-rejection in Large Audio Language Models

Title: Comparing Moral Values in Western English-speaking societies and LLMs with Word Associations

Title: Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement

Title: Deep Actor-Critics with Tight Risk Certificates

Title: VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models

Title: Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation

Title: DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving

Title: Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition

Title: Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality

Title: JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

Title: Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments

Title: Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Title: Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning

Title: Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision

Title: MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval

Title: On the Relation between Rectified Flows and Optimal Transport

Title: MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning

Title: Graceful Forgetting in Generative Language Models

Title: Distilling Closed-Source LLM's Knowledge for Locally Stable and Economic Biomedical Entity Linking

Title: Cross-Sequence Semi-Supervised Learning for Multi-Parametric MRI-Based Visual Pathway Delineation

Title: Machine Learning Algorithm for Noise Reduction and Disease-Causing Gene Feature Extraction in Gene Sequencing Data

Title: HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

Title: Token-level Accept or Reject: A Micro Alignment Approach for Large Language Models

Title: Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation

Title: SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect

Title: SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model

Title: Discrete Markov Bridge

Title: NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering

Title: Agentic Predictor: Performance Prediction for Agentic Workflows via Multi-View Encoding

Title: SGM: A Framework for Building Specification-Guided Moderation Filters

Title: What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs

Title: Analyzing Political Bias in LLMs via Target-Oriented Sentiment Classification

Title: What Can RL Bring to VLA Generalization? An Empirical Study

Title: The Missing Point in Vision Transformers for Universal Image Segmentation

Title: The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants

Title: MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs

Title: GraphAU-Pain: Graph-based Action Unit Representation for Pain Intensity Estimation

Title: Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

Title: Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks

Title: Density Ratio-Free Doubly Robust Proxy Causal Learning

Title: Efficient Multi-modal Long Context Learning for Training-free Adaptation

Title: GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis

Title: Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Title: InfoCons: Identifying Interpretable Critical Concepts in Point Clouds via Information Theory

Title: Poison in the Well: Feature Embedding Disruption in Backdoor Attacks

Title: LAPA-based Dynamic Privacy Optimization for Wireless Federated Learning in Heterogeneous Environments

Title: Foundation Models for Tabular Data within Systemic Contexts Need Grounding

Title: FoodTaxo: Generating Food Taxonomies with Large Language Models

Title: One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP

Title: Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation

Title: Improving Multilingual Math Reasoning for African Languages

Title: Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages

Title: Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

Title: CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models

Title: Deep Active Inference Agents for Delayed and Long-Horizon Environments

Title: Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling

Title: Deep Spectral Prior

Title: StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation

Title: Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Title: Generalized and Personalized Federated Learning with Foundation Models via Orthogonal Transformations

Title: OmniFall: A Unified Staged-to-Wild Benchmark for Human Fall Detection

Title: ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining

Title: Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement

Title: Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM

Title: Attention! You Vision Language Model Could Be Maliciously Manipulated

Title: APE: A Data-Centric Benchmark for Efficient LLM Adaptation in Text Summarization

Title: Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Title: A Responsible Face Recognition Approach for Small and Mid-Scale Systems Through Personalized Neural Networks

Title: CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge

Title: Logic Gate Neural Networks are Good for Verification

Title: ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs

Title: Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning

Title: Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning

Title: SaSi: A Self-augmented and Self-interpreted Deep Learning Approach for Few-shot Cryo-ET Particle Detection

Title: Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Title: Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

Title: An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning

Title: UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space

Title: MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models

Title: The Limits of Preference Data for Post-Training

Title: Learning to Select In-Context Demonstration Preferred by Large Language Model

Title: Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models

Title: CP-Router: An Uncertainty-Aware Router Between LLM and LRM

Title: Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language

Title: DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Title: Rethinking Probabilistic Circuit Parameter Learning

Title: Structured Initialization for Vision Transformers

Title: How Well Do Large Reasoning Models Translate? A Comprehensive Evaluation for Multi-Domain Machine Translation

Title: Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

Title: NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID

Title: TabPFN: One Model to Rule Them All?

Title: Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition

Title: WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback

Title: Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning Distillation

Title: Ontology- and LLM-based Data Harmonization for Federated Learning in Healthcare

Title: Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking

Title: ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving

Title: Gradient Inversion Transcript: Leveraging Robust Generative Priors to Reconstruct Training Data from Gradient Leakage

Title: ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers

Title: EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition

Title: Graph Wave Networks

Title: Beyond Simple Concatenation: Fairly Assessing PLM Architectures for Multi-Chain Protein-Protein Interactions Prediction

Title: Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs

Title: Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks

Title: Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation Benchmarks

Title: Data-Free Class-Incremental Gesture Recognition with Prototype-Guided Pseudo Feature Replay

Title: Catoni-Style Change Point Detection for Regret Minimization in Non-Stationary Heavy-Tailed Bandits

Title: Ankh3: Multi-Task Pretraining with Sequence Denoising and Completion Enhances Protein Representations

Title: Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Title: PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation

Title: SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

Title: Incentivizing Reasoning from Weak Supervision

Title: An Out-Of-Distribution Membership Inference Attack Approach for Cross-Domain Graph Attacks

Title: Grokking ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior

Title: Inference-time Alignment in Continuous Space

Title: Multi-Domain Explainability of Preferences

Title: Spurious Privacy Leakage in Neural Networks

Title: MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning

Title: S2LPP: Small-to-Large Prompt Prediction across LLMs

Title: Transformer in Protein: A Survey

Title: Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities

Title: AdaTP: Attention-Debiased Token Pruning for Video Large Language Models

Title: Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

Title: From Data to Modeling: Fully Open-vocabulary Scene Graph Generation

Title: Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning

Title: Language-Agnostic Suicidal Risk Detection Using Large Language Models

Title: Proxy-Free GFlowNet

Title: ResSVD: Residual Compensated SVD for Large Language Model Compression

Title: Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone

Title: TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent

Title: Understanding Generalization in Diffusion Models via Probability Flow Distance

Title: TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Title: Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers

Title: MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning

Title: Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks

Title: SeMe: Training-Free Language Model Merging via Semantic Alignment

Title: FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

Title: UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models

Title: Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs

Title: HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

Title: Exploring Generative Error Correction for Dysarthric Speech Recognition

Title: Visual Abstract Thinking Empowers Multimodal Reasoning

Title: Long-Context State-Space Video World Models

Title: "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding

Title: THiNK: Can Large Language Models Think-aloud?

Title: Eradicating the Unseen: Detecting, Exploiting, and Remediating a Path Traversal Vulnerability across GitHub

Title: FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement

Title: Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

Title: Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations

Title: PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology

Title: How to Improve the Robustness of Closed-Source Models on NLI

Title: Parameter-Efficient Fine-Tuning with Column Space Projection

Title: Dependency Parsing is More Parameter-Efficient with Normalization

Title: Fine-grained List-wise Alignment for Generative Medication Recommendation

Title: Gradient Flow Matching for Learning Update Dynamics in Neural Network Training

Title: FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

Title: From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance

Title: Multimodal Federated Learning With Missing Modalities through Feature Imputation Network

Title: Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models

Title: DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

Title: RedAHD: Reduction-Based End-to-End Automatic Heuristic Design with Large Language Models

Title: It's High Time: A Survey of Temporal Information Retrieval and Question Answering

Title: KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing

Title: WXImpactBench: A Disruptive Weather Impact Understanding Benchmark for Evaluating Large Language Models

Title: Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

Title: AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models

Title: Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Title: Lifelong Safety Alignment for Language Models

Title: HaloGS: Loose Coupling of Compact Geometry and Gaussian Splats for 3D Scenes

Title: In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation

Title: Ground-R1: Incentivizing Grounded Visual Reasoning via Reinforcement Learning

Title: ImgEdit: A Unified Image Editing Dataset and Benchmark

Title: Does quantization affect models' performance on long-context tasks?

Title: OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction

Title: The Coverage Principle: A Framework for Understanding Compositional Generalization

Title: VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Title: One-shot Entropy Minimization

Title: MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search Capability

Title: Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots

Title: OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Title: Enhancing the Comprehensibility of Text Explanations via Unsupervised Concept Discovery

Title: Self-reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?

Title: Reasoning LLMs are Wandering Solution Explorers

Title: DiSA: Diffusion Step Annealing in Autoregressive Image Generation