2025-05-23

Title: Multilinear subspace learning for person re-identification based fusion of high order tensor features

Title: Adaptive Tokenization: On the Hop-Overpriority Problem in Tokenized Graph Learning Models

Title: Generative AI for Autonomous Driving: A Review

Title: How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads

Title: SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval

Title: Satellites Reveal Mobility: A Commuting Origin-destination Flow Generator for Global Cities

Title: Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging

Title: Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval

Title: GRIT: Teaching MLLMs to Think with Images

Title: Challenger: Affordable Adversarial Driving Video Generation

Title: Is (Selective) Round-To-Nearest Quantization All You Need?

Title: BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law

Title: Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization

Title: Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition

Title: AllMetrics: A Unified Python Library for Standardized Metric Evaluation and Robust Data Validation in Machine Learning

Title: MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding

Title: Citation Parsing and Analysis with Language Models

Title: Training Step-Level Reasoning Verifiers with Formal Verification Tools

Title: OViP: Online Vision-Language Preference Learning

Title: Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Title: Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders

Title: Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku

Title: Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers

Title: Image-to-Image Translation with Diffusion Transformers and CLIP-Based Image Conditioning

Title: Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions

Title: SLMEval: Entropy-Based Calibration for Human-Aligned Evaluation of Large Language Models

Title: Interpretability Illusions with Sparse Autoencoders: Evaluating Robustness of Concept Representations

Title: LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization

Title: Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

Title: NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Title: Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild

Title: Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging

Title: CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment

Title: An Approach Towards Identifying Bangladeshi Leaf Diseases through Transfer Learning and XAI

Title: Equivariant Eikonal Neural Networks: Grid-Free, Scalable Travel-Time Prediction on Homogeneous Spaces

Title: OpenEthics: A Comprehensive Ethical Evaluation of Open-Source Generative Large Language Models

Title: An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection

Title: Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Title: Mesh-free sparse identification of nonlinear dynamics

Title: Few-Shot Test-Time Optimization Without Retraining for Semiconductor Recipe Generation and Beyond

Title: Internal and External Impacts of Natural Language Processing Papers

Title: Small Language Models in the Real World: Insights from Industrial Text Classification

Title: BiasLab: Toward Explainable Political Bias Detection with Dual-Axis Annotations and Rationale Indicators

Title: Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Title: A Survey of Large Language Models for Text-Guided Molecular Discovery: from Molecule Generation to Optimization

Title: Continually Self-Improving Language Models for Bariatric Surgery Question--Answering

Title: MPL: Multiple Programming Languages with Large Language Models for Information Extraction

Title: Extensible Post Quantum Cryptography Based Authentication

Title: Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools

Title: A Generic Framework for Conformal Fairness

Title: Semiotic Reconstruction of Destination Expectation Constructs An LLM-Driven Computational Paradigm for Social Media Tourism Analytics

Title: Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning

Title: KoBALT: Korean Benchmark For Advanced Linguistic Tasks

Title: Robust Invariant Representation Learning by Distribution Extrapolation

Title: LLMs Are Not Scorers: Rethinking MT Evaluation with Generation-Based Methods

Title: Scalable Graph Generative Modeling via Substructure Sequences

Title: Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Title: Outsourcing SAT-based Verification Computations in Network Security

Title: Multimodal Online Federated Learning with Modality Missing in Internet of Things

Title: Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning

Title: GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation

Title: When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification

Title: BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World

Title: Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention

Title: Why Can Accurate Models Be Learned from Inaccurate Annotations?

Title: EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios

Title: KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization

Title: Can LLMs Simulate Human Behavioral Variability? A Case Study in the Phonemic Fluency Task

Title: RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition

Title: TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Title: When Do LLMs Admit Their Mistakes? Understanding the Role of Model Belief in Retraction

Title: Automated Feedback Loops to Protect Text Simplification with Generative AI from Information Loss

Title: Erased or Dormant? Rethinking Concept Erasure Through Reversibility

Title: Understanding Fact Recall in Language Models: Why Two-Stage Training Encourages Memorization but Mixed Training Teaches Knowledge

Title: Redemption Score: An Evaluation Framework to Rank Image Captions While Redeeming Image Semantics and Language Pragmatics

Title: Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Title: SAE-SSV: Supervised Steering in Sparse Representation Spaces for Reliable Control of Language Models

Title: Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare

Title: VLM-R$^3$: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Title: An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Title: VIVID: A Novel Approach to Remediation Prioritization in Static Application Security Testing (SAST)

Title: NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics

Title: Large Language Models based ASR Error Correction for Child Conversations

Title: A Scalable Hierarchical Intrusion Detection System for Internet of Vehicles

Title: Memorization or Reasoning? Exploring the Idiom Understanding of LLMs

Title: Don't Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation

Title: Realistic Evaluation of TabPFN v2 in Open Environments

Title: MuseRAG: Idea Originality Scoring At Scale

Title: LIFEBench: Evaluating Length Instruction Following in Large Language Models

Title: Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation

Title: DOVE: Efficient One-Step Diffusion Model for Real-World Video Super-Resolution

Title: Three Minds, One Legend: Jailbreak Large Reasoning Model with Adaptive Stacked Ciphers

Title: Verifying Differentially Private Median Estimation

Title: Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models

Title: Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces

Title: DualComp: End-to-End Learning of a Unified Dual-Modality Lossless Compressor

Title: Interpretable Anomaly Detection in Encrypted Traffic Using SHAP with Machine Learning Models

Title: All You Need is "Leet": Evading Hate-speech Detection AI

Title: LINEA: Fast and Accurate Line Detection Using Scalable Transformers

Title: Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

Title: Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Title: Spontaneous Speech Variables for Evaluating LLMs Cognitive Plausibility

Title: DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving

Title: HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation

Title: ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay

Title: Efficient Prototype Consistency Learning in Medical Image Segmentation via Joint Uncertainty and Data Augmentation

Title: Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

Title: Fairness under Competition

Title: Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

Title: ToDi: Token-wise Distillation via Fine-Grained Divergence Control

Title: Poster: Towards an Automated Security Testing Framework for Industrial UEs

Title: INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling

Title: SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation

Title: PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models

Title: CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

Title: Paired and Unpaired Image to Image Translation using Generative Adversarial Networks

Title: Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings

Title: NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Title: SuperPure: Efficient Purification of Localized and Distributed Adversarial Patches via Super-Resolution GAN Models

Title: FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail

Title: Efficient Motion Prompt Learning for Robust Visual Tracking

Title: TensorAR: Refinement is All You Need in Autoregressive Image Generation

Title: CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

Title: ChemMLLM: Chemical Multimodal Large Language Model

Title: SC4ANM: Identifying Optimal Section Combinations for Automated Novelty Prediction in Academic Papers

Title: Understanding Differential Transformer Unchains Pretrained Self-Attentions

Title: Panoptic Captioning: Seeking An Equivalency Bridge for Image and Text

Title: FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design

Title: Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification

Title: Improving Chemical Understanding of LLMs via SMILES Parsing

Title: Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Title: Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation

Title: AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training

Title: A collaborative constrained graph diffusion model for the generation of realistic synthetic molecules

Title: ReCopilot: Reverse Engineering Copilot in Binary Analysis

Title: SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

Title: Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning

Title: Temporal and Spatial Feature Fusion Framework for Dynamic Micro Expression Recognition

Title: DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos

Title: PaTH Attention: Position Encoding via Accumulating Householder Transformations

Title: Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models

Title: Omni TM-AE: A Scalable and Interpretable Embedding Model Using the Full Tsetlin Machine State Space

Title: Resource for Error Analysis in Text Simplification: New Taxonomy and Test Collection

Title: Consistent and Compatible Modelling of Cyber Intrusions and Incident Response Demonstrated in the Context of Malware Attacks on Critical Infrastructure

Title: Sketchy Bounding-box Supervision for 3D Instance Segmentation

Title: AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Title: Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games

Title: AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems

Title: Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach

Title: From Surveys to Narratives: Rethinking Cultural Value Adaptation in LLMs

Title: Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Title: Pose-invariant face recognition via feature-space pose frontalization

Title: Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation

Title: Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models

Title: WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Title: $I^2G$: Generating Instructional Illustrations via Text-Conditioned Diffusion

Title: Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems

Title: Password Strength Detection via Machine Learning: Analysis, Modeling, and Evaluation

Title: Implicit Jailbreak Attacks via Cross-Modal Information Concealment on Vision-Language Models

Title: TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition

Title: CMRINet: Joint Groupwise Registration and Segmentation for Cardiac Function Quantification from Cine-MRI

Title: MAGIC: Motion-Aware Generative Inference via Confidence-Guided LLM

Title: University of Indonesia at SemEval-2025 Task 11: Evaluating State-of-the-Art Encoders for Multi-Label Emotion Detection

Title: AnchorFormer: Differentiable Anchor Attention for Efficient Vision Transformer

Title: Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization

Title: Consistent World Models via Foresight Diffusion

Title: Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Title: LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing

Title: Accuracy vs. Accuracy: Computational Tradeoffs Between Classification Rates and Utility

Title: ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation

Title: Language-based Security and Time-inserting Supervisor

Title: Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection

Title: Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting

Title: AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

Title: Computing Exact Shapley Values in Polynomial Time for Product-Kernel Methods

Title: Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Title: Benchmarking and Pushing the Multi-Bias Elimination Boundary of LLMs via Causal Effect Estimation-guided Debiasing

Title: CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving

Title: EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance

Title: Joint Relational Database Generation via Graph-Conditional Diffusion Models

Title: DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection

Title: SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion

Title: Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models

Title: TextureSAM: Towards a Texture Aware Foundation Model for Segmentation

Title: Incremental Sequence Classification with Temporal Consistency

Title: Towards Coordinate- and Dimension-Agnostic Machine Learning for Partial Differential Equations

Title: Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Title: CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning

Title: Auto-nnU-Net: Towards Automated Medical Image Segmentation

Title: A Two-Stage Data Selection Framework for Data-Efficient Model Training on Edge Devices

Title: M2SVid: End-to-End Inpainting and Refinement for Monocular-to-Stereo Video Conversion

Title: ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts

Title: Finetuning-Activated Backdoors in LLMs

Title: URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training

Title: EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions

Title: Large Language Model-Empowered Interactive Load Forecasting

Title: O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Title: Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Title: Temporal Object Captioning for Street Scene Videos from LiDAR Tracks

Title: Decoupled Geometric Parameterization and its Application in Deep Homography Estimation

Title: MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation

Title: From Generic Empathy to Personalized Emotional Support: A Self-Evolution Framework for User Preference Alignment

Title: Steering Large Language Models for Machine Translation Personalization

Title: Energy Consumption Framework and Analysis of Post-Quantum Key-Generation on Embedded Devices

Title: CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models

Title: Background Matters: A Cross-view Bidirectional Modeling Framework for Semi-supervised Medical Image Segmentation

Title: Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds

Title: SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Title: Reconsidering Fairness Through Unawareness from the Perspective of Model Multiplicity

Title: BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Title: From Evaluation to Defense: Advancing Safety in Video Large Language Models

Title: Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models

Title: Collaboration among Multiple Large Language Models for Medical Question Answering

Title: Unsupervised Network Anomaly Detection with Autoencoders and Traffic Images

Title: Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding

Title: SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images

Title: A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP

Title: End-to-End Framework for Predicting the Remaining Useful Life of Lithium-Ion Batteries

Title: BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models

Title: R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

Title: Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge

Title: Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds

Title: Learning Genomic Structure from $k$-mers

Title: One-Step Diffusion-Based Image Compression with Semantic Distillation

Title: Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

Title: Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence

Title: Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs

Title: An Analysis of Concept Bottleneck Models: Measuring, Understanding, and Mitigating the Impact of Noisy Annotations

Title: KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Title: Training Long-Context LLMs Efficiently via Chunk-wise Optimization

Title: Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification

Title: Robust LLM Fingerprinting via Domain-Specific Watermarks

Title: Advancing Brainwave Modeling with a Codebook-Based Foundation Model

Title: Masked Conditioning for Deep Generative Models

Title: Forward-only Diffusion Probabilistic Models

Title: Maximum Total Correlation Reinforcement Learning

Title: Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization

Title: Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP

Title: TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning

Title: PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning Projects

Title: Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval

Title: When Safety Detectors Aren't Enough: A Stealthy and Effective Jailbreak Attack on LLMs via Steganographic Techniques

Title: Mitigating Overfitting in Medical Imaging: Self-Supervised Pretraining vs. ImageNet Transfer Learning for Dermatological Diagnosis

Title: IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models

Title: Single Domain Generalization for Few-Shot Counting via Universal Representation Matching

Title: Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Title: CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models

Title: FlowMixer: A Constrained Neural Architecture for Interpretable Spatiotemporal Forecasting

Title: Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability

Title: Learning Flexible Forward Trajectories for Masked Molecular Diffusion

Title: Cohort-Based Active Modality Acquisition

Title: REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Title: REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Title: V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation

Title: Learning Beyond Limits: Multitask Learning and Synthetic Data for Low-Resource Canonical Morpheme Segmentation

Title: SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving

Title: Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement

Title: Hypergraph Tversky-Aware Domain Incremental Learning for Brain Tumor Segmentation with Missing Modalities

Title: Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?

Title: Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts

Title: Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs

Title: SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

Title: R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

Title: LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Title: ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning

Title: Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft

Title: Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Title: Training-Free Efficient Video Generation via Dynamic Token Carving

Title: MPO: Multilingual Safety Alignment via Reward Gap Optimization

Title: A Multi-Step Comparative Framework for Anomaly Detection in IoT Data Streams

Title: T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

Title: CASTILLO: Characterizing Response Length Distributions of Large Language Models

Title: CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework

Title: Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs

Title: Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality

Title: Unsupervised Prompting for Graph Neural Networks

Title: Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Title: Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype

Title: UNCLE: Uncertainty Expressions in Long-Form Generation

Title: LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Title: In-Context Watermarks for Large Language Models

Title: SPAR: Self-supervised Placement-Aware Representation Learning for Multi-Node IoT Systems

Title: FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

Title: MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

Title: Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning

Title: A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization

Title: Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models

Title: Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models

Title: BP-Seg: A graphical model approach to unsupervised and non-contiguous text segmentation using belief propagation

Title: UniPhy: Learning a Unified Constitutive Model for Inverse Physics Simulation

Title: OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning

Title: Creatively Upscaling Images with Global-Regional Priors

Title: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On

Title: Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction

Title: LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding

Title: UFT: Unifying Supervised and Reinforcement Fine-Tuning

Title: Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

Title: T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning

Title: MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems

Title: Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Title: Native Segmentation Vision Transformers

Title: DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization

Title: Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Title: Guided Diffusion Sampling on Function Spaces with Applications to PDEs

Title: R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Title: CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

Title: Deep mineralogical segmentation of thin section images based on QEMSCAN maps

Title: Understanding Prompt Tuning and In-Context Learning via Meta-Learning

Title: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Title: SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Title: When Are Concepts Erased From Diffusion Models?

Title: Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Title: Interactive Post-Training for Vision-Language-Action Models

Title: Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO

Title: SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Title: Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework

Title: CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms