2025-05-30

Title: SlimLLM: Accurate Structured Pruning for Large Language Models

Title: MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning

Title: LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning

Title: When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks?

Title: Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

Title: Private Rate-Constrained Optimization with Applications to Fair Learning

Title: Training Language Models to Generate Quality Code with Program Analysis Feedback

Title: HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

Title: TensorShield: Safeguarding On-Device Inference by Shielding Critical DNN Tensors with TEE

Title: Climate Finance Bench

Title: Pre-Training Curriculum for Multi-Token Prediction in Language Models

Title: FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference

Title: FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian

Title: MIAS-SAM: Medical Image Anomaly Segmentation without thresholding

Title: Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems

Title: Machine Learning Models Have a Supply Chain Problem

Title: Can Large Language Models Match the Conclusions of Systematic Reviews?

Title: Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization

Title: Efficient Preimage Approximation for Neural Network Certification

Title: Towards a More Generalized Approach in Open Relation Extraction

Title: Preference Learning with Response Time

Title: Self-Critique and Refinement for Faithful Natural Language Explanations

Title: PGLearn -- An Open-Source Learning Toolkit for Optimal Power Flow

Title: What Has Been Lost with Synthetic Evaluation?

Title: How Do Diffusion Models Improve Adversarial Robustness?

Title: Development and Validation of SXI++ LNM Algorithm for Sepsis Prediction

Title: Kernel-Smoothed Scores for Denoising Diffusion: A Bias-Variance Study

Title: Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation

Title: Security Benefits and Side Effects of Labeling AI-Generated Images

Title: RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation

Title: Improving Contrastive Learning for Referring Expression Counting

Title: Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment

Title: CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting

Title: IRS: Incremental Relationship-guided Segmentation for Digital Pathology

Title: A Probabilistic Jump-Diffusion Framework for Open-World Egocentric Activity Recognition

Title: Permissioned LLMs: Enforcing Access Control in Large Language Models

Title: Scaling Offline RL via Efficient and Expressive Shortcut Models

Title: GateNLP at SemEval-2025 Task 10: Hierarchical Three-Step Prompting for Multilingual Narrative Classification

Title: CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models

Title: BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection

Title: Smart Surrogate Losses for Contextual Stochastic Linear Optimization with Robust Constraints

Title: VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models

Title: Talent or Luck? Evaluating Attribution Bias in Large Language Models

Title: cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Title: Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

Title: ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

Title: Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Title: Scalable Parameter and Memory Efficient Pretraining for LLM: Recent Algorithmic Advances and Benchmarking

Title: Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification

Title: Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging

Title: Is Noise Conditioning Necessary? A Unified Theory of Unconditional Graph Diffusion Models

Title: Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs

Title: Fast Isotropic Median Filtering

Title: WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning

Title: Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates

Title: ATI: Any Trajectory Instruction for Controllable Video Generation

Title: OWL: Probing Cross-Lingual Recall of Memorized Texts via World Literature

Title: NegVQA: Can Vision Language Models Understand Negation?

Title: Directed Graph Grammars for Sequence-based Learning

Title: StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs

Title: LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments

Title: LLM-based HSE Compliance Assessment: Benchmark, Performance, and Advancements

Title: ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind

Title: Exploring Scaling Laws for EHR Foundation Models

Title: MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming

Title: EquiReg: Equivariance Regularized Diffusion for Inverse Problems

Title: HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions

Title: Pose-free 3D Gaussian splatting via shape-ray estimation

Title: MOVi: Training-free Text-conditioned Multi-Object Video Generation

Title: A Computational Approach to Improving Fairness in K-means Clustering

Title: Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation

Title: LLM Agents for Bargaining with Utility-based Feedback

Title: DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Title: Hybrid Cross-domain Robust Reinforcement Learning

Title: QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Title: A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs

Title: EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge

Title: SeG-SR: Integrating Semantic Knowledge into Remote Sensing Image Super-Resolution via Vision-Language Model

Title: Scalable Complexity Control Facilitates Reasoning Ability of LLMs

Title: Hyperbolic-PDE GNN: Spectral Graph Neural Networks in the Perspective of A System of Hyperbolic Partial Differential Equations

Title: Detecting Stealthy Backdoor Samples based on Intra-class Distance for Large Language Models

Title: $K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Title: AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models

Title: SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference

Title: An Empirical Study of Federated Prompt Learning for Vision Language Model

Title: Context Robust Knowledge Editing for Language Models

Title: Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

Title: Can Modern NLP Systems Reliably Annotate Chest Radiography Exams? A Pre-Purchase Evaluation and Comparative Study of Solutions from AWS, Google, Azure, John Snow Labs, and Open-Source Models on an Independent Pediatric Dataset

Title: Towards Privacy-Preserving Fine-Grained Visual Classification via Hierarchical Learning from Label Proportions

Title: Improving Multilingual Social Media Insights: Aspect-based Comment Analysis

Title: EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models

Title: Deep Modeling and Optimization of Medical Image Classification

Title: From Theory to Application: Fine-Tuning Large EEG Model with Real-World Stress Data

Title: ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Title: DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Title: Query Routing for Retrieval-Augmented Language Models

Title: Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object

Title: CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents

Title: DINGO: Constrained Inference for Diffusion LLMs

Title: Loss-Guided Model Sharing and Local Learning Correction in Decentralized Federated Learning for Crop Disease Classification

Title: SNS-Bench-VL: Benchmarking Multimodal Large Language Models in Social Networking Services

Title: GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

Title: Equivariant Spherical Transformer for Efficient Molecular Modeling

Title: LeMoRe: Learn More Details for Lightweight Semantic Segmentation

Title: MAP: Revisiting Weight Decomposition for Low-Rank Adaptation

Title: Learning to Search for Vehicle Routing with Multiple Time Windows

Title: EAD: An EEG Adapter for Automated Classification

Title: Generating Diverse Training Samples for Relation Extraction with Large Language Models

Title: Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data

Title: Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Title: Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking

Title: Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios

Title: TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance

Title: MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

Title: ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Title: PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics

Title: HMAD: Advancing E2E Driving with Anchored Offset Proposals and Simulation-Supervised Multi-target Scoring

Title: Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing

Title: VERINA: Benchmarking Verifiable Code Generation

Title: Enhancing Large Language Models'Machine Translation via Dynamic Focus Anchoring

Title: FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

Title: Cross-Domain Bilingual Lexicon Induction via Pretrained Language Models

Title: Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners

Title: PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Title: Implicit Inversion turns CLIP into a Decoder

Title: Best Arm Identification with Possibly Biased Offline Data

Title: Tell, Don't Show: Leveraging Language Models' Abstractive Retellings to Model Literary Themes

Title: RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Title: Map&Make: Schema Guided Text to Table Generation

Title: The Panaceas for Improving Low-Rank Decomposition in Communication-Efficient Federated Learning

Title: Infinite-Instruct: Synthesizing Scaling Code instruction Data with Bidirectional Synthesis and Static Verification

Title: DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

Title: FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification

Title: FSL-SAGE: Accelerating Federated Split Learning via Smashed Activation Gradient Estimation

Title: Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

Title: Two Is Better Than One: Rotations Scale LoRAs

Title: HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image

Title: Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration

Title: ExpeTrans: LLMs Are Experiential Transfer Learners

Title: Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks

Title: Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images

Title: Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics

Title: WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver-Assistance Systems

Title: HyperPointFormer: Multimodal Fusion in 3D Space with Dual-Branch Cross-Attention Transformers

Title: Navigating the Accuracy-Size Trade-Off with Flexible Model Merging

Title: Daunce: Data Attribution through Uncertainty Estimation

Title: MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration

Title: Generalizability vs. Counterfactual Explainability Trade-Off

Title: MCTSr-Zero: Self-Reflective Psychological Counseling Dialogues Generation via Principles and Adaptive Exploration

Title: ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering

Title: Measuring Participant Contributions in Decentralized Federated Learning

Title: Accelerating RLHF Training with Reward Variance Increase

Title: Advancing Image Super-resolution Techniques in Remote Sensing: A Comprehensive Survey

Title: UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes

Title: Efficiently Access Diffusion Fisher: Within the Outer Product Span Space

Title: Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs

Title: Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion

Title: Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs

Title: The Arabic AI Fingerprint: Stylometric Analysis and Detection of Large Language Models Text

Title: Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective

Title: RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

Title: Comparative Analysis of the Land Use and Land Cover Changes in Different Governorates of Oman using Spatiotemporal Multi-spectral Satellite Data

Title: GenCAD-Self-Repairing: Feasibility Enhancement for 3D CAD Generation

Title: Federated Unsupervised Semantic Segmentation

Title: How Does Response Length Affect Long-Form Factuality

Title: EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian

Title: Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs

Title: Generalized Category Discovery in Event-Centric Contexts: Latent Pattern Mining with LLMs

Title: Score-based Generative Modeling for Conditional Independence Testing

Title: TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models

Title: Adversarial Semantic and Label Perturbation Attack for Pedestrian Attribute Recognition

Title: Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO

Title: Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

Title: Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization

Title: DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification

Title: Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering

Title: Towards Reward Fairness in RLHF: From a Resource Allocation Perspective

Title: Grower-in-the-Loop Interactive Reinforcement Learning for Greenhouse Climate Control

Title: Joint Data Hiding and Partial Encryption of Compressive Sensed Streams

Title: VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Title: Discriminative Policy Optimization for Token-Level Reward Models

Title: PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening

Title: Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation

Title: Dynamic Spectral Backpropagation for Efficient Neural Network Training

Title: Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models

Title: UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

Title: Automated Modeling Method for Pathloss Model Discovery

Title: Robust and Annotation-Free Wound Segmentation on Noisy Real-World Pressure Ulcer Images: Towards Automated DESIGN-R\textsuperscript{\textregistered} Assessment

Title: Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation

Title: Adaptive Jailbreaking Strategies Based on the Semantic Understanding Capabilities of Large Language Models

Title: From Parameters to Prompts: Understanding and Mitigating the Factuality Gap between Fine-Tuned LLMs

Title: Buffer-free Class-Incremental Learning with Out-of-Distribution Detection

Title: Bidirectional predictive coding

Title: The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence

Title: OTPTO: Joint Product Selection and Inventory Optimization in Fresh E-commerce Front-End Warehouses

Title: Enhanced DACER Algorithm with High Diffusion Efficiency

Title: Diversity-Aware Policy Optimization for Large Language Model Reasoning

Title: UrbanCraft: Urban View Extrapolation via Hierarchical Sem-Geometric Priors

Title: Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation

Title: VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration

Title: CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis

Title: Diffusion Guidance Is a Controllable Policy Improvement Operator

Title: On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment

Title: LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter

Title: A Divide-and-Conquer Approach for Global Orientation of Non-Watertight Scene-Level Point Clouds Using 0-1 Integer Optimization

Title: TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning

Title: Evaluating the performance and fragility of large language models on the self-assessment for neurological surgeons

Title: Revisiting Overthinking in Long Chain-of-Thought from the Perspective of Self-Doubt

Title: VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation

Title: R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation

Title: Can Large Language Models Challenge CNNS in Medical Image Analysis?

Title: VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning

Title: AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity

Title: Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization and Temporal Motion Modulation

Title: Normalizing Flows are Capable Models for RL

Title: Comparative assessment of fairness definitions and bias mitigation strategies in machine learning-based diagnosis of Alzheimer's disease from MR images

Title: Subgraph Gaussian Embedding Contrast for Self-Supervised Graph Representation Learning

Title: Domain-Aware Tensor Network Structure Search

Title: CLaC at SemEval-2025 Task 6: A Multi-Architecture Approach for Corporate Environmental Promise Verification

Title: Probability-Consistent Preference Optimization for Enhanced LLM Reasoning

Title: Position Paper: Metadata Enrichment Model: Integrating Neural Networks and Semantic Knowledge Graphs for Cultural Heritage Applications

Title: Translation in the Wild

Title: Adaptive Federated LoRA in Heterogeneous Wireless Networks with Independent Sampling

Title: Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models

Title: Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models

Title: DRO: A Python Library for Distributionally Robust Optimization in Machine Learning

Title: Maximum Likelihood Learning of Latent Dynamics Without Reconstruction

Title: Evaluating AI capabilities in detecting conspiracy theories on YouTube

Title: BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

Title: On-Policy RL with Optimal Reward Baseline

Title: Weakly-supervised Localization of Manipulated Image Regions Using Multi-resolution Learned Features

Title: PCA for Enhanced Cross-Dataset Generalizability in Breast Ultrasound Tumor Segmentation

Title: Accelerated Training of Federated Learning via Second-Order Methods

Title: Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles

Title: Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models

Title: DeepChest: Dynamic Gradient-Free Task Weighting for Effective Multi-Task Learning in Chest X-ray Classification

Title: Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation

Title: LLM Performance for Code Generation on Noisy Tasks

Title: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis

Title: Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Title: Inference-time Scaling of Diffusion Models through Classical Search

Title: Learning Interpretable Differentiable Logic Networks for Tabular Regression

Title: One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Title: Characterizing the Expressivity of Transformer Language Models

Title: AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora

Title: MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment

Title: Comparing the Effects of Persistence Barcodes Aggregation and Feature Concatenation on Medical Imaging

Title: Securing AI Agents with Information-Flow Control

Title: Continuous Chain of Thought Enables Parallel Exploration and Reasoning

Title: How does Transformer Learn Implicit Reasoning?

Title: ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs

Title: Keyed Chaotic Tensor Transformations for Secure And Attributable Neural Inference

Title: VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Title: Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation

Title: Bayesian Perspective on Memorization and Reconstruction

Title: D-AR: Diffusion via Autoregressive Models

Title: OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation

Title: ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions

Title: LoLA: Low-Rank Linear Attention With Sparse Caching

Title: ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer

Title: Learning Compositional Functions with Transformers from Easy-to-Hard Data

Title: Automatic classification of stop realisation with wav2vec2.0

Title: DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers

Title: Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms

Title: DiCoFlex: Model-agnostic diverse counterfactuals with flexible control

Title: Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation

Title: CLDTracker: A Comprehensive Language Description for Visual Tracking

Title: Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

Title: SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Title: SenWiCh: Sense-Annotation of Low-Resource Languages for WiC using Hybrid Methods

Title: Don't Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models

Title: TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

Title: DiffER: Categorical Diffusion for Chemical Retrosynthesis

Title: Label-Guided In-Context Learning for Named Entity Recognition

Title: ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

Title: SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA

Title: MuLoCo: Muon is a practical inner optimizer for DiLoCo

Title: FMG-Det: Foundation Model Guided Robust Object Detection

Title: PixelThink: Towards Efficient Chain-of-Pixel Reasoning

Title: Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Title: ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS

Title: ATLAS: Learning to Optimally Memorize the Context at Test Time

Title: How Animals Dance (When You're Not Looking)

Title: LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization

Title: MAGREF: Masked Guidance for Any-Reference Video Generation

Title: DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP

Title: Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need

Title: Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Title: Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?

Title: REOrdering Patches Improves Vision Models

Title: ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks

Title: DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Title: LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Title: MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Title: From Chat Logs to Collective Insights: Aggregative Question Answering

Title: Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

Title: TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models