2025-11-26

Title: PuzzlePoles: Cylindrical Fiducial Markers Based on the PuzzleBoard Pattern

Title: Personalized Reward Modeling for Text-to-Image Generation

Title: PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained Transformer

Title: WavefrontDiffusion: Dynamic Decoding Schedule or Improved Reasoning

Title: Tracking and Segmenting Anything in Any Modality

Title: Exploiting the Experts: Unauthorized Compression in MoE-LLMs

Title: Quality analysis and evaluation prediction of RAG retrieval based on machine learning algorithms

Title: OmniTFT: Omni Target Forecasting for Vital Signs and Laboratory Result Trajectories in Multi Center ICU Data

Title: Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification

Title: Generative Model-Aided Continual Learning for CSI Feedback in FDD mMIMO-OFDM Systems

Title: A Systematic Study of Compression Ordering for Large Language Models

Title: Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM

Title: PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting

Title: Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data

Title: Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection

Title: Position: The Complexity of Perfect AI Alignment -- Formalizing the RLHF Trilemma

Title: TouchFormer: A Robust Transformer-based Framework for Multimodal Material Perception

Title: Connecting the Dots: Training-Free Visual Grounding via Agentic Reasoning

Title: Automating Deception: Scalable Multi-Turn LLM Jailbreaks

Title: Blinking Beyond EAR: A Stable Eyelid Angle Metric for Driver Drowsiness Detection and Data Augmentation

Title: EAGER: Edge-Aligned LLM Defense for Robust, Efficient, and Accurate Cybersecurity Question Answering

Title: VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning

Title: Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Space

Title: AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents

Title: Cross-Domain Generalization of Multimodal LLMs for Global Photovoltaic Assessment

Title: Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration

Title: Think First, Assign Next (ThiFAN-VQA): A Two-stage Chain-of-Thought Framework for Post-Disaster Damage Assessment

Title: SPQR: A Standardized Benchmark for Modern Safety Alignment Methods in Text-to-Image Diffusion Models

Title: ModHiFi: Identifying High Fidelity predictive components for Model Modification

Title: An Invariant Latent Space Perspective on Language Model Inversion

Title: HunyuanOCR Technical Report

Title: Leveraging Unlabeled Scans for NCCT Image Segmentation in Early Stroke Diagnosis: A Semi-Supervised GAN Approach

Title: Multiscale Vector-Quantized Variational Autoencoder for Endoscopic Image Synthesis

Title: IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response

Title: Efficient Multi-Hop Question Answering over Knowledge Graphs via LLM Planning and Embedding-Guided Search

Title: Synthetic Data: AI's New Weapon Against Android Malware

Title: Accuracy and Efficiency Trade-Offs in LLM-Based Malware Detection and Explanation: A Comparative Study of Parameter Tuning vs. Full Fine-Tuning

Title: Structured Noise Modeling for Enhanced Time-Series Forecasting

Title: Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds

Title: OncoVision: Integrating Mammography and Clinical Data through Attention-Driven Multimodal AI for Enhanced Breast Cancer Diagnosis

Title: BASICS: Binary Analysis and Stack Integrity Checker System for Buffer Overflow Mitigation

Title: CountXplain: Interpretable Cell Counting with Prototype-Based Density Map Estimation

Title: TREASURE: A Transformer-Based Foundation Model for High-Volume Transaction Understanding

Title: TiCT: A Synthetically Pre-Trained Foundation Model for Time Series Classification

Title: RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models

Title: CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding

Title: Rethinking Vision Transformer Depth via Structural Reparameterization

Title: Can LLMs Faithfully Explain Themselves in Low-Resource Languages? A Case Study on Emotion Detection in Persian

Title: Prompt Fencing: A Cryptographic Approach to Establishing Security Boundaries in Large Language Model Prompts

Title: Training-Free Active Learning Framework in Materials Science with Large Language Models

Title: Comparative Analysis of LoRA-Adapted Embedding Models for Clinical Cardiology Text Representation

Title: Efficient Transferable Optimal Transport via Min-Sliced Transport Plans

Title: DISCO: A Browser-Based Privacy-Preserving Framework for Distributed Collaborative Learning

Title: What You See is (Usually) What You Get: Multimodal Prototype Networks that Abstain from Expensive Modalities

Title: Vision--Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation

Title: A Storage-Efficient Feature for 3D Concrete Defect Segmentation to Replace Normal Vector

Title: Lightweight Transformer Framework for Weakly Supervised Semantic Segmentation

Title: One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer

Title: Gender Bias in Emotion Recognition by Large Language Models

Title: Terminal Velocity Matching

Title: Scalable Data Attribution via Forward-Only Test-Time Inference

Title: Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization

Title: Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models

Title: Large Language Model Aided Birt-Hogg-Dube Syndrome Diagnosis with Multimodal Retrieval-Augmented Generation

Title: Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation

Title: SX-GeoTree: Self-eXplaining Geospatial Regression Tree Incorporating the Spatial Similarity of Feature Attributions

Title: DOGE: Differentiable Bezier Graph Optimization for Road Network Extraction

Title: Accelerating Wireless Distributed Learning via Hybrid Split and Federated Learning Optimization

Title: Profile-LLM: Dynamic Profile Optimization for Realistic Personality Expression in LLMs

Title: A Systematic Analysis of Large Language Models with RAG-enabled Dynamic Prompting for Medical Error Detection and Correction

Title: GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Title: MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization

Title: ChessMamba: Structure-Aware Interleaving of State Spaces for Change Detection in Remote Sensing Images

Title: Frequency Bias Matters: Diving into Robust and Generalized Deep Image Forgery Detection

Title: LiMT: A Multi-task Liver Image Benchmark Dataset

Title: Frailty-Aware Transformer for Recurrent Survival Modeling of Driver Retention in Ride-Hailing Platforms

Title: MHB: Multimodal Handshape-aware Boundary Detection for Continuous Sign Language Recognition

Title: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

Title: Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving

Title: Coupled Physics-Gated Adaptation: Spatially Decoding Volumetric Photochemical Conversion in Complex 3D-Printed Objects

Title: Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models

Title: HybriDLA: Hybrid Generation for Document Layout Analysis

Title: Intelligent Image Search Algorithms Fusing Visual Large Models

Title: CounterVQA: Evaluating and Improving Counterfactual Reasoning in Vision-Language Models for Video Understanding

Title: Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking

Title: EfficientXpert: Efficient Domain Adaptation for Large Language Models via Propagation-Aware Pruning

Title: Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

Title: Optimize Flip Angle Schedules In MR Fingerprinting Using Reinforcement Learning

Title: Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning

Title: Hierarchical Spatio-Temporal Attention Network with Adaptive Risk-Aware Decision for Forward Collision Warning in Complex Scenarios

Title: Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting

Title: Prompt Fairness: Sub-group Disparities in LLMs

Title: AppSelectBench: Application-Level Tool Selection Benchmark

Title: GFT-GCN: Privacy-Preserving 3D Face Mesh Recognition with Spectral Diffusion

Title: ParaBlock: Communication-Computation Parallel Block Coordinate Federated Learning for Large Language Models

Title: MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing

Title: HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning

Title: Stragglers Can Contribute More: Uncertainty-Aware Distillation for Asynchronous Federated Learning

Title: VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction

Title: EmoFeedback2: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback

Title: Rethinking Message Passing Neural Networks with Diffusion Distance-guided Stress Majorization

Title: $\text{R}^2\text{R}$: A Route-to-Rerank Post-Training Framework for Multi-Domain Decoder-Only Rerankers

Title: OmniRefiner: Reinforcement-Guided Local Diffusion Refinement

Title: Directional Optimization Asymmetry in Transformers: A Synthetic Stress Test

Title: A Machine Learning Approach for Detection of Mental Health Conditions and Cyberbullying from Social Media

Title: On the Feasibility of Hijacking MLLMs' Decision Chain via One Perturbation

Title: Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network

Title: Multi-Context Fusion Transformer for Pedestrian Crossing Intention Prediction in Urban Environments

Title: iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and Localization

Title: ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction

Title: WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving

Title: SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM

Title: Cross-Contrastive Clustering for Multimodal Attributed Graphs with Dual Graph Filtering

Title: MFM-point: Multi-scale Flow Matching for Point Cloud Generation

Title: RED-F: Reconstruction-Elimination based Dual-stream Contrastive Forecasting for Multivariate Time Series Anomaly Prediction

Title: PRADA: Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images

Title: MTA: A Merge-then-Adapt Framework for Personalized Large Language Model

Title: Learning Procedural-aware Video Representations through State-Grounded Hierarchy Unfolding

Title: More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering

Title: Explainable Visual Anomaly Detection via Concept Bottleneck Models

Title: Exploring State-of-the-art models for Early Detection of Forest Fires

Title: QiMeng-CRUX: Narrowing the Gap between Natural Language and Verilog via Core Refined Understanding eXpression

Title: SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Title: The Devil in the Details: Emergent Misalignment, Format and Coherence in Open-Weights LLMs

Title: EM2LDL: A Multilingual Speech Corpus for Mixed Emotion Recognition through Label Distribution Learning

Title: CLIMATEAGENT: Multi-Agent Orchestration for Complex Climate Data Science Workflows

Title: LungEvaty: A Scalable, Open-Source Transformer-based Deep Learning Model for Lung Cancer Risk Prediction in LDCT Screening

Title: "When Data is Scarce, Prompt Smarter"... Approaches to Grammatical Error Correction in Low-Resource Settings

Title: UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers

Title: IDAP++: Advancing Divergence-Based Pruning via Filter-Level and Layer-Level Optimization

Title: SEDA: A Self-Adapted Entity-Centric Data Augmentation for Boosting Gird-based Discontinuous NER Models

Title: Hybrid Convolution and Frequency State Space Network for Image Compression

Title: Restora-Flow: Mask-Guided Image Restoration with Flow Matching

Title: Alzheimers Disease Progression Prediction Based on Manifold Mapping of Irregularly Sampled Longitudinal Data

Title: SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery

Title: Harmonious Parameter Adaptation in Continual Visual Instruction Tuning for Safety-Aligned MLLMs

Title: On the Limits of Momentum in Decentralized and Federated Optimization

Title: Realizing Fully-Integrated, Low-Power, Event-Based Pupil Tracking with Neuromorphic Hardware

Title: Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis

Title: In-Context Compositional Learning via Sparse Coding Transformer

Title: GHR-VQA: Graph-guided Hierarchical Relational Reasoning for Video Question Answering

Title: Robust 3D Brain MRI Inpainting with Random Masking Augmentation

Title: OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

Title: Text-guided Controllable Diffusion for Realistic Camouflage Images Generation

Title: Communication-Efficient Learning for Satellite Constellations

Title: Patch-Level Glioblastoma Subregion Classification with a Contrastive Learning-Based Encoder

Title: V-Attack: Targeting Disentangled Value Features for Controllable Adversarial Attacks on LVLMs

Title: Improving the Identification of Real-world Malware's DNS Covert Channels Using Locality Sensitive Hashing

Title: REFLEX: Self-Refining Explainable Fact-Checking via Disentangling Truth into Style and Substance

Title: HistoSpeckle-Net: Mutual Information-Guided Deep Learning for high-fidelity reconstruction of complex OrganAMNIST images via perturbed Multimode Fibers

Title: Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation

Title: Hey there! You are using WhatsApp: Enumerating Three Billion Accounts for Security and Privacy

Title: XiCAD: Camera Activation Detection in the Da Vinci Xi User Interface

Title: Interpretable Air Pollution Forecasting by Physics-Guided Spatiotemporal Decoupling

Title: Modality-Balanced Collaborative Distillation for Multi-Modal Domain Generalization

Title: Advancing Image Classification with Discrete Diffusion Classification Modeling

Title: DRL-Guided Neural Batch Sampling for Semi-Supervised Pixel-Level Anomaly Detection

Title: VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

Title: Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits

Title: ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis

Title: HVAdam: A Full-Dimension Adaptive Optimizer

Title: DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion

Title: SelfMOTR: Revisiting MOTR with Self-Generating Detection Priors

Title: Bootstrapping Physics-Grounded Video Generation through VLM-Guided Iterative Self-Refinement

Title: Can LLMs Make (Personalized) Access Control Decisions?

Title: APT-CGLP: Advanced Persistent Threat Hunting via Contrastive Graph-Language Pre-Training

Title: CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation

Title: TReFT: Taming Rectified Flow Models For One-Step Image Translation

Title: A Reality Check on SBOM-based Vulnerability Management: An Empirical Study and A Path Forward

Title: Geometry of Decision Making in Language Models

Title: IrisNet: Infrared Image Status Awareness Meta Decoder for Infrared Small Targets Detection

Title: ShelfRectNet: Single View Shelf Image Rectification with Homography Estimation

Title: The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models

Title: Soft Adaptive Policy Optimization

Title: From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations

Title: MoRE: Batch-Robust Multi-Omics Representations from Frozen Pre-trained Transformers

Title: FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers

Title: BengaliFig: A Low-Resource Challenge for Figurative and Culturally Grounded Reasoning in Bengali

Title: Short-Range Oversquashing

Title: Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs

Title: BRIC: Bridging Kinematic Plans and Physical Control at Test Time

Title: Object-Centric Vision Token Pruning for Vision Language Models

Title: Diffusion for Fusion: Designing Stellarators with Generative AI

Title: Learning to Generate Human-Human-Object Interactions from Textual Descriptions

Title: Towards Trustworthy Wi-Fi Sensing: Systematic Evaluation of Deep Learning Model Robustness to Adversarial Attacks

Title: Generation, Evaluation, and Explanation of Novelists' Styles with Single-Token Prompts

Title: STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

Title: Dance Style Classification using Laban-Inspired and Frequency-Domain Motion Features

Title: NVIDIA Nemotron Parse 1.1

Title: Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders

Title: MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology

Title: Adversarial Confusion Attack: Disrupting Multimodal Large Language Models

Title: From One Attack Domain to Another: Contrastive Transfer Learning with Siamese Networks for APT Detection

Title: A Physics-Informed Loss Function for Boundary-Consistent and Robust Artery Segmentation in DSA Sequences

Title: A Single-Root, Multi-Curve, Context-Isolated, PQC-Pluggable Cryptographic Identity Primitive with Stateless Secret Rotation

Title: The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models

Title: DP-MicroAdam: Private and Frugal Algorithm for Training and Fine-tuning

Title: DesignPref: Capturing Personal Preferences in Visual Design Generation

Title: HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation

Title: Engel p-adic Isogeny-based Cryptography over Laurent Series: Foundations, Security, and an ESP32 Implementation

Title: Bridging the Language Gap: Synthetic Voice Diversity via Latent Mixup for Equitable Speech Recognition

Title: Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation

Title: From Words to Wisdom: Discourse Annotation and Baseline Models for Student Dialogue Understanding

Title: Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Title: Effective Command-line Interface Fuzzing with Path-Aware Large Language Model Orchestration

Title: Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Title: A Reason-then-Describe Instruction Interpreter for Controllable Video Generation

Title: DINO-Tok: Adapting DINO for Visual Tokenizers

Title: MSTN: Fast and Efficient Multivariate Time Series Model

Title: Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models

Title: Latent Diffusion Inversion Requires Understanding the Latent Space

Title: BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Title: On Evaluating LLM Alignment by Evaluating LLMs as Judges

Title: How to Purchase Labels? A Cost-Effective Approach Using Active Learning Markets

Title: Adaptive Hopfield Network: Rethinking Similarities in Associative Memory

Title: Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning

Title: Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities

Title: Wanderland: Geometrically Grounded Simulation for Open-World Embodied AI

Title: ShapeGen: Towards High-Quality 3D Shape Synthesis

Title: ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Title: MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

Title: Quantum-Resistant Authentication Scheme for RFID Systems Using Lattice-Based Cryptography

Title: Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model

Title: Latent Collaboration in Multi-Agent Systems

Title: MotionV2V: Editing Motion in a Video

Title: PixelDiT: Pixel Diffusion Transformers for Image Generation

Title: 3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding

Title: Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization

Title: LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight

Title: Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Title: RubricRL: Simple Generalizable Rewards for Text-to-Image Generation