2025-04-15

Title: Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams

Title: Efficient Evaluation of Large Language Models via Collaborative Filtering

Title: Embedding Hidden Adversarial Capabilities in Pre-Trained Diffusion Models

Title: Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation

Title: Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks

Title: Learnable Multi-Scale Wavelet Transformer: A Novel Alternative to Self-Attention

Title: InfoGain Wavelets: Furthering the Design of Diffusion Wavelets for Graph-Structured Data

Title: A temporal scale transformer framework for precise remaining useful life prediction in fuel cells

Title: Generative AI in Live Operations: Evidence of Productivity Gains in Cybersecurity and Endpoint Management

Title: Exploring the Effectiveness and Interpretability of Texts in LLM-based Time Series Models

Title: Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Title: PriM: Principle-Inspired Material Discovery through Multi-Agent Collaboration

Title: Analogical Learning for Cross-Scenario Generalization: Framework and Application to Intelligent Localization

Title: SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models

Title: From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting

Title: CAReDiO: Cultural Alignment of LLM via Representativeness and Distinctiveness Guided Data Optimization

Title: Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics

Title: FM-LoRA: Factorized Low-Rank Meta-Prompting for Continual Learning

Title: PatchTrAD: A Patch-Based Transformer focusing on Patch-Wise Reconstruction Error for Time Series Anomaly Detection

Title: Datum-wise Transformer for Synthetic Tabular Data Detection in the Wild

Title: SD$^2$: Self-Distilled Sparse Drafters

Title: Adaptive Shrinkage Estimation For Personalized Deep Kernel Regression In Modeling Brain Trajectories

Title: Towards Combinatorial Interpretability of Neural Computation

Title: X-Guard: Multilingual Guard Agent for Content Moderation

Title: Mimic In-Context Learning for Multimodal Tasks

Title: ML For Hardware Design Interpretability: Challenges and Opportunities

Title: Hardware Design and Security Needs Attention: From Survey to Path Forward

Title: On Transfer-based Universal Attacks in Pure Black-box Setting

Title: An LLM Framework For Cryptography Over Chat Channels

Title: Personalizing Federated Learning for Hierarchical Edge Networks with Non-IID Data

Title: Distilling and exploiting quantitative insights from Large Language Models for enhanced Bayesian optimization of chemical reactions

Title: Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

Title: Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

Title: Toward Spiking Neural Network Local Learning Modules Resistant to Adversarial Attacks

Title: LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

Title: Robust SAM: On the Adversarial Robustness of Vision Foundation Models

Title: HyperCore: The Core Framework for Building Hyperbolic Foundation Models with Comprehensive Modules

Title: An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline

Title: Long Context In-Context Compression by Getting to the Gist of Gisting

Title: Generating Planning Feedback for Open-Ended Programming Exercises with LLMs

Title: MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer

Title: A Fully Automated Pipeline for Conversational Discourse Annotation: Tree Scheme Generation and Labeling with Large Language Models

Title: Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion

Title: RAG-Based Fuzzing of Cross-Architecture Compilers

Title: Robust Steganography from Large Language Models

Title: AGENT: An Aerial Vehicle Generation and Design Tool Using Large Language Models

Title: Adaptive Additive Parameter Updates of Vision Transformers for Few-Shot Continual Learning

Title: MCP Bridge: A Lightweight, LLM-Agnostic RESTful Proxy for Model Context Protocol Servers

Title: Detecting Instruction Fine-tuning Attack on Language Models with Influence Function

Title: Associating transportation planning-related measures with Mild Cognitive Impairment

Title: Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization

Title: From Punchlines to Predictions: A Metric to Assess LLM Performance in Identifying Humor in Stand-Up Comedy

Title: Multimodal 3D Genome Pre-training

Title: Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models

Title: Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification

Title: A Visual Self-attention Mechanism Facial Expression Recognition Network beyond Convnext

Title: crowd-hpo: Realistic Hyperparameter Optimization and Benchmarking for Learning from Crowds with Noisy Labels

Title: Privacy Preservation in Gen AI Applications

Title: BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting

Title: Synthetic Aircraft Trajectory Generation Using Time-Based VQ-VAE

Title: Shrinkage Initialization for Smooth Learning of Neural Networks

Title: Probability Distribution Alignment and Low-Rank Weight Decomposition for Source-Free Domain Adaptive Brain Decoding

Title: Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning

Title: CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift

Title: Self-Supervised Autoencoder Network for Robust Heart Rate Extraction from Noisy Photoplethysmogram: Applying Blind Source Separation to Biosignal Analysis

Title: Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models

Title: Kernel-Based Enhanced Oversampling Method for Imbalanced Classification

Title: MatWheel: Addressing Data Scarcity in Materials Science Through Synthetic Data

Title: Secure Physical Layer Communications for Low-Altitude Economy Networking: A Survey

Title: Evolved Hierarchical Masking for Self-Supervised Learning

Title: LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography

Title: Can postgraduate translation students identify machine-generated text?

Title: Langformers: Unified NLP Pipelines for Language Models

Title: A Confounding Factors-Inhibition Adversarial Learning Framework for Multi-site fMRI Mental Disorder Identification

Title: A Multi-Layered Security Analysis of Blockchain Systems: From Attack Vectors to Defense and System Hardening

Title: Feature-Aware Malicious Output Detection and Mitigation

Title: Towards More Efficient, Robust, Instance-adaptive, and Generalizable Online Learning

Title: ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking

Title: RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

Title: Illusion Worlds: Deceptive UI Attacks in Social VR

Title: From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Title: AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images

Title: Query-based Knowledge Transfer for Heterogeneous Learning Environments

Title: FairACE: Achieving Degree Fairness in Graph Neural Networks via Contrastive and Adversarial Group-Balanced Training

Title: Accurate Diagnosis of Respiratory Viruses Using an Explainable Machine Learning with Mid-Infrared Biomolecular Fingerprinting of Nasopharyngeal Secretions

Title: Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition

Title: DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models

Title: Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking

Title: Type-Constrained Code Generation with Language Models

Title: Head-Aware KV Cache Compression for Efficient Visual Autoregressive Modeling

Title: Mixture of Group Experts for Learning Invariant Representations

Title: VideoAds for Fast-Paced Video Understanding: Where Opensource Foundation Models Beat GPT-4o & Gemini-1.5 Pro

Title: A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search

Title: Enhancing Contrastive Demonstration Selection with Semantic Diversity for Robust In-Context Machine Translation

Title: Improving the Accuracy and Efficiency of Legal Document Tagging with Large Language Models and Instruction Prompts

Title: SmartShift: A Secure and Efficient Approach to Smart Contract Migration

Title: CrossLink: A Decentralized Framework for Secure Cross-Chain Smart Contract Execution

Title: MedIL: Implicit Latent Spaces for Generating Heterogeneous Medical Images at Arbitrary Resolutions

Title: Text To 3D Object Generation For Scalable Room Assembly

Title: Efficient Implementation of Reinforcement Learning over Homomorphic Encryption

Title: Towards Optimal Differentially Private Regret Bounds in Linear MDPs

Title: Context-Aware Adaptive Sampling for Intelligent Data Acquisition Systems Using DQN

Title: REMEMBER: Retrieval-based Explainable Multimodal Evidence-guided Modeling for Brain Evaluation and Reasoning in Zero- and Few-shot Neurodegenerative Diagnosis

Title: PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking

Title: Machine Learning-Based Cyberattack Detection and Identification for Automatic Generation Control Systems Considering Nonlinearities

Title: QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Title: Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers

Title: Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs

Title: Contour Flow Constraint: Preserving Global Shape Similarity for Deep Learning based Image Segmentation

Title: Beyond Memorization: Mapping the Originality-Quality Frontier of Language Models

Title: Vision Transformers Exhibit Human-Like Biases: Evidence of Orientation and Color Selectivity, Categorical Perception, and Phase Transitions

Title: Adaptive Insurance Reserving with CVaR-Constrained Reinforcement Learning under Macroeconomic Regimes

Title: Question Tokens Deserve More Attention: Enhancing Large Language Models without Training through Step-by-Step Reading and Question Attention Recalibration

Title: UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Title: Nash Equilibrium Between Consumer Electronic Devices and DoS Attacker for Distributed IoT-enabled RSE Systems

Title: SaRO: Enhancing LLM Safety through Reasoning-based Alignment

Title: ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model

Title: BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning

Title: Ensemble-Enhanced Graph Autoencoder with GAT and Transformer-Based Encoders for Robust Fault Diagnosis

Title: Constants of motion network revisited

Title: PLS-Assisted Offloading for Edge Computing-Enabled Post-Quantum Security in Resource-Constrained Devices

Title: Structure-Accurate Medical Image Translation based on Dynamic Frequency Balance and Knowledge Guidance

Title: aweSOM: a CPU/GPU-accelerated Self-organizing Map and Statistically Combined Ensemble Framework for Machine-learning Clustering Analysis

Title: FractalForensics: Proactive Deepfake Detection and Localization via Fractal Watermarks

Title: D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation

Title: Measuring Leakage in Concept-Based Methods: An Information Theoretic Approach

Title: AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

Title: CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models

Title: Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

Title: HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs

Title: An overview of condensation phenomenon in deep learning

Title: GenEDA: Unleashing Generative Reasoning on Netlist via Multimodal Encoder-Decoder Aligned Foundation Model

Title: Kongzi: A Historical Large Language Model with Fact Enhancement

Title: Federated Prototype Graph Learning

Title: EasyREG: Easy Depth-Based Markerless Registration and Tracking using Augmented Reality Device for Surgical Guidance

Title: PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification

Title: MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs

Title: FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution

Title: DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion

Title: How new data permeates LLM knowledge and how to dilute it

Title: A Secure Communication Protocol for Remote Keyless Entry System with Adaptive Adjustment of Transmission Parameters

Title: AeroLite: Tag-Guided Lightweight Generation of Aerial Image Captions

Title: Trajectory-guided Motion Perception for Facial Expression Quality Assessment in Neurological Disorders

Title: SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification

Title: Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

Title: Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution

Title: LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as Offline

Title: Eccfrog512ck2: An Enhanced 512-bit Weierstrass Elliptic Curve

Title: Short-Path Prompting in LLMs: Analyzing Reasoning Instability and Solutions for Robust Performance

Title: TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting

Title: ControlNET: A Firewall for RAG-based LLM System

Title: Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation

Title: Mitigating Many-Shot Jailbreaking

Title: Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training

Title: Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images

Title: Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization

Title: Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability

Title: Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Title: SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model

Title: Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar

Title: RANSAC Revisited: An Improved Algorithm for Robust Subspace Recovery under Adversarial and Noisy Corruptions

Title: Bridging Immutability with Flexibility: A Scheme for Secure and Efficient Smart Contract Upgrades

Title: KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Title: Ordinary Least Squares as an Attention Mechanism

Title: Adapting to the Unknown: Robust Meta-Learning for Zero-Shot Financial Time Series Forecasting

Title: CLEAR-KGQA: Clarification-Enhanced Ambiguity Resolution for Knowledge Graph Question Answering

Title: Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?

Title: Computer-Aided Layout Generation for Building Design: A Review

Title: GRPO-LEAD: A Difficulty-Aware Reinforcement Learning Approach for Concise Mathematical Reasoning in Language Models

Title: ToolTipNet: A Segmentation-Driven Deep Learning Baseline for Surgical Instrument Tip Detection

Title: Transformer-Based Representation Learning for Robust Gene Expression Modeling and Cancer Prognosis

Title: DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Title: The Structural Safety Generalization Problem

Title: Evaluating the Quality of Benchmark Datasets for Low-Resource Languages: A Case Study on Turkish

Title: Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention

Title: Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

Title: Alleviating the Fear of Losing Alignment in LLM Fine-tuning

Title: Enhancing Classifier Evaluation: A Fairer Benchmarking Strategy Based on Ability and Robustness

Title: Dynamical symmetries in the fluctuation-driven regime: an application of Noether's theorem to noisy dynamical systems

Title: Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Title: Socratic Chart: Cooperating Multiple Agents for Robust SVG Chart Understanding

Title: An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection

Title: Reasoning without Regret

Title: Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning

Title: EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise

Title: IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme

Title: Multi-task Federated Learning with Encoder-Decoder Structure: Enabling Collaborative Learning Across Different Tasks

Title: Training Small Reasoning LLMs with Cognitive Preference Alignment

Title: Efficient Multi-Task Modeling through Automated Fusion of Trained Models

Title: DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation

Title: Transferable text data distillation by trajectory matching

Title: Density-based Object Detection in Crowded Scenes

Title: StruPhantom: Evolutionary Injection Attacks on Black-Box Tabular Agents Powered by Large Language Models

Title: Accelerating Differentially Private Federated Learning via Adaptive Extrapolation

Title: GFT: Gradient Focal Transformer

Title: RadarLLM: Empowering Large Language Models to Understand Human Motion from Millimeter-wave Point Cloud Sequence

Title: HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation

Title: Revisiting the attacker's knowledge in inference attacks against Searchable Symmetric Encryption

Title: Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Title: Investigating Syntactic Biases in Multilingual Transformers with RC Attachment Ambiguities in Italian and English

Title: Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution

Title: Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Title: TWSSenti: A Novel Hybrid Framework for Topic-Wise Sentiment Analysis on Social Media Using Transformer Models

Title: TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models

Title: Refining Financial Consumer Complaints through Multi-Scale Model Interaction

Title: Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models

Title: Guiding Reasoning in Small Language Models with LLM Assistance

Title: FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Title: KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference

Title: FedRecon: Missing Modality Reconstruction in Distributed Heterogeneous Environments

Title: Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes

Title: C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset

Title: Dual-Path Enhancements in Event-Based Eye Tracking: Augmented Robustness and Adaptive Temporal Modeling

Title: Towards Unbiased Federated Graph Learning: Label and Topology Perspectives

Title: Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data

Title: Proofs of Useful Work from Arbitrary Matrix Multiplication

Title: EthCluster: An Unsupervised Static Analysis Method for Ethereum Smart Contract

Title: Turn-taking annotation for quantitative and qualitative analyses of conversation

Title: Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning

Title: Metric-Guided Synthesis of Class Activation Mapping

Title: Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?

Title: GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting

Title: Improving Controller Generalization with Dimensionless Markov Decision Processes

Title: Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network

Title: Quantifying Privacy Leakage in Split Inference via Fisher-Approximated Shannon Information Analysis

Title: The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination

Title: Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics

Title: DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify

Title: Investigating the Role of Bilateral Symmetry for Inpainting Brain MRI

Title: Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers

Title: Hallucination Detection in LLMs via Topological Divergence on Attention Graphs

Title: A Computational Cognitive Model for Processing Repetitions of Hierarchical Relations

Title: Undermining Federated Learning Accuracy in EdgeIoT via Variational Graph Auto-Encoders

Title: Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Title: Learning to Harmonize Cross-vendor X-ray Images by Non-linear Image Dynamics Correction

Title: CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography

Title: STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data

Title: Global and Local Mamba Network for Multi-Modality Medical Image Super-Resolution

Title: Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design

Title: Universally Composable Commitments with Communicating Malicious Physically Uncloneable Functions

Title: M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data

Title: Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers

Title: SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

Title: COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts

Title: MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

Title: WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs

Title: C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation

Title: HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection

Title: Challenges in interpretability of additive models

Title: LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

Title: Efficient Generative Model Training via Embedded Representation Warmup

Title: Differentially Private 2D Human Pose Estimation

Title: Localized Cultural Knowledge is Conserved and Controllable in Large Language Models

Title: DioR: Adaptive Cognitive Detection and Contextual Retrieval Optimization for Dynamic Retrieval-Augmented Generation

Title: VibrantLeaves: A principled parametric image generator for training deep restoration models

Title: Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection

Title: Probing then Editing Response Personality of Large Language Models

Title: ROSFD: Robust Online Streaming Fraud Detection with Resilience to Concept Drift in Data Streams

Title: A Model Zoo of Vision Transformers

Title: CAT: A Conditional Adaptation Tailor for Efficient and Effective Instance-Specific Pansharpening on Real-World Data

Title: MASSeg : 2nd Technical Report for 4th PVUW MOSE Track

Title: XY-Cut++: Advanced Layout Ordering via Hierarchical Mask Mechanism on a Novel Benchmark

Title: Trade-offs in Privacy-Preserving Eye Tracking through Iris Obfuscation: A Benchmarking Study

Title: LMFormer: Lane based Motion Prediction Transformer

Title: DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing

Title: $α$-Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models

Title: ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting

Title: Analysis of Attention in Video Diffusion Transformers

Title: Shield Bash: Abusing Defensive Coherence State Retrieval to Break Timing Obfuscation

Title: SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

Title: LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis

Title: MorphTok: Morphologically Grounded Tokenization for Indian Languages

Title: Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families

Title: VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Title: Benchmarking 3D Human Pose Estimation Models Under Occlusions

Title: Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis

Title: DICE: A Framework for Dimensional and Contextual Evaluation of Language Models

Title: DUE: A Deep Learning Framework and Library for Modeling Unknown Equations

Title: Ctrl-Z: Controlling AI Agents via Resampling

Title: Towards Low-Latency Event-based Obstacle Avoidance on a FPGA-Drone

Title: Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks

Title: Performance of Large Language Models in Supporting Medical Diagnosis and Treatment

Title: LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

Title: CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation

Title: Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Title: Can We Edit LLMs for Long-Tail Biomedical Knowledge?

Title: LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models

Title: MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model

Title: Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

Title: Multimodal Long Video Modeling Based on Temporal Dynamic Context

Title: M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Title: Integrating Vision and Location with Transformers: A Multimodal Deep Learning Framework for Medical Wound Analysis

Title: GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Title: The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

Title: Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Title: Art3D: Training-Free 3D Generation from Flat-Colored Illustration

Title: MIEB: Massive Image Embedding Benchmark

Title: InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Title: REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Title: Decoupled Diffusion Sparks Adaptive Scene Generation

Title: FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation