2025-11-26

Title: SG-OIF: A Stability-Guided Online Influence Framework for Reliable Vision Data

Title: PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained Transformer

Title: WavefrontDiffusion: Dynamic Decoding Schedule or Improved Reasoning

Title: Pistachio: Towards Synthetic, Balanced, and Long-Form Video Anomaly Benchmarks

Title: Generative Model-Aided Continual Learning for CSI Feedback in FDD mMIMO-OFDM Systems

Title: PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting

Title: Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection

Title: Perceptual Taxonomy: Evaluating and Guiding Hierarchical Scene Reasoning in Vision-Language Models

Title: Studying Maps at Scale: A Digital Investigation of Cartography and the Evolution of Figuration

Title: Think First, Assign Next (ThiFAN-VQA): A Two-stage Chain-of-Thought Framework for Post-Disaster Damage Assessment

Title: SPQR: A Standardized Benchmark for Modern Safety Alignment Methods in Text-to-Image Diffusion Models

Title: Leveraging Unlabeled Scans for NCCT Image Segmentation in Early Stroke Diagnosis: A Semi-Supervised GAN Approach

Title: Multiscale Vector-Quantized Variational Autoencoder for Endoscopic Image Synthesis

Title: Learning Massively Multitask World Models for Continuous Control

Title: On the Utility of Foundation Models for Fast MRI: Vision-Language-Guided Image Reconstruction

Title: Synthetic Data: AI's New Weapon Against Android Malware

Title: Navigating Gigapixel Pathology Images with Large Multimodal Models

Title: Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds

Title: TREASURE: A Transformer-Based Foundation Model for High-Volume Transaction Understanding

Title: TiCT: A Synthetically Pre-Trained Foundation Model for Time Series Classification

Title: RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models

Title: Efficient Transferable Optimal Transport via Min-Sliced Transport Plans

Title: Leveraging Foundation Models for Histological Grading in Cutaneous Squamous Cell Carcinoma using PathFMTools

Title: Vision--Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation

Title: One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer

Title: Terminal Velocity Matching

Title: Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization

Title: Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation

Title: Cisco Time Series Model Technical Report

Title: Face, Whole-Person, and Object Classification in a Unified Space Via The Interleaved Multi-Domain Identity Curriculum

Title: Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks

Title: GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Title: Frequency Bias Matters: Diving into Robust and Generalized Deep Image Forgery Detection

Title: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

Title: Coupled Physics-Gated Adaptation: Spatially Decoding Volumetric Photochemical Conversion in Complex 3D-Printed Objects

Title: Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models

Title: HybriDLA: Hybrid Generation for Document Layout Analysis

Title: Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

Title: Supervise Less, See More: Training-free Nuclear Instance Segmentation with Prototype-Guided Prompting

Title: GFT-GCN: Privacy-Preserving 3D Face Mesh Recognition with Spectral Diffusion

Title: MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing

Title: HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning

Title: VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction

Title: Rethinking Semi-Supervised Node Classification with Self-Supervised Graph Clustering

Title: Operator Learning at Machine Precision

Title: EmoFeedback2: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback

Title: Rethinking Message Passing Neural Networks with Diffusion Distance-guided Stress Majorization

Title: OmniRefiner: Reinforcement-Guided Local Diffusion Refinement

Title: Zero-Shot Transfer Capabilities of the Sundial Foundation Model for Leaf Area Index Forecasting

Title: iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and Localization

Title: MFM-point: Multi-scale Flow Matching for Point Cloud Generation

Title: RED-F: Reconstruction-Elimination based Dual-stream Contrastive Forecasting for Multivariate Time Series Anomaly Prediction

Title: DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven Illumination

Title: Explainable Visual Anomaly Detection via Concept Bottleneck Models

Title: EM2LDL: A Multilingual Speech Corpus for Mixed Emotion Recognition through Label Distribution Learning

Title: UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers

Title: Restora-Flow: Mask-Guided Image Restoration with Flow Matching

Title: ADNet: A Large-Scale and Extensible Multi-Domain Benchmark for Anomaly Detection Across 380 Real-World Categories

Title: Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis

Title: In-Context Compositional Learning via Sparse Coding Transformer

Title: OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

Title: Text-guided Controllable Diffusion for Realistic Camouflage Images Generation

Title: Zoo3D: Zero-Shot 3D Object Detection at Scene Level

Title: The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Title: Advancing Image Classification with Discrete Diffusion Classification Modeling

Title: DRL-Guided Neural Batch Sampling for Semi-Supervised Pixel-Level Anomaly Detection

Title: HVAdam: A Full-Dimension Adaptive Optimizer

Title: CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation

Title: TReFT: Taming Rectified Flow Models For One-Step Image Translation

Title: VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild

Title: MoRE: Batch-Robust Multi-Omics Representations from Frozen Pre-trained Transformers

Title: FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers

Title: Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs

Title: BRIC: Bridging Kinematic Plans and Physical Control at Test Time

Title: Diffusion for Fusion: Designing Stellarators with Generative AI

Title: Learning to Generate Human-Human-Object Interactions from Textual Descriptions

Title: Generation, Evaluation, and Explanation of Novelists' Styles with Single-Token Prompts

Title: Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search

Title: STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

Title: Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders

Title: MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology

Title: From One Attack Domain to Another: Contrastive Transfer Learning with Siamese Networks for APT Detection

Title: DesignPref: Capturing Personal Preferences in Visual Design Generation

Title: HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation

Title: Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Title: Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Title: A Reason-then-Describe Instruction Interpreter for Controllable Video Generation

Title: E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

Title: DINO-Tok: Adapting DINO for Visual Tokenizers

Title: Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models

Title: Latent Diffusion Inversion Requires Understanding the Latent Space

Title: Adaptive Hopfield Network: Rethinking Similarities in Associative Memory

Title: ShapeGen: Towards High-Quality 3D Shape Synthesis

Title: MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

Title: Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model

Title: MotionV2V: Editing Motion in a Video

Title: PixelDiT: Pixel Diffusion Transformers for Image Generation

Title: Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization

Title: Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Title: MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities

Title: RubricRL: Simple Generalizable Rewards for Text-to-Image Generation