2025-10-28

Title: A Feature Engineering Approach for Business Impact-Oriented Failure Detection in Distributed Instant Payment Systems

Title: Proportion and Perspective Control for Flow-Based Image Generation

Title: H2OFlow: Grounding Human-Object Affordances with 3D Generative Models and Dense Diffused Flows

Title: Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Title: Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making

Title: Exploring the design space of diffusion and flow models for data fusion

Title: Variance-Reduction Guidance: Sampling Trajectory Optimization for Diffusion Models

Title: 2D_3D Feature Fusion via Cross-Modal Latent Synthesis and Attention Guided Restoration for Industrial Anomaly Detection

Title: Xihe: Scalable Zero-Shot Time Series Learner Via Hierarchical Interleaved Block Attention

Title: It Takes Two to Tango: Two Parallel Samplers Improve Quality in Diffusion Models for Limited Steps

Title: SITS-DECO: A Generative Decoder Is All You Need For Multitask Satellite Image Time Series Modelling

Title: Wavelet-based GAN Fingerprint Detection using ResNet50

Title: A Flow Model with Low-Rank Transformers for Incomplete Multimodal Survival Analysis

Title: A Multimodal, Multitask System for Generating E Commerce Text Listings from Images

Title: Improving the Physics of Video Generation with VJEPA-2 Reward Signal

Title: KARIPAP: Quantum-Inspired Tensor Network Compression of Large Language Models Using Infinite Projected Entangled Pair States and Tensor Renormalization Group

Title: SynCast: Synergizing Contradictions in Precipitation Nowcasting via Diffusion Sequential Preference Optimization

Title: Poisson Flow Consistency Training

Title: The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems

Title: Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications

Title: The Principles of Diffusion Models

Title: AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing

Title: Towards Low-Latency and Adaptive Ransomware Detection Using Contrastive Learning

Title: Parallel Sampling from Masked Diffusion Models via Conditional Independence Testing

Title: Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Title: LiteDiff

Title: FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing

Title: Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data

Title: Human-Centric Anomaly Detection in Surveillance Videos Using YOLO-World and Spatio-Temporal Deep Learning

Title: Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation

Title: MAGIC-Flow: Multiscale Adaptive Conditional Flows for Generation and Interpretable Classification

Title: Discovering Latent Graphs with GFlowNets for Diverse Conditional Image Generation

Title: GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation

Title: CogStereo: Neural Stereo Matching with Implicit Spatial Cognition Embedding

Title: LOC: A General Language-Guided Framework for Open-Set 3D Occupancy Prediction

Title: Attention Residual Fusion Network with Contrast for Source-free Domain Adaptation

Title: I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions

Title: TPPR: APT Tactic / Technique Pattern Guided Attack Path Reasoning for Attack Investigation

Title: Scaling Non-Parametric Sampling with Representation

Title: LongCat-Video Technical Report

Title: Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation

Title: DiffusionLane: Diffusion Model for Lane Detection

Title: LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis

Title: Adapting Noise-Driven PUF and AI for Secure WBG ICS: A Proof-of-Concept Study

Title: Supervised Fine-Tuning or In-Context Learning? Evaluating LLMs for Clinical NER

Title: AnyECG-Lab: An Exploration Study of Fine-tuning an ECG Foundation Model to Estimate Laboratory Values from Single-Lead ECG Signals

Title: GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping

Title: Beyond Augmentation: Leveraging Inter-Instance Relation in Self-Supervised Representation Learning

Title: Monitoring State Transitions in Markovian Systems with Sampling Cost

Title: Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction

Title: GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation

Title: EndoSfM3D: Learning to 3D Reconstruct Any Endoscopic Surgery Scene using Self-supervised Foundation Model

Title: T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models

Title: Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents

Title: DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss

Title: Accelerating Materials Design via LLM-Guided Evolutionary Search

Title: CANDI: Hybrid Discrete-Continuous Diffusion Models

Title: SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning

Title: FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Title: DDTR: Diffusion Denoising Trace Recovery

Title: From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy

Title: Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems

Title: PSScreen V2: Partially Supervised Multiple Retinal Disease Screening

Title: Projection Embedded Diffusion Bridge for CT Reconstruction from Incomplete Data

Title: SWAN: Self-supervised Wavelet Neural Network for Hyperspectral Image Unmixing

Title: CLEANet: Robust and Efficient Anomaly Detection in Contaminated Multivariate Time Series

Title: DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection

Title: Self-Attention Decomposition For Training Free Diffusion Editing

Title: Variational Polya Tree

Title: Learning Without Augmenting: Unsupervised Time Series Representation Learning via Frame Projections

Title: Conjugate Relation Modeling for Few-Shot Knowledge Graph Completion

Title: SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery

Title: FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning

Title: VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree

Title: WaveMAE: Wavelet decomposition Masked Auto-Encoder for Remote Sensing

Title: Cross-view Localization and Synthesis - Datasets, Challenges and Opportunities

Title: Beyond Semantics: How Temporal Biases Shape Retrieval in Transformer and State-Space Models

Title: Distributionally Robust Optimization via Diffusion Ambiguity Modeling

Title: A Theory of the Mechanics of Information: Generalization Through Measurement of Uncertainty (Learning is Measuring)

Title: MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control

Title: Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays

Title: Clustering by Denoising: Latent plug-and-play diffusion for single-cell data

Title: Semantic-Preserving Cross-Style Visual Reasoning for Robust Multi-Modal Understanding in Large Vision-Language Models

Title: Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models

Title: Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Title: A Comprehensive Dataset for Human vs. AI Generated Text Detection

Title: Limits of Generative Pre-Training in Structured EMR Trajectories with Irregular Sampling

Title: On the Anisotropy of Score-Based Generative Models

Title: Simple Denoising Diffusion Language Models

Title: Diffuse to Detect: A Generalizable Framework for Anomaly Detection with Diffusion Models Applications to UAVs and Beyond

Title: Survey of Multimodal Geospatial Foundation Models: Techniques, Applications, and Challenges

Title: VALA: Learning Latent Anchors for Training-Free and Temporally Consistent

Title: Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

Title: Can Language Models Compose Skills In-Context?

Title: Understanding In-Context Learning Beyond Transformers: An Investigation of State Space and Hybrid Architectures

Title: M$^{3}$T2IBench: A Large-Scale Multi-Category, Multi-Instance, Multi-Relation Text-to-Image Benchmark

Title: UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization

Title: Nested AutoRegressive Models

Title: A high-capacity linguistic steganography based on entropy-driven rank-token mapping

Title: LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation

Title: A Survey on LLM Mid-training

Title: Sampling from Energy distributions with Target Concrete Score Identity

Title: Residual Diffusion Bridge Model for Image Restoration

Title: Implicit Modeling for Transferability Estimation of Vision Foundation Models

Title: Finding 3D Scene Analogies with Multimodal Foundation Models

Title: Evaluation of Vision-LLMs in Surveillance Video

Title: Are ASR foundation models generalized enough to capture features of regional dialects for low-resource languages?

Title: Privacy-Preserving Semantic Communication over Wiretap Channels with Learnable Differential Privacy

Title: Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling

Title: ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation

Title: Multitask Multimodal Self-Supervised Learning for Medical Images

Title: GRAD: Real-Time Gated Recurrent Anomaly Detection in Autonomous Vehicle Sensors Using Reinforced EMA and Multi-Stage Sliding Window Techniques

Title: ZeroFlood: A Geospatial Foundation Model for Data-Efficient Flood Susceptibility Mapping

Title: An Efficient Remote Sensing Super Resolution Method Exploring Diffusion Priors and Multi-Modal Constraints for Crop Type Mapping

Title: Symmetria: A Synthetic Dataset for Learning in Point Clouds

Title: Towards Generalisable Foundation Models for 3D Brain MRI

Title: CURVETE: Curriculum Learning and Progressive Self-supervised Training for Medical Image Classification

Title: Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Title: Adaptive Dual Prompting: Hierarchical Debiasing for Fairness-aware Graph Neural Networks

Title: T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning

Title: Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap

Title: Mixed Precision Training of Neural ODEs

Title: Towards Deep Physics-Informed Kolmogorov-Arnold Networks

Title: FreeFuse: Multi-Subject LoRA Fusion via Auto Masking at Test Time

Title: More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

Title: FARMER: Flow AutoRegressive Transformer over Pixels

Title: Think Twice: Branch-and-Rethink Reasoning Reward Model

Title: Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling

Title: Variational Masked Diffusion Models

Title: Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations