2026-03-31

Title: GeoBlock: Inferring Block Granularity from Dependency Geometry in Diffusion Language Models

Title: A Multimodal Deep Learning Framework for Edema Classification Using HCT and Clinical Data

Title: Language-Conditioned World Modeling for Visual Navigation

Title: Motion Semantics Guided Normalizing Flow for Privacy-Preserving Video Anomaly Detection

Title: From Diffusion To Flow: Efficient Motion Generation In MotionGPT3

Title: Survey on Remote Sensing Scene Classification: From Traditional Methods to Large Generative AI Models

Title: Generating Synthetic Wildlife Health Data from Camera Trap Imagery: A Pipeline for Alopecia and Body Condition Training Data

Title: Physics-Aware Diffusion for LiDAR Point Cloud Densification

Title: A training-free framework for high-fidelity appearance transfer via diffusion transformers

Title: Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models

Title: LogicDiff: Logic-Guided Denoising Improves Reasoning in Masked Diffusion Language Models

Title: Learning to Select Visual In-Context Demonstrations

Title: TED: Training-Free Experience Distillation for Multimodal Reasoning

Title: Can We Change the Stroke Size for Easier Diffusion?

Title: Elucidating the Design Space of Flow Matching for Cellular Microscopy

Title: Gaussian Joint Embeddings For Self-Supervised Representation Learning

Title: Epileptic Seizure Prediction Using Patient-Adaptive Transformer Networks

Title: Throughput Optimization as a Strategic Lever in Large-Scale AI Systems: Evidence from Dataloader and Memory Profiling Innovations

Title: Central-to-Local Adaptive Generative Diffusion Framework for Improving Gene Expression Prediction in Data-Limited Spatial Transcriptomics

Title: Envisioning global urban development with satellite imagery and generative AI

Title: VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection

Title: Beyond Textual Knowledge-Leveraging Multimodal Knowledge Bases for Enhancing Vision-and-Language Navigation

Title: LACON: Training Text-to-Image Model from Uncurated Data

Title: Property-Guided Molecular Generation and Optimization via Latent Flows

Title: Strategic Candidacy in Generative AI Arenas

Title: Leveraging Avatar Fingerprinting: A Multi-Generator Photorealistic Talking-Head Public Database and Benchmark

Title: RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models

Title: Generative Shape Reconstruction with Geometry-Guided Langevin Dynamics

Title: Unified Number-Free Text-to-Motion Generation Via Flow Matching

Title: Unsupervised Behavioral Compression: Learning Low-Dimensional Policy Manifolds through State-Occupancy Matching

Title: MOOZY: A Patient-First Foundation Model for Computational Pathology

Title: Liquid Networks with Mixture Density Heads for Efficient Imitation Learning

Title: ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Title: LightCtrl: Training-free Controllable Video Relighting

Title: SceneExpander: Expanding 3D Scenes with Free-Form Inserted Views

Title: EFlow: Fast Few-Step Video Generator Training from Scratch via Efficient Solution Flow

Title: PRUE: A Practical Recipe for Field Boundary Segmentation at Scale

Title: Semantic Interaction Information mediates compositional generalization in latent space

Title: Spectral-Aware Text-to-Time Series Generation with Billion-Scale Multimodal Meteorological Data

Title: RiskProp: Collision-Anchored Self-Supervised Risk Propagation for Early Accident Anticipation

Title: MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence

Title: Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision

Title: MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation

Title: Let Triggers Control: Frequency-Aware Dropout for Effective Token Control

Title: Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Title: LightMover: Generative Light Movement with Color and Intensity Controls

Title: NimbusGS: Unified 3D Scene Reconstruction under Hybrid Weather

Title: TrendGen: An Outfit Recommendation and Display System

Title: TrackMAE: Video Representation Learning via Track Mask and Predict

Title: TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR

Title: Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach

Title: HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors

Title: Active In-Context Learning for Tabular Foundation Models

Title: K-Means Based TinyML Anomaly Detection and Distributed Model Reuse via the Distributed Internet of Learning (DIoL)

Title: The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Residual Streams

Title: Mind the Shape Gap: A Benchmark and Baseline for Deformation-Aware 6D Pose Estimation of Agricultural Produce

Title: GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback

Title: LOME: Learning Human-Object Manipulation with Action-Conditioned Egocentric World Model

Title: FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies

Title: From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis

Title: Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Title: Transferring Physical Priors into Remote Sensing Segmentation via Large Language Models

Title: Understanding Semantic Perturbations on In-Processing Generative Image Watermarks

Title: SPROUT: A Scalable Diffusion Foundation Model for Agricultural Vision

Title: LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Title: Annotation-Free Detection of Drivable Areas and Curbs Leveraging LiDAR Point Cloud Maps

Title: PANDORA: Pixel-wise Attention Dissolution and Latent Guidance for Zero-Shot Object Removal

Title: STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

Title: You Only Erase Once: Erasing Anything without Bringing Unexpected Content

Title: On the Asymptotics of Self-Supervised Pre-training: Two-Stage M-Estimation and Representation Symmetry

Title: OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation

Title: OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery

Title: Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling

Title: Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Title: CrossHGL: A Text-Free Foundation Model for Cross-Domain Heterogeneous Graph Learning

Title: LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation

Title: Can Unsupervised Segmentation Reduce Annotation Costs for Video Semantic Segmentation?

Title: Synergizing Discriminative Exemplars and Self-Refined Experience for MLLM-based In-Context Learning in Medical Diagnosis

Title: AI-Powered Facial Mask Removal Is Not Suitable For Biometric Identification

Title: When Surfaces Lie: Exploiting Wrinkle-Induced Attention Shift to Attack Vision-Language Models

Title: What-If Explanations Over Time: Counterfactuals for Time Series Classification

Title: Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detection

Title: Towards Context-Aware Image Anonymization with Multi-Agent Reasoning

Title: Poppy: Polarization-based Plug-and-Play Guidance for Enhancing Monocular Normal Estimation

Title: FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation

Title: Physics-Guided Transformer (PGT): Physics-Aware Attention Mechanism for PINNs

Title: Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute

Title: MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation

Title: Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment

Title: From Independent to Correlated Diffusion: Generalized Generative Modeling with Probabilistic Computers

Title: Diffusion Maps is not Dimensionality Reduction

Title: Seeing the Unseen: Rethinking Illicit Promotion Detection with In-Context Learning

Title: Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Drifting

Title: Physics-Embedded Feature Learning for AI in Medical Imaging

Title: From Vessel Trajectories to Safety-Critical Encounter Scenarios: A Generative AI Framework for Autonomous Ship Digital Testing

Title: GEMS: Agent-Native Multimodal Generation with Memory and Skills

Title: To View Transform or Not to View Transform: NeRF-based Pre-training Perspective

Title: Attention Frequency Modulation: Training-Free Spectral Modulation of Diffusion Cross-Attention

Title: SVGS: Single-View to 3D Object Editing via Gaussian Splatting

Title: RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation

Title: ObjectMorpher: 3D-Aware Image Editing via Deformable 3DGS Models

Title: ColorFLUX: A Structure-Color Decoupling Framework for Old Photo Colorization

Title: ToLL: Topological Layout Learning with Structural Multi-view Augmentation for 3D Scene Graph Pretraining

Title: \textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language

Title: Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal

Title: Detecting the Unexpected: AI-Driven Anomaly Detection in Smart Bridge Monitoring

Title: DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning

Title: MR-ImagenTime: Multi-Resolution Time Series Generation through Dual Image Representations

Title: FI-KAN: Fractal Interpolation Kolmogorov-Arnold Networks

Title: DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis

Title: NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information

Title: Taming the Instability: A Robust Second-Order Optimizer for Federated Learning over Non-IID Data

Title: Integrating Multimodal Large Language Model Knowledge into Amodal Completion

Title: AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation

Title: Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Models

Title: EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation

Title: Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models

Title: $R_{dm}$: Re-conceptualizing Distribution Matching as a Reward for Diffusion Distillation

Title: INSID3: Training-Free In-Context Segmentation with DINOv3

Title: ConceptWeaver: Weaving Disentangled Concepts with Flow

Title: Generalizable Detection of AI Generated Images with Large Models and Fuzzy Decision Tree

Title: Detecting low left ventricular ejection fraction from ECG using an interpretable and scalable predictor-driven framework

Title: "What Did It Actually Do?": Understanding Risk Awareness and Traceability for Computer-Use Agents

Title: Unrestrained Simplex Denoising for Discrete Data. A Non-Markovian Approach Applied to Graph Generation

Title: ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection

Title: Unsafe2Safe: Controllable Image Anonymization for Downstream Utility

Title: TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark

Title: Interpretable Ensemble Learning for Network Traffic Anomaly Detection: A SHAP-based Explainable AI Framework for Embedded Systems Security

Title: Safeguarding LLMs Against Misuse and AI-Driven Malware Using Steganographic Canaries

Title: Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and CrossParadigm Benchmark for Industrial Infrastructure

Title: DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing

Title: Stepwise Credit Assignment for GRPO on Flow-Matching Models

Title: See it to Place it: Evolving Macro Placements with Vision-Language Models

Title: On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

Title: PoseDreamer: Scalable and Photorealistic Human Data Generation Pipeline with Diffusion Models

Title: Geometry-aware similarity metrics for neural representations on Riemannian and statistical manifolds

Title: HandX: Scaling Bimanual Motion and Interaction Generation