2025-10-28

Title: Agro-Consensus: Semantic Self-Consistency in Vision-Language Models for Crop Disease Management in Developing Countries

Title: Proportion and Perspective Control for Flow-Based Image Generation

Title: H2OFlow: Grounding Human-Object Affordances with 3D Generative Models and Dense Diffused Flows

Title: OCR-Quality: A Human-Annotated Dataset for OCR Quality Assessment

Title: Face-MakeUpV2: Facial Consistency Learning for Controllable Text-to-Image Generation

Title: Online Mixture of Experts: No-Regret Learning for Optimal Collective Decision-Making

Title: Exploring the design space of diffusion and flow models for data fusion

Title: Variance-Reduction Guidance: Sampling Trajectory Optimization for Diffusion Models

Title: 2D_3D Feature Fusion via Cross-Modal Latent Synthesis and Attention Guided Restoration for Industrial Anomaly Detection

Title: Embodied Navigation with Auxiliary Task of Action Description Prediction

Title: Comparative Analysis of Object Detection Algorithms for Surface Defect Detection

Title: SITS-DECO: A Generative Decoder Is All You Need For Multitask Satellite Image Time Series Modelling

Title: Wavelet-based GAN Fingerprint Detection using ResNet50

Title: A Flow Model with Low-Rank Transformers for Incomplete Multimodal Survival Analysis

Title: Restoring Pruned Large Language Models via Lost Component Compensation

Title: A Multimodal, Multitask System for Generating E Commerce Text Listings from Images

Title: Improving the Physics of Video Generation with VJEPA-2 Reward Signal

Title: KARIPAP: Quantum-Inspired Tensor Network Compression of Large Language Models Using Infinite Projected Entangled Pair States and Tensor Renormalization Group

Title: SynCast: Synergizing Contradictions in Precipitation Nowcasting via Diffusion Sequential Preference Optimization

Title: SCoPE VLM: Selective Context Processing for Efficient Document Navigation in Vision-Language Models

Title: Poisson Flow Consistency Training

Title: The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems

Title: Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications

Title: The Principles of Diffusion Models

Title: Parallel Sampling from Masked Diffusion Models via Conditional Independence Testing

Title: Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Title: FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing

Title: Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data

Title: PF$Δ$: A Benchmark Dataset for Power Flow under Load, Generation, and Topology Variations

Title: Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation

Title: MAGIC-Flow: Multiscale Adaptive Conditional Flows for Generation and Interpretable Classification

Title: Discovering Latent Graphs with GFlowNets for Diverse Conditional Image Generation

Title: GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation

Title: I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions

Title: HARMONY: Hidden Activation Representations and Model Output-Aware Uncertainty Estimation for Vision-Language Models

Title: Scaling Non-Parametric Sampling with Representation

Title: MOGRAS: Human Motion with Grasping in 3D Scenes

Title: LongCat-Video Technical Report

Title: GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping

Title: Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction

Title: GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation

Title: T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models

Title: Top-Down Semantic Refinement for Image Captioning

Title: Benchmarking Egocentric Multimodal Goal Inference for Assistive Wearable Agents

Title: DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss

Title: LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation

Title: Accelerating Materials Design via LLM-Guided Evolutionary Search

Title: CANDI: Hybrid Discrete-Continuous Diffusion Models

Title: Open Multimodal Retrieval-Augmented Factual Image Generation

Title: AesCrop: Aesthetic-driven Cropping Guided by Composition

Title: SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning

Title: FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Title: FastVLM: Self-Speculative Decoding for Fast Vision-Language Model Inference

Title: Variational Polya Tree

Title: RoboSVG: A Unified Framework for Interactive SVG Generation with Multi-modal Guidance

Title: FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning

Title: Windsock is Dancing: Adaptive Multimodal Retrieval-Augmented Generation

Title: S-Chain: Structured Visual Chain-of-Thought For Medicine

Title: Cross-view Localization and Synthesis - Datasets, Challenges and Opportunities

Title: A Theory of the Mechanics of Information: Generalization Through Measurement of Uncertainty (Learning is Measuring)

Title: MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control

Title: Logical GANs: Adversarial Learning through Ehrenfeucht Fraisse Games

Title: Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models

Title: Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Title: A Review of End-to-End Precipitation Prediction Using Remote Sensing Data: from Divination to Machine Learning

Title: Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models

Title: Limits of Generative Pre-Training in Structured EMR Trajectories with Irregular Sampling

Title: Transforming volcanic monitoring: A dataset and benchmark for onboard volcano activity detection

Title: On the Anisotropy of Score-Based Generative Models

Title: Towards Personalized Treatment Plan: Geometrical Model-Agnostic Approach to Counterfactual Explanations

Title: Simple Denoising Diffusion Language Models

Title: Diffuse to Detect: A Generalizable Framework for Anomaly Detection with Diffusion Models Applications to UAVs and Beyond

Title: RL-AUX: Reinforcement Learning for Auxiliary Task Generation

Title: LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Title: VALA: Learning Latent Anchors for Training-Free and Temporally Consistent

Title: Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

Title: SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency

Title: CoMo: Compositional Motion Customization for Text-to-Video Generation

Title: M$^{3}$T2IBench: A Large-Scale Multi-Category, Multi-Instance, Multi-Relation Text-to-Image Benchmark

Title: UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization

Title: Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Title: Nested AutoRegressive Models

Title: LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation

Title: Residual Diffusion Bridge Model for Image Restoration

Title: Task-Agnostic Fusion of Time Series and Imagery for Earth Observation

Title: Enabling Vibration-Based Gesture Recognition on Everyday Furniture via Energy-Efficient FPGA Implementation of 1D Convolutional Networks

Title: Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks

Title: Accelerating Eigenvalue Dataset Generation via Chebyshev Subspace Filter

Title: Through the Lens: Benchmarking Deepfake Detectors Against Moiré-Induced Distortions

Title: Autoregressive Styled Text Image Generation, but Make it Reliable

Title: A Novel Framework for Multi-Modal Protein Representation Learning

Title: Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling

Title: ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation

Title: Robust Non-negative Proximal Gradient Algorithm for Inverse Problems

Title: Symbolic Neural Generation with Applications to Lead Discovery in Drug Design

Title: An Efficient Remote Sensing Super Resolution Method Exploring Diffusion Priors and Multi-Modal Constraints for Crop Type Mapping

Title: The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Title: Mixed Precision Training of Neural ODEs

Title: FreeFuse: Multi-Subject LoRA Fusion via Auto Masking at Test Time

Title: More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

Title: Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Title: FARMER: Flow AutoRegressive Transformer over Pixels

Title: PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection

Title: Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling

Title: Variational Masked Diffusion Models