2025-10-31

Title: HiMAE: Hierarchical Masked Autoencoders Discover Resolution-Specific Structure in Wearable Time Series

Title: SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes

Title: MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Title: MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

Title: Generative Image Restoration and Super-Resolution using Physics-Informed Synthetic Data for Scanning Tunneling Microscopy

Title: SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing

Title: Towards Scaling Laws for Symbolic Regression

Title: New Money: A Systematic Review of Synthetic Data Generation for Finance

Title: Security Risk of Misalignment between Text and Image in Multi-modal Model

Title: OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research

Title: FullPart: Generating each 3D Part at Full Resolution

Title: BasicAVSR: Arbitrary-Scale Video Super-Resolution via Image Priors and Enhanced Motion Compensation

Title: CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark

Title: Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction

Title: OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Title: Likely Interpolants of Generative Models

Title: Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws

Title: Distributional Multi-objective Black-box Optimization for Diffusion-model Inference-time Multi-Target Generation

Title: Beyond Imitation: Constraint-Aware Trajectory Generation with Flow Matching For End-to-End Autonomous Driving

Title: GLYPH-SR: Can We Achieve Both High-Quality Image Super-Resolution and High-Fidelity Text Recovery via VLM-guided Latent Diffusion Model?

Title: Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings

Title: EEG-Driven Image Reconstruction with Saliency-Guided Diffusion Models

Title: LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation

Title: Co-Evolving Latent Action World Models

Title: ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems

Title: Quantum Gated Recurrent GAN with Gaussian Uncertainty for Network Anomaly Detection

Title: Polybasic Speculative Decoding Through a Theoretical Perspective

Title: Emu3.5: Native Multimodal Models are World Learners

Title: ResMatching: Noise-Resilient Computational Super-Resolution via Guided Conditional Flow Matching

Title: All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles

Title: LSM-MS2: A Foundation Model Bridging Spectral Identification and Biological Interpretation

Title: STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization

Title: Surpassing state of the art on AMD area estimation from RGB fundus images through careful selection of U-Net architectures and loss functions for class imbalance

Title: Clone Deterministic 3D Worlds with Geometrically-Regularized World Models

Title: The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Title: SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting

Title: OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Title: Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark