2026-03-13

Title: H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code

Title: Frequency-Modulated Visual Restoration for Matryoshka Large Multimodal Models

Title: On the Robustness of Langevin Dynamics to Score Function Error

Title: UniCompress: Token Compression for Unified Vision-Language Understanding and Generation

Title: UNet-AF: An alias-free UNet for image restoration

Title: Towards Trustworthy Selective Generation: Reliability-Guided Diffusion for Ultra-Low-Field to High-Field MRI Synthesis

Title: Jailbreak Scaling Laws for Large Language Models: Polynomial-Exponential Crossover

Title: ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation

Title: Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video Captioning

Title: OrthoEraser: Coupled-Neuron Orthogonal Projection for Concept Erasure

Title: KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation

Title: LongFlow: Efficient KV Cache Compression for Reasoning M

Title: Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices

Title: MDS-VQA: Model-Informed Data Selection for Video Quality Assessment

Title: CFD-HAR: User-controllable Privacy through Conditional Feature Disentanglement

Title: Risk-Controllable Multi-View Diffusion for Driving Scenario Generation

Title: Multi-Task Anti-Causal Learning for Reconstructing Urban Events from Residents' Reports

Title: PCA-Enhanced Probabilistic U-Net for Effective Ambiguous Medical Image Segmentation

Title: MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks

Title: Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception

Title: WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Title: LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference

Title: DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion Sampling

Title: Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild

Title: Personalized Federated Learning via Gaussian Generative Modeling

Title: Developing Foundation Models for Universal Segmentation from 3D Whole-Body Positron Emission Tomography

Title: MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D Generation

Title: Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans

Title: BackdoorIDS: Zero-shot Backdoor Detection for Pretrained Vision Encoder

Title: PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On

Title: UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution

Title: PolyCrysDiff: Controllable Generation of Three-Dimensional Computable Polycrystalline Material Structures

Title: OSCBench: Benchmarking Object State Change in Text-to-Video Generation

Title: SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory

Title: Controllable Egocentric Video Generation via Occlusion-Aware Sparse 3D Hand Joints

Title: Language Generation with Replay: A Learning-Theoretic View of Model Collapse

Title: Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding

Title: A Decade of Generative Adversarial Networks for Porous Material Reconstruction

Title: Derain-Agent: A Plug-and-Play Agent Framework for Rainy Image Restoration

Title: Think While Watching: Online Streaming Segment-Level Memory for Multi-Turn Video Reasoning in Multimodal Large Language Models

Title: Causal Representation Learning with Optimal Compression under Complex Treatments

Title: EnTransformer: A Deep Generative Transformer for Multivariate Probabilistic Forecasting

Title: InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model

Title: MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?

Title: Statistical and structural identifiability in representation learning

Title: HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

Title: Ada3Drift: Adaptive Training-Time Drifting for One-Step 3D Visuomotor Robotic Manipulation

Title: Flowcean - Model Learning for Cyber-Physical Systems

Title: Nyxus: A Next Generation Image Feature Extraction Library for the Big Data and AI Era

Title: Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization

Title: Single Pixel Image Classification using an Ultrafast Digital Light Projector

Title: Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability

Title: Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Title: Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis

Title: EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation

Title: Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D

Title: Automatic Generation of High-Performance RL Environments

Title: FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Title: GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows

Title: A Quantitative Characterization of Forgetting in Post-Training

Title: ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models

Title: Real-World Point Tracking with Verifier-Guided Pseudo-Labeling

Title: SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation

Title: BiGain: Unified Token Compression for Joint Generation and Classification

Title: Separable neural architectures as a primitive for unified predictive and generative intelligence

Title: One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers

Title: Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Title: DVD: Deterministic Video Depth Estimation with Generative Priors

Title: The Latent Color Subspace: Emergent Order in High-Dimensional Chaos

Title: GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Title: MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning

Title: EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation