2025-11-26

Title: Personalized Reward Modeling for Text-to-Image Generation

Title: PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained Transformer

Title: WavefrontDiffusion: Dynamic Decoding Schedule or Improved Reasoning

Title: Pistachio: Towards Synthetic, Balanced, and Long-Form Video Anomaly Benchmarks

Title: Quality analysis and evaluation prediction of RAG retrieval based on machine learning algorithms

Title: Generative Model-Aided Continual Learning for CSI Feedback in FDD mMIMO-OFDM Systems

Title: PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting

Title: Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection

Title: Profile Generators: A Link between the Narrative and the Binary Matrix Representation

Title: Single Image to High-Quality 3D Object via Latent Features

Title: VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning

Title: Vidi2: Large Multimodal Models for Video Understanding and Creation

Title: Learning to Solve Weighted Maximum Satisfiability with a Co-Training Architecture

Title: Think First, Assign Next (ThiFAN-VQA): A Two-stage Chain-of-Thought Framework for Post-Disaster Damage Assessment

Title: Leveraging Unlabeled Scans for NCCT Image Segmentation in Early Stroke Diagnosis: A Semi-Supervised GAN Approach

Title: Multiscale Vector-Quantized Variational Autoencoder for Endoscopic Image Synthesis

Title: Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds

Title: Efficient Transferable Optimal Transport via Min-Sliced Transport Plans

Title: One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer

Title: Terminal Velocity Matching

Title: Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization

Title: Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models

Title: ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding

Title: Large Language Model Aided Birt-Hogg-Dube Syndrome Diagnosis with Multimodal Retrieval-Augmented Generation

Title: Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation

Title: 4DWorldBench: A Comprehensive Evaluation Framework for 3D/4D World Generation Models

Title: Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks

Title: GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Title: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

Title: Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving

Title: Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models

Title: HybriDLA: Hybrid Generation for Document Layout Analysis

Title: Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

Title: Low-Resolution Editing is All You Need for High-Resolution Editing

Title: Prompt Fairness: Sub-group Disparities in LLMs

Title: HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning

Title: EmoFeedback2: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback

Title: OmniRefiner: Reinforcement-Guided Local Diffusion Refinement

Title: CREward: A Type-Specific Creativity Reward Model

Title: iRadioDiff: Physics-Informed Diffusion Model for Indoor Radio Map Construction and Localization

Title: SAM-MI: A Mask-Injected Framework for Enhancing Open-Vocabulary Semantic Segmentation with SAM

Title: Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention

Title: MFM-point: Multi-scale Flow Matching for Point Cloud Generation

Title: History-Augmented Contrastive Meta-Learning for Unsupervised Blind Super-Resolution of Planetary Remote Sensing Images

Title: PRADA: Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images

Title: QiMeng-CRUX: Narrowing the Gap between Natural Language and Verilog via Core Refined Understanding eXpression

Title: The Devil in the Details: Emergent Misalignment, Format and Coherence in Open-Weights LLMs

Title: Vision-Language Models for Automated 3D PET/CT Report Generation

Title: Restora-Flow: Mask-Guided Image Restoration with Flow Matching

Title: Realizing Fully-Integrated, Low-Power, Event-Based Pupil Tracking with Neuromorphic Hardware

Title: Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis

Title: OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

Title: Text-guided Controllable Diffusion for Realistic Camouflage Images Generation

Title: PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling

Title: Zoo3D: Zero-Shot 3D Object Detection at Scene Level

Title: The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Title: Bootstrapping Physics-Grounded Video Generation through VLM-Guided Iterative Self-Refinement

Title: Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations

Title: MoRE: Batch-Robust Multi-Omics Representations from Frozen Pre-trained Transformers

Title: FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers

Title: A Training-Free Approach for Multi-ID Customization via Attention Adjustment and Spatial Control

Title: Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs

Title: MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts

Title: Block Cascading: Training Free Acceleration of Block-Causal Video Models

Title: BRIC: Bridging Kinematic Plans and Physical Control at Test Time

Title: Diffusion for Fusion: Designing Stellarators with Generative AI

Title: Learning to Generate Human-Human-Object Interactions from Textual Descriptions

Title: Towards Trustworthy Wi-Fi Sensing: Systematic Evaluation of Deep Learning Model Robustness to Adversarial Attacks

Title: STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

Title: DesignPref: Capturing Personal Preferences in Visual Design Generation

Title: HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation

Title: Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Title: Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Title: PhysChoreo: Physics-Controllable Video Generation with Part-Aware Semantic Grounding

Title: A Reason-then-Describe Instruction Interpreter for Controllable Video Generation

Title: DINO-Tok: Adapting DINO for Visual Tokenizers

Title: Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models

Title: Latent Diffusion Inversion Requires Understanding the Latent Space

Title: Adaptive Hopfield Network: Rethinking Similarities in Associative Memory

Title: Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning

Title: The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

Title: ShapeGen: Towards High-Quality 3D Shape Synthesis

Title: MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

Title: iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Title: Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model

Title: MotionV2V: Editing Motion in a Video

Title: PixelDiT: Pixel Diffusion Transformers for Image Generation

Title: Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization

Title: Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Title: RubricRL: Simple Generalizable Rewards for Text-to-Image Generation