2025-05-09

Title: Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers

Title: When Bad Data Leads to Good Models

Title: Replay to Remember (R2R): An Efficient Uncertainty-driven Unsupervised Continual Learning Framework Using Generative Replay

Title: Guide your favorite protein sequence generative model

Title: Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Title: ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning

Title: Cross-Branch Orthogonality for Improved Generalization in Face Deepfake Detection

Title: Clustering with Communication: A Variational Framework for Single Cell Representation Learning

Title: OWT: A Foundational Organ-Wise Tokenization Framework for Medical Imaging

Title: SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models

Title: GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing

Title: Canny2Palm: Realistic and Controllable Palmprint Generation for Large-scale Pre-training

Title: Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping

Title: T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Title: Graffe: Graph Representation Learning via Diffusion Probabilistic Models

Title: CAG-VLM: Fine-Tuning of a Large-Scale Model to Recognize Angiographic Images for Next-Generation Diagnostic Systems

Title: ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment

Title: Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints

Title: Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme

Title: SOAP: Style-Omniscient Animatable Portraits

Title: CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts

Title: PIDiff: Image Customization for Personalized Identities with Diffusion Models

Title: ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model

Title: EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution

Title: Diffusion Model Quantization: A Review

Title: GFlowNets for Active Learning Based Resource Allocation in Next Generation Wireless Networks

Title: Does CLIP perceive art the same way we do?

Title: Joint Super-Resolution and Segmentation for 1-m Impervious Surface Area Mapping in China's Yangtze River Economic Belt

Title: TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Title: Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Title: Flow-GRPO: Training Flow Matching Models via Online RL

Title: Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation

Title: 3D Scene Generation: A Survey

Title: SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation