2025-05-28

Title: Joint-stochastic-approximation Random Fields with Application to Semi-supervised Learning

Title: Decision Flow Policy Optimization

Title: FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

Title: GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Title: What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Title: Time Series Generation Under Data Scarcity: A Unified Generative Modeling Approach

Title: DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data

Title: WeatherEdit: Controllable Weather Editing with 4D Gaussian Field

Title: ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image

Title: MultLFG: Training-free Multi-LoRA composition using Frequency-domain Guidance

Title: Causality and "In-the-Wild" Video-Based Person Re-ID: A Survey

Title: Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL

Title: ConsiStyle: Style Diversity in Training-Free Consistent T2I Generation

Title: Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training

Title: Open-Det: An Efficient Learning Framework for Open-Ended Detection

Title: Scan-and-Print: Patch-level Data Summarization and Augmentation for Content-aware Layout Generation in Poster Design

Title: Photography Perspective Composition: Towards Aesthetic Perspective Recommendation

Title: Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Title: Generating Hypotheses of Dynamic Causal Graphs in Neuroscience: Leveraging Generative Factor Models of Observed Time Series

Title: Hierarchical Instruction-aware Embodied Visual Tracking

Title: LeDiFlow: Learned Distribution-guided Flow Matching to Accelerate Image Generation

Title: Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction

Title: ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

Title: MetaSlot: Break Through the Fixed Number of Slots in Object-Centric Learning

Title: Integrating Intermediate Layer Optimization and Projected Gradient Descent for Solving Inverse Problems with Diffusion Models

Title: Rendering-Aware Reinforcement Learning for Vector Graphics Generation

Title: Not All Thats Rare Is Lost: Causal Paths to Rare Concept Synthesis

Title: Frame-Level Captions for Long Video Generation with Complex Multi Scenes

Title: Exploring Timeline Control for Facial Motion Generation

Title: Generalizable Heuristic Generation Through Large Language Models with Meta-Optimization

Title: Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects

Title: Geometry-Editable and Appearance-Preserving Object Compositon

Title: ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation

Title: OrienText: Surface Oriented Textual Image Generation

Title: DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Title: BIPNN: Learning to Solve Binary Integer Programming via Hypergraph Neural Networks

Title: Facial Attribute Based Text Guided Face Anonymization

Title: RainFusion: Adaptive Video Generation Acceleration via Multi-Dimensional Visual Redundancy

Title: Advancing high-fidelity 3D and Texture Generation with 2.5D latents

Title: Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals

Title: Minute-Long Videos with Dual Parallelisms

Title: DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response

Title: Instance Data Condensation for Image Super-Resolution

Title: Conditional Diffusion Models with Classifier-Free Gibbs-like Guidance

Title: Differentiable Solver Search for Fast Diffusion Sampling

Title: Learning Single Index Models with Diffusion Priors

Title: SageAttention2++: A More Efficient Implementation of SageAttention2

Title: A Predicting Phishing Websites Using Support Vector Machine and MultiClass Classification Based on Association Rule Techniques

Title: FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention

Title: STEB: In Search of the Best Evaluation Approach for Synthetic Time Series

Title: Topological Deep Learning for Speech Data

Title: Latent label distribution grid representation for modeling uncertainty

Title: PoisonSwarm: Universal Harmful Information Synthesis via Model Crowdsourcing

Title: 3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics-Based Appearance-Medium Decouplin

Title: Copresheaf Topological Neural Networks: A Generalized Deep Learning Framework

Title: Plenodium: UnderWater 3D Scene Reconstruction with Plenoptic Medium Representation

Title: DiMoSR: Feature Modulation via Multi-Branch Dilated Convolutions for Efficient Image Super-Resolution

Title: UGCE: User-Guided Incremental Counterfactual Exploration

Title: OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models

Title: Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility

Title: A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective

Title: Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Title: Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration

Title: DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction

Title: Policy Optimized Text-to-Image Pipeline Design

Title: MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation

Title: Be Decisive: Noise-Induced Layouts for Multi-Subject Generation

Title: Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

Title: Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Title: Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers