2025-01-13

Title: Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion

Title: FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-based Federated Learning

Title: Generative Flow Networks: Theory and Applications to Structure Learning

Title: Shrink the longest: improving latent space isotropy with symplicial geometry

Title: OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Title: HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection

Title: UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation

Title: EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model

Title: Element-wise Attention Is All You Need

Title: LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising

Title: StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation

Title: StructSR: Refuse Spurious Details in Real-World Image Super-Resolution

Title: Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Title: Diffusion Models for Smarter UAVs: Decision-Making and Modeling

Title: PersonaHOI: Effortlessly Improving Personalized Face with Human-Object Interaction Generation

Title: Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models

Title: VideoRAG: Retrieval-Augmented Generation over Video Corpus

Title: Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation

Title: DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information

Title: Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory

Title: Learning to generate feasible graphs using graph grammars

Title: A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection

Title: From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

Title: GenMol: A Drug Discovery Generalist with Discrete Diffusion

Title: VideoAuteur: Towards Long Narrative Video Generation

Title: Multi-subject Open-set Personalization in Video Generation