2025-03-12

Title: FourierNAT: A Fourier-Mixing-Based Non-Autoregressive Transformer for Parallel Sequence Generation

Title: BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification

Title: The day-ahead scenario generation method for new energy based on an improved conditional generative diffusion model

Title: TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster

Title: MergeQuant: Accurate 4-bit Static Quantization of Large Language Models by Channel-wise Calibration

Title: Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy

Title: RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories

Title: Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Title: Blind Video Super-Resolution based on Implicit Kernels

Title: Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?

Title: FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction

Title: Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Title: Counterfactual Explanations for Model Ensembles Using Entropic Risk Measures

Title: CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement

Title: STRMs: Spatial Temporal Reasoning Models for Vision-Based Localization Rivaling GPS Precision

Title: BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes

Title: Recent Advances in Hypergraph Neural Networks

Title: 7ABAW-Compound Expression Recognition via Curriculum Learning

Title: Regulatory DNA sequence Design with Reinforcement Learning

Title: DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation

Title: CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction

Title: Exploring Bias in over 100 Text-to-Image Generative Models

Title: GPT-PPG: A GPT-based Foundation Model for Photoplethysmography Signals

Title: HOFAR: High-Order Augmentation of Flow Autoregressive Transformers

Title: SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models

Title: Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm

Title: Seeing Beyond Haze: Generative Nighttime Image Dehazing

Title: PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models

Title: MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution

Title: ACE: Concept Editing in Diffusion Models without Performance Degradation

Title: Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models

Title: Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Title: Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments

Title: MGHanD: Multi-modal Guidance for authentic Hand Diffusion

Title: FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems

Title: FilmComposer: LLM-Driven Music Production for Silent Film Clips

Title: Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features

Title: Depth-Assisted Network for Indiscernible Marine Object Counting with Adaptive Motion-Differentiated Feature Encoding

Title: WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation

Title: Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model

Title: U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

Title: Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation

Title: Multimodal Generation of Animatable 3D Human Models with AvatarForge

Title: Towards Synthesized and Editable Motion In-Betweening Through Part-Wise Phase Representation

Title: A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models

Title: ExMAG: Learning of Maximally Ancestral Graphs

Title: Aligning Text to Image in Diffusion Models is Easier Than You Think

Title: SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models

Title: DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness

Title: Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks

Title: HERO: Human Reaction Generation from Videos

Title: OminiControl2: Efficient Conditioning for Diffusion Transformers

Title: D3PO: Preference-Based Alignment of Discrete Diffusion Models

Title: Feature Alignment with Equivariant Convolutions for Burst Image Super-Resolution

Title: $^R$FLAV: Rolling Flow matching for infinite Audio Video generation

Title: Pathology-Aware Adaptive Watermarking for Text-Driven Medical Image Synthesis

Title: Robust Latent Matters: Boosting Image Generation with Sampling Error

Title: Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Title: Recognition-Synergistic Scene Text Editing

Title: Controlling Latent Diffusion Using Latent CLIP

Title: Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum

Title: Learning to Match Unpaired Data with Minimum Entropy Coupling

Title: DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning

Title: High-Quality 3D Head Reconstruction from Any Single Portrait Image

Title: 3D Point Cloud Generation via Autoregressive Up-sampling

Title: Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Title: LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

Title: MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

Title: REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

Title: Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields

Title: OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting

Title: OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models