2025-02-13

Title: CrossVideoMAE: Self-Supervised Image-Video Representation Learning with Masked Autoencoders

Title: Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution

Title: Pre-Trained Video Generative Models as World Simulators

Title: Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing

Title: Captured by Captions: On Memorization and its Mitigation in CLIP Models

Title: TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation

Title: Spread them Apart: Towards Robust Watermarking of Generated Content

Title: Technical note on calibrating vision-language models under covariate shift

Title: Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations

Title: MRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers

Title: MAAT: Mamba Adaptive Anomaly Transformer with association discrepancy for time series

Title: TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Title: Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning

Title: SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion

Title: Federated Self-supervised Domain Generalization for Label-efficient Polyp Segmentation

Title: Generative Risk Minimization for Out-of-Distribution Generalization on Graphs

Title: A Survey of In-Context Reinforcement Learning

Title: Towards Training One-Step Diffusion Models Without Distillation

Title: Greed is Good: Guided Generation from a Greedy Perspective

Title: The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models

Title: Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery

Title: Out-of-Distribution Detection on Graphs: A Survey

Title: PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation

Title: In-Context Learning of Linear Dynamical Systems with Transformers: Error Bounds and Depth-Separation

Title: Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling

Title: DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias

Title: ActiveSSF: An Active-Learning-Guided Self-Supervised Framework for Long-Tailed Megakaryocyte Classification

Title: Equivariant Masked Position Prediction for Efficient Molecular Representation

Title: FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis

Title: UniCoRN: Unified Commented Retrieval Network with LMMs

Title: GenIAS: Generator for Instantiating Anomalies in time Series

Title: HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting

Title: Screener: Self-supervised Pathology Segmentation Model for 3D Medical Images

Title: Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact

Title: Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding

Title: A Survey on Pre-Trained Diffusion Model Distillations

Title: One-Shot Federated Learning with Classifier-Free Diffusion Models

Title: Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction

Title: FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices

Title: LLMs can implicitly learn from mistakes in-context

Title: Human-Centric Foundation Models: Perception, Generation and Agentic Modeling

Title: Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion

Title: Ultrasound Image Generation using Latent Diffusion Models

Title: Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Title: Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio

Title: CurvGAD: Leveraging Curvature for Enhanced Graph Anomaly Detection

Title: Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model

Title: CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Title: SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation