2025-03-04

Title: A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety

Title: I see what you mean: Co-Speech Gestures for Reference Resolution in Multimodal Dialogue

Title: SSL4EO-S12 v1.1: A Multimodal, Multiseasonal Dataset for Pretraining, Updated

Title: PRISM: High-Resolution & Precise Counterfactual Medical Image Generation using Language-guided Stable Diffusion

Title: Llamarine: Open-source Maritime Industry-specific Large Language Model

Title: AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies

Title: Foundation-Model-Boosted Multimodal Learning for fMRI-based Neuropathic Pain Drug Response Prediction

Title: Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality

Title: Learning to Animate Images from A Few Videos to Portray Delicate Human Actions

Title: Remasking Discrete Diffusion Models with Inference-Time Scaling

Title: DeepONet Augmented by Randomized Neural Networks for Efficient Operator Learning in PDEs

Title: More of the Same: Persistent Representational Harms Under Increased Representation

Title: SHAZAM: Self-Supervised Change Monitoring for Hazard Detection and Mapping

Title: Solving Instance Detection from an Open-World Perspective

Title: Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding

Title: Approaching the Limits to EFL Writing Enhancement with AI-generated Text and Diverse Learners

Title: MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention

Title: Taming Large Multimodal Agents for Ultra-low Bitrate Semantically Disentangled Image Compression

Title: Auto-encoding Molecules: Graph-Matching Capabilities Matter

Title: Split Adaptation for Pre-trained Vision Transformers

Title: G-OSR: A Comprehensive Benchmark for Graph Open-Set Recognition

Title: Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture

Title: Periodic Materials Generation using Text-Guided Joint Diffusion Model

Title: End-To-End Learning of Gaussian Mixture Priors for Diffusion Sampler

Title: GaussianSeal: Rooting Adaptive Watermarks for 3D Gaussian Generation Model

Title: What Makes a Good Diffusion Planner for Decision Making?

Title: Streaming Video Question-Answering with In-context Video KV-Cache Retrieval

Title: Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery

Title: AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models

Title: SolidMark: Evaluating Image Memorization in Generative Models

Title: Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

Title: How to Probe: Simple Yet Effective Techniques for Improving Post-hoc Explanations

Title: Discrete Codebook World Models for Continuous Control

Title: Transformer Based Self-Context Aware Prediction for Few-Shot Anomaly Detection in Videos

Title: Proteina: Scaling Flow-based Protein Structure Generative Models

Title: OpenECG: Benchmarking ECG Foundation Models with Public 1.2 Million Records

Title: Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology

Title: FaceShot: Bring Any Character into Life

Title: Confounder-Aware Medical Data Selection for Fine-Tuning Pretrained Vision Models

Title: Dynamic Gradient Sparsification Training for Few-Shot Fine-tuning of CT Lymph Node Segmentation Foundation Model

Title: Edge Prompt Tuning for Graph Neural Networks

Title: Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation

Title: MFM-DA: Instance-Aware Adaptor and Hierarchical Alignment for Efficient Domain Adaptation in Medical Foundation Models

Title: HiMo: High-Speed Objects Motion Compensation in Point Clouds

Title: Toward Stable and Consistent Evaluation Results: A New Methodology for Base Model Evaluation

Title: Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models

Title: CyberCScope: Mining Skewed Tensor Streams and Online Anomaly Detection in Cybersecurity Systems

Title: A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning

Title: From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization

Title: Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think

Title: Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Title: Using Synthetic Images to Augment Small Medical Image Datasets

Title: Molecule Generation for Target Protein Binding with Hierarchical Consistency Diffusion Model

Title: Underdamped Diffusion Bridges with Applications to Sampling

Title: Data Unlearning in Diffusion Models

Title: Scientific Reasoning: Assessment of Multimodal Generative LLMs

Title: All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning

Title: Depth-Adaptive Graph Neural Networks via Learnable Bakry-'Emery Curvature

Title: Fence Theorem: Preprocessing is Dual-Objective Semantic Structure Isolator in 3D Anomaly Detection

Title: Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Title: VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors

Title: WeGen: A Unified Model for Interactive Multimodal Generation as We Chat

Title: ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization

Title: DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning

Title: One-shot In-context Part Segmentation

Title: CoInD: Enabling Logical Compositions in Diffusion Models

Title: Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting

Title: EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting

Title: Split Gibbs Discrete Diffusion Posterior Sampling

Title: Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis

Title: Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data

Title: Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

Title: SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting

Title: Language-Assisted Feature Transformation for Anomaly Detection

Title: DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution

Title: Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling

Title: Hypergraph Foundation Model

Title: Tera-MIND: Tera-scale mouse brain simulation via spatial mRNA-guided diffusion

Title: Enhancing Network Security Management in Water Systems using FM-based Attack Attribution

Title: OIPR: Evaluation for Time-series Anomaly Detection Inspired by Operator Interest

Title: Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual

Title: SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance

Title: PA-CLIP: Enhancing Zero-Shot Anomaly Detection through Pseudo-Anomaly Awareness

Title: Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting

Title: MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation

Title: OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Title: CacheQuant: Comprehensively Accelerated Diffusion Models

Title: Jailbreaking Generative AI: Empowering Novices to Conduct Phishing Attacks

Title: Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification

Title: DLF: Extreme Image Compression with Dual-generative Latent Fusion

Title: Generative Human Geometry Distribution

Title: Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

Title: Meta Learning-Driven Iterative Refinement for Robust Anomaly Detection in Industrial Inspection

Title: MRI super-resolution reconstruction using efficient diffusion probabilistic model with residual shifting

Title: EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Title: In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models

Title: A General Purpose Spectral Foundational Model for Both Proximal and Remote Sensing Spectral Imaging

Title: SparseMamba-PCL: Scribble-Supervised Medical Image Segmentation via SAM-Guided Progressive Collaborative Learning

Title: DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

Title: ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts

Title: GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models

Title: Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution

Title: KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation

Title: Quality Measures for Dynamic Graph Generative Models

Title: Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios

Title: VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation

Title: ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition

Title: Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Title: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

Title: OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment

Title: On the Power of Context-Enhanced Learning in LLMs

Title: Open-source framework for detecting bias and overfitting for large pathology images

Title: Denoising Functional Maps: Diffusion Models for Shape Correspondence