2025-06-04

Title: Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN

Title: Improvement of AMPs Identification with Generative Adversarial Network and Ensemble Classification

Title: EWGN: Elastic Weight Generation and Context Switching in Deep Learning

Title: Developing a Risk Identification Framework for Foundation Model Uses

Title: An Introduction to Flow Matching and Diffusion Models

Title: RATFM: Retrieval-augmented Time Series Foundation Model for Anomaly Detection

Title: Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

Title: Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability

Title: Constrained Sliced Wasserstein Embedding

Title: Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment

Title: Investigating the Impact of Word Informativeness on Speech Emotion Recognition

Title: Motion aware video generative model

Title: Latent Stochastic Interpolants

Title: Sounding Like a Winner? Prosodic Differences in Post-Match Interviews

Title: Improving Knowledge Distillation Under Unknown Covariate Shift Through Confidence-Guided Data Augmentation

Title: MINT: Multimodal Instruction Tuning with Multimodal Interaction Grouping

Title: Absorb and Converge: Provable Convergence Guarantee for Absorbing Discrete Diffusion Models

Title: Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Title: Auto-Labeling Data for Object Detection

Title: Approximate Borderline Sampling using Granular-Ball for Classification Tasks

Title: SFBD Flow: A Continuous-Optimization Framework for Training Diffusion Models with Noisy Samples

Title: Exploring Explanations Improves the Robustness of In-Context Learning

Title: The Devil is in the Darkness: Diffusion-Based Nighttime Dehazing Anchored in Brightness Perception

Title: Modelship Attribution: Tracing Multi-Stage Manipulations Across Generative Models

Title: Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology

Title: SingaKids: A Multilingual Multimodal Dialogic Tutor for Language Learning

Title: Guiding Registration with Emergent Similarity from Pre-Trained Diffusion Models

Title: Empowering Functional Neuroimaging: A Pre-trained Generative Framework for Unified Representation of Neural Signals

Title: SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios

Title: ANT: Adaptive Neural Temporal-Aware Text-to-Motion Model

Title: ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment

Title: Generative Perception of Shape and Material from Differential Motion

Title: Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay

Title: Flexiffusion: Training-Free Segment-Wise Neural Architecture Search for Efficient Diffusion Models

Title: LumosFlow: Motion-Guided Long Video Generation

Title: KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG

Title: RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers

Title: MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection

Title: Rethinking Post-Unlearning Behavior of Large Vision-Language Models

Title: Technical Report for Ego4D Long-Term Action Anticipation Challenge 2025

Title: SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence

Title: Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Title: DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing

Title: Prosodic Structure Beyond Lexical Content: A Study of Self-Supervised Learning

Title: Hyperspectral Image Generation with Unmixing Guided Diffusion Model

Title: One-Step Diffusion-based Real-World Image Super-Resolution with Visual Perception Distillation

Title: Simple, Good, Fast: Self-Supervised World Models Free of Baggage

Title: HGOT: Self-supervised Heterogeneous Graph Neural Network with Optimal Transport

Title: Synthetic Iris Image Databases and Identity Leakage: Risks and Mitigation Strategies

Title: ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration

Title: Small Aid, Big Leap: Efficient Test-Time Adaptation for Vision-Language Models with AdaptNet

Title: Solving Inverse Problems with FLAIR

Title: Large-scale Self-supervised Video Foundation Model for Intelligent Surgery

Title: LayoutRAG: Retrieval-Augmented Model for Content-agnostic Conditional Layout Generation

Title: Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Title: Investigating Mask-aware Prototype Learning for Tabular Anomaly Detection

Title: Exploiting the English Vocabulary Profile for L2 word-level vocabulary assessment with LLMs

Title: FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts

Title: CART-based Synthetic Tabular Data Generation for Imbalanced Regression

Title: Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection

Title: Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings

Title: Token and Span Classification for Entity Recognition in French Historical Encyclopedias

Title: Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning

Title: Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Title: INESC-ID @ eRisk 2025: Exploring Fine-Tuned, Similarity-Based, and Prompt-Based Approaches to Depression Symptom Identification

Title: FORLA:Federated Object-centric Representation Learning with Slot Attention

Title: Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation

Title: On the Robustness of Tabular Foundation Models: Test-Time Attacks and In-Context Defenses

Title: Astrophotography turbulence mitigation via generative models

Title: Implicit Regularization of the Deep Inverse Prior Trained with Inertia

Title: DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

Title: Sample complexity of Schrödinger potential estimation

Title: Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Title: EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models

Title: ORV: 4D Occupancy-centric Robot Video Generation

Title: SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis

Title: Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds

Title: ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions

Title: Rectified Flows for Fast Multiscale Fluid Flow Modeling

Title: Targeted Forgetting of Image Subgroups in CLIP Models

Title: Controllable Human-centric Keyframe Interpolation with Generative Prior

Title: DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Title: AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

Title: Zero-Shot Time Series Forecasting with Covariates via In-Context Learning

Title: Native-Resolution Image Synthesis

Title: UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Title: Self-Supervised Spatial Correspondence Across Modalities

Title: IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation