2025-08-08

Title: LumiGen: An LVLM-Enhanced Iterative Framework for Fine-Grained Text-to-Image Generation

Title: Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC)

Title: Enhancing Dialogue Annotation with Speaker Characteristics Leveraging a Frozen LLM

Title: CoMAD: A Multiple-Teacher Self-Supervised Distillation Framework

Title: Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models

Title: Unified Flow Matching for Long Horizon Event Forecasting

Title: Multi-Stage Knowledge-Distilled VGAE and GAT for Robust Controller-Area-Network Intrusion Detection

Title: Retrieval-Augmented Water Level Forecasting for Everglades

Title: Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens

Title: AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics

Title: MENDR: Manifold Explainable Neural Data Representations

Title: Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Title: Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion

Title: A Novel Image Similarity Metric for Scene Composition Structure

Title: DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion

Title: Automatic Image Colorization with Convolutional Neural Networks and Generative Adversarial Networks

Title: FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Title: AdaFusion: Prompt-Guided Inference with Adaptive Fusion of Pathology Foundation Models

Title: Cold Start Active Preference Learning in Socio-Economic Domains

Title: PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation

Title: Attention Basin: Why Contextual Position Matters in Large Language Models

Title: FAITH: A Framework for Assessing Intrinsic Tabular Hallucinations in finance

Title: Segmenting the Complex and Irregular in Two-Phase Flows: A Real-World Empirical Study with SAM2

Title: ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models

Title: CF3: Compact and Fast 3D Feature Fields

Title: SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion

Title: FlowState: Sampling Rate Invariant Time Series Forecasting

Title: SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Title: Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting

Title: UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation

Title: MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow

Title: How and Why: Taming Flow Matching for Unsupervised Anomaly Detection and Localization

Title: SMOL-MapSeg: Show Me One Label

Title: AutoIAD: Manager-Driven Multi-Agent Collaboration for Automated Industrial Anomaly Detection

Title: Symmetry Understanding of 3D Shapes via Chirality Disentanglement

Title: MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips

Title: Revealing Latent Information: A Physics-inspired Self-supervised Pre-training Framework for Noisy and Sparse Events

Title: When Deepfake Detection Meets Graph Neural Network:a Unified and Lightweight Learning Framework

Title: Tractable Sharpness-Aware Learning of Probabilistic Circuits

Title: Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis

Title: WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction

Title: LLaVA-RE: Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model

Title: Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Title: Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Title: GAP: Gaussianize Any Point Clouds with Text Guidance

Title: FaceAnonyMixer: Cancelable Faces via Identity Consistent Latent Space Mixing