2025-03-17

Title: Evaluating Local and Cloud-Based Large Language Models for Simulating Consumer Choices in Energy Stated Preference Surveys

Title: Video Anomaly Detection with Structured Keywords

Title: Text-to-3D Generation using Jensen-Shannon Score Distillation

Title: A Survey on Knowledge-Oriented Retrieval-Augmented Generation

Title: VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion

Title: End-to-end Learning of Sparse Interventions on Activations to Steer Generation

Title: Understanding the Quality-Diversity Trade-off in Diffusion Language Models

Title: Open-World Skill Discovery from Unsegmented Demonstrations

Title: VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation

Title: Context-guided Responsible Data Augmentation with Diffusion Models

Title: Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion

Title: TA-V2A: Textually Assisted Video-to-Audio Generation

Title: Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework

Title: Team NYCU at Defactify4: Robust Detection and Source Identification of AI-Generated Images Using CNN and CLIP-Based Models

Title: Numerical and statistical analysis of NeuralODE with Runge-Kutta time integration

Title: Visual Polarization Measurement Using Counterfactual Image Generation

Title: Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation

Title: RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors

Title: Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANs

Title: OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models

Title: EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

Title: ACMo: Attribute Controllable Motion Generation

Title: InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences

Title: PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing

Title: Measuring Similarity in Causal Graphs: A Framework for Semantic and Structural Analysis

Title: Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning

Title: Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization

Title: Generative Modelling for Mathematical Discovery

Title: Falcon: A Remote Sensing Vision-Language Foundation Model

Title: Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models

Title: Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

Title: Understanding Flatness in Generative Models: Its Role and Benefits

Title: A Survey of Cross-domain Graph Learning: Progress and Future Directions

Title: Multi-View Industrial Anomaly Detection with Epipolar Constrained Cross-View Fusion

Title: Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Title: A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data

Title: A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis

Title: Quantifying Interpretability in CLIP Models with Concept Consistency

Title: DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Title: SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets

Title: GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior

Title: Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction

Title: Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models

Title: Palette of Language Models: A Solver for Controlled Text Generation

Title: Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption

Title: Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Title: Spherical Tree-Sliced Wasserstein Distance

Title: Federated Koopman-Reservoir Learning for Large-Scale Multivariate Time-Series Anomaly Detection

Title: Noise Synthesis for Low-Light Image Denoising with Diffusion Models

Title: CyclePose -- Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy

Title: OPTIMUS: Predicting Multivariate Outcomes in Alzheimer's Disease Using Multi-modal Data amidst Missing Values

Title: BriLLM: Brain-inspired Large Language Model

Title: Leveraging Diffusion Knowledge for Generative Image Compression with Fractal Frequency-Aware Band Learning

Title: Self-Supervised Pretraining for Fine-Grained Plankton Recognition

Title: AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation

Title: PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models

Title: PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture

Title: Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding

Title: Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models

Title: A Neural Network Architecture Based on Attention Gate Mechanism for 3D Magnetotelluric Forward Modeling

Title: Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models

Title: MTV-Inpaint: Multi-Task Long Video Inpainting

Title: From Generative AI to Innovative AI: An Evolutionary Roadmap

Title: TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation

Title: Text Compression for Efficient Language Generation

Title: Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

Title: T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation

Title: Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

Title: TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Title: Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Title: Bottom-up Iterative Anomalous Diffusion Detector (BI-ADD)

Title: AugGen: Synthetic Augmentation Can Improve Discriminative Models

Title: Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information

Title: From Denoising Score Matching to Langevin Sampling: A Fine-Grained Error Analysis in the Gaussian Setting

Title: ReCamMaster: Camera-Controlled Generative Rendering from A Single Video