2025-01-07

Title: SmartSpatial: Enhancing the 3D Spatial Arrangement Capabilities of Stable Diffusion Models and Introducing a Novel 3D Spatial Evaluation Framework

Title: Information Subtraction: Learning Representations for Conditional Entropy

Title: Machine Learning-Based Differential Diagnosis of Parkinson's Disease Using Kinematic Feature Extraction and Selection

Title: 3D Cloud reconstruction through geospatially-aware Masked Autoencoders

Title: Advancing Pancreatic Cancer Prediction with a Next Visit Token Prediction Head on top of Med-BERT

Title: Active Learning Enables Extrapolation in Molecular Generative Models

Title: AGGA: A Dataset of Academic Guidelines for Generative AI and Large Language Models

Title: ArtCrafter: Text-Image Aligning Style Transfer via Embedding Reframing

Title: Counterfactual Explanation for Auto-Encoder Based Time-Series Anomaly Detection

Title: Online Detection of Water Contamination Under Concept Drift

Title: Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN

Title: Generating Multimodal Images with GAN: Integrating Text, Image, and Style

Title: CPTuning: Contrastive Prompt Tuning for Generative Relation Extraction

Title: Self-Supervised Learning for Detecting AI-Generated Faces as Anomalies

Title: Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised Learning

Title: MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control

Title: Unsupervised Class Generation to Expand Semantic Segmentation Datasets

Title: TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration

Title: DiffGraph: Heterogeneous Graph Diffusion Model

Title: Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications

Title: CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models

Title: Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models

Title: MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation

Title: GCP: Guarded Collaborative Perception with Spatial-Temporal Aware Malicious Agent Detection

Title: Enhancing Contrastive Learning for Retinal Imaging via Adjusted Augmentation Scales

Title: Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera

Title: DeTrack: In-model Latent Denoising Learning for Visual Object Tracking

Title: ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

Title: Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors

Title: Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation

Title: Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks

Title: Decoding fMRI Data into Captions using Prefix Language Modeling

Title: LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations

Title: DepthMaster: Taming Diffusion Models for Monocular Depth Estimation

Title: Representation Learning of Lab Values via Masked AutoEncoder

Title: A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model

Title: GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Title: Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment

Title: Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising

Title: GraphDART: Graph Distillation for Efficient Advanced Persistent Threat Detection

Title: First-place Solution for Streetscape Shop Sign Recognition Competition

Title: InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models

Title: Large Language Models for Video Surveillance Applications

Title: Seeing the Whole in the Parts in Self-Supervised Representation Learning

Title: Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems

Title: FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection

Title: Skillful High-Resolution Ensemble Precipitation Forecasting with an Integrated Deep Learning Framework

Title: Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis

Title: Unsupervised Tomato Split Anomaly Detection using Hyperspectral Imaging and Variational Autoencoders

Title: The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features

Title: SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Title: Human Gaze Boosts Object-Centered Representation Learning

Title: LOHA: Direct Graph Spectral Contrastive Learning Between Low-pass and High-pass Views

Title: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Title: TransPixar: Advancing Text-to-Video Generation with Transparency

Title: Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling

Title: CAT: Content-Adaptive Image Tokenization

Title: Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy

Title: Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning

Title: Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text

Title: MObI: Multimodal Object Inpainting Using Diffusion Models

Title: Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text

Title: ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Title: BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Title: Gaussian Masked Autoencoders