2025-01-17

Title: Synthetic Data and Health Privacy

Title: Do generative video models learn physical principles from watching videos?

Title: Pseudolabel guided pixels contrast for domain adaptive semantic segmentation

Title: Generative Visual Commonsense Answering and Explaining with Generative Scene Graph Constructing

Title: CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion

Title: Spatio-Temporal Foundation Models: Vision, Challenges, and Opportunities

Title: Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning

Title: SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation

Title: Generative Medical Image Anonymization Based on Latent Code Projection and Optimization

Title: Deep Self-Supervised Disturbance Mapping with the OPERA Sentinel-1 Radiometric Terrain Corrected SAR Backscatter Product

Title: Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation

Title: Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability

Title: Attention is All You Need Until You Need Retention

Title: Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation

Title: Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures

Title: Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs

Title: Foundations of Large Language Models

Title: Task Vectors in In-Context Learning: Emergence, Formation, and Benefit

Title: Perspective Transition of Large Language Models for Solving Subjective Tasks

Title: Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding

Title: A Study of In-Context-Learning-Based Text-to-SQL Errors

Title: UVRM: A Scalable 3D Reconstruction Model from Unposed Videos

Title: Strategic Base Representation Learning via Feature Augmentations for Few-Shot Class Incremental Learning

Title: Evaluating LLM Abilities to Understand Tabular Electronic Health Records: A Comprehensive Study of Patient Data Extraction and Retrieval

Title: Towards Robust and Realistic Human Pose Estimation via WiFi Signals

Title: AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Title: CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation

Title: Scaling up self-supervised learning for improved surgical foundation models

Title: Teaching Wav2Vec2 the Language of the Brain

Title: Pruning for Sparse Diffusion Models based on Gradient Flow

Title: DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

Title: VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Title: AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation

Title: Confidence Estimation for Error Detection in Text-to-SQL Systems

Title: Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

Title: Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities

Title: Cueless EEG imagined speech for subject identification: dataset and benchmarks

Title: Domain Adaptation of Foundation LLMs for e-Commerce

Title: Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text

Title: A Simple Aerial Detection Baseline of Multimodal Language Models

Title: Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Title: ComplexVAD: Detecting Interaction Anomalies in Video

Title: Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Title: SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces