2025-01-22

Title: Towards General Purpose Robots at Scale: Lifelong Learning and Learning to Use Memory

Title: BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

Title: 4bit-Quantization in Vector-Embedding for RAG

Title: Towards Data-Centric AI: A Comprehensive Survey of Traditional, Reinforcement, and Generative Approaches for Tabular Data Transformation

Title: Mutual Regression Distance

Title: EMO2: End-Effector Guided Audio-Driven Avatar Video Generation

Title: GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation

Title: Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples

Title: Diffusion-Based Imitation Learning for Social Pose Generation

Title: Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Title: Know "No" Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP

Title: Data Enrichment Opportunities for Distribution Grid Cable Networks using Variational Autoencoders

Title: Generative Physical AI in Vision: A Survey

Title: Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra Pay

Title: BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution

Title: Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection

Title: Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications

Title: Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Title: CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning

Title: Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism

Title: Leveraging GANs For Active Appearance Models Optimized Model Fitting

Title: Successive Interference Cancellation-aided Diffusion Models for Joint Channel Estimation and Data Detection in Low Rank Channel Scenarios

Title: A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs

Title: Nested Annealed Training Scheme for Generative Adversarial Networks

Title: CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation

Title: GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video

Title: Block Flow: Learning Straight Flow on Data Blocks

Title: A Survey on Diffusion Models for Anomaly Detection

Title: UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

Title: Explainable Lane Change Prediction for Near-Crash Scenarios Using Knowledge Graph Embeddings and Retrieval Augmented Generation

Title: Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Title: Recurrent Diffusion for Large-Scale Parameter Generation

Title: GL-ICNN: An End-To-End Interpretable Convolutional Neural Network for the Diagnosis and Prediction of Alzheimer's Disease

Title: SILO: Solving Inverse Problems with Latent Operators

Title: Are generative models fair? A study of racial bias in dermatological image generation

Title: EfficientVITON: An Efficient Virtual Try-On Model using Optimized Diffusion Process

Title: Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference

Title: Generating visual explanations from deep networks using implicit neural representations

Title: CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Title: PXGen: A Post-hoc Explainable Method for Generative Models

Title: Survey on Monocular Metric Depth Estimation

Title: Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging

Title: ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation

Title: MeshONet: A Generalizable and Efficient Operator Learning Method for Structured Mesh Generation

Title: TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data

Title: Foreign object segmentation in chest x-rays through anatomy-guided shape insertion

Title: A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment

Title: Proxies for Distortion and Consistency with Applications for Real-World Image Restoration

Title: ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions

Title: Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Title: Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

Title: TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Title: InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models

Title: HAC++: Towards 100X Compression of 3D Gaussian Splatting

Title: VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

Title: Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement

Title: A Hybrid Supervised and Self-Supervised Graph Neural Network for Edge-Centric Applications

Title: VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Title: Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2

Title: InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Title: Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Title: Parallel Sequence Modeling via Generalized Spatial Propagation Network

Title: DiffDoctor: Diagnosing Image Diffusion Models Before Treating

Title: Taming Teacher Forcing for Masked Autoregressive Video Generation

Title: GPS as a Control Signal for Image Generation