2025-01-22

Title: Towards General Purpose Robots at Scale: Lifelong Learning and Learning to Use Memory

Title: Uncovering Bias in Foundation Models: Impact, Testing, Harm, and Mitigation

Title: BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

Title: Improving the Efficiency of Self-Supervised Adversarial Training through Latent Clustering-Based Selection

Title: Tabular-TX: Theme-Explanation Structure-based Table Summarization via In-Context Learning

Title: Towards Data-Centric AI: A Comprehensive Survey of Traditional, Reinforcement, and Generative Approaches for Tabular Data Transformation

Title: Mutual Regression Distance

Title: AI/ML Based Detection and Categorization of Covert Communication in IPv6 Network

Title: HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting

Title: Unveiling the Mystery of Weight in Large Foundation Models: Gaussian Distribution Never Fades

Title: Deep Operator Networks for Bayesian Parameter Estimation in PDEs

Title: EMO2: End-Effector Guided Audio-Driven Avatar Video Generation

Title: A CNN-Transformer for Classification of Longitudinal 3D MRI Images -- A Case Study on Hepatocellular Carcinoma Prediction

Title: GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation

Title: Addressing Multilabel Imbalance with an Efficiency-Focused Approach Using Diffusion Model-Generated Synthetic Samples

Title: Visual RAG: Expanding MLLM visual knowledge without fine-tuning

Title: Diffusion-Based Imitation Learning for Social Pose Generation

Title: CEReBrO: Compact Encoder for Representations of Brain Oscillations Using Efficient Alternating Attention

Title: Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Title: Data Enrichment Opportunities for Distribution Grid Cable Networks using Variational Autoencoders

Title: Generative Physical AI in Vision: A Survey

Title: Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra Pay

Title: Generative AI-driven Cross-layer Covert Communication: Fundamentals, Framework and Case Study

Title: Tell me about yourself: LLMs are aware of their learned behaviors

Title: CLOFAI: A Dataset of Real And Fake Image Classification Tasks for Continual Learning

Title: Leveraging GANs For Active Appearance Models Optimized Model Fitting

Title: Successive Interference Cancellation-aided Diffusion Models for Joint Channel Estimation and Data Detection in Low Rank Channel Scenarios

Title: A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs

Title: Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders

Title: Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning

Title: MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching

Title: Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review

Title: Nested Annealed Training Scheme for Generative Adversarial Networks

Title: StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer

Title: CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation

Title: GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video

Title: Block Flow: Learning Straight Flow on Data Blocks

Title: A Survey on Diffusion Models for Anomaly Detection

Title: Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges

Title: UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

Title: Graph Defense Diffusion Model

Title: Recurrent Diffusion for Large-Scale Parameter Generation

Title: Trojan Detection Through Pattern Recognition for Large Language Models

Title: Class Imbalance in Anomaly Detection: Learning from an Exactly Solvable Model

Title: Spatially-Delineated Domain-Adapted AI Classification: An Application for Oncology Data

Title: GL-ICNN: An End-To-End Interpretable Convolutional Neural Network for the Diagnosis and Prediction of Alzheimer's Disease

Title: Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Title: SILO: Solving Inverse Problems with Latent Operators

Title: Are generative models fair? A study of racial bias in dermatological image generation

Title: EfficientVITON: An Efficient Virtual Try-On Model using Optimized Diffusion Process

Title: CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Title: PXGen: A Post-hoc Explainable Method for Generative Models

Title: Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs

Title: Survey on Monocular Metric Depth Estimation

Title: Noise-Resilient Point-wise Anomaly Detection in Time Series Using Weak Segment Labels

Title: TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection

Title: TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data

Title: Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection?

Title: Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis

Title: Unified 3D MRI Representations via Sequence-Invariant Contrastive Learning

Title: Proxies for Distortion and Consistency with Applications for Real-World Image Restoration

Title: Teacher Encoder-Student Decoder Denoising Guided Segmentation Network for Anomaly Detection

Title: ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions

Title: Extend Adversarial Policy Against Neural Machine Translation via Unknown Token

Title: Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Title: Explainability for Vision Foundation Models: A Survey

Title: Score Combining for Contrastive OOD Detection

Title: Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

Title: You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense

Title: Exploring Temporally-Aware Features for Point Tracking

Title: TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Title: InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models

Title: Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos

Title: CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification

Title: VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

Title: With Great Backbones Comes Great Adversarial Transferability

Title: Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement

Title: Towards Accurate Unified Anomaly Segmentation

Title: A Hybrid Supervised and Self-Supervised Graph Neural Network for Edge-Centric Applications

Title: Diffusion-aware Censored Gaussian Processes for Demand Modelling

Title: MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Title: DiffDoctor: Diagnosing Image Diffusion Models Before Treating

Title: GPS as a Control Signal for Image Generation

Title: Towards Affordance-Aware Articulation Synthesis for Rigged Objects