2024-03-19

Title: VISREAS: Complex Visual Reasoning with Unanswerable Questions

Title: Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Title: Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer

Title: Learning to Watermark LLM-generated Text via Reinforcement Learning

Title: Second-Order Information Matters: Revisiting Machine Unlearning for Large Language Models

Title: Adaptive Hybrid Masking Strategy for Privacy-Preserving Face Recognition Against Model Inversion Attack

Title: Generative Models and Connected and Automated Vehicles: A Survey in Exploring the Intersection of Transportation and AI

Title: Counter-Samples: A Stateless Strategy to Neutralize Black Box Adversarial Attacks

Title: Cooling-Guide Diffusion Model for Battery Cell Arrangement

Title: Symbiotic Game and Foundation Models for Cyber Deception Operations in Strategic Cyber Warfare

Title: Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers

Title: Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain

Title: From Algorithms to Outcomes: Reviewing AI's Role in Non-Muscle-Invasive Bladder Cancer Recurrence Prediction

Title: Neural Erosion: Emulating Controlled Neurodegeneration and Aging in AI Systems

Title: SurvRNC: Learning Ordered Representations for Survival Prediction using Rank-N-Contrast

Title: LightIt: Illumination Modeling and Control for Diffusion Models

Title: DiPaCo: Distributed Path Composition

Title: Leveraging CLIP for Inferring Sensitive Information and Improving Model Fairness

Title: MeDSLIP: Medical Dual-Stream Language-Image Pre-training for Fine-grained Alignment

Title: A Survey of Source Code Representations for Machine Learning-Based Cybersecurity Tasks

Title: PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation

Title: Improving Fairness in Credit Lending Models using Subgroup Threshold Optimization

Title: Towards Practical Fabrication Stage Attacks Using Interrupt-Resilient Hardware Trojans

Title: SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images

Title: Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data

Title: GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation

Title: MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling

Title: On the low-shot transferability of [V]-Mamba

Title: Robust Influence-based Training Methods for Noisy Brain MRI

Title: IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

Title: PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Title: Uncovering Latent Themes of Messaging on Social Media by Integrating LLMs: A Case Study on Climate Campaigns

Title: Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency

Title: Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation

Title: Leveraging Synthetic Data for Generalizable and Fair Facial Action Unit Detection

Title: Depression Detection on Social Media with Large Language Models

Title: Rules still work for Open Information Extraction

Title: ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference

Title: Detecting Bias in Large Language Models: Fine-tuned KcBERT

Title: HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

Title: LLM-based Conversational AI Therapist for Daily Functioning Screening and Psychotherapeutic Intervention via Everyday Smart Devices

Title: Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation

Title: StableGarment: Garment-Centric Generation via Stable Diffusion

Title: Time Series Representation Learning with Supervised Contrastive Temporal Transformer

Title: From Words to Routes: Applying Large Language Models to Vehicle Routing

Title: Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval

Title: Efficient Pruning of Large Language Model with Adaptive Estimation Fusion

Title: Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data in Text-Image Encoders

Title: Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples

Title: Anomaly Detection Based on Isolation Mechanisms: A Survey

Title: DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the Dark

Title: Active Label Correction for Semantic Segmentation with Foundation Models

Title: Do Large Language Models understand Medical Codes?

Title: VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis

Title: Affective Behaviour Analysis via Integrating Multi-Modal Knowledge

Title: Exploring Learning-based Motion Models in Multi-Object Tracking

Title: Data Availability and Decentralization: New Techniques for zk-Rollups in Layer 2 Blockchain Networks

Title: DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation

Title: Twin Transformer using Gated Dynamic Learnable Attention mechanism for Fault Detection and Diagnosis in the Tennessee Eastman Process

Title: RETINAQA : A Knowledge Base Question Answering Model Robust to both Answerable and Unanswerable Questions

Title: Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

Title: A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Title: Zero-shot Generative Linguistic Steganography

Title: RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification

Title: Characterizing the Solana NFT Ecosystem

Title: Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean

Title: Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction

Title: Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation

Title: A Watermark-Conditioned Diffusion Model for IP Protection

Title: Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization

Title: Rethinking Multi-view Representation Learning via Distilled Disentangling

Title: BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English

Title: DTOR: Decision Tree Outlier Regressor to explain anomalies

Title: Graph Regularized NMF with L20-norm for Unsupervised Feature Learning

Title: Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Title: FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation

Title: Batch-oriented Element-wise Approximate Activation for Privacy-Preserving Neural Networks

Title: Interpretable Machine Learning for TabPFN

Title: Understanding Robustness of Visual State Space Models for Image Classification

Title: ScanTalk: 3D Talking Heads from Unregistered Scans

Title: SelfIE: Self-Interpretation of Large Language Model Embeddings

Title: Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription

Title: Energy-Based Models with Applications to Speech and Language Processing

Title: Exploiting Topological Prior for Boosting Point Cloud Generation

Title: Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That!

Title: Enhancing IoT Security Against DDoS Attacks through Federated Learning

Title: Task-Aware Low-Rank Adaptation of Segment Anything Model

Title: OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Title: IoTCO2: Assessing the End-To-End Carbon Footprint of Internet-of-Things-Enabled Deep Learning

Title: Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Title: Edge Private Graph Neural Networks with Singular Value Perturbation

Title: N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields

Title: MASSM: An End-to-End Deep Learning Framework for Multi-Anatomy Statistical Shape Modeling Directly From Images

Title: EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration

Title: Reward Guided Latent Consistency Distillation

Title: Texture Edge detection by Patch consensus (TEP)

Title: FAGH: Accelerating Federated Learning with Approximated Global Hessian

Title: From Pixels to Predictions: Spectrogram and Vision Transformer for Better Time Series Forecasting

Title: Endora: Video Generation Models as Endoscopy Simulators

Title: Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention

Title: Large Language Models Powered Context-aware Motion Prediction

Title: Intelligent Railroad Grade Crossing: Leveraging Semantic Segmentation and Object Detection for Enhanced Safety

Title: Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping

Title: Audio-Visual Segmentation via Unlabeled Frame Exploitation

Title: Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Title: RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Title: Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

Title: Programming Frameworks for Differential Privacy

Title: Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts

Title: Hierarchical Generative Network for Face Morphing Attacks

Title: ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models

Title: Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Title: Self-supervised co-salient object detection via feature correspondence at multiple scales

Title: 3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models

Title: Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis

Title: PhD: A Prompted Visual Hallucination Evaluation Dataset

Title: Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence

Title: LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation

Title: Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Title: Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants' API Invocation Capabilities

Title: Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering

Title: Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models

Title: Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications

Title: Is Mamba Effective for Time Series Forecasting?

Title: Evaluation Ethics of LLMs in Legal Domain

Title: Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Title: CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion

Title: Pencil: Private and Extensible Collaborative Learning without the Non-Colluding Assumption

Title: Correcting misinformation on social media with a large language model

Title: A Tip for IOTA Privacy: IOTA Light Node Deanonymization via Tip Selection

Title: Artifact Feature Purification for Cross-domain Detection of AI-generated Images

Title: Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

Title: usfAD Based Effective Unknown Attack Detection Focused IDS Framework

Title: DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation

Title: NetTrack: Tracking Highly Dynamic Objects with a Net

Title: Boosting Semi-Supervised Temporal Action Localization by Learning from Non-Target Classes

Title: MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

Title: TAG: Guidance-free Open-Vocabulary Semantic Segmentation

Title: TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models

Title: MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Title: THOR: Text to Human-Object Interaction Diffusion via Relation Intervention

Title: RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

Title: SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream

Title: Cheap Ways of Extracting Clinical Markers from Texts

Title: Concatenate, Fine-tuning, Re-training: A SAM-enabled Framework for Semi-supervised 3D Medical Image Segmentation

Title: Compact 3D Gaussian Splatting For Dense Visual SLAM

Title: Uncertainty-Aware Pseudo-Label Filtering for Source-Free Unsupervised Domain Adaptation

Title: Understanding Diffusion Models by Feynman's Path Integral

Title: Stylized Face Sketch Extraction via Generative Prior with Limited Data

Title: Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation

Title: BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis

Title: Fast Personalized Text-to-Image Syntheses With Attention Injection

Title: Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning

Title: A Modified Word Saliency-Based Adversarial Attack on Text Classification Models

Title: SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Title: A Brief Study of Computer Network Security Technologies

Title: Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding

Title: Reasoning in Transformers - Mitigating Spurious Correlations and Reasoning Shortcuts

Title: Few-Shot VQA with Frozen LLMs: A Tale of Two Approaches

Title: StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

Title: GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering

Title: Domain-Guided Masked Autoencoders for Unique Player Identification

Title: Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction

Title: Federated Transfer Learning with Differential Privacy

Title: COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits

Title: IGANN Sparse: Bridging Sparsity and Interpretability with Non-linear Insight

Title: JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning

Title: What Makes Math Word Problems Challenging for LLMs?

Title: DynamicGlue: Epipolar and Time-Informed Data Association in Dynamic Environments using Graph Neural Networks

Title: Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration

Title: ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

Title: Investigating the Benefits of Projection Head for Representation Learning

Title: Automated data processing and feature engineering for deep learning and big data applications: a survey

Title: Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

Title: X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

Title: Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

Title: DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Title: VmambaIR: Visual State Space Model for Image Restoration

Title: Benchmarking the Robustness of UAV Tracking Against Common Corruptions

Title: Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure

Title: BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Title: A Novel Paradigm Boosting Translation Capabilities of Large Language Models

Title: InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions

Title: StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

Title: Boosting Continuous Emotion Recognition with Self-Pretraining using Masked Autoencoders, Temporal Convolutional Networks, and Transformers

Title: Budget Recycling Differential Privacy

Title: Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM

Title: Graph Partial Label Learning with Potential Cause Discovering

Title: CasSR: Activating Image Power for Real-World Image Super-Resolution

Title: HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models

Title: Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning

Title: Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

Title: FedSPU: Personalized Federated Learning for Resource-constrained Devices with Stochastic Parameter Update

Title: Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V

Title: Generative Motion Stylization within Canonical Motion Space

Title: Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs

Title: VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

Title: SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction

Title: Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting

Title: CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization

Title: Semantic-Enhanced Representation Learning for Road Networks with Temporal Dynamics

Title: Do CLIPs Always Generalize Better than ImageNet Models?

Title: Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Title: Circle Representation for Medical Instance Object Segmentation

Title: DEE: Dual-stage Explainable Evaluation Method for Text Generation

Title: SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications

Title: Efficient and Privacy-Preserving Federated Learning based on Full Homomorphic Encryption

Title: Video Object Segmentation with Dynamic Query Modulation

Title: Continual Forgetting for Pre-trained Vision Models

Title: EchoReel: Enhancing Action Generation of Existing Video Diffusion Models

Title: OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

Title: Reinforcement Learning with Token-level Feedback for Controllable Text Generation

Title: Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

Title: EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

Title: Augment Before Copy-Paste: Data and Memory Efficiency-Oriented Instance Segmentation Framework for Sport-scenes

Title: MISS: Memory-efficient Instance Segmentation Framework By Visual Inductive Priors Flow Propagation

Title: 3DGS-Calib: 3D Gaussian Splatting for Multimodal SpatioTemporal Calibration

Title: OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation

Title: Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines

Title: End-to-end multi-modal product matching in fashion e-commerce

Title: CRS-Diff: Controllable Generative Remote Sensing Foundation Model

Title: Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model

Title: LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Title: Arc2Face: A Foundation Model of Human Faces

Title: Diffusion-Based Environment-Aware Trajectory Prediction

Title: LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model

Title: Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction

Title: Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection

Title: Semantic Data Representation for Explainable Windows Malware Detection Models

Title: Better (pseudo-)labels for semi-supervised instance segmentation

Title: NEDS-SLAM: A Novel Neural Explicit Dense Semantic SLAM Framework using 3D Gaussian Splatting

Title: Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding

Title: TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models

Title: Urban Scene Diffusion through Semantic Occupancy Map

Title: PITA: Physics-Informed Trajectory Autoencoder

Title: LSKNet: A Foundation Lightweight Backbone for Remote Sensing

Title: Post-Quantum Cryptography: Securing Digital Communication in the Quantum Era

Title: Embedded Named Entity Recognition using Probing Classifiers

Title: Relational Representation Learning Network for Cross-Spectral Image Patch Matching

Title: Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems

Title: Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

Title: DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding

Title: Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm

Title: Construction of Hyper-Relational Knowledge Graphs Using Pre-Trained Large Language Models

Title: Deep Medial Voxels: Learned Medial Axis Approximations for Anatomical Shape Modeling

Title: SETA: Semantic-Aware Token Augmentation for Domain Generalization

Title: Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

Title: Low-Cost Privacy-Aware Decentralized Learning

Title: Is It Really You Who Forgot the Password? When Account Recovery Meets Risk-Based Authentication

Title: Counting-Stars: A Simple, Efficient, and Reasonable Strategy for Evaluating Long-Context Large Language Models

Title: Federated Modality-specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation

Title: Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Title: Metaphor Understanding Challenge Dataset for LLMs

Title: Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery

Title: Problem space structural adversarial attacks for Network Intrusion Detection Systems based on Graph Neural Networks

Title: SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator

Title: Towards Understanding the Relationship between In-context Learning and Compositional Generalization

Title: Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Title: Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

Title: Near-Optimal Solutions of Constrained Learning Problems

Title: GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Title: Complete and Efficient Graph Transformers for Crystal Material Property Prediction

Title: GPT-4 as Evaluator: Evaluating Large Language Models on Pest Management in Agriculture

Title: Towards automated formal security analysis of SAML V2.0 Web Browser SSO standard - the POST/Artifact use case

Title: IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

Title: CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite

Title: Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors

Title: InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting

Title: ReGenNet: Towards Human Action-Reaction Synthesis

Title: QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback based Self-Correction

Title: SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules

Title: KnFu: Effective Knowledge Fusion

Title: From explainable to interpretable deep learning for natural language processing in healthcare: how far from reality?

Title: Investigating Markers and Drivers of Gender Bias in Machine Translations

Title: Larimar: Large Language Models with Episodic Memory Control

Title: CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification

Title: RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

Title: LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Title: HyperColorization: Propagating spatially sparse noisy spectral clues for reconstructing hyperspectral images

Title: Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment

Title: Enhanced Event-Based Video Reconstruction with Motion Compensation

Title: Transfer Learning Beyond Bounded Density Ratios

Title: Informed Spectral Normalized Gaussian Processes for Trajectory Prediction

Title: Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Title: Diffusion Denoising as a Certified Defense against Clean-label Poisoning

Title: Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Title: GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation

Title: Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph Reasoning

Title: Learning Useful Representations of Recurrent Neural Network Weight Matrices

Title: HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs

Title: DreamMotion: Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing

Title: GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Title: SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Title: Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks

Title: VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

Title: HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

Title: GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Title: EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents

Title: Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Title: Supervised Fine-Tuning as Inverse Reinforcement Learning

Title: LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

Title: FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Title: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

Title: Align and Distill: Unifying and Improving Domain Adaptive Object Detection

Title: ROUTERBENCH: A Benchmark for Multi-LLM Routing System

Title: Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Title: HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation

Title: VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Title: One-Step Image Translation with Text-to-Image Models

Title: MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Title: Zero-Shot Image Feature Consensus with Deep Functional Maps

Title: Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation