2025-02-06

Title: MIND: Microstructure INverse Design with Generative Hybrid Neural Representation

Title: e-SimFT: Alignment of Generative Models with Simulation Feedback for Pareto-Front Design Exploration

Title: On Teacher Hacking in Language Model Distillation

Title: Blind Visible Watermark Removal with Morphological Dilation

Title: Controllable Video Generation with Provable Disentanglement

Title: A Unified Understanding and Evaluation of Steering Methods

Title: LLM Bandit: Cost-Efficient LLM Generation via Preference-Conditioned Dynamic Routing

Title: A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges

Title: PH-VAE: A Polynomial Hierarchical Variational Autoencoder Towards Disentangled Representation Learning

Title: Elucidating the Preconditioning in Consistency Distillation

Title: Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization

Title: Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models

Title: Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Title: Symmetry-Aware Bayesian Flow Networks for Crystal Generation

Title: PICBench: Benchmarking LLMs for Photonic Integrated Circuits Design

Title: MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Title: General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data

Title: RadVLM: A Multitask Conversational Vision-Language Model for Radiology

Title: Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation

Title: TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer

Title: Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Title: Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics

Title: A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)

Title: SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living