2024-11-27

Title: Leveraging Conversational Generative AI for Anomaly Detection in Digital Substations

Title: Conditional Text-to-Image Generation with Reference Guidance

Title: TPIE: Topology-Preserved Image Editing With Text Instructions

Title: PaRCE: Probabilistic and Reconstruction-based Competency Estimation for CNN-based Image Classification

Title: Importance-based Token Merging for Diffusion Models

Title: $\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

Title: EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Title: Classifier-Free Guidance inside the Attraction Basin May Cause Memorization

Title: FollowGen: A Scaled Noise Conditional Diffusion Model for Car-Following Trajectory Prediction

Title: LetsTalk: Latent Diffusion Transformer for Talking Video Synthesis

Title: AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks

Title: PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation

Title: Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

Title: SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction

Title: Revisiting DDIM Inversion for Controlling Defect Generation by Disentangling the Background

Title: In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models

Title: MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing

Title: SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models

Title: NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

Title: Scaling Laws for Black box Adversarial Attacks

Title: CoCoNO: Attention Contrast-and-Complete for Initial Noise Optimization in Text-to-Image Synthesis

Title: Contrastive Multi-graph Learning with Neighbor Hierarchical Sifting for Semi-supervised Text Classification

Title: TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Title: From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task

Title: ST-Align: A Multimodal Foundation Model for Image-Gene Alignment in Spatial Transcriptomics

Title: Phase-Informed Tool Segmentation for Manual Small-Incision Cataract Surgery

Title: Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image

Title: Controllable Human Image Generation with Personalized Multi-Garments

Title: Abnormality-Driven Representation Learning for Radiology Imaging

Title: Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation

Title: Pathways on the Image Manifold: Image Editing via Video Generation

Title: DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow

Title: Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing

Title: Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

Title: MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning

Title: ZoomLDM: Latent Diffusion Model for multi-scale image generation

Title: SatVision-TOA: A Geospatial Foundation Model for Coarse-Resolution All-Sky Remote Sensing Imagery

Title: Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation

Title: TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On

Title: D$^2$-World: An Efficient World Model through Decoupled Dynamic Flow

Title: Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

Title: A generalised novel loss function for computational fluid dynamics

Title: Contrastive Graph Condensation: Advancing Data Versatility through Self-Supervised Learning

Title: Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models

Title: Contrastive CFG: Improving CFG in Diffusion Models by Contrasting Positive and Negative Concepts

Title: PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution

Title: Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation

Title: OSDFace: One-Step Diffusion Model for Face Restoration

Title: ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Title: LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization

Title: PhysMotion: Physics-Grounded Dynamics From a Single Image

Title: SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting

Title: Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Title: GraphSubDetector: Time Series Subsequence Anomaly Detection via Density-Aware Adaptive Graph Neural Network

Title: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Title: From Graph Diffusion to Graph Classification

Title: Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Title: DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model

Title: APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents

Title: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Title: Reward Incremental Learning in Text-to-Image Generation

Title: InsightEdit: Towards Better Instruction Following for Image Editing

Title: The Extractive-Abstractive Spectrum: Uncovering Verifiability Trade-offs in LLM Generations

Title: RealTraj: Towards Real-World Pedestrian Trajectory Forecasting

Title: AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation

Title: One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Title: CoA: Chain-of-Action for Generative Semantic Labels

Title: DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters

Title: Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps

Title: Rewiring Techniques to Mitigate Oversquashing and Oversmoothing in GNNs: A Survey

Title: "Stupid robot, I want to speak to a human!" User Frustration Detection in Task-Oriented Dialog Systems

Title: Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Title: VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Title: WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Title: Learning 3D Representations from Procedural 3D Programs

Title: Towards Precise Scaling Laws for Video Diffusion Transformers

Title: Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

Title: Probing the Mid-level Vision Capabilities of Self-Supervised Learning

Title: SuperMat: Physically Consistent PBR Material Estimation at Interactive Rates

Title: FTMoMamba: Motion Generation with Frequency and Text State Space Models

Title: IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation - An Enhanced Prototype-Guided Diffusion Framework

Title: AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments

Title: Pre-training for Action Recognition with Automatically Generated Fractal Datasets

Title: VideoDirector: Precise Video Editing via Text-to-Video Models

Title: Accelerating Vision Diffusion Transformers with Skip Branches

Title: A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve

Title: How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations

Title: SketchAgent: Language-Driven Sequential Sketch Generation

Title: GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration

Title: ScribbleLight: Single Image Indoor Relighting with Scribbles

Title: StableAnimator: High-Quality Identity-Preserving Human Image Animation