2025-01-16

Title: SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval

Title: Is Stochastic Gradient Descent Effective? A PDE Perspective on Machine Learning processes

Title: Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Title: Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin

Title: Selective Attention Merging for low resource tasks: A case study of Child ASR

Title: Detecting Contextual Anomalies by Discovering Consistent Spatial Regions

Title: Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition

Title: Yuan: Yielding Unblemished Aesthetics Through A Unified Network for Visual Imperfections Removal in Generated Images

Title: DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors

Title: Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Title: Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT)

Title: RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

Title: Transformer-based Multivariate Time Series Anomaly Localization

Title: MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities

Title: Joint Learning of Depth and Appearance for Portrait Image Animation

Title: StereoGen: High-quality Stereo Image Generation from a Single Image

Title: FlexiClip: Locality-Preserving Free-Form Character Animation

Title: Investigating Parameter-Efficiency of Hybrid QuGANs Based on Geometric Properties of Generated Sea Route Graphs

Title: RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency

Title: Self-supervised Transformation Learning for Equivariant Representations

Title: The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities

Title: Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models

Title: Few-Shot Learner Generalizes Across AI-Generated Image Detection

Title: Enhanced Large Language Models for Effective Screening of Depression and Anxiety

Title: Admitting Ignorance Helps the Video Question Answering Models to Answer

Title: Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning

Title: MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation

Title: Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Title: Enhanced Multi-Scale Cross-Attention for Person Image Generation

Title: Applying General Turn-taking Models to Conversational Human-Robot Interaction

Title: CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation

Title: CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Title: RepVideo: Rethinking Cross-Layer Representation for Video Generation

Title: VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science

Title: Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails

Title: SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation

Title: Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion