2025-04-15

Title: Embedding Hidden Adversarial Capabilities in Pre-Trained Diffusion Models

Title: Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation

Title: InfoGain Wavelets: Furthering the Design of Diffusion Wavelets for Graph-Structured Data

Title: Generative AI in Live Operations: Evidence of Productivity Gains in Cybersecurity and Endpoint Management

Title: Mechanistic Anomaly Detection for "Quirky" Language Models

Title: Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics

Title: PatchTrAD: A Patch-Based Transformer focusing on Patch-Wise Reconstruction Error for Time Series Anomaly Detection

Title: Datum-wise Transformer for Synthetic Tabular Data Detection in the Wild

Title: Mimic In-Context Learning for Multimodal Tasks

Title: Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

Title: Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

Title: LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

Title: Robust SAM: On the Adversarial Robustness of Vision Foundation Models

Title: HyperCore: The Core Framework for Building Hyperbolic Foundation Models with Comprehensive Modules

Title: Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models

Title: Long Context In-Context Compression by Getting to the Gist of Gisting

Title: MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer

Title: Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization

Title: Multimodal 3D Genome Pre-training

Title: Privacy Preservation in Gen AI Applications

Title: BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting

Title: CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift

Title: Self-Supervised Autoencoder Network for Robust Heart Rate Extraction from Noisy Photoplethysmogram: Applying Blind Source Separation to Biosignal Analysis

Title: MatWheel: Addressing Data Scarcity in Materials Science Through Synthetic Data

Title: Secure Physical Layer Communications for Low-Altitude Economy Networking: A Survey

Title: Evolved Hierarchical Masking for Self-Supervised Learning

Title: Can postgraduate translation students identify machine-generated text?

Title: From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Title: VideoAds for Fast-Paced Video Understanding: Where Opensource Foundation Models Beat GPT-4o & Gemini-1.5 Pro

Title: Enhancing Contrastive Demonstration Selection with Semantic Diversity for Robust In-Context Machine Translation

Title: MedIL: Implicit Latent Spaces for Generating Heterogeneous Medical Images at Arbitrary Resolutions

Title: Text To 3D Object Generation For Scalable Room Assembly

Title: REMEMBER: Retrieval-based Explainable Multimodal Evidence-guided Modeling for Brain Evaluation and Reasoning in Zero- and Few-shot Neurodegenerative Diagnosis

Title: Structure-Accurate Medical Image Translation based on Dynamic Frequency Balance and Knowledge Guidance

Title: D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation

Title: CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models

Title: Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

Title: GenEDA: Unleashing Generative Reasoning on Netlist via Multimodal Encoder-Decoder Aligned Foundation Model

Title: MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs

Title: DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion

Title: Causal integration of chemical structures improves representations of microscopy images for morphological profiling

Title: SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification

Title: Mitigating Long-tail Distribution in Oracle Bone Inscriptions: Dataset, Model, and Benchmark

Title: TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting

Title: Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation

Title: Mitigating Many-Shot Jailbreaking

Title: Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training

Title: Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Title: KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Title: Computer-Aided Layout Generation for Building Design: A Review

Title: ToolTipNet: A Segmentation-Driven Deep Learning Baseline for Surgical Instrument Tip Detection

Title: Dynamical symmetries in the fluctuation-driven regime: an application of Noether's theorem to noisy dynamical systems

Title: Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Title: EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise

Title: VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

Title: Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution

Title: TWSSenti: A Novel Hybrid Framework for Topic-Wise Sentiment Analysis on Social Media Using Transformer Models

Title: Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models

Title: Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data

Title: GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting

Title: Investigating the Role of Bilateral Symmetry for Inpainting Brain MRI

Title: A Computational Cognitive Model for Processing Repetitions of Hierarchical Relations

Title: AGO: Adaptive Grounding for Open World 3D Occupancy Prediction

Title: Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers

Title: Efficient Generative Model Training via Embedded Representation Warmup

Title: A Model Zoo of Vision Transformers

Title: DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing

Title: $α$-Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models

Title: Noise2Ghost: Self-supervised deep convolutional reconstruction for ghost imaging

Title: Analysis of Attention in Video Diffusion Transformers

Title: SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

Title: LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis

Title: Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis

Title: Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks

Title: Foundation models for electronic health records: representation dynamics and transferability

Title: MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model

Title: Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing

Title: Art3D: Training-Free 3D Generation from Flat-Colored Illustration

Title: REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Title: Decoupled Diffusion Sparks Adaptive Scene Generation