2025-05-13

Title: Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation

Title: A Data-Driven Probabilistic Framework for Cascading Urban Risk Analysis Using Bayesian Networks

Title: InfoNCE is a Free Lunch for Semantically guided Graph Contrastive Learning

Title: UniCO: Towards a Unified Model for Combinatorial Optimization Problems

Title: Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction

Title: Prompting Large Language Models for Training-Free Non-Intrusive Load Monitoring

Title: NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines

Title: The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

Title: Engineering Risk-Aware, Security-by-Design Frameworks for Assurance of Large-Scale Autonomous AI Models

Title: My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing

Title: Probing In-Context Learning: Impact of Task Complexity and Model Architecture on Generalization and Efficiency

Title: Learning from the Good Ones: Risk Profiling-Based Defenses Against Evasion Attacks on DNNs

Title: System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection

Title: HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation

Title: Improving Generalization of Medical Image Registration Foundation Model

Title: ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images

Title: HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models

Title: ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection

Title: AI-Powered Anomaly Detection with Blockchain for Real-Time Security and Reliability in Autonomous Vehicles

Title: Dataset Distillation with Probabilistic Latent Features

Title: TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models

Title: StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation

Title: Video Dataset Condensation with Diffusion Models

Title: Jailbreaking the Text-to-Video Generative Models

Title: Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws

Title: SimMIL: A Universal Weakly Supervised Pre-Training Framework for Multi-Instance Learning in Whole Slide Pathology Images

Title: Learning Graph Representation of Agent Diffuser

Title: Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology

Title: Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies

Title: Image Classification Using a Diffusion Model as a Pre-Training Model

Title: Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion

Title: A systematic review of challenges and proposed solutions in modeling multimodal data

Title: Unsupervised Learning for Class Distribution Mismatch

Title: Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation

Title: Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation

Title: CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation

Title: A Vision-Language Foundation Model for Leaf Disease Identification

Title: DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models

Title: Seed1.5-VL Technical Report

Title: Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures

Title: Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution

Title: Discovering Concept Directions from Diffusion-based Counterfactuals via Latent Clustering

Title: Towards Scalable IoT Deployment for Visual Anomaly Detection via Efficient Compression

Title: Real-Time Bit-Level Encryption of Full High-Definition Video Without Diffusion

Title: Generalizable Pancreas Segmentation via a Dual Self-Supervised Learning Framework

Title: Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning

Title: Incomplete In-context Learning

Title: Synthetic Similarity Search in Automotive Production

Title: AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Title: Generative Pre-trained Autoregressive Diffusion Transformer

Title: QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines

Title: From Search To Sampling: Generative Models For Robust Algorithmic Recourse

Title: Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection

Title: Unified Continuous Generative Models

Title: You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts

Title: Addressing degeneracies in latent interpolation for diffusion models

Title: EAGLE: Contrastive Learning for Efficient Graph Anomaly Detection

Title: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Title: Noise Optimized Conditional Diffusion for Domain Adaptation

Title: Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs

Title: Evaluating Modern Visual Anomaly Detection Approaches in Semiconductor Manufacturing: A Comparative Study

Title: Deep Learning Advances in Vision-Based Traffic Accident Anticipation: A Comprehensive Review of Methods,Datasets,and Future Directions

Title: ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models

Title: Multimodal Survival Modeling in the Age of Foundation Models

Title: Spoken Language Understanding on Unseen Tasks With In-Context Learning

Title: LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention

Title: Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Title: Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation

Title: Learning Dynamics in Continual Pre-Training for Large Language Models

Title: Continuous Visual Autoregressive Generation via Score Maximization

Title: DanceGRPO: Unleashing GRPO on Visual Generation