2025-05-13

Title: Dialz: A Python Toolkit for Steering Vectors

Title: A machine learning model for skillful climate system prediction

Title: PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

Title: A Data-Driven Probabilistic Framework for Cascading Urban Risk Analysis Using Bayesian Networks

Title: DMRL: Data- and Model-aware Reward Learning for Data Extraction

Title: UniCO: Towards a Unified Model for Combinatorial Optimization Problems

Title: Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction

Title: QiMeng-TensorOp: Automatically Generating High-Performance Tensor Operators with Hardware Primitives

Title: Collaborative Multi-LoRA Experts with Achievement-based Multi-Tasks Loss for Unified Multimodal Information Extraction

Title: GraphComp: Extreme Error-bounded Compression of Scientific Data via Temporal Graph Autoencoders

Title: Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Learning

Title: The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

Title: Toward Advancing License Plate Super-Resolution in Real-World Scenarios: A Dataset and Benchmark

Title: My Emotion on your face: The use of Facial Keypoint Detection to preserve Emotions in Latent Space Editing

Title: PromptIQ: Who Cares About Prompts? Let System Handle It -- A Component-Aware Framework for T2I Generation

Title: HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation

Title: ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images

Title: HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models

Title: ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection

Title: Dataset Distillation with Probabilistic Latent Features

Title: Jailbreaking the Text-to-Video Generative Models

Title: UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration

Title: Learning Graph Representation of Agent Diffuser

Title: Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning

Title: Topology Guidance: Controlling the Outputs of Generative Models via Vector Field Topology

Title: Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies

Title: Image Classification Using a Diffusion Model as a Pre-Training Model

Title: Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration

Title: Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI

Title: A systematic review of challenges and proposed solutions in modeling multimodal data

Title: High-Frequency Prior-Driven Adaptive Masking for Accelerating Image Super-Resolution

Title: Learning Value of Information towards Joint Communication and Control in 6G V2X

Title: BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation

Title: Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation

Title: Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models

Title: CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation

Title: MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception

Title: DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models

Title: Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures

Title: Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution

Title: Multi-Objective-Guided Discrete Flow Matching for Controllable Biological Sequence Design

Title: Generalizable Pancreas Segmentation via a Dual Self-Supervised Learning Framework

Title: Causal View of Time Series Imputation: Some Identification Results on Missing Mechanism

Title: Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection

Title: Compression, Regularity, Randomness and Emergent Structure: Rethinking Physical Complexity in the Data-Driven Era

Title: Generative Pre-trained Autoregressive Diffusion Transformer

Title: From Search To Sampling: Generative Models For Robust Algorithmic Recourse

Title: Unified Continuous Generative Models

Title: You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts

Title: Addressing degeneracies in latent interpolation for diffusion models

Title: FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images

Title: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Title: Noise Optimized Conditional Diffusion for Domain Adaptation

Title: Generating Skyline Explanations for Graph Neural Networks

Title: ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models

Title: Anatomical Attention Alignment representation for Radiology Report Generation

Title: Gameplay Highlights Generation

Title: LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention

Title: Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Title: Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation

Title: MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Title: Continuous Visual Autoregressive Generation via Score Maximization

Title: DanceGRPO: Unleashing GRPO on Visual Generation