2025-02-18

Title: Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning

Title: Towards Copyright Protection for Knowledge Bases of Retrieval-augmented Language Models via Ownership Verification with Reasoning

Title: FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation

Title: I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Title: LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search

Title: SinSim: Sinkhorn-Regularized SimCLR

Title: SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models

Title: Hallucinations and Truth: A Comprehensive Accuracy Evaluation of RAG, LoRA and DoRA

Title: Preference learning made easy: Everything should be understood through win rate

Title: Efficient Hierarchical Contrastive Self-supervising Learning for Time Series Classification via Importance-aware Resolution Selection

Title: Classifier-free Guidance with Adaptive Scaling

Title: Towards Self-Supervised Covariance Estimation in Deep Heteroscedastic Regression

Title: Post-training an LLM for RAG? Train on Self-Generated Demonstrations

Title: Federated Learning-Driven Cybersecurity Framework for IoT Networks with Privacy-Preserving and Real-Time Threat Detection Capabilities

Title: HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation

Title: Lost in the Passage: Passage-level In-context Learning Does Not Necessarily Need a "Passage"

Title: Is Self-Supervised Pre-training on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-2

Title: Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model

Title: A Computational Model for Ransomware Detection Using Cross-Domain Entropy Signatures

Title: FuncGenFoil: Airfoil Generation and Editing Model in Function Space

Title: Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal

Title: BASE-SQL: A powerful open source Text-To-SQL baseline approach

Title: Preconditioned Inexact Stochastic ADMM for Deep Model

Title: Epidemic-guided deep learning for spatiotemporal forecasting of Tuberculosis outbreak

Title: PDA: Generalizable Detection of AI-Generated Images via Post-hoc Distribution Alignment

Title: HybriDNA: A Hybrid Transformer-Mamba2 Long-Range DNA Language Model

Title: BalanceBenchmark: A Survey for Imbalanced Learning

Title: The Vendiscope: An Algorithmic Microscope For Data Collections

Title: SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

Title: Do Deepfake Detectors Work in Reality?

Title: Exploring Contextual Flux in Large Language Models: A Novel Approach to Self-Modulating Semantic Networks

Title: Skillful Nowcasting of Convective Clouds With a Cascade Diffusion Model

Title: ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations

Title: Prompt Inject Detection with Generative Explanation as an Investigative Tool

Title: Collaborative Deterministic-Diffusion Model for Probabilistic Urban Spatiotemporal Prediction

Title: ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models

Title: Are Generative Models Underconfident? An Embarrassingly Simple Quality Estimation Approach

Title: FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching

Title: Machine Learning-Based Intrusion Detection and Prevention System for IIoT Smart Metering Networks: Challenges and Solutions

Title: AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks

Title: LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning

Title: MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation

Title: Span-Agnostic Optimal Sample Complexity and Oracle Inequalities for Average-Reward RL

Title: Uncertainty-Aware Step-wise Verification with Generative Reward Models

Title: Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection

Title: ALGEN: Few-shot Inversion Attacks on Textual Embeddings using Alignment and Generation

Title: Inverse Flow and Consistency Models

Title: WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing

Title: Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning

Title: Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-Localization

Title: MARS: Mesh AutoRegressive Model for 3D Shape Detailization

Title: Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment

Title: Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models

Title: ADO: Automatic Data Optimization for Inputs in LLM Prompts

Title: SAFE-SQL: Self-Augmented In-Context Learning with Fine-grained Example Selection for Text-to-SQL

Title: An Efficient Row-Based Sparse Fine-Tuning

Title: Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness

Title: Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation

Title: Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models

Title: DifCluE: Generating Counterfactual Explanations with Diffusion Autoencoders and modal clustering

Title: SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

Title: Training Large Language Models to be Better Rule Followers

Title: Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation

Title: DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection

Title: Continuous Diffusion Model for Language Modeling

Title: Towards a Trustworthy Anomaly Detection for Critical Applications through Approximated Partial AUC Loss

Title: Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku

Title: iMOVE: Instance-Motion-Aware Video Understanding

Title: GraphThought: Graph Combinatorial Optimization with Thought Generation

Title: Maximum Entropy Reinforcement Learning with Diffusion Policy

Title: In-Context Parametric Inference: Point or Distribution Estimators?

Title: Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models

Title: GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Title: Hyperspherical Energy Transformer with Recurrent Depth

Title: Object-Centric Image to Video Generation with Language Guidance

Title: MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction

Title: RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars

Title: Improve LLM-as-a-Judge Ability as a General Ability

Title: MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow

Title: The Worse The Better: Content-Aware Viewpoint Generation Network for Projection-related Point Cloud Quality Assessment

Title: Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection

Title: Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing

Title: ILIAS: Instance-Level Image retrieval At Scale

Title: Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning

Title: BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model

Title: Intuitive physics understanding emerges from self-supervised pretraining on natural videos

Title: Enhanced Anomaly Detection in IoMT Networks using Ensemble AI Models on the CICIoMT2024 Dataset

Title: Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu

Title: DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

Title: Continual Learning Should Move Beyond Incremental Classification

Title: Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Title: Image Inversion: A Survey from GANs to Diffusion and Beyond

Title: Unsupervised Structural-Counterfactual Generation under Domain Shift

Title: HumanGif: Single-View Human Diffusion with Generative Prior

Title: Unifying Explainable Anomaly Detection and Root Cause Analysis in Dynamical Systems

Title: Descriminative-Generative Custom Tokens for Vision-Language Models

Title: A-MEM: Agentic Memory for LLM Agents

Title: LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities

Title: MagicArticulate: Make Your 3D Models Articulation-Ready

Title: Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening

Title: HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Title: Diffusion Models without Classifier-free Guidance