2025-02-18

Title: Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning

Title: A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision

Title: FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation

Title: Quaternion-Hadamard Network: A Novel Defense Against Adversarial Attacks with a New Dataset

Title: One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Title: I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Title: LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search

Title: Preference learning made easy: Everything should be understood through win rate

Title: KernelBench: Can LLMs Write Efficient GPU Kernels?

Title: PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation

Title: Classifier-free Guidance with Adaptive Scaling

Title: Data-driven Super-Resolution of Flood Inundation Maps using Synthetic Simulations

Title: ControllableGPT: A Ground-Up Designed Controllable GPT for Molecule Optimization

Title: LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and Regularization

Title: Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features

Title: FuncGenFoil: Airfoil Generation and Editing Model in Function Space

Title: Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal

Title: VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS

Title: Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation

Title: Preconditioned Inexact Stochastic ADMM for Deep Model

Title: HybriDNA: A Hybrid Transformer-Mamba2 Long-Range DNA Language Model

Title: The Vendiscope: An Algorithmic Microscope For Data Collections

Title: SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

Title: Implicit Neural Representations of Molecular Vector-Valued Functions

Title: Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach

Title: Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images

Title: LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs

Title: Do Deepfake Detectors Work in Reality?

Title: TPCap: Unlocking Zero-Shot Image Captioning with Trigger-Augmented and Multi-Modal Purification Modules

Title: Simplify RLHF as Reward-Weighted SFT: A Variational Method

Title: Diversified Sampling Improves Scaling LLM inference

Title: Deep Incomplete Multi-view Learning via Cyclic Permutation of VAEs

Title: Accelerating Anchors via Specialization and Feature Transformation

Title: Phantom: Subject-consistent video generation via cross-modal alignment

Title: AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks

Title: SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors

Title: MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation

Title: Span-Agnostic Optimal Sample Complexity and Oracle Inequalities for Average-Reward RL

Title: Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection

Title: Inverse Flow and Consistency Models

Title: Biases in Edge Language Models: Detection, Analysis, and Mitigation

Title: MARS: Mesh AutoRegressive Model for 3D Shape Detailization

Title: DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services

Title: Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models

Title: Connector-S: A Survey of Connectors in Multi-modal Large Language Models

Title: Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models

Title: GiFT: Gibbs Fine-Tuning for Code Generation

Title: Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation

Title: DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning

Title: Accelerated Gradient-based Design Optimization Via Differentiable Physics-Informed Neural Operator: A Composites Autoclave Processing Case Study

Title: A GNN-based Spectral Filtering Mechanism for Imbalance Classification in Network Digital Twin

Title: DifCluE: Generating Counterfactual Explanations with Diffusion Autoencoders and modal clustering

Title: SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

Title: Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation

Title: Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku

Title: GraphThought: Graph Combinatorial Optimization with Thought Generation

Title: Maximum Entropy Reinforcement Learning with Diffusion Policy

Title: Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models

Title: Neural Interpretable Reasoning

Title: GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text

Title: Hyperspherical Energy Transformer with Recurrent Depth

Title: MMXU: A Multi-Modal and Multi-X-ray Understanding Dataset for Disease Progression

Title: Object-Centric Image to Video Generation with Language Guidance

Title: MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction

Title: MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow

Title: The Worse The Better: Content-Aware Viewpoint Generation Network for Projection-related Point Cloud Quality Assessment

Title: Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection

Title: Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing

Title: No-reference geometry quality assessment for colorless point clouds via list-wise rank learning

Title: Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning

Title: From Selection to Generation: A Survey of LLM-based Active Learning

Title: DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

Title: Continual Learning Should Move Beyond Incremental Classification

Title: Image Inversion: A Survey from GANs to Diffusion and Beyond

Title: Unsupervised Structural-Counterfactual Generation under Domain Shift

Title: HumanGif: Single-View Human Diffusion with Generative Prior

Title: Descriminative-Generative Custom Tokens for Vision-Language Models

Title: LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities

Title: MagicArticulate: Make Your 3D Models Articulation-Ready

Title: HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Title: VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution