2025-05-28

Title: Beyond Demonstrations: Dynamic Vector Construction from Latent Representations

Title: Joint-stochastic-approximation Random Fields with Application to Semi-supervised Learning

Title: Decision Flow Policy Optimization

Title: FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

Title: GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Title: What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Title: SEMMA: A Semantic Aware Knowledge Graph Foundation Model

Title: In-context Language Learning for Endangered Languages in Speech Recognition

Title: Time Series Generation Under Data Scarcity: A Unified Generative Modeling Approach

Title: DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data

Title: WeatherEdit: Controllable Weather Editing with 4D Gaussian Field

Title: Gatsby Without the 'E': Crafting Lipograms with LLMs

Title: CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic

Title: MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning

Title: MultLFG: Training-free Multi-LoRA composition using Frequency-domain Guidance

Title: Causality and "In-the-Wild" Video-Based Person Re-ID: A Survey

Title: Emotion Classification In-Context in Spanish

Title: Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL

Title: Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

Title: Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation

Title: OccLE: Label-Efficient 3D Semantic Occupancy Prediction

Title: Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training

Title: Test-Time Learning for Large Language Models

Title: Continuous-Time Attention: PDE-Guided Mechanisms for Long-Sequence Transformers

Title: Pretraining Language Models to Ponder in Continuous Space

Title: Generating Hypotheses of Dynamic Causal Graphs in Neuroscience: Leveraging Generative Factor Models of Observed Time Series

Title: LeDiFlow: Learned Distribution-guided Flow Matching to Accelerate Image Generation

Title: Intern-GS: Vision Model Guided Sparse-View 3D Gaussian Splatting

Title: MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

Title: Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction

Title: Robust and Explainable Detector of Time Series Anomaly via Augmenting Multiclass Pseudo-Anomalies

Title: Integrating Intermediate Layer Optimization and Projected Gradient Descent for Solving Inverse Problems with Diffusion Models

Title: Not All Thats Rare Is Lost: Causal Paths to Rare Concept Synthesis

Title: Frame-Level Captions for Long Video Generation with Complex Multi Scenes

Title: HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling

Title: Exploring Timeline Control for Facial Motion Generation

Title: Respond to Change with Constancy: Instruction-tuning with LLM for Non-I.I.D. Network Traffic Classification

Title: In Context Learning with Vision Transformers: Case Study

Title: Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing

Title: Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects

Title: Geometry-Editable and Appearance-Preserving Object Compositon

Title: NatADiff: Adversarial Boundary Guidance for Natural Adversarial Diffusion

Title: ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation

Title: Unveiling Impact of Frequency Components on Membership Inference Attacks for Diffusion Models

Title: OrienText: Surface Oriented Textual Image Generation

Title: Personalized Query Auto-Completion for Long and Short-Term Interests with Adaptive Detoxification Generation

Title: DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization

Title: Facial Attribute Based Text Guided Face Anonymization

Title: Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

Title: Efficient and Unbiased Sampling from Boltzmann Distributions via Variance-Tuned Diffusion Models

Title: FeatInv: Spatially resolved mapping from feature space to input space using conditional diffusion models

Title: RainFusion: Adaptive Video Generation Acceleration via Multi-Dimensional Visual Redundancy

Title: Advancing high-fidelity 3D and Texture Generation with 2.5D latents

Title: Minute-Long Videos with Dual Parallelisms

Title: Conditional Diffusion Models with Classifier-Free Gibbs-like Guidance

Title: A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction

Title: Differentiable Solver Search for Fast Diffusion Sampling

Title: ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction

Title: Learning Single Index Models with Diffusion Priors

Title: Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction

Title: Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis

Title: FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention

Title: Assessment of L2 Oral Proficiency using Speech Large Language Models

Title: RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images

Title: STEB: In Search of the Best Evaluation Approach for Synthetic Time Series

Title: Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model

Title: Sci-Fi: Symmetric Constraint for Frame Inbetweening

Title: Is Hyperbolic Space All You Need for Medical Anomaly Detection?

Title: LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners

Title: BindEnergyCraft: Casting Protein Structure Predictors as Energy-Based Models for Binder Design

Title: Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV Settings

Title: Supervised and self-supervised land-cover segmentation & classification of the Biesbosch wetlands

Title: Learnable Kernel Density Estimation for Graphs

Title: A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features

Title: MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on

Title: AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Crop Mapping

Title: GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution

Title: ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

Title: DeCAF: Decentralized Consensus-And-Factorization for Low-Rank Adaptation of Foundation Models

Title: Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

Title: A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective

Title: Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning

Title: Can Large Reasoning Models Self-Train?

Title: OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers

Title: Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Title: M3S-UPD: Efficient Multi-Stage Self-Supervised Learning for Fine-Grained Encrypted Traffic Classification with Unknown Pattern Discovery

Title: Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion

Title: MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation

Title: Be Decisive: Noise-Induced Layouts for Multi-Subject Generation

Title: Frame In-N-Out: Unbounded Controllable Image-to-Video Generation