2025-01-31

Title: Explainable Machine Learning: An Illustration of Kolmogorov-Arnold Network Model for Airfoil Lift Prediction

Title: Shared DIFF Transformer

Title: Generative AI for Vision: A Comprehensive Study of Frameworks and Applications

Title: FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models

Title: LLMs can see and hear without any training

Title: Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss

Title: MAMS: Model-Agnostic Module Selection Framework for Video Captioning

Title: Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis

Title: A Video-grounded Dialogue Dataset and Metric for Event-driven Activities

Title: State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence

Title: MatIR: A Hybrid Mamba-Transformer Image Restoration Model

Title: SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Title: CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization

Title: HSRMamba: Contextual Spatial-Spectral State Space Model for Single Hyperspectral Super-Resolution

Title: Integrating Spatial and Frequency Information for Under-Display Camera Image Restoration

Title: UDC-VIT: A Real-World Video Dataset for Under-Display Cameras

Title: Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH

Title: Diffusion Autoencoders are Scalable Image Tokenizers