2025-12-24

Title: Reducing Label Dependency in Human Activity Recognition with Wearables: From Supervised Learning to Novel Weakly Self-Supervised Approaches

Title: High-Performance Self-Supervised Learning by Joint Training of Flow Matching

Title: CoPHo: Classifier-guided Conditional Topology Generation with Persistent Homology

Title: Simulation-Driven Railway Delay Prediction: An Imitation Learning Approach

Title: OpComm: A Reinforcement Learning Framework for Adaptive Buffer Control in Warehouse Volume Forecasting

Title: Generating the Past, Present and Future from a Motion-Blurred Image

Title: Learning to Refocus with Video Diffusion Models

Title: Fine-Tuned In-Context Learners for Efficient Adaptation

Title: How well do Large Language Models Recognize Instructional Moves? Establishing Baselines for Foundation Models in Educational Discourse

Title: Modeling Non-Ergodic Path Effects Using Conditional Generative Model for Fourier Amplitude Spectra

Title: The Seismic Wavefield Common Task Framework

Title: SE360: Semantic Edit in 360$^\circ$ Panoramas via Hierarchical Data Construction

Title: How Much 3D Do Video Foundation Models Encode?

Title: A Dual-Branch Local-Global Framework for Cross-Resolution Land Cover Mapping

Title: Few-Shot-Based Modular Image-to-Video Adapter for Diffusion Models

Title: Control Variate Score Matching for Diffusion Models

Title: IoT-based Android Malware Detection Using Graph Neural Network With Adversarial Defense

Title: FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs

Title: PairFlow: Closed-Form Source-Target Coupling for Few-Step Generation in Discrete Flow Models

Title: Spatio-Temporal Graphs Beyond Grids: Benchmark for Maritime Anomaly Detection

Title: UMAMI: Unifying Masked Autoregressive Models and Deterministic Rendering for View Synthesis

Title: Retrieval-augmented Prompt Learning for Pre-trained Foundation Models

Title: CoDi -- an exemplar-conditioned diffusion model for low-shot counting

Title: AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model

Title: Optimistic TEE-Rollups: A Hybrid Architecture for Scalable and Verifiable Generative AI Inference on Blockchain

Title: Generative Latent Coding for Ultra-Low Bitrate Image Compression

Title: How I Met Your Bias: Investigating Bias Amplification in Diffusion Models

Title: HGAN-SDEs: Learning Neural Stochastic Differential Equations with Hermite-Guided Adversarial Training

Title: SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision

Title: Can LLMs Solve My Grandma's Riddle? Evaluating Multilingual Large Language Models on Reasoning Traditional Bangla Tricky Riddles

Title: The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection

Title: Inverse Autoregressive Flows for Zero Degree Calorimeter fast simulation

Title: Field-Space Attention for Structure-Preserving Earth System Transformers

Title: CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation

Title: SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images

Title: Chain-of-Anomaly Thoughts with Large Vision-Language Models

Title: High Dimensional Data Decomposition for Anomaly Detection of Textured Images

Title: UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images

Title: Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Title: Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs

Title: MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts

Title: Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Title: Repurposing Video Diffusion Transformers for Robust Point Tracking

Title: Active Intelligence in Video Avatars via Closed-loop World Modeling

Title: SemanticGen: Video Generation in Semantic Space