2025-07-18

Title: Spatially Grounded Explanations in Vision Language Models for Document Visual Question Answering

Title: Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training

Title: IncA-DES: An incremental and adaptive dynamic ensemble selection approach using online K-d tree neighborhood search for data streams with concept drift

Title: Assay2Mol: large language model-based drug design using BioAssay context

Title: Federated Learning in Open- and Closed-Loop EMG Decoding: A Privacy and Performance Perspective

Title: Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation

Title: World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving

Title: Local Representative Token Guided Merging for Text-to-Image Generation

Title: DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment

Title: ATL-Diff: Audio-Driven Talking Head Generation with Early Landmarks-Guide Noise Diffusion

Title: RONOM: Reduced-Order Neural Operator Modeling

Title: FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering

Title: SEMT: Static-Expansion-Mesh Transformer Network Architecture for Remote Sensing Image Captioning

Title: An Investigation of Ear-EEG Signals for a Novel Biometric Authentication System

Title: DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization

Title: Insights into a radiology-specialised multimodal large language model with sparse autoencoders

Title: LoViC: Efficient Long Video Generation with Context Compression

Title: FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

Title: A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints

Title: Fault detection and diagnosis for the engine electrical system of a space launcher based on a temporal convolutional autoencoder and calibrated classifiers

Title: Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Title: R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept Learning

Title: NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation

Title: Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models

Title: Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection

Title: Leveraging Pre-Trained Visual Models for AI-Generated Video Detection

Title: VITA: Vision-to-Action Flow Matching Policy

Title: Leveraging Asynchronous Cross-border Market Data for Improved Day-Ahead Electricity Price Forecasting in European Markets

Title: FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization

Title: Taming Diffusion Transformer for Real-Time Mobile Video Generation

Title: Imbalance in Balance: Online Concept Balancing in Generation Models

Title: AutoPartGen: Autogressive 3D Part Generation and Discovery

Title: Hierarchical Rectified Flow Matching with Mini-Batch Couplings