2025-08-20

Title: Contextual Attention-Based Multimodal Fusion of LLM and CNN for Sentiment Analysis

Title: Strategies for training point distributions in physics-informed neural networks

Title: MIRAGE: Towards AI-Generated Image Detection in the Wild

Title: DianJin-OCR-R1: Enhancing OCR Capabilities via a Reasoning-and-Tool Interleaved Vision-Language Model

Title: GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis

Title: Efficient Constraint-Aware Flow Matching via Randomized Exploration

Title: X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms

Title: Counterfactual Probabilistic Diffusion with Expert Models

Title: NovoMolGen: Rethinking Molecular Language Model Pretraining

Title: EventTSF: Event-Aware Non-Stationary Time Series Forecasting

Title: Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference

Title: EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis

Title: Revisiting MLLM Token Technology through the Lens of Classical Visual Coding

Title: MINR: Efficient Implicit Neural Representations for Multi-Image Encoding

Title: 2D Gaussians Meet Visual Tokenizer

Title: Evaluating Open-Source Vision Language Models for Facial Emotion Recognition against Traditional Deep Learning Models

Title: MuFlex: A Scalable, Physics-based Platform for Multi-Building Flexibility Analysis and Coordination

Title: EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors

Title: FLAIR: Frequency- and Locality-Aware Implicit Neural Representations

Title: A Lightweight Dual-Mode Optimization for Generative Face Video Coding

Title: Color Spike Data Generation via Bio-inspired Neuron-like Encoding with an Artificial Photoreceptor Layer

Title: Prediction of Hospital Associated Infections During Continuous Hospital Stays

Title: Generative Model-Based Feature Attention Module for Video Action Analysis

Title: Bridging Clear and Adverse Driving Conditions

Title: PersonaVlog: Personalized Multimodal Vlog Generation with Multi-Agent Collaboration and Iterative Self-Correction

Title: Towards a Larger Model via One-Shot Federated Learning on Heterogeneous Client Models

Title: DiffIER: Optimizing Diffusion Models with Iterative Error Reduction

Title: Text2Weight: Bridging Natural Language and Neural Network Weight Spaces

Title: Personalized Subgraph Federated Learning with Sheaf Collaboration

Title: Disentangled Deep Smoothed Bootstrap for Fair Imbalanced Regression

Title: SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation

Title: Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation

Title: DIME-Net: A Dual-Illumination Adaptive Enhancement Network Based on Retinex and Mixture-of-Experts

Title: ViT-FIQA: Assessing Face Image Quality using Vision Transformers

Title: ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving

Title: Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment

Title: ResPlan: A Large-Scale Vector-Graph Dataset of 17,000 Residential Floor Plans

Title: InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Title: GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation