2026-01-15

Title: Bias Detection and Rotation-Robustness Mitigation in Vision-Language Models and Generative Image Models

Title: R$^2$BD: A Reconstruction-Based Method for Generalizable and Efficient Detection of Fake Images

Title: ForensicFormer: Hierarchical Multi-Scale Reasoning for Cross-Domain Image Forgery Detection

Title: The Semantic Lifecycle in Embodied AI: Acquisition, Representation and Storage via Foundation Models

Title: TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Title: Compressing Vision Transformers in Geospatial Transfer Learning with Manifold-Constrained Optimization

Title: Spectral Generative Flow Models: A Physics-Inspired Replacement for Vectorized Large Language Models

Title: DriftGuard: A Hierarchical Framework for Concept Drift Detection and Remediation in Supply Chain Forecasting

Title: Breaking the Bottlenecks: Scalable Diffusion Models for 3D Molecular Generation

Title: TranslateGemma Technical Report

Title: Depth-Wise Representation Development Under Blockwise Self-Supervised Learning for Video Vision Transformers

Title: How Many Human Judgments Are Enough? Feasibility Limits of Human Preference Evaluation

Title: Vision Foundation Models for Domain Generalisable Cross-View Localisation in Planetary Ground-Aerial Robotic Teams

Title: Small but Mighty: Dynamic Wavelet Expert-Guided Fine-Tuning of Large-Scale Models for Optical Remote Sensing Object Segmentation

Title: SAM-Aug: Leveraging SAM Priors for Few-Shot Parcel Segmentation in Satellite Time Series

Title: Discrete Solution Operator Learning for Geometry-Dependent PDEs

Title: SSVP: Synergistic Semantic-Visual Prompting for Industrial Zero-Shot Anomaly Detection

Title: Architecture inside the mirage: evaluating generative image models on architectural style, elements, and typologies

Title: OrthoGeoLoRA: Geometric Parameter-Efficient Fine-Tuning for Structured Social Science Concept Retrieval on theWeb

Title: Affostruction: 3D Affordance Grounding with Generative Reconstruction

Title: SpikeVAEDiff: Neural Spike-based Natural Visual Scene Reconstruction via VD-VAE and Versatile Diffusion

Title: Knowledge-Embedded and Hypernetwork-Guided Few-Shot Substation Meter Defect Image Generation Method

Title: PhyRPR: Training-Free Physics-Constrained Video Generation

Title: GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials

Title: Enhancing Spatial Reasoning in Large Language Models for Metal-Organic Frameworks Structure Prediction

Title: Explainable Autoencoder-Based Anomaly Detection in IEC 61850 GOOSE Networks

Title: Frequency Error-Guided Under-sampling Optimization for Multi-Contrast MRI Reconstruction

Title: Beyond the final layer: Attentive multilayer fusion for vision transformers

Title: Relation Extraction Capabilities of LLMs on Clinical Text: A Bilingual Evaluation for English and Turkish

Title: MAD: Motion Appearance Decoupling for efficient Driving World Models

Title: Terminally constrained flow-based generative models from an optimal control perspective

Title: GlovEgo-HOI: Bridging the Synthetic-to-Real Gap for Industrial Egocentric Human-Object Interaction Detection

Title: Trustworthy Longitudinal Brain MRI Completion: A Deformation-Based Approach with KAN-Enhanced Diffusion Model

Title: CogRail: Benchmarking VLMs in Cognitive Intrusion Perception for Intelligent Railway Transportation Systems

Title: TaxoBell: Gaussian Box Embeddings for Self-Supervised Taxonomy Expansion

Title: Exploring Fine-Tuning for Tabular Foundation Models

Title: Self-Supervised Animal Identification for Long Videos

Title: STEP3-VL-10B Technical Report

Title: Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design

Title: LLMs can Compress LLMs: Adaptive Pruning by Agents

Title: Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Title: COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation