2025-11-20

Title: Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

Title: B-Rep Distance Functions (BR-DF): How to Represent a B-Rep Model by Volumetric Distance Functions?

Title: GeoSceneGraph: Geometric Scene Graph Diffusion Model for Text-guided 3D Indoor Scene Synthesis

Title: InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization

Title: nnMIL: A generalizable multiple instance learning framework for computational pathology

Title: X-WIN: Building Chest Radiograph World Model via Predictive Sensing

Title: Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities

Title: Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Title: BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching

Title: Fourier-KAN-Mamba: A Novel State-Space Equation Approach for Time-Series Anomaly Detection

Title: Jointly Conditioned Diffusion Model for Multi-View Pose-Guided Person Image Synthesis

Title: MAIF: Enforcing AI Trust and Provenance with an Artifact-Centric Agentic Paradigm

Title: A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models

Title: Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation

Title: Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation

Title: FaultDiffusion: Few-Shot Fault Time Series Generation with Diffusion Model

Title: Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Title: Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition

Title: Trustworthy GenAI over 6G: Integrated Applications and Security Frameworks

Title: Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones

Title: Taming Generative Synthetic Data for X-ray Prohibited Item Detection

Title: Adapt-As-You-Walk Through the Clouds: Training-Free Online Test-Time Adaptation of 3D Vision-Language Foundation Models

Title: Adaptive thresholding pattern for fingerprint forgery detection

Title: On the Internal Semantics of Time-Series Foundation Models

Title: STREAM-VAE: Dual-Path Routing for Slow and Fast Dynamics in Vehicle Telemetry Anomaly Detection

Title: Parameter Importance-Driven Continual Learning for Foundation Models

Title: EVA-Net: Interpretable Brain Age Prediction via Continuous Aging Prototypes from EEG

Title: ShelfOcc: Native 3D Supervision beyond LiDAR for Vision-Based Occupancy Estimation

Title: Towards Understanding Layer Contributions in Tabular In-Context Learning Models

Title: TSFM in-context learning for time-series classification of bearing-health status

Title: A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture

Title: Computer-Use Agents as Judges for Generative User Interface

Title: GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI

Title: Walrus: A Cross-Domain Foundation Model for Continuum Dynamics

Title: Think Visually, Reason Textually: Vision-Language Synergy in ARC

Title: RoMa v2: Harder Better Faster Denser Feature Matching