2025-11-24

Title: Joint Design of Protein Surface and Structure Using a Diffusion Bridge Model

Title: Motion Transfer-Enhanced StyleGAN for Generating Diverse Macaque Facial Expressions

Title: Password Strength Analysis Through Social Network Data Exposure: A Combined Approach Relying on Data Reconstruction and Generative Models

Title: SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG

Title: WorldGen: From Text to Traversable and Interactive 3D Worlds

Title: ManifoldFormer: Geometric Deep Learning for Neural Dynamics on Riemannian Manifolds

Title: PEPPER: Perception-Guided Perturbation for Robust Backdoor Defense in Text-to-Image Diffusion Models

Title: Better audio representations are more brain-like: linking model-brain alignment with performance in downstream auditory tasks

Title: The use of vocal biomarkers in the detection of Parkinson's disease: a robust statistical performance comparison of classic machine learning models

Title: Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representational Alignment

Title: Predicting the Formation of Induction Heads

Title: Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models

Title: Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content

Title: PepEVOLVE: Position-Aware Dynamic Peptide Optimization via Group-Relative Advantage

Title: UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation

Title: DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution

Title: Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features

Title: CroTad: A Contrastive Reinforcement Learning Framework for Online Trajectory Anomaly Detection

Title: Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models

Title: MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis

Title: Real-Time Cooked Food Image Synthesis and Visual Cooking Progress Monitoring on Edge Devices

Title: The Finer the Better: Towards Granular-aware Open-set Domain Generalization

Title: DReX: Pure Vision Fusion of Self-Supervised and Convolutional Representations for Image Complexity Prediction

Title: FLUID: Training-Free Face De-identification via Latent Identity Substitution

Title: Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation

Title: OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding

Title: ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion

Title: Diversity Has Always Been There in Your Visual Autoregressive Models

Title: SPAGS: Sparse-View Articulated Object Reconstruction from Single State via Planar Gaussian Splatting

Title: Sparse Reasoning is Enough: Biological-Inspired Framework for Video Anomaly Detection with Large Pre-trained Models

Title: AutoGraphAD: A novel approach using Variational Graph Autoencoders for anomalous network flow detection

Title: Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design

Title: Four decades of circumpolar super-resolved satellite land surface temperature data

Title: One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution

Title: DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous Driving

Title: Investigating self-supervised representations for audio-visual deepfake detection

Title: Continual Alignment for SAM: Rethinking Foundation Models for Medical Image Segmentation in Continual Learning

Title: Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers

Title: DelTriC: A Novel Clustering Method with Accurate Outlier

Title: QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy

Title: FlexiFlow: decomposable flow matching for generation of flexible molecular ensemble

Title: Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats

Title: A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback

Title: Range-Edit: Semantic Mask Guided Outdoor LiDAR Scene Editing

Title: SpatialGeo:Boosting Spatial Reasoning in Multimodal LLMs via Geometry-Semantics Fusion

Title: MuM: Multi-View Masked Image Modeling for 3D Vision

Title: Self-supervised denoising of raw tomography detector data for improved image reconstruction

Title: ReBaPL: Repulsive Bayesian Prompt Learning

Title: Refracting Reality: Generating Images with Realistic Transparent Objects

Title: Loomis Painter: Reconstructing the Painting Process

Title: DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive Architecture

Title: UAM: A Unified Attention-Mamba Backbone of Multimodal Framework for Tumor Cell Classification

Title: SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation

Title: Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks

Title: Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?

Title: Self-Supervised Learning by Curvature Alignment

Title: REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing

Title: Planning with Sketch-Guided Verification for Physics-Aware Video Generation

Title: Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift

Title: Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards

Title: Counterfactual World Models via Digital Twin-conditioned Video Diffusion

Title: Radar2Shape: 3D Shape Reconstruction from High-Frequency Radar using Multiresolution Signed Distance Functions

Title: An Artificial Intelligence Framework for Measuring Human Spine Aging Using MRI

Title: EvDiff: High Quality Video with an Event Camera