2025-06-24

Title: Mechanistic Interpretability of Diffusion Models: Circuit-Level Analysis and Causal Validation

Title: Recursive Learning-Based Virtual Buffering for Analytical Global Placement

Title: Securing Generative AI Agentic Workflows: Risks, Mitigation, and a Proposed Firewall Architecture

Title: Mercury: Ultra-Fast Language Models Based on Diffusion

Title: Fine-Scale Soil Mapping in Alaska with Multimodal Machine Learning

Title: Origins of Creativity in Attention-Based Diffusion Models

Title: AndroIDS : Android-based Intrusion Detection System using Federated Learning

Title: Spatial-Temporal Pre-Training for Embryo Viability Prediction Using Time-Lapse Videos

Title: Leveraging LLMs to Assess Tutor Moves in Real-Life Dialogues: A Feasibility Study

Title: When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network

Title: Probing for Phonology in Self-Supervised Speech Representations: A Case Study on Accent Perception

Title: Accelerating Residual Reinforcement Learning with Uncertainty Estimation

Title: OpenMAP-BrainAge: Generalizable and Interpretable Brain Age Predictor

Title: Histopathology Image Report Generation by Vision Language Model with Multimodal In-Context Learning

Title: SSAVSV: Towards Unified Model for Self-Supervised Audio-Visual Speaker Verification

Title: DreamJourney: Perpetual View Generation with Video Diffusion Models

Title: Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models

Title: PhysID: Physics-based Interactive Dynamics from a Single-view Image

Title: Towards a Unified Textual Graph Framework for Spectral Reasoning via Physical and Chemical Information Fusion

Title: PhysiX: A Foundation Model for Physics Simulations

Title: Reimagining Parameter Space Exploration with Diffusion Models

Title: Time-Contrastive Pretraining for In-Context Image and Video Segmentation

Title: In-Context Learning Strategies Emerge Rationally

Title: How Alignment Shrinks the Generative Horizon

Title: EgoWorld: Translating Exocentric View to Egocentric View using Rich Exocentric Observations

Title: Adapting Vision-Language Models for Evaluating World Models

Title: Enabling PSO-Secure Synthetic Data Sharing Using Diversity-Aware Diffusion Models

Title: Imputation of Longitudinal Data Using GANs: Challenges and Implications for Classification

Title: On the Robustness of Human-Object Interaction Detection against Distribution Shift

Title: TAB: Unified Benchmarking of Time Series Anomaly Detection Methods

Title: CLGRPO: Reasoning Ability Enhancement for Small VLMs

Title: Deep Supervised LSTM for 3D morphology estimation from Multi-View RGB Images of Wheat Spikes

Title: Evaluating Prompt-Based and Fine-Tuned Approaches to Czech Anaphora Resolution

Title: ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Title: Enhancing VICReg: Random-Walk Pairing for Improved Generalization and Better Global Semantics Capturing

Title: $ϕ^{\infty}$: Clause Purification, Embedding Realignment, and the Total Suppression of the Em Dash in Autoregressive Language Models

Title: Targeted False Positive Synthesis via Detector-guided Adversarial Diffusion Attacker for Robust Polyp Detection

Title: Pattern-Based Phase-Separation of Tracer and Dispersed Phase Particles in Two-Phase Defocusing Particle Tracking Velocimetry

Title: CDG-MAE: Learning Correspondences from Diffusion Generated Views

Title: Non-equilibrium Annealed Adjoint Sampler

Title: Joint Embedding Predictive Architecture for self-supervised pretraining on polymer molecular graphs

Title: Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano

Title: Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability

Title: YouTube-Occ: Learning Indoor 3D Semantic Occupancy Prediction from YouTube Videos

Title: ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

Title: Adaptive Mask-guided K-space Diffusion for Accelerated MRI Reconstruction

Title: Learning Causal Graphs at Scale: A Foundation Model Approach

Title: Learning High-Quality Latent Representations for Anomaly Detection and Signal Integrity Enhancement in High-Speed Signals

Title: Instability in Diffusion ODEs: An Explanation for Inaccurate Image Reconstruction

Title: NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation

Title: Geometry-Aware Preference Learning for 3D Texture Generation

Title: Controlled Generation with Equivariant Variational Flow Matching

Title: Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection

Title: Benchmarking Foundation Models and Parameter-Efficient Fine-Tuning for Prognosis Prediction in Medical Imaging

Title: CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing

Title: DIP: Unsupervised Dense In-Context Post-training of Visual Representations

Title: GANs vs. Diffusion Models for virtual staining with the HER2match dataset

Title: MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models

Title: Multi-Scale Representation of Follicular Lymphoma Pathology Images in a Single Hyperbolic Space

Title: Auto-Regressively Generating Multi-View Consistent Images

Title: End-to-End Spoken Grammatical Error Correction

Title: Normality Prior Guided Multi-Semantic Fusion Network for Unsupervised Image Anomaly Detection

Title: Resampling Augmentation for Time Series Contrastive Learning: Application to Remote Sensing

Title: No Training Wheels: Steering Vectors for Bias Correction at Inference Time

Title: Simulation-Free Differential Dynamics through Neural Conservation Laws

Title: ReDit: Reward Dithering for Improved LLM Policy Optimization

Title: Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtyping

Title: MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation

Title: Matrix-Game: Interactive World Foundation Model

Title: Benchmarking the Pedagogical Knowledge of Large Language Models

Title: Towards Group Fairness with Multiple Sensitive Attributes in Federated Foundation Models

Title: ContinualFlow: Learning and Unlearning with Neural Flow Matching

Title: 3D Arena: An Open Platform for Generative 3D Evaluation

Title: ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Title: RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies

Title: 4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation

Title: OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation

Title: OmniGen2: Exploration to Advanced Multimodal Generation

Title: Let Your Video Listen to Your Music!

Title: Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Title: 4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Title: Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Title: FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation

Title: Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models