2025-05-02

Title: Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis

Title: Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models

Title: GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule Generation

Title: Direct Motion Models for Assessing Generated Videos

Title: Generative Machine Learning in Adaptive Control of Dynamic Manufacturing Processes: A Review

Title: Online Federation For Mixtures of Proprietary Agents with Black-Box Encoders

Title: Predicting Estimated Times of Restoration for Electrical Outages Using Longitudinal Tabular Transformers

Title: ReXGradient-160K: A Large-Scale Publicly Available Dataset of Chest Radiographs with Free-text Reports

Title: Scaling On-Device GPU Inference for Large Generative Models

Title: Empowering Agentic Video Analytics Systems with Video Language Models

Title: AI-Assisted Decision-Making for Clinical Assessment of Auto-Segmented Contour Quality

Title: Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution

Title: T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Title: JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers

Title: KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Title: Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks

Title: A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic

Title: Towards Autonomous Micromobility through Scalable Urban Simulation

Title: T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT