2025-03-10

Title: Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models

Title: Invisible Strings: Revealing Latent Dancer-to-Dancer Interactions with Graph Neural Networks

Title: StickMotion: Generating 3D Human Motions by Drawing a Stickman

Title: Adversarial Training for Multimodal Large Language Models against Jailbreak Attacks

Title: Combined Physics and Event Camera Simulator for Slip Detection

Title: ZAugNet for Z-Slice Augmentation in Bio-Imaging

Title: End-to-End Human Pose Reconstruction from Wearable Sensors for 6G Extended Reality Systems

Title: Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation

Title: FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

Title: Metadata-free Georegistration of Ground and Airborne Imagery

Title: Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge

Title: Energy-Weighted Flow Matching for Offline Reinforcement Learning

Title: LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression

Title: Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review

Title: Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs

Title: Fake It To Make It: Virtual Multiviews to Enhance Monocular Indoor Semantic Scene Completion

Title: TS-LIF: A Temporal Segment Spiking Neuron Network for Time Series Forecasting

Title: Development and Enhancement of Text-to-Image Diffusion Models

Title: Accelerating Diffusion Transformer via Gradient-Optimized Cache

Title: Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions

Title: RecipeGen: A Benchmark for Real-World Recipe Image Generation

Title: Unified Reward Model for Multimodal Understanding and Generation

Title: Frequency Autoregressive Image Generation with Continuous Tokens

Title: PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?

Title: Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Title: Automatic Teaching Platform on Vision Language Retrieval Augmented Generation

Title: FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework

Title: Mol-CADiff: Causality-Aware Autoregressive Diffusion for Molecule Generation

Title: Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations

Title: Global graph features unveiled by unsupervised geometric deep learning

Title: QArtSR: Quantization via Reverse-Module and Timestep-Retraining in One-Step Diffusion based Image Super-Resolution

Title: Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models

Title: TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Title: VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Title: AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data

Title: GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

Title: Multi-Fidelity Policy Gradient Algorithms