2025-10-10

Title: ConCuR: Conciseness Makes State-of-the-Art Kernel Generation

Title: DynamicEval: Rethinking Evaluation for Dynamic Text-to-Video Synthesis

Title: Black-box Detection of LLM-generated Text Using Generalized Jensen-Shannon Divergence

Title: MLLM4TS: Leveraging Vision and Multimodal Language Models for General Time-Series Analysis

Title: D2RA: Dual Domain Regeneration Attack

Title: TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility

Title: Cross-Modal Attention Guided Unlearning in Vision-Language Models

Title: Symbolic-Diffusion: Deep Learning Based Symbolic Regression with D3PM Discrete Token Diffusion

Title: LLM Unlearning Under the Microscope: A Full-Stack View on Methods and Metrics

Title: PIT-QMM: A Large Multimodal Model For No-Reference Point Cloud Quality Assessment

Title: Once Is Enough: Lightweight DiT-Based Video Virtual Try-On via One-Time Garment Appearance Injection

Title: Controllable Video Synthesis via Variational Inference

Title: SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction

Title: GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation

Title: A Unified Multi-Task Learning Framework for Generative Auto-Bidding with Validation-Aligned Optimization

Title: MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions

Title: MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation

Title: IsoSignVid2Aud: Sign Language Video to Audio Conversion without Text Intermediaries

Title: GRADE: Personalized Multi-Task Fusion via Group-relative Reinforcement Learning with Adaptive Dirichlet Exploratio

Title: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Title: CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

Title: Latent Harmony: Synergistic Unified UHD Image Restoration via Latent Space Regularization and Controllable Refinement

Title: Is Architectural Complexity Always the Answer? A Case Study on SwinIR vs. an Efficient CNN

Title: Real-Time Motion-Controllable Autoregressive Video Diffusion

Title: UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution

Title: Beyond Textual CoT: Interleaved Text-Image Chains with Deep Confidence Reasoning for Image Editing

Title: Bidirectional Representations Augmented Autoregressive Biological Sequence Generation:Application in De Novo Peptide Sequencing

Title: Expressive Value Learning for Scalable Offline Reinforcement Learning

Title: Fine-grained text-driven dual-human motion generation via dynamic hierarchical interaction

Title: Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene Classification

Title: Bridging the Physics-Data Gap with FNO-Guided Conditional Flow Matching: Designing Inductive Bias through Hierarchical Physical Constraints

Title: LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation

Title: Hyperspectral data augmentation with transformer-based diffusion models

Title: Guided Star-Shaped Masked Diffusion

Title: UniVideo: Unified Understanding, Generation, and Editing for Videos

Title: FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts

Title: VideoVerse: How Far is Your T2V Generator from a World Model?

Title: Biology-driven assessment of deep learning super-resolution imaging of the porosity network in dentin

Title: Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

Title: Synthetic Series-Symbol Data Generation for Time Series Foundation Models

Title: SummDiff: Generative Modeling of Video Summarization with Diffusion

Title: MoA-VR: A Mixture-of-Agents System Towards All-in-One Video Restoration

Title: FlexTraj: Image-to-Video Generation with Flexible Point Trajectory Control

Title: Kontinuous Kontext: Continuous Strength Control for Instruction-based Image Editing

Title: MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Title: Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization

Title: VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Title: MultiCOIN: Multi-Modal COntrollable Video INbetweening

Title: Who Said Neural Networks Aren't Linear?