2025-10-17

Title: CoLoR-GAN: Continual Few-Shot Learning with Low-Rank Adaptation in Generative Adversarial Networks

Title: Joint Discriminative-Generative Modeling via Dual Adversarial Training

Title: REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Title: NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations

Title: Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

Title: CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations

Title: Synchronization of Multiple Videos

Title: Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images

Title: Briding Diffusion Posterior Sampling and Monte Carlo methods: a survey

Title: Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures

Title: Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation

Title: LOTA: Bit-Planes Guided AI-Generated Image Detection

Title: Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models

Title: PIA: Deepfake Detection Using Phoneme-Temporal and Identity-Dynamic Analysis

Title: Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization

Title: Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning

Title: Nonparametric Data Attribution for Diffusion Models

Title: A Multi-domain Image Translative Diffusion StyleGAN for Iris Presentation Attack Detection

Title: Stop-RAG: Value-Based Retrieval Control for Iterative RAG

Title: DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation

Title: Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits

Title: Coder as Editor: Code-driven Interpretable Molecular Optimization

Title: Unsupervised Deep Generative Models for Anomaly Detection in Neuroimaging: A Systematic Scoping Review

Title: Pruning Overparameterized Multi-Task Networks for Degraded Web Image Restoration

Title: Noise Projection: Closing the Prompt-Agnostic Gap Behind Text-to-Image Misalignment in Diffusion Models

Title: Exploring Image Representation with Decoupled Classical Visual Descriptors

Title: Consistent text-to-image generation via scene de-contextualization

Title: STANCE: Motion Coherent Video Generation Via Sparse-to-Dense Anchored Encoding

Title: Multimodal RAG for Unstructured Data:Leveraging Modality-Aware Knowledge Graphs with Hybrid Retrieval

Title: Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering

Title: LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching

Title: Adapting Self-Supervised Representations as a Latent Space for Efficient Generation

Title: In-Context Learning with Unpaired Clips for Instruction-based Video Editing

Title: Leveraging Learned Image Prior for 3D Gaussian Compression

Title: Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Title: LightQANet: Quantized and Adaptive Feature Learning for Low-Light Image Enhancement

Title: FraQAT: Quantization Aware Training with Fractional bits

Title: To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models

Title: ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

Title: Benchmarking Multimodal Large Language Models for Face Recognition

Title: TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions

Title: ScaleWeaver: Weaving Efficient Controllable T2I Generation with Multi-Scale Reference Attention

Title: 3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation

Title: OmniMotion: Multimodal Motion Generation with Continuous Masked Autoregression

Title: RealDPO: Real or Not Real, that is the Preference

Title: MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Title: Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

Title: pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

Title: WithAnyone: Towards Controllable and ID Consistent Image Generation

Title: Terra: Explorable Native 3D World Model with Point Latents