2025-09-29

Title: Random Direct Preference Optimization for Radiography Report Generation

Title: Automated Prompt Generation for Creative and Counterfactual Text-to-image Synthesis

Title: In silico Deep Learning Protocols for Label-Free Super-Resolution Microscopy: A Comparative Study of Network Architectures and SNR Dependence

Title: ShipwreckFinder: A QGIS Tool for Shipwreck Detection in Multibeam Sonar Data

Title: Large AI Model-Enabled Generative Semantic Communications for Image Transmission

Title: Downscaling climate projections to 1 km with single-image super resolution

Title: JaiLIP: Jailbreaking Vision-Language Models via Loss Guided Image Perturbation

Title: QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models

Title: DyME: Dynamic Multi-Concept Erasure in Diffusion Models with Bi-Level Orthogonal LoRA Adaptation

Title: Score-based Idempotent Distillation of Diffusion Models

Title: Are Hallucinations Bad Estimations?

Title: d2: Improved Techniques for Training Reasoning Diffusion Language Models

Title: Filtering with Confidence: When Data Augmentation Meets Conformal Prediction

Title: GraphPFN: A Prior-Data Fitted Graph Foundation Model

Title: SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models

Title: Contrastive Mutual Information Learning: Toward Robust Representations without Positive-Pair Augmentations

Title: DistillKac: Few-Step Image Generation via Damped Wave Equations

Title: Preemptive Detection and Steering of LLM Misalignment via Latent Reachability

Title: Expert-guided Clinical Text Augmentation via Query-Based Model Collaboration

Title: No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models

Title: X-Streamer: Unified Human World Modeling with Audiovisual Interaction

Title: What Happens Next? Anticipating Future Motion by Generating Point Trajectories

Title: GenUQ: Predictive Uncertainty Estimates via Generative Hyper-Networks

Title: FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction

Title: Neuroprobe: Evaluating Intracranial Brain Responses to Naturalistic Stimuli

Title: SpecMER: Fast Protein Generation with K-mer Guided Speculative Decoding

Title: UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments

Title: UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models

Title: Beyond Formula Complexity: Effective Information Criterion Improves Performance and Interpretability for Symbolic Regression

Title: LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE

Title: FastGRPO: Accelerating Policy Optimization via Concurrency-aware Speculative Decoding and Online Draft Learning

Title: MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation

Title: On the Complexity Theory of Masked Discrete Diffusion: From $\mathrm{poly}(1/ε)$ to Nearly $ε$-Free

Title: DiTraj: training-free trajectory control for video diffusion transformer

Title: A Comprehensive Evaluation of Transformer-Based Question Answering Models and RAG-Enhanced Design

Title: Graph of Agents: Principled Long Context Modeling by Emergent Multi-Agent Collaboration

Title: SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit 3D Meshes

Title: MolSpectLLM: A Molecular Foundation Model Bridging Spectroscopy, Molecule Elucidation, and 3D Structure Generation

Title: Deepfakes: we need to re-think the concept of "real" images

Title: Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge Grounding

Title: Abductive Logical Rule Induction by Bridging Inductive Logic Programming and Multimodal Large Language Models

Title: Drag4D: Align Your Motion with Text-Driven 3D Scene Generation

Title: Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers

Title: Discrete Guidance Matching: Exact Guidance for Discrete Flow Matching

Title: Generation Properties of Stochastic Interpolation under Finite Training Set

Title: SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet

Title: Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning

Title: MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning

Title: No-Reference Image Contrast Assessment with Customized EfficientNet-B0

Title: Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation

Title: WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Title: FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration

Title: Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors

Title: Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning

Title: CoFFT: Chain of Foresight-Focus Thought for Visual Language Models

Title: Latent Diffusion : Multi-Dimension Stable Diffusion Latent Space Explorer

Title: High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling

Title: Large Material Gaussian Model for Relightable 3D Generation

Title: Countering adversarial evasion in regression analysis

Title: REFINE-CONTROL: A Semi-supervised Distillation Method For Conditional Image Generation

Title: MultiMat: Multimodal Program Synthesis for Procedural Materials using Large Multimodal Models

Title: Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs

Title: UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective

Title: Beyond Classification Accuracy: Neural-MedBench and the Need for Deeper Reasoning Benchmarks

Title: UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data

Title: MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Title: Jailbreaking on Text-to-Video Models via Scene Splitting Strategy

Title: Aurora: Towards Universal Generative Multimodal Time Series Forecasting

Title: HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models

Title: RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer

Title: CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

Title: SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis

Title: Stochastic activations

Title: Text Adversarial Attacks with Dynamic Outputs

Title: Closing the Safety Gap: Surgical Concept Erasure in Visual Autoregressive Models

Title: MoveFM-R: Advancing Mobility Foundation Models via Language-driven Semantic Reasoning

Title: RAU: Reference-based Anatomical Understanding with Vision Language Models

Title: Fast-Forward Lattice Boltzmann: Learning Kinetic Behaviour with Physics-Informed Neural Operators

Title: LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

Title: Explaining multimodal LLMs via intra-modal token interactions

Title: Overclocking Electrostatic Generative Models

Title: Nonlinear Optimization with GPU-Accelerated Neural Network Constraints

Title: Learning the Neighborhood: Contrast-Free Multimodal Self-Supervised Molecular Graph Pretraining

Title: Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation

Title: Group Critical-token Policy Optimization for Autoregressive Image Generation

Title: Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation

Title: JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation

Title: From Parameters to Behavior: Unsupervised Compression of the Policy Space

Title: Transport Based Mean Flows for Generative Modeling

Title: LongLive: Real-time Interactive Long Video Generation

Title: A Theoretical Analysis of Discrete Flow Matching Generative Models

Title: SPARK: Synergistic Policy And Reward Co-Evolving Framework

Title: Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance

Title: Scale-Wise VAR is Secretly Discrete Diffusion

Title: Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs

Title: RefAM: Attention Magnets for Zero-Shot Referral Segmentation