2025-12-05

Title: Towards Contextual Sensitive Data Detection

Title: Tipping the Dominos: Topology-Aware Multi-Hop Attacks on LLM-Based Multi-Agent Systems

Title: Decoding Large Language Diffusion Models with Foreseeing Movement

Title: ReasonX: MLLM-Guided Intrinsic Image Decomposition

Title: ActVAE: Modelling human activity schedules with a deep conditional generative approach

Title: MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models

Title: UniLight: A Unified Representation for Lighting

Title: The Initialization Determines Whether In-Context Learning Is Gradient Descent

Title: Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer

Title: Plug-and-Play Image Restoration with Flow Matching: A Continuous Viewpoint

Title: SQuARE: Structured Query & Adaptive Retrieval Engine For Tabular Formats

Title: Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications

Title: Data-regularized Reinforcement Learning for Diffusion Models at Scale

Title: STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting

Title: MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving

Title: GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis

Title: dVLM-AD: Enhance Diffusion Vision-Language-Model for Driving via Controllable Reasoning

Title: UniTS: Unified Time Series Generative Model for Remote Sensing

Title: GraphBench: Next-generation graph learning benchmarking

Title: DeRA: Decoupled Representation Alignment for Video Tokenization

Title: Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model

Title: UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers

Title: DuGI-MAE: Improving Infrared Mask Autoencoders via Dual-Domain Guidance

Title: EgoLCD: Egocentric Video Generation with Long Context Diffusion

Title: VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory

Title: Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation

Title: PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement

Title: Refaçade: Editing Object with Given Reference Texture

Title: X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale

Title: Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function

Title: LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models

Title: Exploiting \texttt{ftrace}'s \texttt{function\_graph} Tracer Features for Machine Learning: A Case Study on Encryption Detection

Title: QoSDiff: An Implicit Topological Embedding Learning Framework Leveraging Denoising Diffusion and Adversarial Attention for Robust QoS Prediction

Title: Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space

Title: Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence

Title: Federated Learning for Anomaly Detection in Maritime Movement Data

Title: Cryptanalysis of Gleeok-128

Title: Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Title: Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation

Title: TRINITY: An Evolved LLM Coordinator

Title: OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution

Title: Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild

Title: Order Matters: 3D Shape Generation from Sequential VR Sketches

Title: PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Title: LaFiTe: A Generative Latent Field for 3D Native Texturing

Title: LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation

Title: FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

Title: Tokenizing Buildings: A Transformer for Layout Synthesis

Title: A Novel Trust-Based DDoS Cyberattack Detection Model for Smart Business Environments

Title: Logic-Driven Cybersecurity: A Novel Framework for System Log Anomaly Detection using Answer Set Programming

Title: Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Title: LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging

Title: Towards Adaptive Fusion of Multimodal Deep Networks for Human Action Recognition

Title: Rethinking the Use of Vision Transformers for AI-Generated Image Detection

Title: Efficient Generative Transformer Operators For Million-Point PDEs

Title: Reflection Removal through Efficient Adaptation of Diffusion Transformers

Title: Self-Supervised Learning for Transparent Object Depth Completion Using Depth from Non-Transparent Objects

Title: Generative Neural Video Compression via Video Diffusion Prior

Title: RAMEN: Resolution-Adjustable Multimodal Encoder for Earth Observation

Title: Semantic-Guided Two-Stage GAN for Face Inpainting with Hybrid Perceptual Encoding

Title: Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image

Title: Personalizing Agent Privacy Decisions via Logical Entailment

Title: Hybrid Quantum-Classical Autoencoders for Unsupervised Network Intrusion Detection

Title: David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?

Title: BulletTime: Decoupled Control of Time and Camera Pose for Video Generation

Title: Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints

Title: OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design

Title: Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression

Title: The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception

Title: TV2TV: A Unified Framework for Interleaved Language and Video Generation

Title: NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

Title: ShadowDraw: From Any Object to Shadow-Drawing Compositional Art

Title: ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Title: Light-X: Generative 4D Video Rendering with Camera and Illumination Control

Title: Value Gradient Guidance for Flow Matching Alignment