2026-01-22

Title: SOSControl: Enhancing Human Motion Generation through Saliency-Aware Symbolic Orientation and Timing Control

Title: A Cloud-Based Cross-Modal Transformer for Emotion Recognition and Adaptive Human-Computer Interaction

Title: GCG Attack On A Diffusion LLM

Title: On the Limits of Learned Importance Scoring for KV Cache Compression

Title: Beyond Affinity: A Benchmark of 1D, 2D, and 3D Methods Reveals Critical Trade-offs in Structure-Based Drug Design

Title: Chain-of-Memory: Lightweight Memory Construction with Dynamic Evolution for LLM Agents

Title: LURE: Latent Space Unblocking for Multi-Concept Reawakening in Diffusion Models

Title: VJEPA: Variational Joint Embedding Predictive Architectures as Probabilistic World Models

Title: Large-Scale Label Quality Assessment for Medical Segmentation via a Vision-Language Judge and Synthetic Data

Title: Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation

Title: Search over Self-Edit Strategies for LLM Adaptation

Title: Report for NSF Workshop on AI for Electronic Design Automation

Title: QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design

Title: Anatomically Guided Latent Diffusion for Brain MRI Progression Modeling

Title: Counterfactual Modeling with Fine-Tuned LLMs for Health Intervention Design and Sensor Data Augmentation

Title: 3D Space as a Scratchpad for Editable Text-to-Image Generation

Title: Mirai: Autoregressive Visual Generation Needs Foresight

Title: LaVR: Scene Latent Conditioned Generative Video Trajectory Re-Rendering using Large 4D Reconstruction Models

Title: A comprehensive overview of deep learning models for object detection from videos/images

Title: DeepMoLM: Leveraging Visual and Geometric Structural Information for Molecule-Text Modeling

Title: Safeguarding Facial Identity against Diffusion-based Face Swapping via Cascading Pathway Disruption

Title: Enhancing Text-to-Image Generation via End-Edge Collaborative Hybrid Super-Resolution

Title: ReinPath: A Multimodal Reinforcement Learning Approach for Pathology

Title: Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models

Title: Reconstruction-Anchored Diffusion Model for Text-to-Motion Generation

Title: Synthetic Data Augmentation for Multi-Task Chinese Porcelain Classification: A Stable Diffusion Approach

Title: Reflecting in the Reflection: Integrating a Socratic Questioning Framework into Automated AI-Based Question Generation

Title: Tailoring Adverse Event Prediction in Type 1 Diabetes with Patient-Specific Deep Learning Models

Title: TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models

Title: Improving Regret Approximation for Unsupervised Dynamic Environment Generation

Title: Towards Holistic Modeling for Video Frame Interpolation with Auto-regressive Diffusion Transformers

Title: InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement

Title: SpatialV2A: Visual-Guided High-fidelity Spatial Audio Generation

Title: HyperNet-Adaptation for Diffusion-Based Test Case Generation

Title: Deep Leakage with Generative Flow Matching Denoiser

Title: Differential Privacy Image Generation with Reconstruction Loss and Noise Injection Using an Error Feedback SGD

Title: Field-Space Autoencoder for Scalable Climate Emulators

Title: Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation

Title: DeepFedNAS: A Unified Framework for Principled, Hardware-Aware, and Predictor-Free Federated Neural Architecture Search

Title: ScenDi: 3D-to-2D Scene Diffusion Cascades for Urban Generation

Title: FlowSSC: Universal Generative Monocular Semantic Scene Completion via One-Step Latent Diffusion

Title: MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs

Title: StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Title: Rethinking Video Generation Model for the Embodied World

Title: LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes

Title: Walk through Paintings: Egocentric World Models from Internet Priors

Title: Iterative Refinement Improves Compositional Image Generation