2025-07-22

Title: Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI

Title: LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Title: Developing an AI-Guided Assistant Device for the Deaf and Hearing Impaired

Title: Rethinking Individual Fairness in Deepfake Detection

Title: Solo Connection: A Parameter Efficient Fine-Tuning Technique for Transformers

Title: Hallucination Score: Towards Mitigating Hallucinations in Generative Image Super-Resolution

Title: DUSTrack: Semi-automated point tracking in ultrasound videos

Title: It's Not That Simple. An Analysis of Simple Test-Time Scaling

Title: Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation

Title: Benefit from Reference: Retrieval-Augmented Cross-modal Point Cloud Completion

Title: Efficient Whole Slide Pathology VQA via Token Compression

Title: Generative Distribution Distillation

Title: Clutter Detection and Removal by Multi-Objective Analysis for Photographic Guidance

Title: Benchmarking GANs, Diffusion Models, and Flow Matching for T1w-to-T2w MRI Translation

Title: A Transformer-Based Conditional GAN with Multiple Instance Learning for UAV Signal Detection and Classification

Title: BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM

Title: Docopilot: Improving Multimodal Models for Document-Level Understanding

Title: GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks

Title: Fraud is Not Just Rarity: A Causal Prototype Attention Approach to Realistic Synthetic Oversampling

Title: Exploring the Dynamic Scheduling Space of Real-Time Generative AI Applications on Emerging Heterogeneous Systems

Title: Beyond the Single-Best Model: Rashomon Partial Dependence Profile for Trustworthy Explanations in AutoML

Title: Omni-Think: Scaling Cross-Domain Generalization in LLMs via Multi-Task RL with Hybrid Rewards

Title: Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models

Title: Exploring Scalable Unified Modeling for General Low-Level Vision

Title: SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models

Title: Paired Image Generation with Diffusion-Guided Diffusion Models

Title: The Invisible Leash: Why RLVR May Not Escape Its Origin

Title: Grounding Degradations in Natural Language for All-In-One Video Restoration

Title: Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction

Title: OmniVTON: Training-Free Universal Virtual Try-On

Title: Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback

Title: Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR

Title: AnalogFed: Federated Discovery of Analog Circuit Topologies with Generative AI

Title: Resonant-Tunnelling Diode Reservoir Computing System for Image Recognition

Title: Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

Title: Better Models and Algorithms for Learning Ising Models from Dynamics

Title: MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction

Title: Improving Joint Embedding Predictive Architecture with Diffusion Noise

Title: Hierarchical Part-based Generative Model for Realistic 3D Blood Vessel

Title: Cross-Domain Few-Shot Learning with Coalescent Projections and Latent Space Reservation

Title: FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers

Title: CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers

Title: Conditional Video Generation for High-Efficiency Video Compression

Title: RoadFusion: Latent Diffusion Model for Pavement Defect Detection

Title: SAIGFormer: A Spatially-Adaptive Illumination-Guided Network for Low-Light Image Enhancement

Title: Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario

Title: SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging

Title: Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Title: Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity

Title: CylinderPlane: Nested Cylinder Representation for 3D-aware Image Generation

Title: Accelerating HEC-RAS: A Recurrent Neural Operator for Rapid River Forecasting

Title: Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training

Title: Visual-Language Model Knowledge Distillation Method for Image Quality Assessment

Title: Efficient Face Image Quality Assessment via Self-training and Knowledge Distillation

Title: A Practical Investigation of Spatially-Controlled Image Generation with Transformers

Title: TokensGen: Harnessing Condensed Tokens for Long Video Generation

Title: Label tree semantic losses for rich multi-class medical image segmentation

Title: Diffusion models for multivariate subsurface generation and efficient probabilistic inversion

Title: Can Your Model Separate Yolks with a Water Bottle? Benchmarking Physical Commonsense Understanding in Video Generation Models

Title: FASTGEN: Fast and Cost-Effective Synthetic Tabular Data Generation with LLMs

Title: Latent Denoising Makes Good Visual Tokenizers