2025-12-23

Title: Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models

Title: Graph-O1 : Monte Carlo Tree Search with Reinforcement Learning for Text-Attributed Graph Reasoning

Title: Q-KVComm: Efficient Multi-Agent Communication Via Adaptive KV Cache Compression

Title: Learning to Prioritize IT Tickets: A Comparative Evaluation of Embedding-based Approaches and Fine-Tuned Transformer Models

Title: KVReviver: Reversible KV Cache Compression with Sketch-Based Token Reconstruction

Title: Separating Constraint Compliance from Semantic Accuracy: A Novel Benchmark for Evaluating Instruction-Following Under Compression

Title: A 96pJ/Frame/Pixel and 61pJ/Event Anti-UAV System with Hybrid Object Tracking Modes

Title: NystagmusNet: Explainable Deep Learning for Photosensitivity Risk Prediction

Title: What's the Price of Monotonicity? A Multi-Dataset Benchmark of Monotone-Constrained Gradient Boosting for Credit PD

Title: SuperFlow: Training Flow Matching Models with RL on the Fly

Title: Seeing Beyond the Scene: Analyzing and Mitigating Background Bias in Action Recognition

Title: SCS-SupCon: Sigmoid-based Common and Style Supervised Contrastive Learning with Adaptive Decision Boundaries

Title: A Modular Framework for Single-View 3D Reconstruction of Indoor Environments

Title: Convolutional-neural-operator-based transfer learning for solving PDEs

Title: Parameter-Efficient Fine-Tuning for HAR: Integrating LoRA and QLoRA into Transformer Models

Title: A Hybrid Inductive-Transductive Network for Traffic Flow Imputation on Unsampled Locations

Title: MoE-TransMov: A Transformer-based Model for Next POI Prediction in Familiar & Unfamiliar Movements

Title: FedOAED: Federated On-Device Autoencoder Denoiser for Heterogeneous Data under Limited Client Availability

Title: Enhancing Tea Leaf Disease Recognition with Attention Mechanisms and Grad-CAM Visualization

Title: Name That Part: 3D Part Segmentation and Naming

Title: Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models

Title: Towards Benchmarking Privacy Vulnerabilities in Selective Forgetting with Large Language Models

Title: Securing Agentic AI Systems -- A Multilayer Security Framework

Title: YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs

Title: FOODER: Real-time Facial Authentication and Expression Recognition

Title: FPBench: A Comprehensive Benchmark of Multimodal Large Language Models for Fingerprint Analysis

Title: Uncertainty-Gated Region-Level Retrieval for Robust Semantic Segmentation

Title: Microstructure-based Variational Neural Networks for Robust Uncertainty Quantification in Materials Digital Twins

Title: Learning Generalizable Neural Operators for Inverse Problems

Title: TraCeR: Transformer-Based Competing Risk Analysis with Longitudinal Covariates

Title: PermuteV: A Performant Side-channel-Resistant RISC-V Core Securing Edge AI Inference

Title: Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection

Title: Local Patches Meet Global Context: Scalable 3D Diffusion Priors for Computed Tomography Reconstruction

Title: Conscious Data Contribution via Community-Driven Chain-of-Thought Distillation

Title: Atlas is Your Perfect Context: One-Shot Customization for Generalizable Foundational Medical Image Segmentation

Title: FairExpand: Individual Fairness on Graphs with Partial Similarity Information

Title: MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Title: Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching

Title: ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection

Title: Multi-Part Object Representations via Graph Structures and Co-Part Discovery

Title: PROVEX: Enhancing SOC Analyst Trust with Explainable Provenance-Based IDS

Title: FedWiLoc: Federated Learning for Privacy-Preserving WiFi Indoor Localization

Title: Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Title: GeoSense-AI: Fast Location Inference from Crisis Microblogs

Title: Multifaceted Exploration of Spatial Openness in Rental Housing: A Big Data Analysis in Tokyo's 23 Wards

Title: Joint Learning of Depth, Pose, and Local Radiance Field for Large Scale Monocular 3D Reconstruction

Title: SG-RIFE: Semantic-Guided Real-Time Intermediate Flow Estimation with Diffusion-Competitive Perceptual Quality

Title: Breaking Minds, Breaking Systems: Jailbreaking Large Language Models via Human-like Psychological Manipulation

Title: Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image

Title: Loom: Diffusion-Transformer for Interleaved Generation

Title: Who Can See Through You? Adversarial Shielding Against VLM-Based Attribute Inference Attacks

Title: LeJOT: An Intelligent Job Cost Orchestration Solution for Databricks Platform

Title: FedSUM Family: Efficient Federated Learning Methods under Arbitrary Client Participation

Title: UniMPR: A Unified Framework for Multimodal Place Recognition with Arbitrary Sensor Configurations

Title: AL-GNN: Privacy-Preserving and Replay-Free Continual Graph Learning via Analytic Learning

Title: InstructNet: A Novel Approach for Multi-Label Instruction Classification through Advanced Deep Learning

Title: MORPHEUS: A Multidimensional Framework for Modeling, Measuring, and Mitigating Human Factors in Cybersecurity

Title: Embedded Safety-Aligned Intelligence via Differentiable Internal Alignment Embeddings

Title: MatE: Material Extraction from Single-Image via Geometric Prior

Title: MatSpray: Fusing 2D Material World Knowledge on 3D Geometry

Title: Trustworthy and Explainable Deep Reinforcement Learning for Safe and Energy-Efficient Process Control: A Use Case in Industrial Compressed Air Systems

Title: LIR$^3$AG: A Lightweight Rerank Reasoning Strategy Framework for Retrieval-Augmented Generation

Title: A two-stream network with global-local feature fusion for bone age assessment

Title: Towards Efficient Agents: A Co-Design of Inference Architecture and System

Title: Theodosian: A Deep Dive into Memory-Hierarchy-Centric FHE Acceleration

Title: LLM-based Few-Shot Early Rumor Detection with Imitation Agent

Title: DACE For Railway Acronym Disambiguation

Title: SRS-Stories: Vocabulary-constrained multilingual story generation for language learning

Title: Efficient Zero-Shot Inpainting with Decoupled Diffusion Guidance

Title: Towards Guided Descent: Optimization Algorithms for Training Neural Networks At Scale

Title: RecurGS: Interactive Scene Modeling via Discrete-State Recurrent Gaussian Fusion

Title: AraToken: Optimizing Arabic Tokenization with Normalization Pipeline and Language Extension for Qwen3

Title: Automated Mosaic Tesserae Segmentation via Deep Learning Techniques

Title: MoE Pathfinder: Trajectory-driven Expert Pruning

Title: Federated Learning Based Decentralized Adaptive Intelligent Transmission Protocol for Privacy Preserving 6G Networks

Title: MeniMV: A Multi-view Benchmark for Meniscus Injury Severity Grading

Title: An Agentic AI Framework for Training General Practitioner Student Skills

Title: On the Universality of Transformer Architectures; How Much Attention Is Enough?

Title: Object-Centric Framework for Video Moment Retrieval

Title: Secret mixtures of experts inside your LLM

Title: Out-of-Distribution Detection in Molecular Complexes via Diffusion Models for Irregular Graphs

Title: Plasticine: A Traceable Diffusion Model for Medical Image Translation

Title: SoK: Understanding (New) Security Issues Across AI4Code Use Cases

Title: Self-organizing maps for water quality assessment in reservoirs and lakes: A systematic literature review

Title: APC-GNN++: An Adaptive Patient-Centric GNN with Context-Aware Attention and Mini-Graph Explainability for Diabetes Classification

Title: Research on a hybrid LSTM-CNN-Attention model for text-based web content classification

Title: QLink: Quantum-Safe Bridge Architecture for Blockchain Interoperability

Title: Enhancing Decision-Making in Windows PE Malware Classification During Dataset Shifts with Uncertainty Estimation

Title: Adaptive-VoCo: Complexity-Aware Visual Token Compression for Vision-Language Models

Title: PlantDiseaseNet-RT50: A Fine-tuned ResNet50 Architecture for High-Accuracy Plant Disease Detection Beyond Standard CNNs

Title: NASTaR: NovaSAR Automated Ship Target Recognition Dataset

Title: Teaching and Critiquing Conceptualization and Operationalization in NLP

Title: Feature-Enhanced Graph Neural Networks for Classification of Synthetic Graph Generative Models: A Benchmarking Study

Title: Detection of AI Generated Images Using Combined Uncertainty Measures and Particle Swarm Optimised Rejection Mechanism

Title: WoundNet-Ensemble: A Novel IoMT System Integrating Self-Supervised Deep Learning and Multi-Model Fusion for Automated, High-Accuracy Wound Classification and Healing Progression Monitoring

Title: Generalization Gaps in Political Fake News Detection: An Empirical Study on the LIAR Dataset

Title: SecureCode v2.0: A Production-Grade Dataset for Training Security-Aware Code Generation Models

Title: LLMs on Drugs: Language Models Are Few-Shot Consumers

Title: Enhancing Medical Large Vision-Language Models via Alignment Distillation

Title: Proof of Authenticity of General IoT Information with Tamper-Evident Sensors and Blockchain

Title: OpenView: Empowering MLLMs with Out-of-view VQA

Title: Comparing Dynamical Models Through Diffeomorphic Vector Field Alignment

Title: Placenta Accreta Spectrum Detection Using an MRI-based Hybrid CNN-Transformer Model

Title: SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

Title: DNA-HHE: Dual-mode Near-network Accelerator for Hybrid Homomorphic Encryption on the Edge

Title: From Scratch to Fine-Tuned: A Comparative Study of Transformer Training Strategies for Legal Machine Translation

Title: Benchmarking neural surrogates on realistic spatiotemporal multiphysics flows

Title: Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach

Title: SimpleCall: A Lightweight Image Restoration Agent in Label-Free Environments with MLLM Perceptual Feedback

Title: The Interaction Bottleneck of Deep Neural Networks: Discovery, Proof, and Modulation

Title: A Comparative Study of Light-weight Language Models for PII Masking and their Deployment for Real Conversational Texts

Title: Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments

Title: LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction

Title: Multi-user Pufferfish Privacy

Title: From Shortcut to Induction Head: How Data Diversity Shapes Algorithm Selection in Transformers

Title: Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers

Title: Volley Revolver: A Novel Matrix-Encoding Method for Privacy-Preserving Deep Learning (Inference++)

Title: Adversarial Robustness in Zero-Shot Learning:An Empirical Study on Class and Concept-Level Vulnerabilities

Title: Does It Tie Out? Towards Autonomous Legal Agents in Venture Capital

Title: PMPGuard: Catching Pseudo-Matched Pairs in Remote Sensing Image-Text Retrieval

Title: SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse

Title: AsyncDiff: Asynchronous Timestep Conditioning for Enhanced Text-to-Image Diffusion Inference

Title: brat: Aligned Multi-View Embeddings for Brain MRI Analysis

Title: Solver-Independent Automated Problem Formulation via LLMs for High-Cost Simulation-Driven Design

Title: A Study of Finetuning Video Transformers for Multi-view Geometry Tasks

Title: Fusion of Multiscale Features Via Centralized Sparse-attention Network for EEG Decoding

Title: EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images

Title: Generating Risky Samples with Conformity Constraints via Diffusion Models

Title: A Theoretical Lens for RL-Tuned Language Models via Energy-Based Models

Title: Explainable and Fine-Grained Safeguarding of LLM Multi-Agent Systems via Bi-Level Graph Anomaly Detection

Title: Is Your Conditional Diffusion Model Actually Denoising?

Title: Memorize-and-Generate: Towards Long-Term Consistency in Real-Time Video Generation

Title: MemEvolve: Meta-Evolution of Agent Memory Systems

Title: IPCV: Information-Preserving Compression for MLLM Visual Encoders

Title: Context-Aware Network Based on Multi-scale Spatio-temporal Attention for Action Recognition in Videos

Title: ISADM: An Integrated STRIDE, ATT&CK, and D3FEND Model for Threat Modeling Against Real-world Adversaries

Title: MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation

Title: In-Context Audio Control of Video Diffusion Transformers

Title: Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers

Title: Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation

Title: FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

Title: EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer

Title: Controllable Probabilistic Forecasting with Stochastic Decomposition Layers

Title: From Word to World: Can Large Language Models be Implicit Text-based World Models?

Title: Generative Modeling through Spectral Analysis of Koopman Operator

Title: MDToC: Metacognitive Dynamic Tree of Concepts for Boosting Mathematical Problem-Solving of Large Language Models

Title: Brain-Gen: Towards Interpreting Neural Signals for Stimulus Reconstruction Using Transformers and Latent Diffusion Models

Title: VizDefender: Unmasking Visualization Tampering through Proactive Localization and Intent Inference

Title: Toward Human-Centered AI-Assisted Terminology Work

Title: Cross-modal Counterfactual Explanations: Uncovering Decision Factors and Dataset Biases in Subjective Classification

Title: CrashChat: A Multimodal Large Language Model for Multitask Traffic Crash Video Analysis

Title: Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Title: Remedy-R: Generative Reasoning for Machine Translation Evaluation without Error Annotations

Title: Delta-LLaVA: Base-then-Specialize Alignment for Token-Efficient Vision-Language Models

Title: Merging of Kolmogorov-Arnold networks trained on disjoint datasets

Title: The Ensemble Schr{ö}dinger Bridge filter for Nonlinear Data Assimilation

Title: LouvreSAE: Sparse Autoencoders for Interpretable and Controllable Style Transfer

Title: DPSR: Differentially Private Sparse Reconstruction via Multi-Stage Denoising for Recommender Systems

Title: Point What You Mean: Visually Grounded Instruction Policy

Title: When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models

Title: FASTRIC: Prompt Specification Language for Verifiable LLM Interactions

Title: Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement

Title: Symmetrization of 3D Generative Models

Title: VOIC: Visible-Occluded Decoupling for Monocular 3D Semantic Scene Completion

Title: Scaling Online Distributionally Robust Reinforcement Learning: Sample-Efficient Guarantees with General Function Approximation

Title: DVI: Disentangling Semantic and Visual Identity for Training-Free Personalized Generation

Title: Lag Operator SSMs: A Geometric Framework for Structured State Space Modeling

Title: Total Curvature Regularization and its_Minimization for Surface and Image Smoothing

Title: R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression

Title: ICP-4D: Bridging Iterative Closest Point and LiDAR Panoptic Segmentation

Title: Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework

Title: Context-Aware Initialization for Reducing Generative Path Length in Diffusion Language Models

Title: Quantum-Resistant Cryptographic Models for Next-Gen Cybersecurity

Title: The 6th International Verification of Neural Networks Competition (VNN-COMP 2025): Summary and Results

Title: Efficient Jailbreak Mitigation Using Semantic Linear Classification in a Multi-Staged Pipeline

Title: DREAM: Dynamic Red-teaming across Environments for AI Models

Title: Optimizer Dynamics at the Edge of Stability with Differential Privacy

Title: CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Title: VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation

Title: Steering Vision-Language Pre-trained Models for Incremental Face Presentation Attack Detection

Title: The Erasure Illusion: Stress-Testing the Generalization of LLM Forgetting Evaluation

Title: Finer-Personalization Rank: Fine-Grained Retrieval Examines Identity Preservation for Personalized Generation

Title: Automatic Neuronal Activity Segmentation in Fast Four Dimensional Spatio-Temporal Fluorescence Imaging using Bayesian Approach

Title: Elevating Intrusion Detection and Security Fortification in Intelligent Networks through Cutting-Edge Machine Learning Paradigms

Title: WaTeRFlow: Watermark Temporal Robustness via Flow Consistency

Title: Decoupled Generative Modeling for Human-Object Interaction Synthesis

Title: Efficient Personalization of Generative Models via Optimal Experimental Design

Title: 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation

Title: Retrieving Objects from 3D Scenes with Box-Guided Open-Vocabulary Instance Segmentation

Title: Auditing Significance, Metric Choice, and Demographic Fairness in Medical AI Challenges

Title: A Large Language Model Based Method for Complex Logical Reasoning over Knowledge Graphs

Title: Dual Model Deep Learning for Alzheimer Prognostication

Title: Timely Parameter Updating in Over-the-Air Federated Learning

Title: HyperLoad: A Cross-Modality Enhanced Large Language Model-Based Framework for Green Data Center Cooling Load Prediction

Title: Generative Giants, Retrieval Weaklings: Why do Multimodal Large Language Models Fail at Multimodal Retrieval?

Title: Stop saying LLM: Large Discourse Models (LDM) and Artificial Discursive Agent (ADA)?

Title: ShadowBlock: Efficient Dynamic Anonymous Blocklisting and Its Cross-chain Application

Title: SAP: Syntactic Attention Pruning for Transformer-based Language Models

Title: AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards

Title: QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation

Title: RP-CATE: Recurrent Perceptron-based Channel Attention Transformer Encoder for Industrial Hybrid Modeling

Title: Beyond Sliding Windows: Learning to Manage Memory in Non-Markovian Environments

Title: OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions

Title: JEPA-Reasoner: Decoupling Latent Reasoning from Token Generation

Title: Practical Quantum-Classical Feature Fusion for complex data Classification

Title: Operator-Based Generalization Bound for Deep Learning: Insights on Multi-Task Learning

Title: Evaluating MCC for Low-Frequency Cyberattack Detection in Imbalanced Intrusion Detection Data

Title: MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning

Title: InvCoSS: Inversion-driven Continual Self-supervised Learning in Medical Multi-modal Image Pre-training

Title: HippMetric: A skeletal-representation-based framework for cross-sectional and longitudinal hippocampal substructural morphometry

Title: Towards Minimal Fine-Tuning of VLMs

Title: Regression generation adversarial network based on dual data evaluation strategy for industrial application

Title: Identifying Features Associated with Bias Against 93 Stigmatized Groups in Language Models and Guardrail Model Safety Mitigation

Title: ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models

Title: VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis

Title: Auto-Prompting with Retrieval Guidance for Frame Detection in Logistics

Title: Small Language Models as Compiler Experts: Auto-Parallelization for Heterogeneous Systems

Title: Is Visual Realism Enough? Evaluating Gait Biometric Fidelity in Generative AI Human Animation

Title: Hand-Aware Egocentric Motion Reconstruction with Sequence-Level Context

Title: GShield: Mitigating Poisoning Attacks in Federated Learning

Title: Causal-Guided Detoxify Backdoor Attack of Open-Weight LoRA Models

Title: RMLer: Synthesizing Novel Objects across Diverse Categories via Reinforcement Mixing Learning

Title: Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing

Title: CienaLLM: Generative Climate-Impact Extraction from News Articles with Autoregressive LLMs

Title: Time-Vertex Machine Learning for Optimal Sensor Placement in Temporal Graph Signals: Applications in Structural Health Monitoring

Title: MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture

Title: Protecting Quantum Circuits Through Compiler-Resistant Obfuscation

Title: Neural Implicit Heart Coordinates: 3D cardiac shape reconstruction from sparse segmentations

Title: Alternative positional encoding functions for neural transformers

Title: DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis

Title: GANeXt: A Fully ConvNeXt-Enhanced Generative Adversarial Network for MRI- and CBCT-to-CT Synthesis

Title: ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining

Title: Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation

Title: From Points to Coalitions: Hierarchical Contrastive Shapley Values for Prioritizing Data Samples

Title: Efficient Spike-driven Transformer for High-performance Drone-View Geo-Localization

Title: HATS: High-Accuracy Triple-Set Watermarking for Large Language Models

Title: OmniMER: Indonesian Multimodal Emotion Recognition via Auxiliary-Enhanced LLM Adaptation

Title: Brain-Grounded Axes for Reading and Steering LLM States

Title: Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara

Title: From Retrieval to Reasoning: A Framework for Cyber Threat Intelligence NER with Explicit and Adaptive Instructions

Title: CodeSimpleQA: Scaling Factuality in Code Large Language Models

Title: Attention Is Not What You Need

Title: MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Title: dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Title: MT-Mark: Rethinking Image Watermarking via Mutual-Teacher Collaboration with Adaptive Feature Modulation

Title: An Inverse Scattering Inspired Fourier Neural Operator for Time-Dependent PDE Learning

Title: D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning

Title: Sign Language Recognition using Parallel Bidirectional Reservoir Computing

Title: SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation

Title: Activations as Features: Probing LLMs for Generalizable Essay Scoring Representations

Title: Multi-Layer Confidence Scoring for Detection of Out-of-Distribution Samples, Adversarial Attacks, and In-Distribution Misclassifications

Title: A Large-Language-Model Framework for Automated Humanitarian Situation Reporting

Title: Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation

Title: Lightweight Intrusion Detection in IoT via SHAP-Guided Feature Pruning and Knowledge-Distilled Kronecker Networks

Title: FusionNet: Physics-Aware Representation Learning for Multi-Spectral and Thermal Data via Trainable Signal-Processing Priors

Title: Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation

Title: LacaDM: A Latent Causal Diffusion Model for Multiobjective Reinforcement Learning

Title: A Convolutional Neural Deferred Shader for Physics Based Rendering

Title: Initialization of a Polyharmonic Cascade, Launch and Testing

Title: Multi-Modal Soccer Scene Analysis with Masked Pre-Training

Title: Learning Continuous Solvent Effects from Transient Flow Data: A Graph Neural Network Benchmark on Catechol Rearrangement

Title: Event Extraction in Large Language Model

Title: StoryMem: Multi-shot Long Video Storytelling with Memory

Title: ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars

Title: BabyFlow: 3D modeling of realistic and expressive infant faces

Title: Increasing the Thinking Budget is Not All You Need

Title: No Data? No Problem: Robust Vision-Tabular Learning with Missing Values

Title: MapTrace: Scalable Data Generation for Route Tracing on Maps

Title: MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery

Title: Exploring the features used for summary evaluation by Human and GPT

Title: Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment

Title: The Best of Both Worlds: Hybridizing Neural Operators and Solvers for Stable Long-Horizon Inference

Title: Exploring Zero-Shot ACSA with Unified Meaning Representation in Chain-of-Thought Prompting

Title: Over++: Generative Video Compositing for Layer Interaction Effects

Title: Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis

Title: Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Title: Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning

Title: WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Title: GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Title: From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs

Title: Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models