2025-08-07

Title: PLA: Prompt Learning Attack against Text-to-Image Generative Models

Title: Text2VR: Automated instruction Generation in Virtual Reality using Large language Models for Assembly Task

Title: How Deep Is Representational Bias in LLMs? The Cases of Caste and Religion

Title: FeynTune: Large Language Models for High-Energy Theory

Title: Multimodal Video Emotion Recognition with Reliable Reasoning Priors

Title: From Waveforms to Pixels: A Survey on Audio-Visual Segmentation

Title: A Large Language Model Powered Integrated Circuit Footprint Geometry Understanding

Title: Hierarchical Verification of Speculative Beams for Accelerating LLM Inference

Title: TIR-Diffusion: Diffusion-based Thermal Infrared Image Denoising via Latent and Wavelet Domain Optimization

Title: Privileged Contrastive Pretraining for Multimodal Affect Modelling

Title: What is Beneath Misogyny: Misogynous Memes Classification and Explanation

Title: CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Learning

Title: StorySync: Training-Free Subject Consistency in Text-to-Image Generation via Region Harmonization

Title: Fusion of Pervasive RF Data with Spatial Images via Vision Transformers for Enhanced Mapping in Smart Cities

Title: VQ-DeepISC: Vector Quantized-Enabled Digital Semantic Communication with Channel Adaptive Image Transmission

Title: Latent Knowledge Scalpel: Precise and Massive Knowledge Editing for Large Language Models

Title: Closed-Circuit Television Data as an Emergent Data Source for Urban Rail Platform Crowding Estimation

Title: GlaBoost: A multimodal Structured Framework for Glaucoma Risk Stratification

Title: Modular Transformer Architecture for Precision Agriculture Imaging

Title: Generating Synthetic Invoices via Layout-Preserving Content Replacement

Title: LRTuckerRep: Low-rank Tucker Representation Model for Multi-dimensional Data Completion

Title: Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment

Title: LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation

Title: Provably Near-Optimal Distributionally Robust Reinforcement Learning in Online Settings

Title: GTPO: Trajectory-Based Policy Optimization in Large Language Models

Title: U-PINet: End-to-End Hierarchical Physics-Informed Learning With Sparse Graph Coupling for 3D EM Scattering Modeling

Title: 4D-PreNet: A Unified Preprocessing Framework for 4D-STEM Data Analysis

Title: SoilNet: A Multimodal Multitask Model for Hierarchical Classification of Soil Horizons

Title: HPSv3: Towards Wide-Spectrum Human Preference Score

Title: AttnTrace: Attention-based Context Traceback for Long-Context LLMs

Title: Majority Bit-Aware Watermarking For Large Language Models

Title: DP-NCB: Privacy Preserving Fair Bandits

Title: VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations

Title: Hallucination to Truth: A Review of Fact-Checking and Factuality Evaluation in Large Language Models

Title: Data-Driven Spectrum Demand Prediction: A Spatio-Temporal Framework with Transfer Learning

Title: An Entity Linking Agent for Question Answering

Title: RX-INT: A Kernel Engine for Real-Time Detection and Analysis of In-Memory Threats

Title: Simulating Cyberattacks through a Breach Attack Simulation (BAS) Platform empowered by Security Chaos Engineering (SCE)

Title: Calibrating Biophysical Models for Grape Phenology Prediction via Multi-Task Learning

Title: Sotopia-RL: Reward Design for Social Intelligence

Title: CoAct-1: Computer-using Agents with Coding as Actions

Title: Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model

Title: Next Generation Equation-Free Multiscale Modelling of Crowd Dynamics via Machine Learning

Title: Markov Chain Estimation with In-Context Learning

Title: CAP-LLM: Context-Augmented Personalized Large Language Models for News Headline Generation

Title: ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants

Title: FairPOT: Balancing AUC Performance and Fairness with Proportional Optimal Transport

Title: Policy to Assist Iteratively Local Segmentation: Optimising Modality and Location Selection for Prostate Cancer Localisation

Title: BubbleONet: A Physics-Informed Neural Operator for High-Frequency Bubble Dynamics

Title: RAVID: Retrieval-Augmented Visual Detection: A Knowledge-Driven Approach for AI-Generated Image Identification

Title: Data and AI governance: Promoting equity, ethics, and fairness in large language models

Title: Dynamic User-controllable Privacy-preserving Few-shot Sensing Framework

Title: Are Today's LLMs Ready to Explain Well-Being Concepts?

Title: Investigating the Impact of Large-Scale Pre-training on Nutritional Content Estimation from 2D Images

Title: JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation

Title: Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models

Title: Tensorized Clustered LoRA Merging for Multi-Task Interference

Title: CAD-Judge: Toward Efficient Morphological Grading and Verification for Text-to-CAD Generation

Title: Decoupled Contrastive Learning for Federated Learning

Title: HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Title: Step More: Going Beyond Single Backpropagation in Meta Learning Based Model Editing

Title: $\text{S}^2$Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation

Title: Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Title: Prototype-Driven Structure Synergy Network for Remote Sensing Images Segmentation

Title: Radar-Based NLoS Pedestrian Localization for Darting-Out Scenarios Near Parked Vehicles with Camera-Assisted Point Cloud Interpretation

Title: ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Title: Large Reasoning Models Are Autonomous Jailbreak Agents

Title: VisualTrans: A Benchmark for Real-World Visual Transformation Reasoning

Title: Iterative pseudo-labeling based adaptive copy-paste supervision for semi-supervised tumor segmentation

Title: FeDaL: Federated Dataset Learning for Time Series Foundation Models

Title: Quantum Temporal Fusion Transformer

Title: DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Title: Towards Globally Predictable k-Space Interpolation: A White-box Transformer Approach

Title: Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion

Title: PAIRS: Parametric-Verified Adaptive Information Retrieval and Selection for Efficient RAG

Title: TCSAFormer: Efficient Vision Transformer with Token Compression and Sparse Attention for Medical Image Segmentation

Title: Beyond the Visible: Benchmarking Occlusion Perception in Multimodal Large Language Models

Title: TNet: Terrace Convolutional Decoder Network for Remote Sensing Image Semantic Segmentation

Title: Fine-tuning for Better Few Shot Prompting: An Empirical Comparison for Short Answer Grading

Title: FLAT: Latent-Driven Arbitrary-Target Backdoor Attacks in Federated Learning

Title: Adversarial Fair Multi-View Clustering

Title: Efficient Strategy for Improving Large Language Model (LLM) Capabilities

Title: GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

Title: Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Title: Isolate Trigger: Detecting and Eradicating Evade-Adaptive Backdoors

Title: Model Inversion Attacks on Vision-Language Models: Do They Leak What They Learn?

Title: DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting

Title: SenseCrypt: Sensitivity-guided Selective Homomorphic Encryption for Joint Federated Learning in Cross-Device Scenarios

Title: NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding

Title: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode

Title: Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

Title: Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation

Title: Excavate the potential of Single-Scale Features: A Decomposition Network for Water-Related Optical Image Enhancement

Title: SVC 2025: the First Multimodal Deception Detection Challenge

Title: DS$^2$Net: Detail-Semantic Deep Supervision Network for Medical Image Segmentation

Title: UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

Title: COPO: Consistency-Aware Policy Optimization

Title: IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control

Title: Difficulty-Based Preference Data Selection by DPO Implicit Reward Gap

Title: Evaluating Selective Encryption Against Gradient Inversion Attacks

Title: ToxicTAGS: Decoding Toxic Memes with Rich Tag Annotations

Title: AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization

Title: Uncertainty-Aware Spatial Color Correlation for Low-Light Image Enhancement

Title: Secure Development of a Hooking-Based Deception Framework Against Keylogging Techniques

Title: One Small Step with Fingerprints, One Giant Leap for emph{De Novo} Molecule Generation from Mass Spectra

Title: Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity

Title: BadTime: An Effective Backdoor Attack on Multivariate Long-Term Time Series Forecasting

Title: RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation

Title: From Learning to Unlearning: Biomedical Security Protection in Multimodal Large Language Models

Title: Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models

Title: Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective

Title: Reasoning Beyond Labels: Measuring LLM Sentiment in Low-Resource, Culturally Nuanced Contexts

Title: ReasoningGuard: Safeguarding Large Reasoning Models with Inference-time Safety Aha Moments

Title: DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models

Title: What Holds Back Open-Vocabulary Segmentation?

Title: Hierarchical Text Classification Using Black Box Large Language Models

Title: SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition

Title: LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation

Title: Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction

Title: Empowering Time Series Forecasting with LLM-Agents

Title: DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification

Title: PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction

Title: DP-GPT4MTS: Dual-Prompt Large Language Model for Textual-Numerical Time Series Forecasting

Title: TalkDep: Clinically Grounded LLM Personas for Conversation-Centric Depression Screening

Title: T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion

Title: KVSink: Understanding and Enhancing the Preservation of Attention Sinks in KV Cache Quantization for LLMs

Title: Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark

Title: Revisiting Continual Semantic Segmentation with Pre-trained Vision Models

Title: A Few Words Can Distort Graphs: Knowledge Poisoning Attacks on Graph-based Retrieval-Augmented Generation of Large Language Models

Title: Mockingbird: How does LLM perform in general machine learning tasks?

Title: Per-element Secure Aggregation against Data Reconstruction Attacks in Federated Learning

Title: PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space

Title: Length Matters: Length-Aware Transformer for Temporal Sentence Grounding

Title: A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks

Title: TempFlow-GRPO: When Timing Matters for GRPO in Flow Models

Title: Beyond the Leaderboard: Rethinking Medical Benchmarks for Large Language Models

Title: Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Title: Modelling and Classifying the Components of a Literature Review

Title: From Split to Share: Private Inference with Distributed Feature Sharing

Title: GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy

Title: Chain of Questions: Guiding Multimodal Curiosity in Language Models

Title: Multi-Marginal Stochastic Flow Matching for High-Dimensional Snapshot Data at Irregular Time Points

Title: TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding

Title: ProtoN: Prototype Node Graph Neural Network for Unconstrained Multi-Impression Ear Recognition

Title: Improving Crash Data Quality with Large Language Models: Evidence from Secondary Crash Narratives in Kentucky

Title: Why are LLMs' abilities emergent?

Title: FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design

Title: Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models

Title: Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning

Title: Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation

Title: Efficient Inter-Task Attention for Multitask Transformer Models

Title: Composed Object Retrieval: Object-level Retrieval via Composed Expressions

Title: Decoding the Multimodal Maze: A Systematic Review on the Adoption of Explainability in Multimodal Attention-based Models

Title: Benchmarking Foundation Models for Mitotic Figure Classification

Title: Automated Generation of Curriculum-Aligned Multiple-Choice Questions for Malaysian Secondary Mathematics Using Generative AI

Title: Matrix-Free Two-to-Infinity and One-to-Two Norms Estimation

Title: Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling

Title: Automatic LLM Red Teaming

Title: Small transformer architectures for task switching

Title: CARD: Cache-Assisted Parallel Speculative Decoding for Efficient Large Language Model Inference

Title: GFocal: A Global-Focal Neural Operator for Solving PDEs on Arbitrary Geometries

Title: 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation

Title: FrEVL: Leveraging Frozen Pretrained Embeddings for Efficient Vision-Language Understanding

Title: FedHiP: Heterogeneity-Invariant Personalized Federated Learning Through Closed-Form Solutions

Title: Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model

Title: Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach

Title: QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution

Title: Learning Robust Intervention Representations with Delta Embeddings

Title: Causal Reflection with Language Models

Title: PRISM: Lightweight Multivariate Time-Series Classification through Symmetric Multi-Resolution Convolutional Layers

Title: Skeleton Motion Words for Unsupervised Skeleton-Based Temporal Action Segmentation

Title: Channel-Independent Federated Traffic Prediction

Title: RAIDX: A Retrieval-Augmented Generation and GRPO Reinforcement Learning Framework for Explainable Deepfake Detection

Title: StyliTruth : Unlocking Stylized yet Truthful LLM Generation via Disentangled Steering

Title: No Masks Needed: Explainable AI for Deriving Segmentation from Classification

Title: TopKD: Top-scaled Knowledge Distillation

Title: InceptoFormer: A Multi-Signal Neural Framework for Parkinson's Disease Severity Evaluation from Gait

Title: Privacy Risk Predictions Based on Fundamental Understanding of Personal Data and an Evolving Threat Landscape

Title: MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning

Title: Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis

Title: Augmentation-based Domain Generalization and Joint Training from Multiple Source Domains for Whole Heart Segmentation

Title: One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose

Title: Attack Pattern Mining to Discover Hidden Threats to Industrial Control Systems

Title: Drone Detection with Event Cameras

Title: TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning

Title: Analyzing and Mitigating Object Hallucination: A Training Bias Perspective

Title: DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling

Title: Visual Bias and Interpretability in Deep Learning for Dermatological Image Analysis

Title: Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Title: Measuring the Carbon Footprint of Cryptographic Privacy-Enhancing Technologies

Title: Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline

Title: TURA: Tool-Augmented Unified Retrieval Agent for AI Search

Title: Multitask Learning with Stochastic Interpolants

Title: Neuromorphic Cybersecurity with Semi-supervised Lifelong Learning

Title: OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment

Title: How Does Bilateral Ear Symmetry Affect Deep Ear Features?

Title: Lightweight Transformers for Zero-Shot and Fine-Tuned Text-to-SQL Generation Using Spider

Title: FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging

Title: P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis

Title: CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series

Title: IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Title: 4-Swap: Achieving Grief-Free and Bribery-Safe Atomic Swaps Using Four Transactions

Title: X-SAM: From Segment Anything to Any Segmentation

Title: YOLOv8-Based Deep Learning Model for Automated Poultry Disease Detection and Health Monitoring paper

Title: Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Title: HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models

Title: Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Title: Perch 2.0: The Bittern Lesson for Bioacoustics

Title: Robustly Learning Monotone Single-Index Models

Title: GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay

Title: ANPrompt: Anti-noise Prompt Tuning for Vision-Language Models

Title: Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis

Title: BEVCon: Advancing Bird's Eye View Perception with Contrastive Learning