2025-03-26

Title: A Survey on Structured State Space Sequence (S4) Models

Title: FedSKD: Aggregation-free Model-heterogeneous Federated Learning using Multi-dimensional Similarity Knowledge Distillation

Title: Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks

Title: SplitFrozen: Split Learning with Device-side Model Frozen for Fine-Tuning LLM on Heterogeneous Resource-Constrained Devices

Title: A Novel Hat-Shaped Device-Cloud Collaborative Inference Framework for Large Language Models

Title: SRMIR: Shadow Reward Models Based on Introspective Reasoning for LLM Alignment

Title: Improving Food Image Recognition with Noisy Vision Transformer

Title: DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Title: Option Discovery Using LLM-guided Semantic Hierarchical Reinforcement Learning

Title: RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis

Title: DiffV2IR: Visible-to-Infrared Diffusion Model via Vision-Language Understanding

Title: strideSEA: A STRIDE-centric Security Evaluation Approach

Title: Color Conditional Generation with Sliced Wasserstein Guidance

Title: LookAhead Tuning: Safer Language Models via Partial Answer Previews

Title: Coding Malware in Fancy Programming Languages for Fun and Profit

Title: Graph-Level Label-Only Membership Inference Attack against Graph Neural Networks

Title: HingeRLC-GAN: Combating Mode Collapse with Hinge Loss and RLC Regularization

Title: Paving the way for scientific foundation models: enhancing generalization and robustness in PDEs with constraint-aware pre-training

Title: LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment

Title: Uncertainty-Aware Decomposed Hybrid Networks

Title: Masks and Mimicry: Strategic Obfuscation and Impersonation Attacks on Authorship Verification

Title: Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics

Title: Your ViT is Secretly an Image Segmentation Model

Title: Understanding and Improving Information Preservation in Prompt Compression for LLMs

Title: Where is this coming from? Making groundedness count in the evaluation of Document VQA models

Title: Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling

Title: MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks

Title: Activation Functions Considered Harmful: Recovering Neural Network Weights through Controlled Channels

Title: Compositional Caching for Training-free Open-vocabulary Attribute Detection

Title: Risk-Based Thresholding for Reliable Anomaly Detection in Concentrated Solar Power Plants

Title: HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Title: Language Model Uncertainty Quantification with Attention Chain

Title: Graph neural networks extrapolate out-of-distribution for shortest paths

Title: SoK: How Robust is Audio Watermarking in Generative AI models?

Title: Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education

Title: FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

Title: Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch Scheduling

Title: Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

Title: A Shared Low-Rank Adaptation Approach to Personalized RLHF

Title: Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery

Title: Overtrained Language Models Are Harder to Fine-Tune

Title: FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images

Title: Byzantine Resilient Federated Multi-Task Representation Learning

Title: Towards Terminology Management Automation for Arabic

Title: A Survey of Large Language Model Agents for Question Answering

Title: Face Spoofing Detection using Deep Learning

Title: SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings

Title: Adaptive Multi-Order Graph Regularized NMF with Dual Sparsity for Hyperspectral Unmixing

Title: Linguistic Blind Spots of Large Language Models

Title: Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing

Title: DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Title: PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping

Title: MARS: Memory-Enhanced Agents with Reflective Self-improvement

Title: Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications

Title: Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines

Title: Machine-assisted writing evaluation: Exploring pre-trained language models in analyzing argumentative moves

Title: ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency

Title: No Black Box Anymore: Demystifying Clinical Predictive Modeling with Temporal-Feature Cross Attention Mechanism

Title: Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment

Title: UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design

Title: BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation

Title: A Comprehensive Analysis of Mamba for 3D Volumetric Medical Image Segmentation

Title: Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees

Title: LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text

Title: Efficient Adversarial Detection Frameworks for Vehicle-to-Microgrid Services in Edge Computing

Title: How to optimize K-means?

Title: Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Title: ChA-MAEViT: Unifying Channel-Aware Masked Autoencoders and Multi-Channel Vision Transformers for Improved Cross-Channel Learning

Title: Membership Inference Attacks on Large-Scale Models: A Survey

Title: Efficient IoT Intrusion Detection with an Improved Attention-Based CNN-BiLSTM Architecture

Title: BADGR: Bundle Adjustment Diffusion Conditioned by GRadients for Wide-Baseline Floor Plan Reconstruction

Title: Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Title: QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition

Title: Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models

Title: ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models

Title: Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection

Title: From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting

Title: Show and Segment: Universal Medical Image Segmentation via In-Context Learning

Title: ImageSet2Text: Describing Sets of Images through Text

Title: VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction

Title: EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models

Title: A Benign Activity Extraction Method for Malignant Activity Identification using Data Provenance

Title: DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image

Title: Interpretable Generative Models through Post-hoc Concept Bottlenecks

Title: Social Network User Profiling for Anomaly Detection Based on Graph Neural Networks

Title: MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation

Title: Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing

Title: Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model

Title: QUIC-Fuzz: An Effective Greybox Fuzzer For The QUIC Protocol

Title: Multi-modal 3D Pose and Shape Estimation with Computed Tomography

Title: M$^2$CD: A Unified MultiModal Framework for Optical-SAR Change Detection with Mixture of Experts and Self-Distillation

Title: DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models

Title: Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models

Title: COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting

Title: Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model

Title: SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors

Title: Data-centric Federated Graph Learning with Large Language Models

Title: G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation

Title: AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Title: Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning

Title: A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition

Title: Extracting Interpretable Logic Rules from Graph Neural Networks

Title: TeLL Me what you cant see

Title: GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Title: KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models

Title: Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage

Title: SMT-EX: An Explainable Surrogate Modeling Toolbox for Mixed-Variables Design Exploration

Title: DomainCQA: Crafting Expert-Level QA from Domain-Specific Charts

Title: SparSamp: Efficient Provably Secure Steganography Based on Sparse Sampling

Title: Pose-Based Fall Detection System: Efficient Monitoring on Standard CPUs

Title: Improved Alignment of Modalities in Large Vision Language Models

Title: Towards Imperceptible Adversarial Attacks for Time Series Classification with Local Perturbations and Frequency Analysis

Title: FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models

Title: Tiling artifacts and trade-offs of feature normalization in the segmentation of large biological images

Title: Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts

Title: Noise Resilient Over-The-Air Federated Learning In Heterogeneous Wireless Networks

Title: Scaling Laws of Synthetic Data for Language Models

Title: Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion

Title: FedMM-X: A Trustworthy and Interpretable Framework for Federated Multi-Modal Learning in Dynamic Environments

Title: Improved tissue sodium concentration quantification in breast cancer by reducing partial volume effects: a preliminary study

Title: Context-Efficient Retrieval with Factual Decomposition

Title: Optimization through In-Context Learning and Iterative LLM Prompting for Nuclear Engineering Design Problems

Title: DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios

Title: Red Teaming with Artificial Intelligence-Driven Cyberattacks: A Scoping Review

Title: 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training

Title: Burst Image Super-Resolution with Mamba

Title: Substation Bill of Materials: A Novel Approach to Managing Supply Chain Cyber-risks on IEC 61850 Digital Substations

Title: An Efficient Data Reuse with Tile-Based Adaptive Stationary for Transformer Accelerators

Title: Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation

Title: HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection

Title: Enhancing Graphical Lasso: A Robust Scheme for Non-Stationary Mean Data

Title: OpenSDI: Spotting Diffusion-Generated Images in the Open World

Title: RGB-Th-Bench: A Dense benchmark for Visual-Thermal Understanding of Vision Language Models

Title: BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction

Title: CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation

Title: A multitask transformer to sign language translation using motion gesture primitives

Title: fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models

Title: Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection

Title: AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Title: Optimization of MedSAM model based on bounding box adaptive perturbation algorithm

Title: Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations

Title: On What Depends the Robustness of Multi-source Models to Missing Data in Earth Observation?

Title: EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

Title: CamSAM2: Segment Anything Accurately in Camouflaged Videos

Title: PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models

Title: How to RETIRE Tabular Data in Favor of Discrete Digital Signal Representation

Title: FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion

Title: ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Title: OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations

Title: A Managed Tokens Service for Securely Keeping and Distributing Grid Tokens

Title: BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts

Title: Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion

Title: LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation

Title: PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch

Title: Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models

Title: SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation

Title: In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush

Title: PAVE: Patching and Adapting Video Large Language Models

Title: Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models

Title: SemEval-2025 Task 9: The Food Hazard Detection Challenge

Title: SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI

Title: Bitstream Collisions in Neural Image Compression via Adversarial Perturbations

Title: Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning

Title: FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model

Title: A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950

Title: Attention IoU: Examining Biases in CelebA using Attention Maps

Title: FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs

Title: Towards Online Multi-Modal Social Interaction Understanding

Title: Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

Title: Capacity-Constrained Online Learning with Delays: Scheduling Frameworks and Regret Trade-offs

Title: NickPay, an Auditable, Privacy-Preserving, Nickname-Based Payment System

Title: CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation

Title: Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation

Title: RCC-PFL: Robust Client Clustering under Noisy Labels in Personalized Federated Learning

Title: Scaling Down Text Encoders of Text-to-Image Diffusion Models

Title: CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Title: TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Title: ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models

Title: Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better

Title: AvatarArtist: Open-Domain 4D Avatarization

Title: FullDiT: Multi-Task Video Generative Foundation Model with Full Attention

Title: CoLLM: A Large Language Model for Composed Image Retrieval

Title: SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining

Title: PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

Title: Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models

Title: EventFly: Event Camera Perception from Ground to the Sky