2025-04-16

Title: GPT Meets Graphs and KAN Splines: Testing Novel Frameworks on Multitask Fine-Tuned GPT-2 with LoRA

Title: LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections

Title: ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Title: Federated Learning with Layer Skipping: Efficient Training of Large Language Models for Healthcare NLP

Title: MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers

Title: LEMUR Neural Network Dataset: Towards Seamless AutoML

Title: Beyond the Generative Learning Trilemma: Generative Model Assessment in Data Scarcity Domains

Title: VAE-based Feature Disentanglement for Data Augmentation and Compression in Generalized GNSS Interference Classification

Title: Efficient Process Reward Model Training via Active Learning

Title: Self-Controlled Dynamic Expansion Model for Continual Learning

Title: Data Augmentation Through Random Style Replacement

Title: H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models

Title: Demo: ViolentUTF as An Accessible Platform for Generative AI Red Teaming

Title: Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling

Title: Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models

Title: Skeleton-Based Intake Gesture Detection With Spatial-Temporal Graph Convolutional Networks

Title: Better Estimation of the KL Divergence Between Language Models

Title: Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning

Title: Relation-Rich Visual Document Generator for Visual Information Extraction

Title: Perturbed State Space Feature Encoders for Optical Flow with Event Cameras

Title: Achieving Optimal Tissue Repair Through MARL with Reward Shaping and Curriculum Learning

Title: Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content

Title: EMAFusion: A Self-Optimizing System for Seamless LLM Selection and Integration

Title: The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Title: The Jailbreak Tax: How Useful are Your Jailbreak Outputs?

Title: Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication

Title: Can LLMs Classify CVEs? Investigating LLMs Capabilities in Computing CVSS Vectors

Title: SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models

Title: FuzzSense: Towards A Modular Fuzzing Framework for Autonomous Driving Software

Title: Leveraging Deep Operator Networks (DeepONet) for Acoustic Full Waveform Inversion (FWI)

Title: HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving

Title: Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization

Title: PQ-CAN: A Framework for Simulating Post-Quantum Cryptography in Embedded Systems

Title: Frozen Layers: Memory-efficient Many-fidelity Hyperparameter Optimization

Title: CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates

Title: Encryption scheme based on Automorphism Group of Hermitian Function Field with Homomorphic Encryption

Title: Real-time Seafloor Segmentation and Mapping

Title: Minimal Sensing for Orienting a Solar Panel

Title: How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Title: The Art of Audience Engagement: LLM-Based Thin-Slicing of Scientific Talks

Title: Collaborative Bayesian Optimization via Wasserstein Barycenters

Title: AtlasD: Automatic Local Symmetry Discovery

Title: GUM-SAGE: A Novel Dataset and Approach for Graded Entity Salience Prediction

Title: Name of Thrones: Evaluating How LLMs Rank Student Names, Race, and Gender in Status Hierarchies

Title: The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability

Title: Power-scaled Bayesian Inference with Score-based Generative mModels

Title: Tabular foundation model to detect empathy from visual cues

Title: GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR

Title: FlexiContracts: A Novel and Efficient Scheme for Upgrading Smart Contracts in Ethereum Blockchain

Title: FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare

Title: IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism

Title: CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Title: OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding

Title: LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation

Title: Towards Spatially-Aware and Optimally Faithful Concept-Based Explanations

Title: LightFormer: A lightweight and efficient decoder for remote sensing image segmentation

Title: Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators

Title: How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model?

Title: ICAFS: Inter-Client-Aware Feature Selection for Vertical Federated Learning

Title: Enhancing Features in Long-tailed Data Using Large Vision Mode

Title: PT-Mark: Invisible Watermarking for Text-to-image Diffusion Models via Semantic-aware Pivotal Tuning

Title: LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation

Title: DAAF:Degradation-Aware Adaptive Fusion Framework for Robust Infrared and Visible Images Fusion

Title: Can Vision-Language Models Understand and Interpret Dynamic Gestures from Pedestrians? Pilot Datasets and Exploration Towards Instructive Nonverbal Commands for Cooperative Autonomous Vehicles

Title: Weather-Aware Object Detection Transformer for Domain Adaptation

Title: Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content

Title: Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task

Title: Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models

Title: CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors

Title: Bridging Distribution Gaps in Time Series Foundation Model Pretraining with Prototype-Guided Normalization

Title: InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Title: Understanding LLMs' Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From

Title: Towards A Universal Graph Structural Encoder

Title: Fast-Powerformer: A Memory-Efficient Transformer for Accurate Mid-Term Wind Power Forecasting

Title: Can LLMs Leverage Observational Data? Towards Data-Driven Causal Discovery with LLMs

Title: Improved MST3 Encryption scheme based on small Ree groups

Title: When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Title: An Efficient and Mixed Heterogeneous Model for Image Restoration

Title: AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images

Title: Self-Supervised Enhancement of Forward-Looking Sonar Images: Bridging Cross-Modal Degradation Gaps through Feature Space Transformation and Multi-Frame Fusion

Title: Adaptive Decision Boundary for Few-Shot Class-Incremental Learning

Title: Exploring the Role of KG-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs

Title: ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings

Title: Seeing like a Cephalopod: Colour Vision with a Monochrome Event Camera

Title: PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation

Title: Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation

Title: TMCIR: Token Merge Benefits Composed Image Retrieval

Title: ReZero: Enhancing LLM search ability by trying one-more-time

Title: Dynamic Compressing Prompts for Efficient Inference of Large Language Models

Title: MediSee: Reasoning-based Pixel-level Perception in Medical Images

Title: AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era

Title: DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen

Title: Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation

Title: Defending Against Frequency-Based Attacks with Diffusion Models

Title: QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models

Title: LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews

Title: Leveraging LLMs and attention-mechanism for automatic annotation of historical maps

Title: Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detections

Title: UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques

Title: Improving fingerprint presentation attack detection by an approach integrated into the personal verification stage

Title: Change State Space Models for Remote Sensing Change Detection

Title: FLSSM: A Federated Learning Storage Security Model with Homomorphic Encryption

Title: Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting

Title: Using LLMs as prompt modifier to avoid biases in AI image generators

Title: Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models

Title: KubeFence: Security Hardening of the Kubernetes Attack Surface

Title: Taming Consistency Distillation for Accelerated Human Image Animation

Title: GC-GAT: Multimodal Vehicular Trajectory Prediction using Graph Goal Conditioning and Cross-context Attention

Title: SAR-to-RGB Translation with Latent Diffusion for Earth Observation

Title: TSAL: Few-shot Text Segmentation Based on Attribute Learning

Title: Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails

Title: MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos

Title: TerraMind: Large-Scale Generative Multimodality for Earth Observation

Title: TerraMesh: A Planetary Mosaic of Multimodal Earth Observation Data

Title: Exploring Backdoor Attack and Defense for LLM-empowered Recommendations

Title: Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting

Title: Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items

Title: R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning

Title: Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance

Title: Video Summarization with Large Language Models

Title: Slice+Slice Baby: Generating Last-Level Cache Eviction Sets in the Blink of an Eye

Title: Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models

Title: 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians

Title: CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

Title: Leveraging multimodal explanatory annotations for video interpretation with Modality Specific Dataset

Title: Reconstructing Fine-Grained Network Data using Autoencoder Architectures with Domain Knowledge Penalties

Title: Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach

Title: DeepSelective: Feature Gating and Representation Matching for Interpretable Clinical Prediction

Title: Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution

Title: From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs

Title: UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Title: Automated Python Translation

Title: Autoregressive Distillation of Diffusion Transformers

Title: Context-Aware Palmprint Recognition via a Relative Similarity Metric

Title: Big Brother is Watching: Proactive Deepfake Detection via Learnable Hidden Face

Title: Intelligent driving vehicle front multi-target tracking and detection based on YOLOv5 and point cloud 3D projection

Title: Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints

Title: PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Title: A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Title: Interpretable Hybrid-Rule Temporal Point Processes

Title: DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation

Title: DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks

Title: Teaching Large Language Models to Reason through Learning and Forgetting

Title: A Decade of Wheat Mapping for Lebanon

Title: From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation

Title: OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution

Title: Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions

Title: RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models

Title: DataDecide: How to Predict Best Pretraining Data with Small Experiments

Title: Robustness and sex differences in skin cancer detection: logistic regression vs CNNs

Title: Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps

Title: Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry

Title: Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts

Title: ADT: Tuning Diffusion Models with Adversarial Supervision

Title: A Dual-Space Framework for General Knowledge Distillation of Large Language Models

Title: NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Title: Improving Statistical Privacy by Subsampling

Title: Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models

Title: Enhancing Out-of-Distribution Detection with Extended Logit Normalization

Title: Mamba-Based Ensemble learning for White Blood Cell Classification

Title: TextArena

Title: Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

Title: PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond

Title: A Clean Slate for Offline Reinforcement Learning

Title: Elucidating the Design Space of Multimodal Protein Language Models

Title: Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception