2025-03-12

Title: Psychological Counseling Ability of Large Language Models

Title: FourierNAT: A Fourier-Mixing-Based Non-Autoregressive Transformer for Parallel Sequence Generation

Title: Cross-modal Causal Relation Alignment for Video Question Grounding

Title: Is Pre-training Applicable to the Decoder for Dense Prediction?

Title: Mixture of Experts Made Intrinsically Interpretable

Title: BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification

Title: ConstellationNet: Reinventing Spatial Clustering through GNNs

Title: BicliqueEncoder: An Efficient Method for Link Prediction in Bipartite Networks using Formal Concept Analysis and Transformer Encoder

Title: On the Importance of Clearsky Model in Short-Term Solar Radiation Forecasting

Title: The day-ahead scenario generation method for new energy based on an improved conditional generative diffusion model

Title: TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster

Title: MergeQuant: Accurate 4-bit Static Quantization of Large Language Models by Channel-wise Calibration

Title: GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Title: DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving

Title: SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

Title: Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy

Title: Merge then Realign: Simple and Effective Modality-Incremental Continual Learning for Multimodal LLMs

Title: WECAR: An End-Edge Collaborative Inference and Training Framework for WiFi-Based Continuous Human Activity Recognition

Title: TVNet: A Novel Time Series Analysis Method Based on Dynamic Convolution and 3D-Variation

Title: PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Title: Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Title: A Time Series Multitask Framework Integrating a Large Language Model, Pre-Trained Time Series Model, and Knowledge Graph

Title: Fair Text Classification via Transferable Representations

Title: PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models

Title: Graphint: Graph-based Time Series Clustering Visualisation Tool

Title: RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories

Title: Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Title: SIRE: SE(3) Intrinsic Rigidity Embeddings

Title: SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud Registration

Title: SegResMamba: An Efficient Architecture for 3D Medical Image Segmentation

Title: Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration

Title: NimbleReg: A light-weight deep-learning framework for diffeomorphic image registration

Title: Evaluating LLaMA 3.2 for Software Vulnerability Detection

Title: Sublinear Algorithms for Wasserstein and Total Variation Distances: Applications to Fairness and Privacy Auditing

Title: Joint Explainability-Performance Optimization With Surrogate Models for AI-Driven Edge Services

Title: On the Semantic Security of NTRU -- with a gentle introduction to cryptography

Title: Self-supervised Normality Learning and Divergence Vector-guided Model Merging for Zero-shot Congenital Heart Disease Detection in Fetal Ultrasound Videos

Title: Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models

Title: Training Domain Draft Models for Speculative Decoding: Best Practices and Insights

Title: AgriField3D: A Curated 3D Point Cloud and Procedural Model Dataset of Field-Grown Maize from a Diversity Panel

Title: Group Fairness in Multi-Task Reinforcement Learning

Title: Strengthening the Internal Adversarial Robustness in Lifted Neural Networks

Title: Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Title: Modern Models, Medieval Texts: A POS Tagging Study of Old Occitan

Title: Fixing the RANSAC Stopping Criterion

Title: HalluVerse25: Fine-grained Multilingual Benchmark Dataset for LLM Hallucinations

Title: TwinTURBO: Semi-Supervised Fine-Tuning of Foundation Models via Mutual Information Decompositions for Downstream Task and Latent Spaces

Title: Learning and Evaluating Hierarchical Feature Representations

Title: Blind Video Super-Resolution based on Implicit Kernels

Title: Efficient Resource Management for Secure and Low-Latency O-RAN Communication

Title: Right Reward Right Time for Federated Learning

Title: MapQA: Open-domain Geospatial Question Answering on Map Data

Title: Topology-Preserving Loss for Accurate and Anatomically Consistent Cardiac Mesh Reconstruction

Title: Measuring directional bias amplification in image captions using predictability

Title: Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

Title: ReLATE: Resilient Learner Selection for Multivariate Time-Series Classification Against Adversarial Attacks

Title: Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?

Title: Gemini Embedding: Generalizable Embeddings from Gemini

Title: Can Memory-Augmented Language Models Generalize on Reasoning-in-a-Haystack Tasks?

Title: Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning

Title: FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction

Title: Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Title: From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans

Title: CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement

Title: STRMs: Spatial Temporal Reasoning Models for Vision-Based Localization Rivaling GPS Precision

Title: BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes

Title: Enhancing Sentiment Analysis through Multimodal Fusion: A BERT-DINOv2 Approach

Title: Text-RGBT Person Retrieval: Multilevel Global-Local Cross-Modal Alignment and A High-quality Benchmark

Title: EFPC: Towards Efficient and Flexible Prompt Compression

Title: Pre-trained Models Succeed in Medical Imaging with Representation Similarity Degradation

Title: Recent Advances in Hypergraph Neural Networks

Title: LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking

Title: Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection

Title: Regulatory DNA sequence Design with Reinforcement Learning

Title: DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation

Title: CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction

Title: A Survey on Wi-Fi Sensing Generalizability: Taxonomy, Techniques, Datasets, and Future Research Prospects

Title: Exploring Bias in over 100 Text-to-Image Generative Models

Title: GPT-PPG: A GPT-based Foundation Model for Photoplethysmography Signals

Title: Partial differential equation system for binarization of degraded document images

Title: Multi-Cue Adaptive Visual Token Pruning for Large Vision-Language Models

Title: In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents

Title: Learning to Search Effective Example Sequences for In-Context Learning

Title: HOFAR: High-Order Augmentation of Flow Autoregressive Transformers

Title: Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations

Title: Generalized Kullback-Leibler Divergence Loss

Title: Accurate INT8 Training Through Dynamic Block-Level Fallback

Title: Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation

Title: Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection

Title: SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models

Title: "We just did not have that on the embedded system": Insights and Challenges for Securing Microcontroller Systems from the Embedded CTF Competitions

Title: Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm

Title: Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation

Title: Symbolic Neural Ordinary Differential Equations

Title: Context-aware Biases for Length Extrapolation

Title: Seeing Beyond Haze: Generative Nighttime Image Dehazing

Title: Trend-Aware Supervision: On Learning Invariance for Semi-Supervised Facial Action Unit Intensity Estimation

Title: Advancing Sentiment Analysis: A Novel LSTM Framework with Multi-head Attention

Title: Degradation Self-Supervised Learning for Lithium-ion Battery Health Diagnostics

Title: PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models

Title: SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection

Title: MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction

Title: MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution

Title: Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors

Title: Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning

Title: ACE: Concept Editing in Diffusion Models without Performance Degradation

Title: Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models

Title: Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Title: AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification

Title: Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments

Title: Large Scale Multi-Task Bayesian Optimization with Large Language Models

Title: MGHanD: Multi-modal Guidance for authentic Hand Diffusion

Title: ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting

Title: FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems

Title: HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views

Title: Scaling Probabilistic Circuits via Data Partitioning

Title: Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method

Title: FilmComposer: LLM-Driven Music Production for Silent Film Clips

Title: Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features

Title: Domain Adaptation and Entanglement: an Optimal Transport Perspective

Title: Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model

Title: U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

Title: Concept-Driven Deep Learning for Enhanced Protein-Specific Molecular Generation

Title: OASIS: Order-Augmented Strategy for Improved Code Search

Title: XAI4Extremes: An interpretable machine learning framework for understanding extreme-weather precursors under climate change

Title: Multimodal Generation of Animatable 3D Human Models with AvatarForge

Title: TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement

Title: CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition

Title: Towards All-in-One Medical Image Re-Identification

Title: Towards Synthesized and Editable Motion In-Betweening Through Part-Wise Phase Representation

Title: RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware

Title: Automating Violence Detection and Categorization from Ancient Texts

Title: Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation

Title: A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models

Title: Route Sparse Autoencoder to Interpret Large Language Models

Title: DeepRAG: Building a Custom Hindi Embedding Model for Retrieval Augmented Generation from Scratch

Title: MVD-HuGaS: Human Gaussians from a Single Image via 3D Human Multi-view Diffusion Prior

Title: CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning

Title: EgoBlind: Towards Egocentric Visual Assistance for the Blind People

Title: A Grey-box Text Attack Framework using Explainable AI

Title: Modeling Variants of Prompts for Vision-Language Models

Title: EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification

Title: Tangentially Aligned Integrated Gradients for User-Friendly Explanations

Title: Aligning Text to Image in Diffusion Models is Easier Than You Think

Title: SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models

Title: SoK: A cloudy view on trust relationships of CVMs -- How Confidential Virtual Machines are falling short in Public Cloud

Title: DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness

Title: Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks

Title: LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization

Title: PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net

Title: OminiControl2: Efficient Conditioning for Diffusion Transformers

Title: Neural cyberattacks applied to the vision under realistic visual stimuli

Title: SegDesicNet: Lightweight Semantic Segmentation in Remote Sensing with Geo-Coordinate Embeddings for Domain Adaptation

Title: Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges

Title: A systematic literature review of unsupervised learning algorithms for anomalous traffic detection based on flows

Title: D3PO: Preference-Based Alignment of Discrete Diffusion Models

Title: Privacy for Free: Leveraging Local Differential Privacy Perturbed Data from Multiple Services

Title: Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study

Title: $^R$FLAV: Rolling Flow matching for infinite Audio Video generation

Title: i-WiViG: Interpretable Window Vision GNN

Title: Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

Title: Towards Scalable and Cross-Lingual Specialist Language Models for Oncology

Title: Prototype-based Heterogeneous Federated Learning for Blade Icing Detection in Wind Turbines with Class Imbalanced Data

Title: MFRS: A Multi-Frequency Reference Series Approach to Scalable and Accurate Time-Series Forecasting

Title: MINT-Demo: Membership Inference Test Demonstrator

Title: Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos

Title: Diffusion Transformer Meets Random Masks: An Advanced PET Reconstruction Framework

Title: Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs

Title: DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos

Title: Pathology-Aware Adaptive Watermarking for Text-Driven Medical Image Synthesis

Title: Design and Implementation of FourCropNet: A CNN-Based System for Efficient Multi-Crop Disease Detection and Management

Title: Robust Latent Matters: Boosting Image Generation with Sampling Error

Title: Debiased Prompt Tuning in Vision-Language Model without Annotations

Title: nnInteractive: Redefining 3D Promptable Segmentation

Title: Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Title: Twinner: Shining Light on Digital Twins in a Few Snaps

Title: Recognition-Synergistic Scene Text Editing

Title: V-Max: Making RL practical for Autonomous Driving

Title: DyArtbank: Diverse Artistic Style Transfer via Pre-trained Stable Diffusion and Dynamic Style Prompt Artbank

Title: OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning

Title: Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information

Title: WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Title: Generalizable and Explainable Deep Learning for Medical Image Computing: An Overview

Title: Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing

Title: Controlling Latent Diffusion Using Latent CLIP

Title: TrackOcc: Camera-based 4D Panoptic Occupancy Tracking

Title: NullFace: Training-Free Localized Face Anonymization

Title: Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum

Title: A Triple-Inertial Accelerated Alternating Optimization Method for Deep Learning Training

Title: Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models

Title: Learning to Match Unpaired Data with Minimum Entropy Coupling

Title: CFNet: Optimizing Remote Sensing Change Detection through Content-Aware Enhancement

Title: ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews

Title: Referring to Any Person

Title: DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning

Title: SAS: Segment Any 3D Scene with Integrated 2D Priors

Title: Segmentation-Guided CT Synthesis with Pixel-Wise Conformal Uncertainty Bounds

Title: High-Quality 3D Head Reconstruction from Any Single Portrait Image

Title: Position-Aware Depth Decay Decoding ($D^3$): Boosting Large Language Model Inference Efficiency

Title: GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Title: ChromaFormer: A Scalable and Accurate Transformer Architecture for Land Cover Classification

Title: DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering

Title: Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling

Title: An Analysis of Safety Guarantees in Multi-Task Bayesian Optimization

Title: ComicsPAP: understanding comic strips by picking the correct panel

Title: DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process

Title: Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation

Title: RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding

Title: MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification

Title: HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding

Title: BiasEdit: Debiasing Stereotyped Language Models via Model Editing

Title: 3D Point Cloud Generation via Autoregressive Up-sampling

Title: NSF-SciFy: Mining the NSF Awards Database for Scientific Claims

Title: LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

Title: CellStyle: Improved Zero-Shot Cell Segmentation via Style Transfer

Title: Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Title: LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

Title: SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Title: Secret-Key Generation from Private Identifiers under Channel Uncertainty

Title: How Does Overparameterization Affect Machine Unlearning of Deep Neural Networks?

Title: Birds look like cars: Adversarial analysis of intrinsically interpretable deep learning

Title: Coefficient-to-Basis Network: A Fine-Tunable Operator Learning Framework for Inverse Problems with Adaptive Discretizations and Theoretical Guarantees

Title: MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input

Title: Extra Clients at No Extra Cost: Overcome Data Heterogeneity in Federated Learning with Filter Decomposition

Title: Exploring the Word Sense Disambiguation Capabilities of Large Language Models

Title: MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

Title: REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

Title: Language-Depth Navigated Thermal and Visible Image Fusion

Title: OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting

Title: Self-Taught Self-Correction for Small Language Models

Title: "Principal Components" Enable A New Language of Images

Title: OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models