2025-03-14

Title: FedMSGL: A Self-Expressive Hypergraph Based Federated Multi-View Learning

Title: Inductive Spatio-Temporal Kriging with Physics-Guided Increment Training Strategy for Air Quality Inference

Title: LLM-PS: Empowering Large Language Models for Time Series Forecasting with Temporal Patterns and Semantics

Title: Týr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization

Title: Towards Robust Model Evolution with Algorithmic Recourse

Title: Towards Hardware Supported Domain Generalization in DNN-Based Edge Computing Devices for Health Monitoring

Title: CoRe^2: Collect, Reflect and Refine to Generate Better and Faster

Title: Blockchain-Enabled Management Framework for Federated Coalition Networks

Title: Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

Title: Probabilistic Reasoning with LLMs for k-anonymity Estimation

Title: Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

Title: Have LLMs Made Active Learning Obsolete? Surveying the NLP Community

Title: Revisiting semi-supervised learning in the era of foundation models

Title: Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain

Title: Towards Causal Model-Based Policy Optimization

Title: Finding the Muses: Identifying Coresets through Loss Trajectories

Title: The Pitfalls of Imitation Learning when Actions are Continuous

Title: How Feasible is Augmenting Fake Nodes with Learnable Features as a Counter-strategy against Link Stealing Attacks?

Title: All Your Knowledge Belongs to Us: Stealing Knowledge Graphs via Reasoning APIs

Title: I2V3D: Controllable image-to-video generation with 3D guidance

Title: Enhancing Adversarial Example Detection Through Model Explanation

Title: Unveiling Hidden Pivotal Players with GoalNet: A GNN-Based Soccer Player Evaluation System

Title: Review GIDE -- Restaurant Review Gastrointestinal Illness Detection and Extraction with Large Language Models

Title: Solving Bayesian inverse problems with diffusion priors and off-policy RL

Title: SASNet: Spatially-Adaptive Sinusoidal Neural Networks

Title: Distributionally Robust Multi-Agent Reinforcement Learning for Dynamic Chute Mapping

Title: BiasConnect: Investigating Bias Interactions in Text-to-Image Models

Title: Designing Graph Convolutional Neural Networks for Discrete Choice with Network Effects

Title: Constrained Language Generation with Discrete Diffusion Models

Title: Minimal Time Series Transformer

Title: SeqSAM: Autoregressive Multiple Hypothesis Prediction for Medical Image Segmentation using SAM

Title: Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo

Title: Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving

Title: Temporal Difference Flows

Title: Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

Title: Generative AI for Named Entity Recognition in Low-Resource Language Nepali

Title: Data Traceability for Privacy Alignment

Title: Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning

Title: Resolution Invariant Autoencoder

Title: Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation

Title: A Comprehensive Review on Understanding the Decentralized and Collaborative Approach in Machine Learning

Title: Who Are You Behind the Screen? Implicit MBTI and Gender Detection Using Artificial Intelligence

Title: Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis

Title: EquiPy: Sequential Fairness using Optimal Transport in Python

Title: LuciBot: Automated Robot Policy Learning from Generated Videos

Title: FDCT: Frequency-Aware Decomposition and Cross-Modal Token-Alignment for Multi-Sensor Target Classification

Title: CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation

Title: Tracking the Best Expert Privately

Title: What's In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models

Title: A Semantic-Loss Function Modeling Framework With Task-Oriented Machine Learning Perspectives

Title: eXpLogic: Explaining Logic Types and Patterns in DiffLogic Networks

Title: Inter-environmental world modeling for continuous and compositional dynamics

Title: PluralLLM: Pluralistic Alignment in LLMs via Federated Learning

Title: VideoMerge: Towards Training-free Long Video Generation

Title: Emotion Recognition with CLIP and Sequential Learning

Title: PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation

Title: A Chaotic Image Encryption Scheme Using Novel Geometric Block Permutation and Dynamic Substitution

Title: TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness

Title: Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers

Title: Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction

Title: UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

Title: Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking

Title: X-Cross: Image Encryption Featuring Novel Dual-Layer Block Permutation and Dynamic Substitution Techniques

Title: Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey

Title: Take Off the Training Wheels Progressive In-Context Learning for Effective Alignment

Title: Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification

Title: ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content

Title: Detecting Dataset Bias in Medical AI: A Generalized and Modality-Agnostic Auditing Framework

Title: Uncertainty-aware Long-tailed Weights Model the Utility of Pseudo-labels for Semi-supervised Learning

Title: From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs

Title: Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes

Title: TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs

Title: MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation

Title: One-Shot Federated Unsupervised Domain Adaptation with Scaled Entropy Attention and Multi-Source Smoothed Pseudo Labeling

Title: Using Context to Improve Word Segmentation

Title: Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models

Title: How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game

Title: FourierSR: A Fourier Token-based Plugin for Efficient Image Super-Resolution

Title: Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy

Title: Deep Learning Approaches for Anti-Money Laundering on Mobile Transactions: Review, Framework, and Directions

Title: Provably Secure Covert Messaging Using Image-based Diffusion Processes

Title: Demoting Security via Exploitation of Cache Demote Operation in Intel's Latest ISA Extension

Title: Image Quality Assessment: From Human to Machine Preference

Title: Information Density Principle for MLLM Benchmarks

Title: AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption

Title: Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Title: Enhanced Route Planning with Calibrated Uncertainty Set

Title: Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model

Title: Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text

Title: Semantic Latent Motion for Portrait Video Generation

Title: SOLA-GCL: Subgraph-Oriented Learnable Augmentation Method for Graph Contrastive Learning

Title: Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Learnable Linear Extrapolation

Title: Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space

Title: MoEdit: On Learning Quantity Perception for Multi-object Image Editing

Title: Hybrid Agents for Image Restoration

Title: Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Title: PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models

Title: Deep Learning-Based Direct Leaf Area Estimation using Two RGBD Datasets for Model Development

Title: Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding

Title: Unlocking Generalization Power in LiDAR Point Cloud Registration

Title: Retrieval-Augmented Generation with Hierarchical Knowledge

Title: "Well, Keep Thinking": Enhancing LLM Reasoning with Adaptive Injection Decoding

Title: Verifiable, Efficient and Confidentiality-Preserving Graph Search with Transparency

Title: PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning

Title: Robustness Tokens: Towards Adversarial Robustness of Transformers

Title: ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation

Title: Deep Learning for Time Series Forecasting: A Survey

Title: LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents

Title: Efficient Implementation of CRYSTALS-KYBER Key Encapsulation Mechanism on ESP32

Title: Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation

Title: MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis

Title: CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition

Title: Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout

Title: Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous Models

Title: Probability-Flow ODE in Infinite-Dimensional Function Spaces

Title: Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA

Title: Policy Teaching via Data Poisoning in Learning from Human Preferences

Title: R.U.Psycho? Robust Unified Psychometric Testing of Language Models

Title: Post Quantum Migration of Tor

Title: I Can Tell Your Secrets: Inferring Privacy Attributes from Mini-app Interaction History in Super-apps

Title: MinorBench: A hand-built benchmark for content-based risks for children

Title: Interpretable Image Classification via Non-parametric Part Prototype Learning

Title: SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning

Title: PIMRL: Physics-Informed Multi-Scale Recurrent Learning for Spatiotemporal Prediction

Title: An Open-RAN Testbed for Detecting and Mitigating Radio-Access Anomalies

Title: ROODI: Reconstructing Occluded Objects with Denoising Inpainters

Title: AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation

Title: A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification

Title: An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Title: Targeted Data Poisoning for Black-Box Audio Datasets Ownership Verification

Title: HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings

Title: VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames

Title: MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Title: VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Title: Proceedings of the ISCA/ITG Workshop on Diversity in Large Speech and Language Models

Title: Eye on the Target: Eye Tracking Meets Rodent Tracking

Title: IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification

Title: OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions

Title: Generative Binary Memory: Pseudo-Replay Class-Incremental Learning on Binarized Embeddings

Title: KV-Distill: Nearly Lossless Learnable Context Compression for LLMs

Title: DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image

Title: Enhancing Facial Privacy Protection via Weakening Diffusion Purification

Title: New Trends for Modern Machine Translation with Large Reasoning Models

Title: A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization

Title: ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation

Title: Piece it Together: Part-Based Concepting with IP-Priors

Title: G-Boost: Boosting Private SLMs with General LLMs

Title: Probabilistic Forecasting via Autoregressive Flow Matching

Title: CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Title: RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

Title: Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders

Title: RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models

Title: Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning

Title: Public Channel-Based Fair Exchange Protocols with Advertising

Title: dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

Title: Category Prompt Mamba Network for Nuclei Segmentation and Classification

Title: Improving Medical Waste Classification with Hybrid Capsule Networks

Title: BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models

Title: 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Title: DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation

Title: Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches

Title: LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions

Title: Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion

Title: Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents

Title: MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Title: OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding

Title: TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models

Title: Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction

Title: SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models

Title: Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression

Title: Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set

Title: CountPath: Automating Fragment Counting in Digital Pathology

Title: NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval

Title: PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Title: Lightweight Models for Emotional Analysis in Video

Title: The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory

Title: DP-GPL: Differentially Private Graph Prompt Learning

Title: MASQUE: A Text-Guided Diffusion-Based Framework for Localized and Customized Adversarial Makeup

Title: ASIDE: Architectural Separation of Instructions and Data in Language Models

Title: FedPCA: Noise-Robust Fair Federated Learning via Performance-Capacity Analysis

Title: Radar: Fast Long-Context Decoding for Any Transformer

Title: Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models

Title: VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Title: Unlock the Power of Unlabeled Data in Language Driving Model

Title: The Spectral Bias of Shallow Neural Network Learning is Shaped by the Choice of Non-linearity

Title: Long Context Tuning for Video Generation

Title: CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

Title: GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Title: TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Title: Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Title: MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction

Title: OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction

Title: CoSTA$\ast$: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Title: ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer

Title: R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Title: OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer

Title: Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models

Title: DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Title: From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM

Title: DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding

Title: Transformers without Normalization

Title: LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Title: NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models

Title: Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology

Title: UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Title: HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Title: Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

Title: V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes

Title: A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1

Title: Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective

Title: GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing