2025-03-11

Title: What I cannot execute, I do not understand: Training and Evaluating LLMs on Program Execution Traces

Title: What Are They Filtering Out? A Survey of Filtering Strategies for Harm Reduction in Pretraining Datasets

Title: CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization

Title: Uncertainty-Aware Fusion: An Ensemble Framework for Mitigating Hallucinations in Large Language Models

Title: Geometric Properties and Graph-Based Optimization of Neural Networks: Addressing Non-Linearity, Dimensionality, and Scalability

Title: Graph Masked Language Models

Title: Evaluation of Missing Data Imputation for Time Series Without Ground Truth

Title: FAA-CLIP: Federated Adversarial Adaptation of CLIP

Title: Medical Hallucinations in Foundation Models and Their Impact on Healthcare

Title: DreamNet: A Multimodal Framework for Semantic and Emotional Analysis of Sleep Narratives

Title: FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework

Title: Emergent Abilities in Large Language Models: A Survey

Title: EXALT: EXplainable ALgorithmic Tools for Optimization Problems

Title: CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking

Title: How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model

Title: Federated Learning Framework via Distributed Mutual Learning

Title: Multi-agent Auto-Bidding with Latent Graph Diffusion Models

Title: A Transformer Model for Predicting Chemical Reaction Products from Generic Templates

Title: Randomized based restricted kernel machine for hyperspectral image classification

Title: Enhancing AUTOSAR-Based Firmware Over-the-Air Updates in the Automotive Industry with a Practical Implementation on a Steering System

Title: Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHA

Title: Extracting and Emulsifying Cultural Explanation to Improve Multilingual Capability of LLMs

Title: Encrypted Vector Similarity Computations Using Partially Homomorphic Encryption: Applications and Performance Analysis

Title: This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

Title: QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation

Title: MastermindEval: A Simple But Scalable Reasoning Benchmark

Title: Zero-shot Medical Event Prediction Using a Generative Pre-trained Transformer on Electronic Health Records

Title: IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining

Title: DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization

Title: CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Title: Bayesian Fields: Task-driven Open-Set Semantic Gaussian Splatting

Title: A Survey on Tabular Data Generation: Utility, Alignment, Fidelity, Privacy, and Beyond

Title: SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc

Title: Validating LLM-as-a-Judge Systems in the Absence of Gold Labels

Title: Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks

Title: A Real-time Multimodal Transformer Neural Network-powered Wildfire Forecasting System

Title: Is Your Video Language Model a Reliable Judge?

Title: MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Title: SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs

Title: Black Box Causal Inference: Effect Estimation via Meta Prediction

Title: Integrating Frequency-Domain Representations with Low-Rank Adaptation in Vision-Language Models

Title: Nearly Optimal Differentially Private ReLU Regression

Title: Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Title: End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Title: Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity

Title: GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Title: FedEM: A Privacy-Preserving Framework for Concurrent Utility Preservation in Federated Learning

Title: Data-Free Black-Box Federated Learning via Zeroth-Order Gradient Estimation

Title: SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

Title: Towards Universal Text-driven CT Image Segmentation

Title: A Label-Free High-Precision Residual Moveout Picking Method for Travel Time Tomography based on Deep Learning

Title: Mitigating Memorization in LLMs using Activation Steering

Title: Improving SAM for Camouflaged Object Detection via Dual Stream Adapters

Title: Constructions are Revealed in Word Distributions

Title: Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases

Title: Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

Title: TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking

Title: A Survey on Post-training of Large Language Models

Title: GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

Title: Towards Conversational AI for Disease Management

Title: An Empirical Study of Causal Relation Extraction Transfer: Design and Data

Title: Biased Federated Learning under Wireless Heterogeneity

Title: Exploring Interpretability for Visual Prompt Tuning with Hierarchical Concepts

Title: Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision

Title: Theta Theory: operads and coloring

Title: Clustering-based Meta Bayesian Optimization with Theoretical Guarantee

Title: PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation

Title: Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records

Title: Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained Patch Strategy and Depth Integrity-Prior

Title: ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning

Title: Handwritten Digit Recognition: An Ensemble-Based Approach for Superior Performance

Title: AF-KAN: Activation Function-Based Kolmogorov-Arnold Networks for Efficient Representation Learning

Title: SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography

Title: Unlocking Pretrained LLMs for Motion-Related Multimodal Generation: A Fine-Tuning Approach to Unify Diffusion and Next-Token Prediction

Title: BlackGoose Rimer: Harnessing RWKV-7 as a Simple yet Superior Replacement for Transformers in Large-Scale Time Series Modeling

Title: USP: Unified Self-Supervised Pretraining for Image Generation and Understanding

Title: X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

Title: GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation

Title: GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Title: Boosting the Local Invariance for Better Adversarial Transferability

Title: Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model

Title: VLForgery Face Triad: Detection, Localization and Attribution via Multimodal Large Language Models

Title: Adaptive UAV-Assisted Hierarchical Federated Learning: Optimizing Energy, Latency, and Resilience for Dynamic Smart IoT Networks

Title: Do Fairness Interventions Come at the Cost of Privacy: Evaluations for Binary Classifiers

Title: BioMoDiffuse: Physics-Guided Biomechanical Diffusion for Controllable and Authentic Human Motion Synthesis

Title: UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Title: Invariant Federated Learning: A Novel Approach to Addressing Challenges in Federated Learning for Edge Intelligence

Title: Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction

Title: Secure On-Device Video OOD Detection Without Backpropagation

Title: Treble Counterfactual VLMs: A Causal Approach to Hallucination

Title: ROCM: RLHF on consistency models

Title: FORESCENE: FOREcasting human activity via latent SCENE graphs diffusion

Title: Lightweight Software Kernels and Hardware Extensions for Efficient Sparse Deep Neural Networks on Microcontrollers

Title: Sample-aware Adaptive Structured Pruning for Large Language Models

Title: PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model

Title: Attackers Can Do Better: Over- and Understated Factors of Model Stealing Attacks

Title: NeuroADDA: Active Discriminative Domain Adaptation in Connectomic

Title: Removing Multiple Hybrid Adverse Weather in Video via a Unified Model

Title: Explainable Synthetic Image Detection through Diffusion Timestep Ensembling

Title: CUPCase: Clinically Uncommon Patient Cases and Diagnoses Dataset

Title: Lifelong Learning with Task-Specific Adaptation: Addressing the Stability-Plasticity Dilemma

Title: StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Title: Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations

Title: Reinforced Diffuser for Red Teaming Large Vision-Language Models

Title: WaveStitch: Flexible and Fast Conditional Time Series Generation with Diffusion Models

Title: Integrating Chain-of-Thought for Multimodal Alignment: A Study on 3D Vision-Language Learning

Title: Dynamically evolving segment anything model with continuous learning for medical image segmentation

Title: Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?

Title: MAD-MAX: Modular And Diverse Malicious Attack MiXtures for Automated LLM Red Teaming

Title: Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation

Title: From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Title: Segment Anything, Even Occluded

Title: Get In Video: Add Anything You Want to the Video

Title: Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Title: Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations

Title: Exploring Adversarial Transferability between Kolmogorov-arnold Networks

Title: Mitigating Blockchain extractable value (BEV) threats by Distributed Transaction Sequencing in Blockchains

Title: Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding

Title: IteRABRe: Iterative Recovery-Aided Block Reduction

Title: MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering

Title: ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation

Title: Text2Story: Advancing Video Storytelling with Text Guidance

Title: GeoLangBind: Unifying Earth Observation with Agglomerative Vision-Language Foundation Models

Title: Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection

Title: End-to-End Action Segmentation Transformer

Title: Accurate and Efficient Two-Stage Gun Detection in Video

Title: Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Title: Learning to Unlearn while Retaining: Combating Gradient Conflicts in Machine Unlearning

Title: Backdoor Attacks on Discrete Graph Diffusion Models

Title: GIN-Graph: A Generative Interpretation Network for Model-Level Explanation of Graph Neural Networks

Title: Language Model Personalization via Reward Factorization

Title: Adversarial Robustness of Discriminative Self-Supervised Learning in Vision

Title: Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs

Title: Generative Video Bi-flow

Title: Machine Learning meets Algebraic Combinatorics: A Suite of Datasets Capturing Research-level Conjecturing Ability in Pure Mathematics

Title: VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token Encodings

Title: Spectral State Space Model for Rotation-Invariant~Visual~Representation~Learning

Title: EPR-GAIL: An EPR-Enhanced Hierarchical Imitation Learning Framework to Simulate Complex User Consumption Behaviors

Title: How LLMs Learn: Tracing Internal Representations with Sparse Autoencoders

Title: Removing Averaging: Personalized Lip-Sync Driven Characters Based on Identity Adapter

Title: FEDS: Feature and Entropy-Based Distillation Strategy for Efficient Learned Image Compression

Title: Consistent Image Layout Editing with Diffusion Models

Title: Training LLM-based Tutors to Improve Student Learning Outcomes in Dialogues

Title: Federated Learning for Diffusion Models

Title: Pre-Training Meta-Rule Selection Policy for Visual Generative Abductive Learning

Title: Graph Retrieval-Augmented LLM for Conversational Recommendation Systems

Title: OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

Title: OT-DETECTOR: Delving into Optimal Transport for Zero-shot Out-of-Distribution Detection

Title: CtrTab: Tabular Data Synthesis with High-Dimensional and Limited Data

Title: A Quantitative Evaluation of the Expressivity of BMI, Pose and Gender in Body Embeddings for Recognition and Identification

Title: NaviDet: Efficient Input-level Backdoor Detection on Text-to-Image Synthesis via Neuron Activation Variation

Title: Privacy Protection in Prosumer Energy Management Based on Federated Learning

Title: DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning

Title: Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated Learning

Title: Reconstructing Depth Images of Moving Objects from Wi-Fi CSI Data

Title: Long-tailed Adversarial Training with Self-Distillation

Title: SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts

Title: CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model

Title: A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation

Title: PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training

Title: A Study of Effectiveness of Brand Domain Identification Features for Phishing Detection in 2025

Title: VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Title: Enhancing Malware Fingerprinting through Analysis of Evasive Techniques

Title: Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving

Title: ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis

Title: Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation

Title: HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast

Title: GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks

Title: SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model

Title: Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation

Title: Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

Title: SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic

Title: AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection

Title: SafeSpeech: A Comprehensive and Interactive Tool for Analysing Sexist and Abusive Language in Conversations

Title: One-Step Diffusion Model for Image Motion-Deblurring

Title: ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy

Title: QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation

Title: BingoGuard: LLM Content Moderation Tools with Risk Levels

Title: BDPFL: Backdoor Defense for Personalized Federated Learning via Explainable Distillation

Title: Generative modelling with jump-diffusions

Title: MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation

Title: TR-DQ: Time-Rotation Diffusion Quantization

Title: Future-Aware Interaction Network For Motion Forecasting

Title: Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving

Title: Conceptrol: Concept Control of Zero-shot Personalized Image Generation

Title: Global-Aware Monocular Semantic Scene Completion with State Space Models

Title: SHIP: A Shapelet-based Approach for Interpretable Patient-Ventilator Asynchrony Detection

Title: Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Title: MultiCo3D: Multi-Label Voxel Contrast for One-Shot Incremental Segmentation of 3D Neuroimages

Title: StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition

Title: FW-Shapley: Real-time Estimation of Weighted Shapley Values

Title: Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation

Title: Interpretable Model Drift Detection

Title: GroMo: Plant Growth Modeling with Multiview Images

Title: Synthetic Data Generation for Minimum-Exposure Navigation in a Time-Varying Environment using Generative AI Models

Title: Dynamic Updates for Language Adaptation in Visual-Language Tracking

Title: Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking

Title: DiffCLIP: Differential Attention Meets CLIP

Title: Revisiting Early Detection of Sexual Predators via Turn-level Optimization

Title: BTFL: A Bayesian-based Test-Time Generalization Method for Internal and External Data Distributions in Federated learning

Title: CLAD: Constrained Latent Action Diffusion for Vision-Language Procedure Planning

Title: Enhancing NLP Robustness and Generalization through LLM-Generated Contrast Sets: A Scalable Framework for Systematic Evaluation and Adversarial Training

Title: Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

Title: AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation

Title: Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets

Title: Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On

Title: Emulating Self-attention with Convolution for Efficient Image Super-Resolution

Title: Learning Few-Step Diffusion Models by Trajectory Distribution Matching

Title: Seeing Delta Parameters as JPEG Images: Data-Free Delta Compression with Discrete Cosine Transform

Title: Dynamic Dictionary Learning for Remote Sensing Image Segmentation

Title: PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Title: Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence

Title: UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion

Title: Censoring-Aware Tree-Based Reinforcement Learning for Estimating Dynamic Treatment Regimes with Censored Outcomes

Title: InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Title: What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Title: Unsupervised Multi-Clustering and Decision-Making Strategies for 4D-STEM Orientation Mapping

Title: MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation

Title: PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Title: Alignment for Efficient Tool Calling of Large Language Models

Title: Delusions of Large Language Models

Title: Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation

Title: Enhancing CBMs Through Binary Distillation with Applications to Test-Time Intervention

Title: Data Efficient Subset Training with Differential Privacy

Title: D3DR: Lighting-Aware Object Insertion in Gaussian Splatting

Title: CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving

Title: Color Alignment in Diffusion

Title: DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion

Title: Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Title: Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints

Title: Revisiting Invariant Learning for Out-of-Domain Generalization on Multi-Site Mammogram Datasets

Title: SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation

Title: Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators

Title: Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting

Title: Key Establishment in the Space Environment

Title: GenDR: Lightning Generative Detail Restorator

Title: VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation

Title: Multimodal Emotion Recognition and Sentiment Analysis in Multi-Party Conversation Contexts

Title: Privacy Auditing of Large Language Models

Title: Mitigating Preference Hacking in Policy Optimization with Pessimism

Title: Towards Fine-Grained Video Question Answering

Title: HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors

Title: eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference

Title: GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought

Title: AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU

Title: MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification

Title: From Image- to Pixel-level: Label-efficient Hyperspectral Image Reconstruction

Title: Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting

Title: Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention

Title: ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration

Title: Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help

Title: ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Title: Improving cognitive diagnostics in pathology: a deep learning approach for augmenting perceptional understanding of histopathology images

Title: CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution

Title: HiSTF Mamba: Hierarchical Spatiotemporal Fusion with Multi-Granular Body-Spatial Modeling for High-Fidelity Text-to-Motion Generation

Title: Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone Images

Title: DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation

Title: When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack

Title: You Are Your Own Best Teacher: Achieving Centralized-level Performance in Federated Learning under Heterogeneous and Long-tailed Data

Title: Combinatorial Optimization via LLM-driven Iterated Fine-tuning

Title: From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers

Title: Complete Key Recovery of a DNA-based Encryption and Developing a Novel Stream Cipher for Color Image Encryption: Bio-SNOW

Title: Effect of Selection Format on LLM Performance

Title: FinTSBridge: A New Evaluation Suite for Real-world Financial Prediction with Advanced Time Series Models

Title: Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping

Title: LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs

Title: Modeling Human Skeleton Joint Dynamics for Fall Detection

Title: CineBrain: A Large-Scale Multi-Modal Brain Dataset During Naturalistic Audiovisual Narrative Processing

Title: Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives

Title: Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection

Title: Lshan-1.0 Technical Report

Title: CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation

Title: Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation

Title: Motion Anything: Any to Motion Generation

Title: Capture Global Feature Statistics for One-Shot Federated Learning

Title: MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation

Title: A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Title: Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation

Title: Lightweight Multimodal Artificial Intelligence Framework for Maritime Multi-Scene Recognition

Title: Exploring Multimodal Perception in Large Language Models Through Perceptual Strength Ratings

Title: Learning Decision Trees as Amortized Structure Inference

Title: ConcreTizer: Model Inversion Attack via Occupancy Classification and Dispersion Control for 3D Point Cloud Restoration

Title: Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations

Title: Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs

Title: TiGer: Self-Supervised Purification for Time-evolving Graphs

Title: Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols

Title: CAPT: Class-Aware Prompt Tuning for Federated Long-Tailed Learning with Vision-Language Model

Title: Public space security management using digital twin technologies

Title: SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models

Title: Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Title: Large Language Models Often Say One Thing and Do Another

Title: SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in Video

Title: Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Title: HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions

Title: Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways

Title: EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Title: Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera

Title: Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation

Title: Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams

Title: Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization

Title: TCM-3CEval: A Triaxial Benchmark for Assessing Responses from Large Language Models in Traditional Chinese Medicine

Title: MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation

Title: Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion

Title: A Failure-Free and Efficient Discrete Laplace Distribution for Differential Privacy in MPC

Title: TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation

Title: Generative method for aerodynamic optimization based on classifier-free guided denoising diffusion probabilistic model

Title: Breaking the Limits of Quantization-Aware Defenses: QADT-R for Robustness Against Patch-Based Adversarial Attacks in QNNs

Title: Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning

Title: You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

Title: DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Title: XR-VLM: Cross-Relationship Modeling with Multi-part Prompts and Visual Features for Fine-Grained Recognition

Title: Linguistic Knowledge Transfer Learning for Speech Enhancement

Title: On the Generalization of Representation Uncertainty in Earth Observation

Title: A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images

Title: OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation

Title: Explainable Android Malware Detection and Malicious Code Localization Using Graph Attention

Title: Exposure Bias Reduction for Enhancing Diffusion Transformer Feature Caching

Title: A Light Perspective for 3D Object Detection

Title: Application of Multiple Chain-of-Thought in Contrastive Reasoning for Implicit Sentiment Analysis

Title: MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark

Title: Controllable 3D Outdoor Scene Generation via Scene Graphs

Title: Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms

Title: MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction

Title: Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

Title: Strategies for political-statement segmentation and labelling in unstructured text

Title: Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems

Title: QKD-KEM: Hybrid QKD Integration into TLS with OpenSSL Providers

Title: Effective and Efficient Masked Image Generation Models

Title: How Well Can Differential Privacy Be Audited in One Run?

Title: A Formally Verified Lightning Network

Title: Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation

Title: FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates

Title: A Deep Learning Architecture for Land Cover Mapping Using Spatio-Temporal Sentinel-1 Features

Title: Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios

Title: CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting

Title: Beyond the Edge of Function: Unraveling the Patterns of Type Recovery in Binary Code

Title: Semantic Communications with Computer Vision Sensing for Edge Video Transmission

Title: AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis

Title: COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition

Title: Customized SAM 2 for Referring Remote Sensing Image Segmentation

Title: Federated Learning in NTNs: Design, Architecture and Challenges

Title: Efficient Distillation of Classifier-Free Guidance using Adapters

Title: A Systematic Review of ECG Arrhythmia Classification: Adherence to Standards, Fair Evaluation, and Embedded Feasibility

Title: A Graph-based Verification Framework for Fact-Checking

Title: Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification

Title: Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies

Title: AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models

Title: Group-robust Sample Reweighting for Subpopulation Shifts via Influence Functions

Title: Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models

Title: Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment

Title: LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction

Title: Probabilistic Segmentation for Robust Field of View Estimation

Title: Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs

Title: TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models

Title: PersonaBooth: Personalized Text-to-Motion Generation

Title: SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models

Title: Revisiting Noise in Natural Language Processing for Computational Social Science

Title: Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning

Title: Keeping Representation Similarity in Finetuning for Medical Image Analysis

Title: REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

Title: TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision

Title: AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion

Title: RePO: ReLU-based Preference Optimization

Title: Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds

Title: Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration

Title: Anatomy-Aware Conditional Image-Text Retrieval

Title: LLMs syntactically adapt their language use to their conversational partner

Title: MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Title: Learning to Localize Leakage of Cryptographic Sensitive Variables

Title: YOLOE: Real-Time Seeing Anything

Title: Efficient Membership Inference Attacks by Bayesian Neural Network

Title: Poisoning Attacks to Local Differential Privacy Protocols for Trajectory Data

Title: Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction

Title: LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?

Title: V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation

Title: Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond

Title: Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts

Title: From Centralized to Decentralized Federated Learning: Theoretical Insights, Privacy Preservation, and Robustness Challenges

Title: ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning

Title: PE3R: Perception-Efficient 3D Reconstruction

Title: Language Models Fail to Introspect About Their Knowledge of Language

Title: FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection

Title: TokenButler: Token Importance is Predictable

Title: XIFBench: Evaluating Large Language Models on Multilingual Instruction Following

Title: KSOD: Knowledge Supplement for LLMs On Demand

Title: Federated Multimodal Learning with Dual Adapters and Selective Pruning for Communication and Computational Efficiency

Title: Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression

Title: Inductive Moment Matching

Title: Runtime Detection of Adversarial Attacks in AI Accelerators Using Performance Counters

Title: Split-n-Chain: Privacy-Preserving Multi-Node Split Learning with Blockchain-Based Auditability

Title: Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation

Title: Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru

Title: Detection Avoidance Techniques for Large Language Models

Title: HumanMM: Global Human Motion Recovery from Multi-shot Videos

Title: VACE: All-in-One Video Creation and Editing

Title: Balanced Image Stylization with Style Matching Score

Title: Implicit Reasoning in Transformers is Reasoning through Shortcuts

Title: SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Title: VoD: Learning Volume of Differences for Video-Based Deepfake Detection