2025-06-13

Title: Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models

Title: NOCL: Node-Oriented Conceptualization LLM for Graph Tasks without Message Passing

Title: A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations

Title: From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment

Title: LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

Title: Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language Models

Title: Mind the Gap: Revealing Security Barriers through Situational Awareness of Small and Medium Business Key Decision-Makers

Title: Secure Data Access in Cloud Environments Using Quantum Cryptography

Title: Evaluation empirique de la sécurisation et de l'alignement de ChatGPT et Gemini: analyse comparative des vulnérabilités par expérimentations de jailbreaks

Title: Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment

Title: Multiverse Privacy Theory for Contextual Risks in Complex User-AI Interactions

Title: GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Title: Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Title: Textual Bayes: Quantifying Uncertainty in LLM-Based Systems

Title: A quantum semantic framework for natural language processing

Title: LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning

Title: DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding

Title: Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information

Title: Optimizing Latent Dimension Allocation in Hierarchical VAEs: Balancing Attenuation and Information Retention for OOD Detection

Title: When Meaning Stays the Same, but Models Drift: Evaluating Quality of Service under Token-Level Behavioral Instability in LLMs

Title: EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models

Title: Learning to Collaborate Over Graphs: A Selective Federated Multi-Task Learning Approach

Title: Expert-in-the-Loop Systems with Cross-Domain and In-Domain Few-Shot Learning for Software Vulnerability Detection

Title: NnD: Diffusion-based Generation of Physically-Nonnegative Objects

Title: ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering

Title: Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers

Title: GRAIL: A Benchmark for GRaph ActIve Learning in Dynamic Sensing Environments

Title: D-LiFT: Improving LLM-based Decompiler Backend via Code Quality-driven Fine-tuning

Title: ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Title: Provable Sim-to-Real Transfer via Offline Domain Randomization

Title: Physiological-Model-Based Neural Network for Heart Rate Estimation during Daily Physical Activities

Title: RoCA: Robust Cross-Domain End-to-End Autonomous Driving

Title: Unconditionally Secure Wireless-Wired Ground-Satellite-Ground Communication Networks Utilizing Classical and Quantum Noise

Title: When Large Language Models are Reliable for Judging Empathic Communication

Title: Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective

Title: Disclosure Audits for LLM Agents

Title: SPARKE: Scalable Prompt-Aware Diversity Guidance in Diffusion Models via RKE Score

Title: Retrieval of Surface Solar Radiation through Implicit Albedo Recovery from Temporal Context

Title: AURA: A Multi-Agent Intelligence Framework for Knowledge-Enhanced Cyber Threat Attribution

Title: Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models

Title: Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment

Title: Guardians of the Regime: When and Why Autocrats Create Secret Police

Title: DynaSubVAE: Adaptive Subgrouping for Scalable and Robust OOD Detection

Title: AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent

Title: Cross-Learning Between ECG and PCG: Exploring Common and Exclusive Characteristics of Bimodal Electromechanical Cardiac Waveforms

Title: ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators

Title: California Crop Yield Benchmark: Combining Satellite Image, Climate, Evapotranspiration, and Soil Data Layers for County-Level Yield Forecasting of Over 70 Crops

Title: Classifying Unreliable Narrators with Large Language Models

Title: LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation

Title: Prompt Attacks Reveal Superficial Knowledge Removal in Unlearning Methods

Title: A new type of federated clustering: A non-model-sharing approach

Title: ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese

Title: Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models

Title: Interior-Point Vanishing Problem in Semidefinite Relaxations for Neural Network Verification

Title: Graph-MLLM: Harnessing Multimodal Large Language Models for Multimodal Graph Learning

Title: HalLoc: Token-level Localization of Hallucinations for Vision Language Models

Title: ClusterUCB: Efficient Gradient-Based Data Selection for Targeted Fine-Tuning of LLMs

Title: Flick: Few Labels Text Classification using K-Aware Intermediate Learning in Multi-Task Low-Resource Languages

Title: "Check My Work?": Measuring Sycophancy in a Simulated Educational Context

Title: Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs

Title: Uncertainty-Aware Deep Learning for Automated Skin Cancer Classification: A Comprehensive Evaluation

Title: ELFuzz: Efficient Input Generation via LLM-driven Synthesis Over Fuzzer Space

Title: A Comprehensive Survey of Unmanned Aerial Systems' Risks and Mitigation Strategies

Title: Research on Audio-Visual Quality Assessment Dataset and Method for User-Generated Omnidirectional Video

Title: GeoCAD: Local Geometry-Controllable CAD Generation

Title: Adaptive Chosen-Ciphertext Security of Distributed Broadcast Encryption

Title: Provably Learning from Language Feedback

Title: UrbanSense:AFramework for Quantitative Analysis of Urban Streetscapes leveraging Vision Large Language Models

Title: Code Execution as Grounded Supervision for LLM Reasoning

Title: PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation

Title: History-Aware Neural Operator: Robust Data-Driven Constitutive Modeling of Path-Dependent Materials

Title: Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation

Title: TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree

Title: FaceLiVT: Face Recognition using Linear Vision Transformer with Structural Reparameterization For Mobile Device

Title: Can We Infer Confidential Properties of Training Data from LLMs?

Title: FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion

Title: Revisiting Transformers with Insights from Image Filtering

Title: EQA-RM: A Generative Embodied Reward Model with Test-time Scaling

Title: DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and Mamba

Title: ReconMOST: Multi-Layer Sea Temperature Reconstruction with Observations-Guided Diffusion

Title: Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

Title: FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks

Title: Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation

Title: Generative Algorithms for Wildfire Progression Reconstruction from Multi-Modal Satellite Active Fire Measurements and Terrain Height

Title: PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Title: Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?

Title: Beyond the Battlefield: Framing Analysis of Media Coverage in Conflict Reporting

Title: SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

Title: It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank Representations

Title: MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment

Title: System Identification Using Kolmogorov-Arnold Networks: A Case Study on Buck Converters

Title: MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices

Title: Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty

Title: Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts

Title: Rethinking Generative Human Video Coding with Implicit Motion Transformation

Title: Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance

Title: Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization

Title: MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models

Title: Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications

Title: LLMs Are Not Yet Ready for Deepfake Image Detection

Title: Table-Text Alignment: Explaining Claim Verification Against Tables in Scientific Papers

Title: Class-Incremental Learning for Honey Botanical Origin Classification with Hyperspectral Images: A Study with Continual Backpropagation

Title: Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models

Title: A Crack in the Bark: Leveraging Public Knowledge to Remove Tree-Ring Watermarks

Title: Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation

Title: Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models

Title: J-DDL: Surface Damage Detection and Localization System for Fighter Aircraft

Title: Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs

Title: CogStream: Context-guided Streaming Video Question Answering

Title: ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation

Title: SLICK: Selective Localization and Instance Calibration for Knowledge-Enhanced Car Damage Segmentation in Automotive Insurance

Title: Equivariant Neural Diffusion for Molecule Generation

Title: Data-driven Day Ahead Market Prices Forecasting: A Focus on Short Training Set Windows

Title: From Images to Insights: Explainable Biodiversity Monitoring with Plain Language Habitat Explanations

Title: Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics

Title: LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Title: DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers

Title: Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration

Title: DanceChat: Large Language Model-Guided Music-to-Dance Generation

Title: Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning

Title: Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres

Title: Graph Neural Networks for Automatic Addition of Optimizing Components in Printed Circuit Board Schematics

Title: Rethinking Random Masking in Self Distillation on ViT

Title: Size-adaptive Hypothesis Testing for Fairness

Title: SoK: Evaluating Jailbreak Guardrails for Large Language Models

Title: Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection

Title: High-resolution efficient image generation from WiFi CSI using a pretrained latent diffusion model

Title: TexTailor: Customized Text-aligned Texturing via Effective Resampling

Title: Deep Learning-Based Digitization of Overlapping ECG Images with Open-Source Python Code

Title: Assessing the Resilience of Automotive Intrusion Detection Systems to Adversarial Manipulation

Title: SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis

Title: NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors

Title: Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs

Title: Hessian Geometry of Latent Space in Generative Models

Title: Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models

Title: Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models

Title: CyFence: Securing Cyber-Physical Controllers via Trusted Execution Environment

Title: GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning

Title: Spelling-out is not Straightforward: LLMs' Capability of Tokenization from Token to Characters

Title: From IOCs to Group Profiles: On the Specificity of Threat Group Behaviors in CTI Knowledge Bases

Title: Data Shifts Hurt CoT: A Theoretical Study

Title: GOLIATH: A Decentralized Framework for Data Collection in Intelligent Transportation Systems

Title: PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis

Title: Saturation Self-Organizing Map

Title: Enhancing Deepfake Detection using SE Block Attention with CNN

Title: Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework

Title: Large Language Models for Detection of Life-Threatening Texts

Title: Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery

Title: Preserving Task-Relevant Information Under Linear Concept Removal

Title: ConTextTab: A Semantics-Aware Tabular In-Context Learner

Title: Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement

Title: Inferring Adjective Hypernyms with Language Models to Increase the Connectivity of Open English Wordnet

Title: Commitment Schemes for Multi-Party Computation

Title: TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks

Title: Beyond True or False: Retrieval-Augmented Hierarchical Analysis of Nuanced Claims

Title: TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora

Title: PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Title: ObfusBFA: A Holistic Approach to Safeguarding DNNs from Different Types of Bit-Flip Attacks

Title: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

Title: Different Questions, Different Models: Fine-Grained Evaluation of Uncertainty and Calibration in Clinical QA with LLMs

Title: ME: Trigger Element Combination Backdoor Attack on Copyright Infringement

Title: SlotPi: Physics-informed Object-centric Reasoning Models

Title: Improving Named Entity Transcription with Contextual LLM-based Revision

Title: Human-Robot Navigation using Event-based Cameras and Reinforcement Learning

Title: Mitigating Negative Interference in Multilingual Sequential Knowledge Editing through Null-Space Constraints

Title: Dense Associative Memory with Epanechnikov Energy

Title: Detecting High-Stakes Interactions with Activation Probes

Title: Prompts to Summaries: Zero-Shot Language-Guided Video Summarization

Title: Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders

Title: VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Title: ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization

Title: Efficiency Robustness of Dynamic Deep Learning Systems

Title: Advanced fraud detection using machine learning models: enhancing financial transaction security

Title: Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

Title: Viability of Future Actions: Robust Safety in Reinforcement Learning via Entropy Regularization

Title: Slimming Down LLMs Without Losing Their Minds

Title: Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Title: Lattice Climber Attack: Adversarial attacks for randomized mixtures of classifiers

Title: The Diffusion Duality

Title: AIR: Zero-shot Generative Model Adaptation with Iterative Refinement

Title: BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP

Title: Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning

Title: NoLoCo: No-all-reduce Low Communication Training Method for Large Models

Title: Foundation Models for Causal Inference via Prior-Data Fitted Networks

Title: M4V: Multi-Modal Mamba for Text-to-Video Generation

Title: Sequential-Parallel Duality in Prefix Scannable Models

Title: Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Title: Robustly Improving LLM Fairness in Realistic Settings via Interpretability

Title: Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

Title: Dynamic Epistemic Friction in Dialogue

Title: VINCIE: Unlocking In-context Image Editing from Video

Title: Self-Adapting Language Models

Title: GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models

Title: Execution Guided Line-by-Line Code Generation

Title: Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors

Title: Build the web for agents, not agents for the web

Title: ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems

Title: Understanding In-Context Learning on Structured Manifolds: Bridging Attention to Kernel Methods

Title: ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Title: SpectralAR: Spectral Autoregressive Visual Generation

Title: MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning

Title: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs

Title: Farseer: A Refined Scaling Law in Large Language Models

Title: AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Title: QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction

Title: Fine-Grained Perturbation Guidance via Attention Head Selection

Title: SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis

Title: Rethinking Losses for Diffusion Bridge Samplers