2025-07-01

Title: Robust Perspective Correction for Real-World Crack Evolution Tracking in Image-Based Structural Health Monitoring

Title: Learning Interpretable Rules from Neural Networks: Neurosymbolic AI for Radar Hand Gesture Recognition

Title: Active Learning for Forecasting Severity among Patients with Post Acute Sequelae of SARS-CoV-2

Title: Hierarchical Adversarially-Resilient Multi-Agent Reinforcement Learning for Cyber-Physical Systems Security

Title: EAGLE: Efficient Alignment of Generalized Latent Embeddings for Multimodal Survival Prediction with Interpretable Attribution Analysis

Title: Vision Transformers for Multi-Variable Climate Downscaling: Emulating Regional Climate Models with a Shared Encoder and Multi-Decoder Architecture

Title: Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization

Title: Hallucination Detection with Small Language Models

Title: PromptAug: Fine-grained Conflict Classification Using Data Augmentation

Title: ViFusionTST: Deep Fusion of Time-Series Image Representations from Load Signals for Early Bed-Exit Prediction

Title: Visual-Semantic Knowledge Conflicts in Operating Rooms: Synthetic Data Curation for Surgical Risk Perception in Multimodal Large Language Models

Title: How Can Multimodal Remote Sensing Datasets Transform Classification via SpatialNet-ViT?

Title: What Makes a Dribble Successful? Insights From 3D Pose Tracking Data

Title: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Title: Weakly Supervised Object Segmentation by Background Conditional Divergence

Title: SABRE-FL: Selective and Accurate Backdoor Rejection for Federated Prompt Learning

Title: AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text

Title: FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment

Title: Lightning the Night with Generative Artificial Intelligence

Title: In-context learning for the classification of manipulation techniques in phishing emails

Title: Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis

Title: Weak-to-Strong GraphRAG: Aligning Weak Retrievers with Large Language Models for Graph-based Retrieval Augmented Generation

Title: A Survey on Model Extraction Attacks and Defenses for Large Language Models

Title: MetaCipher: A General and Extensible Reinforcement Learning Framework for Obfuscation-Based Jailbreak Attacks on Black-Box LLMs

Title: Unifying Biomedical Vision-Language Expertise: Towards a Generalist Foundation Model via Multi-CLIP Knowledge Distillation

Title: Dual Atrous Separable Convolution for Improving Agricultural Semantic Segmentation

Title: The Hidden Link Between RLHF and Contrastive Learning

Title: LIGHT: Multi-Modal Text Linking on Historical Maps

Title: BrainMT: A Hybrid Mamba-Transformer Architecture for Modeling Long-Range Dependencies in Functional MRI Data

Title: RExBench: Can coding agents autonomously implement AI research extensions?

Title: Are Fast Methods Stable in Adversarially Robust Transfer Learning?

Title: A User-Centric, Privacy-Preserving, and Verifiable Ecosystem for Personal Data Management and Utilization

Title: Temperature Matters: Enhancing Watermark Robustness Against Paraphrasing Attacks

Title: Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning

Title: CaO$_2$: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation

Title: Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training

Title: Fingerprinting SDKs for Mobile Apps and Where to Find Them: Understanding the Market for Device Fingerprinting

Title: VERA: Variational Inference Framework for Jailbreaking Large Language Models

Title: 3D Shape Generation: A Survey

Title: Assessing the feasibility of Large Language Models for detecting micro-behaviors in team interactions during space missions

Title: Mitigating Semantic Collapse in Generative Personalization with a Surprisingly Simple Test-Time Embedding Adjustment

Title: Residual Matrix Transformers: Scaling the Size of the Residual Stream

Title: Text Production and Comprehension by Human and Artificial Intelligence: Interdisciplinary Workshop Report

Title: General Autonomous Cybersecurity Defense: Learning Robust Policies for Dynamic Topologies and Diverse Attackers

Title: FairMarket-RL: LLM-Guided Fairness Shaping for Multi-Agent Reinforcement Learning in Peer-to-Peer Markets

Title: Generalized Linear Mode Connectivity for Transformers

Title: BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute

Title: Part Segmentation and Motion Estimation for Articulated Objects with Dynamic 3D Gaussians

Title: Kill Two Birds with One Stone! Trajectory enabled Unified Online Detection of Adversarial Examples and Backdoor Attacks

Title: The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Title: Convergent Privacy Framework with Contractive GNN Layers for Multi-hop Aggregations

Title: Robust Tensor Completion via Gradient Tensor Nulclear L1-L2 Norm for Traffic Data Recovery

Title: Enhancing Android Malware Detection with Retrieval-Augmented Generation

Title: Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography

Title: RoboPearls: Editable Video Simulation for Robot Manipulation

Title: VSRM: A Robust Mamba-Based Framework for Video Super-Resolution

Title: Multimodal Atmospheric Super-Resolution With Deep Generative Models

Title: PhonemeFake: Redefining Deepfake Realism with Language-Driven Segmental Manipulation and Adaptive Bilevel Detection

Title: Single-Frame Point-Pixel Registration via Supervised Cross-Modal Feature Matching

Title: What's Privacy Good for? Measuring Privacy as a Shield from Harms due to Personal Data Use

Title: ContextCache: Context-Aware Semantic Cache for Multi-Turn Queries in Large Language Models

Title: RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

Title: Riemannian-Geometric Fingerprints of Generative Models

Title: Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding

Title: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Gate

Title: FreqDGT: Frequency-Adaptive Dynamic Graph Networks with Transformer for Cross-subject EEG Emotion Recognition

Title: MedEthicsQA: A Comprehensive Question Answering Benchmark for Medical Ethics Evaluation of LLMs

Title: BayesLoRA: Task-Specific Uncertainty in Low-Rank Adapters

Title: Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models

Title: Unleashing the Multi-View Fusion Potential: Noise Correction in VLM for Open-Vocabulary 3D Scene Understanding

Title: Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration

Title: Listener-Rewarded Thinking in VLMs for Image Preferences

Title: SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds

Title: FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition

Title: xLSTMAD: A Powerful xLSTM-based Method for Anomaly Detection

Title: AG-VPReID 2025: Aerial-Ground Video-based Person Re-identification Challenge Results

Title: Quantum Neural Networks for Wind Energy Forecasting: A Comparative Study of Performance and Scalability with Classical Models

Title: Boosting CTC-Based ASR Using LLM-Based Intermediate Loss Regularization

Title: DMD-Net: Deep Mesh Denoising Network

Title: Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems

Title: DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues

Title: Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval

Title: Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception

Title: STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing

Title: P$^2$U: Progressive Precision Update For Efficient Model Distribution

Title: Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder

Title: CP-Guard: A Unified, Probability-Agnostic, and Adaptive Framework for Malicious Agent Detection and Defense in Multi-Agent Embodied Perception Systems

Title: Interpretable Time Series Autoregression for Periodicity Quantification

Title: Neural Cellular Automata: From Cells to Pixels

Title: Point Cloud Compression and Objective Quality Assessment: A Survey

Title: MagShield: Towards Better Robustness in Sparse Inertial Motion Capture Under Magnetic Disturbances

Title: Attention to Burstiness: Low-Rank Bilinear Prompt Tuning

Title: Towards Time Series Generation Conditioned on Unstructured Natural Language

Title: Towards Explainable Bilingual Multimodal Misinformation Detection and Localization

Title: Efficient Cybersecurity Assessment Using SVM and Fuzzy Evidential Reasoning for Resilient Infrastructure

Title: A Study on Semi-Supervised Detection of DDoS Attacks under Class Imbalance

Title: Infinite Sampling: Efficient and Stable Grouped RL Training for Large Language Models

Title: YM-WML: A new Yolo-based segmentation Model with Weighted Multi-class Loss for medical imaging

Title: Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models

Title: Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images

Title: ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment

Title: A Systematic Study of Compositional Syntactic Transformer Language Models

Title: Probabilistic Prototype Calibration of Vision-Language Models for Generalized Few-shot Semantic Segmentation

Title: Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language Models

Title: Cybersecurity-Focused Anomaly Detection in Connected Autonomous Vehicles Using Machine Learning

Title: A Reinforcement Learning Approach for Optimal Control in Microgrids

Title: A Novel Frame Identification and Synchronization Technique for Smartphone Visible Light Communication Systems Based on Convolutional Neural Networks

Title: MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models

Title: Spectra 1.1: Scaling Laws and Efficient Inference for Ternary Language Models

Title: Feature-Wise Mixing for Mitigating Contextual Bias in Predictive Supervised Learning

Title: Fragile, Robust, and Antifragile: A Perspective from Parameter Responses in Reinforcement Learning Under Stress

Title: Inpainting is All You Need: A Diffusion-based Augmentation Method for Semi-supervised Medical Image Segmentation

Title: ReMem: Mutual Information-Aware Fine-tuning of Pretrained Vision Transformers for Effective Knowledge Distillation

Title: Ovis-U1 Technical Report

Title: Equivalence Classes in AES -- Part 1

Title: Double-Diffusion: Diffusion Conditioned Diffusion Probabilistic Model For Air Quality Prediction

Title: Measuring How LLMs Internalize Human Psychological Concepts: A preliminary analysis

Title: Boosting LLM's Molecular Structure Elucidation with Knowledge Enhanced Tree Search Reasoning

Title: CoreMark: Toward Robust and Universal Text Watermarking Technique

Title: Curious Causality-Seeking Agents Learn Meta Causal World

Title: Learning Counterfactually Decoupled Attention for Open-World Model Attribution

Title: Frequency-enhanced Multi-granularity Context Network for Efficient Vertebrae Segmentation

Title: Where, What, Why: Towards Explainable Driver Attention Prediction

Title: From Individuals to Interactions: Benchmarking Gender Bias in Multimodal Large Language Models from the Lens of Social Relationship

Title: DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation

Title: FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes

Title: MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings

Title: Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation

Title: Decoding Memes: Benchmarking Narrative Role Classification across Multilingual and Multimodal Models

Title: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Title: Dare to Plagiarize? Plagiarized Painting Recognition and Retrieval

Title: Format-Adapter: Improving Reasoning Capability of LLMs by Adapting Suitable Format

Title: LLM-Assisted Question-Answering on Technical Documents Using Structured Data-Aware Retrieval Augmented Generation

Title: VisualPrompter: Prompt Optimization with Visual Feedback for Text-to-Image Synthesis

Title: Forget-MI: Machine Unlearning for Forgetting Multimodal Information in Healthcare Settings

Title: Learning-to-Context Slope: Evaluating In-Context Learning Effectiveness Beyond Performance Illusions

Title: V-SYNTHESIS: Task-Agnostic Synthesis of Consistent and Diverse In-Context Demonstrations from Scratch via V-Entropy

Title: MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation

Title: Dynamic View Synthesis from Small Camera Motion Videos

Title: Self-Supervised Contrastive Learning for Multi-Label Images

Title: Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes

Title: Data Can Speak for Itself: Quality-guided Utilization of Wireless Synthetic Data

Title: Attribution assignment for deep-generative sequence models enables interpretability analysis using positive-only data

Title: A Practical and Secure Byzantine Robust Aggregator

Title: Trident: Detecting Face Forgeries with Adversarial Triplet Learning

Title: External Data-Enhanced Meta-Representation for Adaptive Probabilistic Load Forecasting

Title: Transformer-Based Person Search with High-Frequency Augmentation and Multi-Wave Mixing

Title: BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion

Title: TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints

Title: FedRef: Communication-Efficient Bayesian Fine Tuning with Reference Model

Title: UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding

Title: Masked Gated Linear Unit

Title: High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation

Title: Generalist Reward Models: Found Inside Large Language Models

Title: VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions

Title: Aggregating Local Saliency Maps for Semi-Global Explainable Image Classification

Title: DGE-YOLO: Dual-Branch Gathering and Attention for Accurate UAV Object Detection

Title: PixelBoost: Leveraging Brownian Motion for Realistic-Image Super-Resolution

Title: From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows

Title: Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis

Title: Token Activation Map to Visually Explain Multimodal LLMs

Title: Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation

Title: Why Settle for One? Text-to-ImageSet Generation and Evaluation

Title: Autoregressive Denoising Score Matching is a Good Video Anomaly Detector

Title: Hierarchical Quantized Diffusion Based Tree Generation Method for Hierarchical Representation and Lineage Analysis

Title: Two Spelling Normalization Approaches Based on Large Language Models

Title: DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios

Title: Objective-Free Local Learning and Emergent Language Structure in Thinking Machines

Title: Threshold Signatures for Central Bank Digital Currencies

Title: DiffFit: Disentangled Garment Warping and Texture Refinement for Virtual Try-On

Title: Securing AI Systems: A Guide to Known Attacks and Impacts

Title: Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting

Title: Interpretable by Design: MH-AutoML for Transparent and Efficient Android Malware Detection without Compromising Performance

Title: FastSeg: Efficient Training-Free Open-Vocabulary Segmentation via Hierarchical Attention Refinement Method

Title: IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering

Title: VALID-Mol: a Systematic Framework for Validated LLM-Assisted Molecular Design

Title: Information Loss in LLMs' Multilingual Translation: The Role of Training Data, Language Proximity, and Language Family

Title: ATGen: A Framework for Active Text Generation

Title: CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Title: A case for data valuation transparency via DValCards

Title: GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields

Title: Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement

Title: Federated Timeline Synthesis: Scalable and Private Methodology For Model Training and Deployment

Title: OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions

Title: When Additive Noise Meets Unobserved Mediators: Bivariate Denoising Diffusion for Causal Discovery

Title: Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs

Title: Hierarchical Memory Organization for Wikipedia Generation

Title: Do LLMs Dream of Discrete Algorithms?

Title: Datasets for Fairness in Language Models: An In-Depth Survey

Title: BenchMake: Turn any scientific data set into a reproducible benchmark

Title: TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs

Title: Accurate Parameter-Efficient Test-Time Adaptation for Time Series Forecasting

Title: Pipelined Decoder for Efficient Context-Aware Text Generation

Title: PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions

Title: Enhancing Insider Threat Detection Using User-Based Sequencing and Transformer Encoders

Title: Contrastive Learning with Diffusion Features for Weakly Supervised Medical Image Segmentation

Title: Time-variant Image Inpainting via Interactive Distribution Transition Estimation

Title: Can We Predict the Unpredictable? Leveraging DisasterNet-LLM for Multimodal Disaster Classification

Title: What to Keep and What to Drop: Adaptive Table Filtering Framework

Title: Sanitizing Manufacturing Dataset Labels Using Vision-Language Models

Title: AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays

Title: Interactive Interface For Semantic Segmentation Dataset Synthesis

Title: A Large-Scale Evolvable Dataset for Model Context Protocol Ecosystem and Security Analysis

Title: Evaluation of Geolocation Capabilities of Multimodal Large Language Models and Analysis of Associated Privacy Risks

Title: MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting

Title: Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent

Title: Qwen-GUI-3B: A Lightweight Vision-Language Model for Cross-Resolution GUI Grounding

Title: Sample Margin-Aware Recalibration of Temperature Scaling

Title: LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching

Title: Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation

Title: Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably

Title: ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models

Title: FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization

Title: WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Title: Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving

Title: NEU-ESC: A Comprehensive Vietnamese dataset for Educational Sentiment analysis and topic Classification toward multitask learning

Title: On Recipe Memorization and Creativity in Large Language Models: Is Your Model a Creative Cook, a Bad Cook, or Merely a Plagiator?

Title: Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound

Title: Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention

Title: Pyramidal Patchification Flow for Visual Generation

Title: A unified framework on the universal approximation of transformer-type architectures

Title: JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching

Title: Metadata, Wavelet, and Time Aware Diffusion Models for Satellite Image Super Resolution

Title: Event-based Tiny Object Detection: A Benchmark Dataset and Baseline

Title: StackCLIP: Clustering-Driven Stacked Prompt in Zero-Shot Industrial Anomaly Detection

Title: Dataset Distillation via Vision-Language Category Prototype

Title: PBCAT: Patch-based composite adversarial training against physically realizable attacks on object detection

Title: Detect \& Score: Privacy-Preserving Misbehaviour Detection and Contribution Evaluation in Federated Learning

Title: Transition Matching: Scalable and Flexible Generative Modeling

Title: CAI: Caption-Sensitive Attention Intervention for Mitigating Object Hallucination in Large Vision-Language Models

Title: Cybersecurity AI: The Dangerous Gap Between Automation and Autonomy

Title: When Will It Fail?: Anomaly to Prompt for Forecasting Future Anomalies in Time Series

Title: Semantic-guided Diverse Decoding for Large Language Model

Title: SoK: Semantic Privacy in Large Language Models

Title: AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval

Title: SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion

Title: PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum

Title: Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs

Title: AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention

Title: TurboVSR: Fantastic Video Upscalers and Where to Find Them

Title: Privacy-Preserving Federated Learning Scheme with Mitigating Model Poisoning Attacks: Vulnerabilities and Countermeasures

Title: Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

Title: A Nonlinear Low-rank Representation Model with Convolutional Neural Network for Imputing Water Quality Data

Title: Blending Concepts with Text-to-Image Diffusion Models

Title: gMBA: Expression Semantic Guided Mixed Boolean-Arithmetic Deobfuscation Using Transformer Architectures

Title: Unified Multimodal Understanding via Byte-Pair Visual Encoding

Title: VAP-Diffusion: Enriching Descriptions with MLLMs for Enhanced Medical Image Generation

Title: MReg: A Novel Regression Model with MoE-based Video Feature Mining for Mitral Regurgitation Diagnosis

Title: Towards Markerless Intraoperative Tracking of Deformable Spine Tissue

Title: Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttack

Title: Zero-Shot Contextual Embeddings via Offline Synthetic Corpus Generation

Title: On the Domain Robustness of Contrastive Vision-Language Models

Title: L0: Reinforcement Learning to Become General Agents

Title: Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation

Title: A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement

Title: Learning Modular Exponentiation with Transformers

Title: Not quite a piece of CHERI-cake: Are new digital security by design architectures usable?

Title: Threadbox: Sandboxing for Modular Security

Title: SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation

Title: Single Image Test-Time Adaptation via Multi-View Co-Training

Title: Subjective Camera: Bridging Human Cognition and Visual Reconstruction through Sequence-Aware Sketch-Guided Diffusion

Title: System-Embedded Diffusion Bridge Models

Title: Proteus-ID: ID-Consistent and Motion-Coherent Video Customization

Title: Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models

Title: AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data

Title: Positional Bias in Binary Question Answering: How Uncertainty Shapes Model Preferences

Title: Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?

Title: Model-driven Stochastic Trace Clustering

Title: Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking

Title: Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors

Title: MadCLIP: Few-shot Medical Anomaly Detection with CLIP

Title: Breaking Out from the TESSERACT: Reassessing ML-based Malware Detection under Spatio-Temporal Drift

Title: Interpretable Zero-Shot Learning with Locally-Aligned Vision-Language Model

Title: Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

Title: Low-latency vision transformers via large-scale multi-head attention

Title: PointSSIM: A novel low dimensional resolution invariant image-to-image comparison metric

Title: Refine Any Object in Any Scene

Title: An ontological lens on attack trees: Toward adequacy and interoperability

Title: Use Sparse Autoencoders to Discover Unknown Concepts, Not to Act on Known Concepts

Title: Differentially Private Synthetic Data Release for Topics API Outputs

Title: VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Title: Exploring Privacy and Security as Drivers for Environmental Sustainability in Cloud-Based Office Solutions

Title: Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic

Title: Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection

Title: Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting

Title: GroundingDINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models

Title: RawMal-TF: Raw Malware Dataset Labeled by Type and Family

Title: Three-dimensional end-to-end deep learning for brain MRI analysis

Title: The Trilemma of Truth in Large Language Models

Title: IMPACT: Inflectional Morphology Probes Across Complex Typologies

Title: Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages

Title: Graft: Integrating the Domain Knowledge via Efficient Parameter Synergy for MLLMs

Title: Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders

Title: Bridging the Gap with Retrieval-Augmented Generation: Making Prosthetic Device User Manuals Available in Marginalised Languages

Title: ADReFT: Adaptive Decision Repair for Safe Autonomous Driving via Reinforcement Fine-Tuning

Title: Evaluating the Impact of Khmer Font Types on Text Recognition

Title: UMA: A Family of Universal Models for Atoms

Title: Visual and Memory Dual Adapter for Multi-Modal Object Tracking

Title: Toward Simple and Robust Contrastive Explanations for Image Classification by Leveraging Instance Similarity and Concept Relevance

Title: A Scalable Approach for Safe and Robust Learning via Lipschitz-Constrained Networks

Title: LLM Agents Are the Antidote to Walled Gardens

Title: TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation

Title: Lock Prediction for Zero-Downtime Database Encryption

Title: Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning

Title: The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models

Title: Large Language Models Don't Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective

Title: EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations

Title: Poisoning Attacks to Local Differential Privacy for Ranking Estimation

Title: Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data

Title: Faster Diffusion Models via Higher-Order Approximation

Title: A Survey on Vision-Language-Action Models for Autonomous Driving

Title: Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models

Title: Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios

Title: STACK: Adversarial Attacks on LLM Safeguard Pipelines

Title: Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention

Title: MotionGPT3: Human Motion as a Second Modality

Title: WaRA: Wavelet Low Rank Adaptation

Title: Development of Hybrid Artificial Intelligence Training on Real and Synthetic Data: Benchmark on Two Mixed Training Strategies

Title: MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction

Title: DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Title: Epona: Autoregressive Diffusion World Model for Autonomous Driving

Title: Computational Detection of Intertextual Parallels in Biblical Hebrew: A Benchmark Study Using Transformer-Based Language Models

Title: Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime

Title: TextMesh4D: High-Quality Text-to-4D Mesh Generation

Title: Calligrapher: Freestyle Text Image Customization

Title: Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives