2025-08-12

Title: Semi-automated Fact-checking in Portuguese: Corpora Enrichment using Retrieval with Claim extraction

Title: Med-GRIM: Enhanced Zero-Shot Medical VQA using prompt-embedded Multimodal Graph RAG

Title: Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

Title: DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation

Title: Frequency Prior Guided Matching: A Data Augmentation Approach for Generalizable Semi-Supervised Polyp Segmentation

Title: CarbonScaling: Extending Neural Scaling Laws for Carbon Footprint in Large Language Models

Title: Large Language Models Facilitate Vision Reflection in Image Classification

Title: A Framework Combining 3D CNN and Transformer for Video-Based Behavior Recognition

Title: RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving

Title: What Makes "Good" Distractors for Object Hallucination Evaluation in Large Vision-Language Models?

Title: The Art of Breaking Words: Rethinking Multilingual Tokenizer Design

Title: Transfer Learning with EfficientNet for Accurate Leukemia Cell Classification

Title: MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing

Title: Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images

Title: Factor Augmented Supervised Learning with Text Embeddings

Title: Slice or the Whole Pie? Utility Control for AI Models

Title: Age-Diverse Deepfake Dataset: Bridging the Age Gap in Deepfake Detection

Title: On the effectiveness of multimodal privileged knowledge distillation in two vision transformer based diagnostic applications

Title: Grounding Emotion Recognition with Visual Prototypes: VEGA -- Revisiting CLIP in MERC

Title: Surformer v1: Transformer-Based Surface Classification Using Tactile and Vision Features

Title: Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering

Title: GFlowNets for Learning Better Drug-Drug Interaction Representations

Title: Discerning minds or generic tutors? Evaluating instructional guidance capabilities in Socratic LLMs

Title: Hypergraph Neural Network with State Space Models for Node Classification

Title: A Federated Learning Framework for Handling Subtype Confounding and Heterogeneity in Large-Scale Neuroimaging Diagnosis

Title: Generative Artificial Intelligence Extracts Structure-Function Relationships from Plants for New Materials

Title: LLM Unlearning Without an Expert Curated Dataset

Title: BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Title: Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs

Title: Local Diffusion Models and Phases of Data Distributions

Title: Generalizing Scaling Laws for Dense and Sparse Large Language Models

Title: Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models

Title: ContextGuard-LVLM: Enhancing News Veracity through Fine-grained Cross-modal Contextual Consistency Verification

Title: VL-MedGuide: A Visual-Linguistic Large Model for Intelligent and Explainable Skin Disease Auxiliary Diagnosis

Title: CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation

Title: Using Imperfect Synthetic Data in Downstream Inference Tasks

Title: Segmented Confidence Sequences and Multi-Scale Adaptive Confidence Segments for Anomaly Detection in Nonstationary Time Series

Title: Rethinking Key-frame-based Micro-expression Recognition: A Robust and Accurate Framework Against Key-frame Errors

Title: Fractal Language Modelling by Universal Sequence Maps (USM)

Title: Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN

Title: Measuring Stereotype and Deviation Biases in Large Language Models

Title: Towards Robust Red-Green Watermarking for Autoregressive Image Generators

Title: Testing the Limits of Machine Translation from One Book

Title: Do Biased Models Have Biased Thoughts?

Title: Watermarking Kolmogorov-Arnold Networks for Emerging Networked Applications via Activation Perturbation

Title: Stabilizing Federated Learning under Extreme Heterogeneity with HeteRo-Select

Title: Learning More by Seeing Less: Line Drawing Pretraining for Efficient, Transferable, and Human-Aligned Vision

Title: MMFformer: Multimodal Fusion Transformer Network for Depression Detection

Title: Play Favorites: A Statistical Method to Measure Self-Bias in LLM-as-a-Judge

Title: Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video

Title: Large Language Models for Oral History Understanding with Text Classification and Sentiment Analysis

Title: Mitigating Distribution Shift in Graph-Based Android Malware Classification via Function Metadata and LLM Embeddings

Title: Analysis of Schedule-Free Nonconvex Optimization

Title: Many-Turn Jailbreaking

Title: FoundBioNet: A Foundation-Based Model for IDH Genotyping of Glioma from Multi-Parametric MRI

Title: VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions

Title: SafePLUG: Empowering Multimodal LLMs with Pixel-Level Insight and Temporal Grounding for Traffic Accident Understanding

Title: Fed MobiLLM: Efficient Federated LLM Fine-Tuning over Heterogeneous Mobile Devices via Server Assisted Side-Tuning

Title: PANAMA: A Network-Aware MARL Framework for Multi-Agent Path Finding in Digital Twin Ecosystems

Title: DiffUS: Differentiable Ultrasound Rendering from Volumetric Imaging

Title: Zero-Direction Probing: A Linear-Algebraic Framework for Deep Analysis of Large-Language-Model Drift

Title: PROPS: Progressively Private Self-alignment of Large Language Models

Title: Label Inference Attacks against Federated Unlearning

Title: Towards Practical Data-Dependent Memory-Hard Functions with Optimal Sustained Space Trade-offs in the Parallel Random Oracle Model

Title: Hardness-Aware Dynamic Curriculum Learning for Robust Multimodal Emotion Recognition with Missing Modalities

Title: SEVADE: Self-Evolving Multi-Agent Analysis with Decoupled Evaluation for Hallucination-Resistant Irony Detection

Title: Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling

Title: Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation

Title: Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems

Title: Technical Report: Full-Stack Fine-Tuning for the Q Programming Language

Title: DualResolution Residual Architecture with Artifact Suppression for Melanocytic Lesion Segmentation

Title: VesselRW: Weakly Supervised Subcutaneous Vessel Segmentation via Learned Random Walk Propagation

Title: Who's the Evil Twin? Differential Auditing for Undesired Behavior

Title: Low-Rank Expert Merging for Multi-Source Domain Adaptation in Person Re-Identification

Title: Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models

Title: Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology

Title: A Joint Sparse Self-Representation Learning Method for Multiview Clustering

Title: VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

Title: Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning

Title: ESNERA: Empirical and semantic named entity alignment for named entity dataset merging

Title: NS-FPN: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective

Title: Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Title: Fusion-Based Brain Tumor Classification Using Deep Learning and Explainable AI, and Rule-Based Reasoning

Title: BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Title: eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos

Title: A Simple yet Powerful Instance-Aware Prompting Framework for Training-free Camouflaged Object Segmentation

Title: MultiRef: Controllable Image Generation with Multiple Visual References

Title: MMReID-Bench: Unleashing the Power of MLLMs for Effective and Versatile Person Re-identification

Title: Model-Agnostic Sentiment Distribution Stability Analysis for Robust LLM-Generated Texts Detection

Title: AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning

Title: CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing

Title: Class Unbiasing for Generalization in Medical Diagnosis

Title: AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

Title: SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Title: BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity

Title: Beyond Frequency: Seeing Subtle Cues Through the Lens of Spatial Decomposition for Fine-Grained Visual Classification

Title: Adversarial Video Promotion Against Text-to-Video Retrieval

Title: Can Multitask Learning Enhance Model Explainability?

Title: Two-Stage Quranic QA via Ensemble Retrieval and Instruction-Tuned Answer Extraction

Title: Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models

Title: Structure-Preserving Digital Twins via Conditional Neural Whitney Forms

Title: WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering

Title: UniMove: A Unified Model for Multi-city Human Mobility Prediction

Title: TADoc: Robust Time-Aware Document Image Dewarping

Title: A Comparative Study of Feature Selection in Tsetlin Machines

Title: OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware

Title: S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Title: Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments

Title: HiMat: DiT-based Ultra-High Resolution SVBRDF Generation

Title: Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings

Title: DocRefine: An Intelligent Framework for Scientific Document Understanding and Content Optimization based on Multimodal Large Model Agents

Title: MV-CoRe: Multimodal Visual-Conceptual Reasoning for Complex Visual Question Answering

Title: Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation

Title: From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving

Title: Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities

Title: A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling

Title: 3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression

Title: SPARE: Securing Progressive Web Applications Against Unauthorized Replications

Title: Membership and Memorization in LLM Knowledge Distillation

Title: SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages

Title: Surgical Knowledge Rewrite in Compact LLMs: An 'Unlearn-then-Learn' Strategy with ($IA^3$) for Localized Factual Modulation and Catastrophic Forgetting Mitigation

Title: Improving Real-Time Concept Drift Detection using a Hybrid Transformer-Autoencoder Framework

Title: ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Title: BharatBBQ: A Multilingual Bias Benchmark for Question Answering in the Indian Context

Title: ScamDetect: Towards a Robust, Agnostic Framework to Uncover Threats in Smart Contracts

Title: Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria

Title: Investigating Intersectional Bias in Large Language Models using Confidence Disparities in Coreference Resolution

Title: AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation

Title: Approaching Maximal Information Extraction in Low-Signal Regimes via Multiple Instance Learning

Title: From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

Title: Multi-Level Service Performance Forecasting via Spatiotemporal Graph Neural Networks

Title: Pref-GUIDE: Continual Policy Learning from Real-Time Human Feedback via Preference-Based Learning

Title: How Effectively Can Large Language Models Connect SNP Variants and ECG Phenotypes for Cardiovascular Risk Prediction?

Title: Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays

Title: A Stable and Principled Loss Function for Direct Language Model Alignment

Title: Strategic Incentivization for Locally Differentially Private Federated Learning

Title: A Real-Time, Self-Tuning Moderator Framework for Adversarial Prompt Detection

Title: CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance

Title: Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens

Title: Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction

Title: SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models

Title: CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion

Title: Large-scale Multi-sequence Pretraining for Generalizable MRI Analysis in Versatile Clinical Applications

Title: Lightweight Multi-Scale Feature Extraction with Fully Connected LMF Layer for Salient Object Detection

Title: EventRR: Event Referential Reasoning for Referring Video Object Segmentation

Title: Gradient Surgery for Safe LLM Fine-Tuning

Title: Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models

Title: Schema Lineage Extraction at Scale: Multilingual Pipelines, Composite Evaluation, and Language-Model Benchmarks

Title: DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention

Title: Understanding NFTs from EIP Standards

Title: Adapting LLMs to Time Series Forecasting via Temporal Heterogeneity Modeling and Semantic Alignment

Title: What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Title: Similarity Matters: A Novel Depth-guided Network for Image Restoration and A New Dataset

Title: Bridging Semantic Logic Gaps: A Cognition-Inspired Multimodal Boundary-Preserving Network for Image Manipulation Localization

Title: Neural Bridge Processes

Title: LLM-based Agents for Automated Confounder Discovery and Subgroup Analysis in Causal Inference

Title: HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation

Title: How Does a Deep Neural Network Look at Lexical Stress?

Title: ASM-UNet: Adaptive Scan Mamba Integrating Group Commonalities and Individual Variations for Fine-Grained Segmentation

Title: Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation

Title: Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Title: SUIT: Spatial-Spectral Union-Intersection Interaction Network for Hyperspectral Object Tracking

Title: PySeizure: A single machine learning classifier framework to detect seizures in diverse datasets

Title: Fading the Digital Ink: A Universal Black-Box Attack Framework for 3DGS Watermarking Systems

Title: MAQuA: Adaptive Question-Asking for Multidimensional Mental Health Screening using Item Response Theory

Title: Representation Understanding via Activation Maximization

Title: "Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas

Title: Arce: Augmented Roberta with Contextualized Elucidations for Ner in Automated Rule Checking

Title: CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation

Title: Revisiting Data Attribution for Influence Functions

Title: SynMatch: Rethinking Consistency in Medical Image Segmentation with Sparse Annotations

Title: BEVANet: Bilateral Efficient Visual Attention Network for Real-Time Semantic Segmentation

Title: DragonFruitQualityNet: A Lightweight Convolutional Neural Network for Real-Time Dragon Fruit Quality Inspection on Mobile Devices

Title: MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark

Title: HealthBranches: Synthesizing Clinically-Grounded Question Answering Datasets via Decision Pathways

Title: DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding

Title: RORPCap: Retrieval-based Objects and Relations Prompt for Image Captioning

Title: ObfusQAte: A Proposed Framework to Evaluate LLM Robustness on Obfuscated Factual Question Answering

Title: Efficient Edge LLMs Deployment via HessianAware Quantization and CPU GPU Collaborative

Title: Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos

Title: Finite-Time Convergence Analysis of ODE-based Generative Models for Stochastic Interpolants

Title: ProteoKnight: Convolution-based phage virion protein classification and uncertainty analysis

Title: SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal

Title: GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction

Title: DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery

Title: Tight Bounds for Schrödinger Potential Estimation in Unpaired Image-to-Image Translation Problems

Title: LET-US: Long Event-Text Understanding of Scenes

Title: ForensicsSAM: Toward Robust and Unified Image Forgery Detection and Localization Resisting to Adversarial Attack

Title: CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization

Title: Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Title: Lightning Prediction under Uncertainty: DeepLight with Hazy Loss

Title: Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs

Title: Towards Unveiling Predictive Uncertainty Vulnerabilities in the Context of the Right to Be Forgotten

Title: MOTGNN: Interpretable Graph Neural Networks for Multi-Omics Disease Classification

Title: AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning

Title: Positional Biases Shift as Inputs Approach Context Window Limits

Title: ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Title: N-BEATS-MOE: N-BEATS with a Mixture-of-Experts Layer for Heterogeneous Time Series Forecasting

Title: Enhancing Privacy in Decentralized Min-Max Optimization: A Differentially Private Approach

Title: SRAM-based Physically Unclonable Function using Lightweight Hamming-Code Fuzzy Extractor for Energy Harvesting Beat Sensors

Title: From Field to Drone: Domain Drift Tolerant Automated Multi-Species and Damage Plant Semantic Segmentation for Herbicide Trials

Title: Augmenting Bias Detection in LLMs Using Topological Data Analysis

Title: Word Clouds as Common Voices: LLM-Assisted Visualization of Participant-Weighted Themes in Qualitative Interviews

Title: FairDRL-ST: Disentangled Representation Learning for Fair Spatio-Temporal Mobility Prediction

Title: Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing

Title: From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Title: Physics-Informed Multimodal Bearing Fault Classification under Variable Operating Conditions using Transfer Learning

Title: Enhanced Generative Structure Prior for Chinese Text Image Super-resolution

Title: A DICOM Image De-identification Algorithm in the MIDI-B Challenge

Title: Domain Generalization of Pathological Image Segmentation by Patch-Level and WSI-Level Contrastive Learning

Title: CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts

Title: Adaptive Pseudo Label Selection for Individual Unlabeled Data by Positive and Unlabeled Learning

Title: Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring

Title: Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning

Title: Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation

Title: Adaptive Cache Enhancement for Test-Time Adaptation of Vision-Language Models

Title: Towards Theoretical Understanding of Transformer Test-Time Computing: Investigation on In-Context Linear Regression

Title: Exploiting Layer Normalization Fine-tuning in Visual Transformer Foundation Models for Classification

Title: When and how can inexact generative models still sample from the data manifold?

Title: IBPS: Indian Bail Prediction System

Title: From Prediction to Explanation: Multimodal, Explainable, and Interactive Deepfake Detection Framework for Non-Expert Users

Title: LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation

Title: X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning

Title: A Trustworthy Method for Multimodal Emotion Recognition

Title: Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo

Title: Extracting Complex Topology from Multivariate Functional Approximation: Contours, Jacobi Sets, and Ridge-Valley Graphs

Title: Beyond Single: A Data Selection Principle for LLM Alignment via Fine-Grained Preference Signals

Title: Multi-Turn Jailbreaks Are Simpler Than They Seem

Title: LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering

Title: Collaborative Learning of Scattering and Deep Features for SAR Target Recognition with Noisy Labels

Title: GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Title: AIS-LLM: A Unified Framework for Maritime Trajectory Prediction, Anomaly Detection, and Collision Risk Assessment with Explainable Forecasting

Title: Semantic Caching for Low-Cost LLM Serving: From Offline Learning to Online Adaptation

Title: Multi-Hop Privacy Propagation for Differentially Private Federated Learning in Social Networks

Title: MORE-CLEAR: Multimodal Offline Reinforcement learning for Clinical notes Leveraged Enhanced State Representation

Title: DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework

Title: TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

Title: LoSemB: Logic-Guided Semantic Bridging for Inductive Tool Retrieval

Title: Semantic-Enhanced Time-Series Forecasting via Large Language Models

Title: Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing

Title: What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction

Title: Training-Free ANN-to-SNN Conversion for High-Performance Spiking Transformer

Title: Detecting Mislabeled and Corrupted Data via Pointwise Mutual Information

Title: DoorDet: Semi-Automated Multi-Class Door Detection Dataset via Object Detection and Large Language Models

Title: A Registration-Based Star-Shape Segmentation Model and Fast Algorithms

Title: Robust Reinforcement Learning over Wireless Networks with Homomorphic State Representations

Title: Enhancing Small-Scale Dataset Expansion with Triplet-Connection-based Sample Re-Weighting

Title: Separation and Collaboration: Two-Level Routing Grouped Mixture-of-Experts for Multi-Domain Continual Learning

Title: Chimera: Harnessing Multi-Agent LLMs for Automatic Insider Threat Simulation

Title: Grouped Speculative Decoding for Autoregressive Image Generation

Title: Exploring Causal Effect of Social Bias on Faithfulness Hallucinations in Large Language Models

Title: Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild

Title: Sparse Probabilistic Graph Circuits

Title: UniSVG: A Unified Dataset for Vector Graphic Understanding and Generation with Multimodal Large Language Models

Title: Pareto Multi-Objective Alignment for Language Models

Title: Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation

Title: Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)

Title: Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Title: Anatomy-Aware Low-Dose CT Denoising via Pretrained Vision Models and Semantic-Guided Contrastive Learning

Title: Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

Title: Power Battery Detection

Title: MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks

Title: Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning

Title: Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Title: Topological Feature Compression for Molecular Graph Neural Networks

Title: EvoCoT: Overcoming the Exploration Bottleneck in Reinforcement Learning

Title: Evaluating Compositional Approaches for Focus and Sentiment Analysis

Title: DiTVR: Zero-Shot Diffusion Transformer for Video Restoration

Title: Segmenting and Understanding: Region-aware Semantic Attention for Fine-grained Image Quality Assessment with Large Language Models

Title: Architectural Co-Design for Zero-Shot Anomaly Detection: Decoupling Representation and Dynamically Fusing Features in CLIP

Title: Evaluating Large Language Models as Expert Annotators

Title: A Comparative Analysis of Lightweight Hash Functions Using AVR ATXMega128 and ChipWhisperer

Title: Learning Satellite Attitude Dynamics with Physics-Informed Normalising Flow

Title: Large Language Models for Czech Aspect-Based Sentiment Analysis

Title: EFU: Enforcing Federated Unlearning via Functional Encryption

Title: Not Yet AlphaFold for the Mind: Evaluating Centaur as a Synthetic Participant

Title: Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

Title: Tailored Emotional LLM-Supporter: Enhancing Cultural Sensitivity

Title: Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models

Title: Generative Video Matting

Title: RSVLM-QA: A Benchmark Dataset for Remote Sensing Vision Language Model-based Question Answering

Title: Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection

Title: Score Augmentation for Diffusion Models

Title: Shapley-Inspired Feature Weighting in $k$-means with No Additional Hyperparameters

Title: Expert Preference-based Evaluation of Automated Related Work Generation

Title: Large Language Models for Subjective Language Understanding: A Survey

Title: VOIDFace: A Privacy-Preserving Multi-Network Face Recognition With Enhanced Security

Title: TrackOR: Towards Personalized Intelligent Operating Rooms Through Robust Tracking

Title: Understanding Syntactic Generalization in Structure-inducing Language Models

Title: WeChat-YATT: A Simple, Scalable and Balanced RLHF Trainer

Title: The Escalator Problem: Identifying Implicit Motion Blindness in AI for Accessibility

Title: Prompt-Guided Relational Reasoning for Social Behavior Understanding with Vision Foundation Models

Title: WideSearch: Benchmarking Agentic Broad Info-Seeking

Title: Progressive Depth Up-scaling via Optimal Transport

Title: Communication-Efficient Zero-Order and First-Order Federated Learning Methods over Wireless Networks

Title: Mitigating Biases in Surgical Operating Rooms with Geometry

Title: Robust Anomaly Detection in O-RAN: Leveraging LLMs against Data Manipulation Attacks

Title: IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning

Title: Deep Learning-Based Analysis of Power Consumption in Gasoline, Electric, and Hybrid Vehicles

Title: TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation

Title: BadPromptFL: A Novel Backdoor Threat to Prompt-based Federated Learning in Multimodal Models

Title: False Reality: Uncovering Sensor-induced Human-VR Interaction Vulnerability

Title: S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix

Title: On Understanding of the Dynamics of Model Capacity in Continual Learning

Title: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model

Title: Fully-Fluctuating Participation in Sleepy Consensus

Title: Information Bottleneck-based Causal Attention for Multi-label Medical Image Recognition

Title: Matrix-3D: Omnidirectional Explorable 3D World Generation

Title: MDD-Net: Multimodal Depression Detection through Mutual Transformer

Title: 3D Plant Root Skeleton Detection and Extraction

Title: Dual Information Speech Language Models for Emotional Conversations

Title: Assessing LLM Text Detection in Educational Contexts: Does Human Contribution Affect Detection?

Title: TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning

Title: Grid2Guide: A* Enabled Small Language Model for Indoor Navigation

Title: Hyperspectral Imaging

Title: GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking

Title: Vision-Based Localization and LLM-based Navigation for Indoor Environments

Title: MemoryKT: An Integrative Memory-and-Forgetting Method for Knowledge Tracing

Title: A Physics-Driven Neural Network with Parameter Embedding for Generating Quantitative MR Maps from Weighted Images

Title: Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks

Title: Optimal Transport Regularization for Speech Text Alignment in Spoken Language Models

Title: FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting

Title: MuaLLM: A Multimodal Large Language Model Agent for Circuit Design Assistance with Hybrid Contextual Retrieval-Augmented Generation

Title: Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models

Title: Data-Efficient Biomedical In-Context Learning: A Diversity-Enhanced Submodular Perspective

Title: Pindrop it! Audio and Visual Deepfake Countermeasures for Robust Detection and Fine Grained-Localization

Title: REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation

Title: FairFLRep: Fairness aware fault localization and repair of Deep Neural Networks

Title: Federated Learning for Epileptic Seizure Prediction Across Heterogeneous EEG Datasets

Title: ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction

Title: Neural Logic Networks for Interpretable Classification

Title: CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data

Title: MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Title: PP-Motion: Physical-Perceptual Fidelity Evaluation for Human Motion Generation

Title: THAT: Token-wise High-frequency Augmentation Transformer for Hyperspectral Pansharpening

Title: KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning

Title: Reinforcement Learning in Vision: A Survey

Title: Differential Privacy for Regulatory Compliance in Cyberattack Detection on Critical Infrastructure Systems

Title: Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions

Title: Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model

Title: Human-Alignment and Calibration of Inference-Time Uncertainty in Large Language Models

Title: SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling

Title: Cross-Subject and Cross-Montage EEG Transfer Learning via Individual Tangent Space Alignment and Spatial-Riemannian Feature Fusion

Title: SAGOnline: Segment Any Gaussians Online

Title: Learning User Preferences for Image Generation Model

Title: Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Title: Capabilities of GPT-5 on Multimodal Medical Reasoning

Title: OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution

Title: Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge

Title: Cut2Next: Generating Next Shot via In-Context Tuning

Title: StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Title: ReferSplat: Referring Segmentation in 3D Gaussian Splatting