2025-03-17

Title: Text2Zinc: A Cross-Domain Dataset for Modeling Optimization and Satisfaction Problems in MiniZinc

Title: The Reliability of LLMs for Medical Diagnosis: An Examination of Consistency, Manipulation, and Contextual Awareness

Title: Hate Speech and Sentiment of YouTube Video Comments From Public and Private Sources Covering the Israel-Palestine Conflict

Title: AI Enabled User-Specific Cyberbullying Severity Detection with Explainability

Title: Evaluating Local and Cloud-Based Large Language Models for Simulating Consumer Choices in Energy Stated Preference Surveys

Title: Video Anomaly Detection with Structured Keywords

Title: Improving RAG Retrieval via Propositional Content Extraction: a Speech Act Theory Approach

Title: Language modelling techniques for analysing the impact of human genetic variation

Title: RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs

Title: LimTopic: LLM-based Topic Modeling and Text Summarization for Analyzing Scientific Articles limitations

Title: MARRO: Multi-headed Attention for Rhetorical Role Labeling in Legal Documents

Title: Text-to-3D Generation using Jensen-Shannon Score Distillation

Title: CeTAD: Towards Certified Toxicity-Aware Distance in Vision Language Models

Title: Evaluation of the Automated Labeling Method for Taxonomic Nomenclature Through Prompt-Optimized Large Language Model

Title: Semantic Wave Functions: Exploring Meaning in Large Language Models Through Quantum Formalism

Title: Small Vision-Language Models: A Survey on Compact Architectures and Techniques

Title: Green Prompting

Title: Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words

Title: UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality

Title: ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model Competition

Title: Enhancing Retrieval for ESGLLM via ESG-CID -- A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS

Title: Beyond One-Size-Fits-All Summarization: Customizing Summaries for Diverse Users

Title: Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data

Title: A Survey on Knowledge-Oriented Retrieval-Augmented Generation

Title: VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion

Title: End-to-end Learning of Sparse Interventions on Activations to Steer Generation

Title: Understanding the Quality-Diversity Trade-off in Diffusion Language Models

Title: Open-World Skill Discovery from Unsegmented Demonstrations

Title: VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation

Title: MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation

Title: Context-guided Responsible Data Augmentation with Diffusion Models

Title: Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents

Title: Battling Misinformation: An Empirical Study on Adversarial Factuality in Open-Source Large Language Models

Title: Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation

Title: Knowledge Consultation for Semi-Supervised Semantic Segmentation

Title: Medical Large Language Model Benchmarks Should Prioritize Construct Validity

Title: Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion

Title: TA-V2A: Textually Assisted Video-to-Audio Generation

Title: ClaimTrust: Propagation Trust Scoring for RAG Systems

Title: Harmonizing Large Language Models with Collaborative Behavioral Signals for Conversational Recommendation

Title: Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework

Title: CALLM: Context-Aware Emotion Analysis in Cancer Survivors Using LLMs and Retrieval-Augmented Mobile Diaries

Title: ZeroMerge: Parameter-Free KV Cache Compression for Memory-Efficient Long-Context LLMs

Title: Team NYCU at Defactify4: Robust Detection and Source Identification of AI-Generated Images Using CNN and CLIP-Based Models

Title: Long-Video Audio Synthesis with Multi-Agent Collaboration

Title: TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics

Title: RankPO: Preference Optimization for Job-Talent Matching

Title: Real-time Pollutant Identification through Optical PM Micro-Sensor

Title: Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores

Title: Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction

Title: Word-level Annotation of GDPR Transparency Compliance in Privacy Policies using Large Language Models

Title: DarkBench: Benchmarking Dark Patterns in Large Language Models

Title: Numerical and statistical analysis of NeuralODE with Runge-Kutta time integration

Title: Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images

Title: Visual Polarization Measurement Using Counterfactual Image Generation

Title: Subnet-Aware Dynamic Supernet Training for Neural Architecture Search

Title: Predicting Treatment Response in Body Dysmorphic Disorder with Interpretable Machine Learning

Title: Clothes-Changing Person Re-identification Based On Skeleton Dynamics

Title: HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer

Title: The Power of One: A Single Example is All it Takes for Segmentation in VLMs

Title: Byzantine-Resilient Federated Learning via Distributed Optimization

Title: HALURust: Exploiting Hallucinations of Large Language Models to Detect Vulnerabilities in Rust

Title: Fixed-Point RNNs: From Diagonal to Dense in a Few Iterations

Title: Attacking Multimodal OS Agents with Malicious Image Patches

Title: Thinking Machines: A Survey of LLM based Reasoning Strategies

Title: Dual Codebook VQ: Enhanced Image Reconstruction with Reduced Codebook Size

Title: Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs?

Title: WAFFLED: Exploiting Parsing Discrepancies to Bypass Web Application Firewalls

Title: Towards Efficient Large Scale Spatial-Temporal Time Series Forecasting via Improved Inverted Transformers

Title: RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors

Title: TAIJI: Textual Anchoring for Immunizing Jailbreak Images in Vision Language Models

Title: Convolutional Rectangular Attention Module

Title: SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable

Title: Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification

Title: HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks

Title: Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANs

Title: PolyRoof: Precision Roof Polygonization in Urban Residential Building with Graph Neural Networks

Title: OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses

Title: Multi-Domain Biometric Recognition using Body Embeddings

Title: ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models

Title: Phishsense-1B: A Technical Perspective on an AI-Powered Phishing Detection Model

Title: $(\varepsilon, δ)$ Considered Harmful: Best Practices for Reporting Differential Privacy Guarantees

Title: Predicting Stock Movement with BERTweet and Transformers

Title: OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models

Title: FedOSAA: Improving Federated Learning with One-Step Anderson Acceleration

Title: From Dionysius Emerges Apollo -- Learning Patterns and Abstractions from Perceptual Sequences

Title: Unlocking Open-Set Language Accessibility in Vision Models

Title: Rethinking Rotation-Invariant Recognition of Fine-grained Shapes from the Perspective of Contour Points

Title: TigerLLM -- A Family of Bangla Large Language Models

Title: Taming Knowledge Conflicts in Language Models

Title: RONA: Pragmatically Diverse Image Captioning with Coherence Relations

Title: VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Title: Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime

Title: Deep Incomplete Multi-view Clustering with Distribution Dual-Consistency Recovery Guidance

Title: EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

Title: FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection

Title: Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data

Title: ACMo: Attribute Controllable Motion Generation

Title: InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences

Title: PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing

Title: Measuring Similarity in Causal Graphs: A Framework for Semantic and Structural Analysis

Title: Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning

Title: Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization

Title: BannerAgency: Advertising Banner Design with Multimodal LLM Agents

Title: Generative Modelling for Mathematical Discovery

Title: Falcon: A Remote Sensing Vision-Language Foundation Model

Title: Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models

Title: Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

Title: Understanding Flatness in Generative Models: Its Role and Benefits

Title: Aerial Vision-and-Language Navigation with Grid-based View Selection and Map Construction

Title: OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning

Title: Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space

Title: A Novel Decomposed Feature-Oriented Framework for Open-Set Semantic Segmentation on LiDAR Data

Title: Quantifying Interpretability in CLIP Models with Concept Consistency

Title: Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers

Title: Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Title: UMB@PerAnsSumm 2025: Enhancing Perspective-Aware Summarization with Prompt Optimization and Supervised Fine-Tuning

Title: A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems

Title: DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Title: Context-Aware Rule Mining Using a Dynamic Transformer-Based Framework

Title: Don't Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning

Title: X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression

Title: SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets

Title: Minding Fuzzy Regions: A Data-driven Alternating Learning Paradigm for Stable Lesion Segmentation

Title: GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior

Title: Layer-wise Update Aggregation with Recycling for Communication-Efficient Federated Learning

Title: Asynchronous Sharpness-Aware Minimization For Fast and Accurate Deep Learning

Title: Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning

Title: Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models

Title: Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective

Title: Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity

Title: Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction

Title: Uncertainty-Aware Normal-Guided Gaussian Splatting for Surface Reconstruction from Sparse Image Sequences

Title: Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models

Title: Palette of Language Models: A Solver for Controlled Text Generation

Title: Multimodal-Aware Fusion Network for Referring Remote Sensing Image Segmentation

Title: Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification

Title: FastVID: Dynamic Density Pruning for Fast Video Large Language Models

Title: Provenance Detection for AI-Generated Images: Combining Perceptual Hashing, Homomorphic Encryption, and AI Detection Models

Title: NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications

Title: LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs

Title: Spatio-Temporal Graph Structure Learning for Earthquake Detection

Title: Cross-Platform Benchmarking of the FHE Libraries: Novel Insights into SEAL and Openfhe

Title: Optimal Transport and Adaptive Thresholding for Universal Domain Adaptation on Time Series

Title: Towards General Multimodal Visual Tracking

Title: MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery

Title: Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption

Title: Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Title: Non Line-of-Sight Optical Wireless Communication using Neuromorphic Cameras

Title: PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders

Title: Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Title: Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking

Title: Reasoning-Grounded Natural Language Explanations for Language Models

Title: Federated Koopman-Reservoir Learning for Large-Scale Multivariate Time-Series Anomaly Detection

Title: Noise Synthesis for Low-Light Image Denoising with Diffusion Models

Title: DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models

Title: CyclePose -- Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy

Title: High-Dimensional Interlingual Representations of Large Language Models

Title: OPTIMUS: Predicting Multivariate Outcomes in Alzheimer's Disease Using Multi-modal Data amidst Missing Values

Title: GMG: A Video Prediction Method Based on Global Focus and Motion Guided

Title: BriLLM: Brain-inspired Large Language Model

Title: GNNs as Predictors of Agentic Workflow Performances

Title: Are formal and functional linguistic mechanisms dissociated?

Title: Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering

Title: MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens

Title: Leveraging Diffusion Knowledge for Generative Image Compression with Fractal Frequency-Aware Band Learning

Title: TransiT: Transient Transformer for Non-line-of-sight Videography

Title: Cardiomyopathy Diagnosis Model from Endomyocardial Biopsy Specimens: Appropriate Feature Space and Class Boundary in Small Sample Size Data

Title: APLA: A Simple Adaptation Method for Vision Transformers

Title: Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models

Title: EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting

Title: AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation

Title: PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models

Title: PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture

Title: Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches

Title: Modeling Subjectivity in Cognitive Appraisal with Language Models

Title: Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding

Title: A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving

Title: Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models

Title: LuSeg: Efficient Negative and Positive Obstacles Segmentation via Contrast-Driven Multi-Modal Feature Fusion on the Lunar

Title: Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models

Title: MTV-Inpaint: Multi-Task Long Video Inpainting

Title: Classifying Long-tailed and Label-noise Data via Disentangling and Unlearning

Title: From Generative AI to Innovative AI: An Evolutionary Roadmap

Title: TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation

Title: Text Compression for Efficient Language Generation

Title: FlowKac: An Efficient Neural Fokker-Planck solver using Temporal Normalizing flows and the Feynman Kac-Formula

Title: Combining Causal Models for More Accurate Abstractions of Neural Networks

Title: COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation

Title: D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning

Title: Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios

Title: T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation

Title: A Review of DeepSeek Models' Key Innovative Techniques

Title: Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

Title: V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Title: Cloud2BIM: An open-source automatic pipeline for efficient conversion of large-scale point clouds into IFC format

Title: Leveraging Angle of Arrival Estimation against Impersonation Attacks in Physical Layer Authentication

Title: TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Title: HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Title: Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

Title: Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Title: Bottom-up Iterative Anomalous Diffusion Detector (BI-ADD)

Title: AugGen: Synthetic Augmentation Can Improve Discriminative Models

Title: Similarity-Aware Token Pruning: Your VLM but Faster

Title: VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity

Title: SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Title: Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Title: Do Construction Distributions Shape Formal Language Learning In German BabyLMs?

Title: Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information

Title: Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages

Title: From Denoising Score Matching to Langevin Sampling: A Fine-Grained Error Analysis in the Gaussian Setting

Title: Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense

Title: ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Title: VGGT: Visual Geometry Grounded Transformer

Title: Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation