On Challenging Aspects of Reproducibility in Deep Anomaly Detection

Our companion paper, On Challenging Aspects of Reproducibility in Deep Anomaly Detection, has been accepted for presentation at the Fourth Workshop on Reproducible Research in Pattern Recognition (satellite event of ICPR 2022).

In it, we discuss aspects of reproducibility for our anomaly detection algorithm MCHAD, as well as anomaly detection with deep neural networks in general. In particular, we discussed the following challenges for the reproducibility:

Nondeterminism: conducting the same experiment with different random seeds might lead to significantly different outcomes.
Sensitivity to hyper-parameters: slight changes in hyper-parameters can drastically alter the outcomes.
Complexity: the more complex an algorithm, the more likely an implementation contains errors.
Dataset Selection: The performance of a method is going to depend on the dataset on which you evaluate it.
Resource Limitations: resource requirements can limit the number of individuals or institutions that are able to reproduce the training.
Dependencies: dependencies, in the form of data, pre-trained weights, or software libraries, might get taken down at some point.

The large number of dependencies in our experiments may harm the reproducibility of our exact numerical results. However, we argue that the reproducibility of conclusions should be prioritized over the reproducibility of exact numerical results since the former contributes to the advancement of scientific knowledge.

Our Paper Addressing Randomness in Evaluation Protocols for Out-of-Distribution Detection has been accepted at the ICJAI 2021 Workshop for Artificial Intelligence for Anomalies and Novelties. In summary, we investigated the following phenomenon: when you train neural networks several times, and then measure their performance on some task, there is a certain variance in the performance measurements, since the results of experiments may vary based on several factors (that are effectively …

Our paper, PyTorch-OOD: A library for Out-of-Distribution Detection based on PyTorch, has been presented at the CVPR 2022 Workshops. You can find the most recent version of the Python source code on GitHub. Abstract § Machine Learning models based on Deep Neural Networks behave unpredictably when presented with inputs that do not stem from the training distribution and sometimes make egregiously wrong predictions with high confidence. This property undermines the trustworthiness of systems …

On Challenging Aspects of Reproducibility in Deep Anomaly Detection

Related Posts