Methods

QML process flow

Data set

We use a data set of 108,948 frontal view real X-ray images to diagnose for eight diseases.

We use the popular open-source Chest X-ray¹ Dataset1 that contains 108,948 frontal view X-ray images of resolution 1024x1024 labeled with 14 diseases. Eight of the disease classes are used for training.

¹ Xiaosong Wang et al. “Chestx-ray8: Hospital-scale chest x-ray database …”. IEEE conf. on computer vision and pattern recognition (2017).https://nihcc.app.box.com/v/ChestXray-NIHCC

Methods

Quantum annealing is used as a next generation computing technology to implement our quantum machine learning algorithm.

Quantum annealing uses a physical system of qubits to simulate an instance of a quadratic binary problem (QUBO). After the problem is mapped to the architecture of the annealer, the physical system is given some time to settle (anneal) to its final state that often represent a good solution to the problem. This process can be significantly faster than a classical algorithm for a QUBO problem.

Currently available quantum annealers still have a limited size and qubit connectivity, hence only small problems can fit into them. They are also noisy and error prone. Therefore, we start with a classical pre-processing to reduce the problem size.

Classical machine learning is used to compress the input samples (images) to a significantly small set of features (only 160 bits per sample).

In order to embed the images on the D-Wave Advantage 4.1., we need to reduce the size of each sample to approximately 160 bits. A state-of-the-art unsupervised deep feature extractor based on the Solo Learn2 library and BYOL3 method is used to reduce the number of features to 80. Each feature is then quantized as two bits, compressing each image data down to 160 bits.

² See https://github.com/vturrisi/solo-learn
³ Jean-Bastien Grill et al. “Bootstrap your own latent...”. Advances in Neural Information Processing Systems 33 (2020).

Quantum restricted Boltzmann machine (QRBM) is a quantum machine leaning method that we used for classification of eight-class multi-label images in our dataset.

We implemented a state-of-the-art quantum restricted Boltzmann machine (QRBM) is a generative machine learning model. After training, this model can reconstruct not just the labels (image classes), but also some missing features. The model is used to classify medical images into eight classes (eight disease in the selected dataset).

We compared different classifiers based on the AUC metric, which measure the overall classification performance by combining sensitivity and specificity.We compared different classifiers based on the AUC metric, which measure the overall classification performance by combining sensitivity and specificity.

The performance of image classification (or any diagnosis test) is a trade-off between sensitivity or true positive rate (TPR)4 and specificity or false positive rate (FPR)5. For each value of FPR we calculate and plot the maximum TPR the classifier achieves. The larger the area under this curve (AUC), the better the classifier is performing, which varies between 0.5 (random classifier) and 1.0 (perfect classifier).

⁴ True positive rate (TPR) or sensitivity of the classifier is the proportion of images correctly classified/diagnosed as having a disease.
⁵ False positive rate (FPR) is the proportion of images incorrectly classified/diagnosed as having a disease. It is related to the specificity (TNR) by TNR = 1 – FPR.

Results

The results show that our algorithm using D-Wave annealer can learn to classify X-ray images and potentially outperform other methods in future.

Neither a machine nor a human can achieve 100% accuracy on medical image classification because some images do not have sufficient information for diagnosis. Figure 1a shows the mean AUC score for classification of different diseases using our QRBM model on D-Wave annealer. It is evident that the model can learn how to classify the images, while still having a long way to perfection. Figure 1b compares the training results of QRBM on D-Wave with classical training methods. The training is done in batches and each batch stopped after 10 epochs. The classical models were also stopped at epoch 10. The main sources of error for QRBM are: (a) compression of images to fit quantum device, (c) regularization to ensure that the weights of the model do not exceed the range allowed on the D-Wave device, and (d) fundamental limitations of the current generation D-Wave machines such as qubit count, connectivity and noise. With future scaling of D-Wave annealer technology, it is foreseeable that quantum annealing approach can potentially outperform classical sampling on some difficult tasks for which our classical compute power is not sufficient.

Figure 1: Mean AUC for the final models

Fig 1 — (a) Detailed results of the model trained on D-Wave

Fig 2 — (b) Comparison between the models

Conclusions

We demonstrated that X-ray images can be classified using quantum machine learning, but the technology needs further advancements to show the real advantage over classical methods.

Our implementation of QRBM on a D-Wave annealer and successful demonstration of its learning capability is a big step towards development of quantum machine learning technology for future AI. D-Wave annealers have the promise of being useful samplers for probability discussions that are difficult to handle classically. While today’s quantum annealer has many limitations as an emerging technology, the potential of huge speedup versus classical sampling methods can open doors to the problems that are not computable today. A combination of a D-Wave annealer and classical methods can deliver the best results – an approach we have started to explore in this work.