Impact of real-world annotations on the training and evaluation of deep learning algorithms in digital pathology
Thesis presented by Adrien FOUCART
with a view to obtaining the PhD Degree in biomedical engineering (“Docteur en ingénierie biomédicale”)
Academic year 2022-2023
Supervisor: Professor Christine DECAESTECKER
Co-supervisor: Professor Olivier DEBEIR
Laboratory of Image Synthesis and Analysis
Thesis jury:
Hughes BERSINI (Université Libre de Bruxelles, Chair)
Gauthier LAFRUIT (Université Libre de Bruxelles, Secretary)
Geert LITJENS (Computational Pathology Group, Radboud University Medical Center)
Raphaël MARÉE (Montefiore Institute, Université de Liège)
Isabelle SALMON (Hôpital Érasme)
Gianluca BONTEMPI (Université Libre de Bruxelles)
Olivier DEBEIR (Université Libre de Bruxelles)
Christine DECAESTECKER (Université Libre de Bruxelles)
Table of contents
Acknowledgments
This thesis is the product of seven years working in the LISA laboratory, as a researcher, a PhD student, and a teaching assistant. Many people have contributed, in one way of another, to the realisation of this work, and I would like to thank them here:
My supervisors, Pr. Christine Decaestecker and Pr. Olivier Debeir, for their support and their valuable feedback throughout the thesis, and for giving me the opportunity to start this thesis in the first place.
The past and present researchers, professors, and members of the lab that I had the pleasure to interact with during these years, for their ideas and inputs, and for the shared discussions, coffees, drinks, burgers, and all the things that make the work environment a better place. I particularly would like to thank Arlette Grave for her support through the various administrative processes of the university. My thanks also go to Isabelle Salmon of DIAPath, and the staff at Erasme Hospital and at the CMMI that contributed materials and feedback.
Being a teaching assistant was extremely stimulating and provided countless opportunities for procrastination, so I need to thank all students who kept me busy during these years. In particular, I would like to thank Élisabeth Gruwé and Alain Zheng for their contribution to this research through their excellent work during their master thesis.
I also want to thank Seyed Alireza Fatemi Jahromi and Amirezza Mahbod for sharing valuable data from their own research, and the NVIDIA Corporation for providing us with a Titan X GPU, which was used in most of the experimental work of this thesis.
I would not be where I am today without my parents Lucy Dever and Dominique Foucart, and the undoubtedly very effective education and support that they have provided to their children. My brothers François and Renaud must also get some thanks (and/or blame) for their contributions, as does my uncle Thierry Dever, without whom I probably wouldn’t have gotten as comfortable with computers.
Finally, for her support (even during the last months of this thesis), I thank my partner Céline Mathieu. And last (but certainly not least) my daughter Margot, for making sure that I didn’t stay too focused.
List of abbreviations
We provide here, in alphabetical order, a list of abbreviations used in this thesis.
ACC – Accuracy
AE – Auto-Encoder
AI – Artificial Intelligence
AP – Average Precision
ASSD – Average Symmetric Surface Distance
AUPRC – Area Under the PR Curve
AUROC – Area Under the ROC curve
BCC – Basal Cell Carcinoma
BP – Backpropagation
CE – Cross-Entropy
CM – Confusion Matrix
CNN – Convolutional Neural Network
CUDA – Compute Unified Device Architecture (software layer for GPU programming)
CYDAC – Cytophotometric Data Conversion System
DCNN – Deep Convolutional Neural Network
DA – Data Augmentation
DCNN – Deep Convolutional Neural Network
DL – Deep Learning
DNN – Deep Neural Network
DP – Digital Pathology
DQ – Detection Quality
DSC – Dice Similarity Coefficient
ER+ – Oestrogen receptor positive
FCN – Fully Convolutional Network
FN – False Negative
FNN – Feedforward Neural Network
FP – False Positive
FPR – False Positive Rate
GAN – Generative Adversarial Network
GlaS – Gland Segmentation challenge
GM – Geometric Mean
GMDH – Group Method of Data Handling
GPU – Graphical Processing Unit
H&E – Haematoxylin and Eosin
HD – Hausdorff’s Distance
HER2 – Human epidermal growth factor receptor 2 (also the name of a 2016 competition)
HSV – Hue-Saturation-Value colour space
ICPR – International Conference on Pattern Recognition
IDC – Invasive Ductal Carcinoma
IHC – Immunohistochemistry
ILSVRC – ImageNet Large Scale Visual Recognition Challenge
IoU – Intersection over Union
ISUP – International Society of Urological Pathology
ISBI – IEEE International Symposium on Biomedical Imaging
κ,κ_U,κ_L,κ_Q – Cohen’s kappa (in general); Unweighted, Linear and Quadratic Cohen’s kappa.
LSTM – Long Short-Term Memory
mAP – Mean Average Precision
MCC – Matthews Correlation Coefficient
MDS – Multi-Dimensional Scaling
MICCAI – Medical Image Computing and Computer Assisted Interventions (scientific society and conference)
MIL – Multiple Instance Learning
ML – Machine Learning
MLP – Multi-Layer Perceptron
MNIST – Modified National Institute of Standards and Technology (handwritten digits dataset)
MoNuSAC Multi-organ Nuclei Segmentation and Classification challenge
MP – Max Pooling
MRI – Magnetic Resonance Imaging
MSE – Mean-Squared Error
NCM – Normalized Confusion Matrix
NPV – Negative Predictive Value
NSD – Normalized Surface Distance
PCA – Principal Component Analysis
PPV – Positive Predictive Value
PQ – Panoptic Quality
PR curve – Precision-Recall curve
PRE – Precision
PR in HIMA – Pattern Recognition in Histological Image Analysis (2010 challenge)
R – Rate of agreement
REC – Recall
ReLU – Rectified Linear Units
ResNet – Network architecture based on “residual units”
RGB – Red-Green-Blue colour space
ROC – Receiver Operating Characteristic
SEN – Sensitivity
SGD – Stochastic Gradient Descent
SNOW – Semi-Supervised, NOisy and/or Weak
SPE – Specificity
SQ – Segmentation Quality
SSL – Semi-Supervised Learning
STAPLE – Simultaneous Truth and Performance Level Estimation
SVM – Support Vector Machine
TCGA – The Cancer Genome Atlas
TCIA – The Cancer Imaging Archive
TMA – Tissue Microarray
TN – True Negative
TNM – Tumour-Node-Metastasis (cancer staging system)
TNR – True Negative Rate
TP – True Positive
TPR – True Positive Rate
UL – Unsupervised Learning
U-Net – Network architecture based on “long-skip” connections
WSI – Whole-Slide Imaging / Whole-Slide Image
WSL – Weakly Supervised Learning
XML – Extensible Markup Language
Notations sheet
While ad-hoc mathematical conventions are sometimes adopted in the manuscript depending on the context (and are thus explained in the text), we reference here the main notations that we use through the thesis.
Input, output, target output of a system (algorithm, neuron, function…)
Bold letters generally denote vectors
Parameter (weight) of a model
Loss function
,
Sets of target and predicted items
Cardinality of a set
Confusion matrix, with the confusion between target class and predicted class
Number of classes/categories in a dataset, class index.
Number of samples in a dataset, sample index
Class imbalance parameter
Class proportionality parameter
Distance parameter
Normal distribution for random sampling with mean and standard deviation
Uniform distribution for random sampling with bounds