Semi-Automatic Quantification Of mRNA Expression In Whole-Slide Tissue Images
For complex diseases, commercially available anti- bodies directed again specific protein sometimes lack the specificity required for clinical purpose. Therefore, novel techniques have been proposed recently to better assess the expression status of molecules at the mRNA level, e.g. by in situ hybridization. With such a technique, single-molecule signals can then be quantified on a cell-by-cell basis in whole tissue slides. However, highly heterogeneous expression of mRNA is often observed at the tissue level, requiring the development of accurate and reproducible quantification methods.
In this work, our aim is to propose and evaluate a methodology for the quantification of mRNA expression in digital slides to help biomedical researchers to analyze their large-scale imaging data. In particular, we will work on a dataset of breast cancer tissue samples and evaluate the accuracy of such a methodology to quantify Breast cancer susceptibility gene 1 (BRCA1) mRNA expression levels. BRCA1 is a tumor suppressor gene associated with the triple negative breast cancer (TNBC) subtype. An accurate technique to determine BRCA1 protein tumoral expression status in TNBC would allow for informed decision and choosing adapted treatments.
Data: To design and evaluate our methodology, we use the data from . In that study, the BRCA1 mRNA expression was assessed by in situ hybridization using RNAscope technology  (ACD-Bioke) for formalin-fixed, paraffin-embedded tissue samples. 88 glass slides were scanned (Hamamatsu, 40X, 0.23Âµm/pixel) and transferred on a Cytomine server (http://www.cytomine.be/) . These slides correspond to 62 tumours (TNBC subtype). Using Cytomine web manual annotation tools, a total of more than 200 regions of interest were manually drawn by an expert in these whole-slide images. Then, independent assessment was performed by two observers in these ROI: they manually draw point annotations corresponding to tens of thousands of BRCA1 mRNA expression signals.
Algorithms: We have developed the Icytomine plugin to communicate between Icy and Cytomine. It allows the importation of the subtumor images at a desired resolution and the expert annotations from Cytomine and the exportation of detection results from Icy software (http://icy.bioimageanalysis.org) . In order to quantify mRNA signals, we combined several image processing routines in the Icy. Our workflow is composed of three steps: 1) color deconvolution  to extract the dark brown staining, 2) detection of spots based on undecimated wavelet transform , 3) post-processing to remove artifacts based on size, shape and color features. This transform is fast and ideal for representing isotropic objects. We will use a colocalisation coefficient to quantify the concordance between the experts. For a purpose of automation, we use scripts and protocols of Icy.
An illustration of our results is given in Figure Our ongoing large-scale empirical study stresses the need for more confident ground-truths hence more robust algorithms. Interestingly, our current strategy detects accurately spot locations. We believe it will help us to create a larger ground-truth dataset using expert proofreading. We will report at the conference our best quantitative results. Overall, we believe that our approach will speedup in a confident way the quantification of mRNA signals in whole-slide tissue samples.
 MarÃ©e R., et al., Collaborative analysis of multi-gigapixel imaging data using Cytomine, Bioinformatics 2016, 32(9):1395-401.
 Wang F., et al, RNAscope: a novel in situ RNA analysis platform for formalin-fixed, paraffin-embedded tissues, J Mol Diagn 2012,14(1):22-9.
 Ruifrok A.C., et al., Quantification of histochemical staining by color deconvolution, Anal Quant Cytol Histol 2001, 23(4):291-9.
 de Chaumont F., Icy: an open bioimage informatics platform for extended reproducible research, Nature methods 2012, 9(7):690-6.
 Olivo-Marin J.C., Extraction of spots in biological images using multiscale products, Pattern Recogn 2002, 35(9):1989-1996.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
4. In case of virtual slide publication the authors agree to copy the article in a structural modified version to the journal's VS archive.