Classifying mixed patterns of proteins in microscopic images with deep neural networks

Authors

DOI:

https://doi.org/10.15276/hait.01.2019.3

Keywords:

deep learning, neural networks, classification, pattern recognition, stochastic gradient descent

Abstract

Nowadays, accurate diagnosis of diseases, their treatment and prognosis is a very acute problem of modern medicine. By studying information about human proteins, you can identify differentially expressed proteins. These proteins are potentially interesting biomarkers that can be used for an accurate diagnosis, prognosis, or selection of individual treatments, especially for cancer. A surprising finding from this research is that we have relatively few proteins that are tissue specific. Almost half of all proteins are categorized as housekeeping proteins, expressed in all cells. Only 2,300 proteins in the human body have been identified as tissue enriched, meaning they have elevated expression levels in certain tissues. Thanks to advances in high-throughput microscopy, images are generated too quickly for manual evaluation. Consequently, the need for automating the analysis of biomedical images is as great as ever to speed up the understanding of human cells and diseases. Historically, the classification of proteins was limited to individual patterns in one or more cell types, but in order to fully understand the complexity of a human cell, models must classify mixed patterns according to a number of different human cells. The article formulates the problem of image classification in medical research. In this area, classification methods using deep convolutional neural networks are actively used. Presented article gives a brief overview of the various approaches and methods of similar research. As a dataset was taken “The Human Protein Atlas”, that presents a tissue-based map of the human proteome, completed in 2014 after 11 years of research. All protein expression profiling data is publicly accessible in an interactive database, enabling tissue-based exploration of the human proteome. It was done an analysis of the work and the methods that were used during the research. To solve this problem, the deep neural network model is proposed taking into account the characteristics of the domain and the sample under study. The neural network model is based on Inception-v3 architecture. Optimization procedure contains combination of several tweaks for fast convergence: stochastic gradient descent with warm restarts (learning rate schedule for exploring different local minima), progressive image resizing (training starts from small resolution and sequentially increases each cycle of SGDR). We propose new method for threshold selection for F1 measure. Developed model can be used to create an instrument integrated into the medical system of intellectual microscopy to determine the location of the protein from a high-performance image.

Downloads

Download data is not yet available.

Author Biographies

Tymchenko, Borys I., Odessa National Polytechnic University, 1, Shevchenka ave., Odesa, 65044, Ukraine

PhD Student

Antoshchuk, Svitlana G., Odessa National Polytechnic University, 1, Shevchenka ave., Odesa, 65044, Ukraine

Doctor of Technical Sciences, Professor

Downloads

Published

2019-01-17

How to Cite

Tymchenko, B. I., Hramatik, A. A., Tulchiy, H. P., & Antoshchuk, S. G. (2019). Classifying mixed patterns of proteins in microscopic images with deep neural networks. Herald of Advanced Information Technology, 2(1), 29-36. https://doi.org/10.15276/hait.01.2019.3

Most read articles by the same author(s)