-
- Jian Zhang, Zhizhong Liu, Ribing Chen, Qingwei Ma, Qian Lyu, Shuhui Fu, Yufei He, Zijie Xiao, Zhi Luo, Jianming Luo, Xingyu Wang, Xiangyi Liu, Peng An, and Wei Sun.
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China.
- Ann. Med. 2022 Dec 1; 54 (1): 293-301.
BackgroundThalassaemia is one of the most common inherited monogenic diseases worldwide with a heavy global health burden. Considering its high prevalence in low and middle-income countries, a cheap, accurate and high-throughput screening test of thalassaemia prior to a more expensive confirmatory diagnostic test is urgently needed.MethodsIn this study, we constructed a machine learning model based on MALDI-TOF mass spectrometry quantification of haemoglobin chains in blood, and for the first time, evaluated its diagnostic efficacy in 674 thalassaemia (including both asymptomatic carriers and symptomatic patients) and control samples collected in three hospitals. Parameters related to haemoglobin imbalance (α-globin, β-globin, γ-globin, α/β and α-β) were used for feature selection before classification model construction with 8 machine learning methods in cohort 1 and further model efficiency validation in cohort 2.ResultsThe logistic regression model with 5 haemoglobin peak features achieved good classification performance in validation cohort 2 (AUC 0.99, 95% CI 0.98-1, sensitivity 98.7%, specificity 95.5%). Furthermore, the logistic regression model with 6 haemoglobin peak features was also constructed to specifically identify β-thalassaemia (AUC 0.94, 95% CI 0.91-0.97, sensitivity 96.5%, specificity 87.8% in validation cohort 2).ConclusionsFor the first time, we constructed an inexpensive, accurate and high-throughput classification model based on MALDI-TOF mass spectrometry quantification of haemoglobin chains and demonstrated its great potential in rapid screening of thalassaemia in large populations.Key messagesThalassaemia is one of the most common inherited monogenic diseases worldwide with a heavy global health burden.We constructed a machine learning model based on MALDI-TOF mass spectrometry quantification of haemoglobin chains to screen for thalassaemia.
Notes
Knowledge, pearl, summary or comment to share?You can also include formatting, links, images and footnotes in your notes
- Simple formatting can be added to notes, such as
*italics*
,_underline_
or**bold**
. - Superscript can be denoted by
<sup>text</sup>
and subscript<sub>text</sub>
. - Numbered or bulleted lists can be created using either numbered lines
1. 2. 3.
, hyphens-
or asterisks*
. - Links can be included with:
[my link to pubmed](http://pubmed.com)
- Images can be included with:

- For footnotes use
[^1](This is a footnote.)
inline. - Or use an inline reference
[^1]
to refer to a longer footnote elseweher in the document[^1]: This is a long footnote.
.