• Forensic Sci. Int. · Oct 2020

    Data mining for sex estimation based on cranial measurements.

    • Diana H Toneva, Silviya Y Nikolova, Gennady P Agre, Dora K Zlatareva, Vassil G Hadjidekov, and Nikolai E Lazarov.
    • Department of Anthropology and Anatomy, Institute of Experimental Morphology, Pathology and Anthropology with Museum, Bulgarian Academy of Sciences, 1113, Sofia, Bulgaria. Electronic address: ditoneva@abv.bg.
    • Forensic Sci. Int. 2020 Oct 1; 315: 110441.

    AbstractThe aim of the present study is to develop effective and understandable classification models for sex estimation and to identify the most dimorphic linear measurements in adult crania by means of data mining techniques. Furthermore, machine learning models and models developed through logistic regression analysis are compared in terms of performance. Computed tomography scans of 393 adult individuals were used in the study. A landmark-based approach was applied to collect the metric data. The three-dimensional coordinates of 47 landmarks were acquired and used for calculation of linear measurements. Two datasets of cranial measurements were assembled, including 37standard measurements and 1081 interlandmark distances, respectively. Three data mining algorithms were applied: the rule induction algorithms JRIP and Ridor, and the decision tree algorithm J48. Two advanced attribute selection methods (Weka BestFirst and Weka GeneticSearch) were also used. The best accuracy result (91.9 %) was achieved by a set of rules learnt by the JRIP algorithm from the dataset constructed by application of the GeneticSearch selection algorithm to the dataset of standard cranial measurements. The set consisted of five rules including seven cranial measurements. Its accuracy was even better than the classification rates achieved by the logistic regression models. Concerning the second dataset of nonstandard measurements, the best accuracy (88.3 %) was obtained by using classification models learnt by two algorithms - JRIP with a dataset preprocessed by the BestFirst selection algorithm and Ridor with preprocessing by the GeneticSearch selection algorithm. Our experiments show that for the two datasets mentioned above the rule-based models contain smaller sets of rules with shorter lists of measurements and achieve better classification accuracy results in comparison with decision tree-based models.Copyright © 2020 Elsevier B.V. All rights reserved.

      Pubmed     Full text   Copy Citation     Plaintext  

      Add institutional full text...

    Notes

     
    Knowledge, pearl, summary or comment to share?
    300 characters remaining
    help        
    You can also include formatting, links, images and footnotes in your notes
    • Simple formatting can be added to notes, such as *italics*, _underline_ or **bold**.
    • Superscript can be denoted by <sup>text</sup> and subscript <sub>text</sub>.
    • Numbered or bulleted lists can be created using either numbered lines 1. 2. 3., hyphens - or asterisks *.
    • Links can be included with: [my link to pubmed](http://pubmed.com)
    • Images can be included with: ![alt text](https://bestmedicaljournal.com/study_graph.jpg "Image Title Text")
    • For footnotes use [^1](This is a footnote.) inline.
    • Or use an inline reference [^1] to refer to a longer footnote elseweher in the document [^1]: This is a long footnote..

    hide…