• Curr Med Res Opin · Aug 2019

    Development of a classifier to identify patients with probable Lennox-Gastaut syndrome in health insurance claims databases via random forest methodology.

    • Francis Vekeman, Jesus Eric Piña-Garza, Wendy Y Cheng, Edward Tuttle, Philippe Giguère-Duval, Arman Oganisian, Joseph Damron, Sheng Duh Mei M c Analysis Group Inc. , Boston , MA , USA., Vivienne Shen, Timothy B Saurer, Georgia D Montouris, and Jouko Isojarvi.
    • a Groupe d'analyse Ltée , Montréal , QC , Canada.
    • Curr Med Res Opin. 2019 Aug 1; 35 (8): 1415-1420.

    AbstractObjective: Describe the development of a claims-based classifier utilizing machine learning to identify patients with probable Lennox-Gastaut syndrome (LGS) from six state Medicaid programs. Methods: Patients were included if they had ≥2 medical claims ≥30 days apart for specified or unspecified epilepsy, excluding those with ≥1 claim for petit mal status. The LGS classifier utilized a random forest algorithm, a compilation of thousands of binary decision trees in which machine-generated predictor variables split the data set into branches that predict the presence or absence of LGS. To construct the splitting rules, the importance of each candidate variable was determined by calculating the mean decrease in Gini impurity. Training and testing were performed on two data sets (30% and 70%) using a "true" LGS and non-LGS patient population. Performance was compared with logistic regression and single tree methodology. Results: Using a 60% probability threshold, which yielded the highest sensitivity (97.3%) and specificity (95.6%), the classifier identified approximately 4% of patients with epilepsy as probable LGS. The most important input variables included number of distinct antiepileptic drugs received, epilepsy-related outpatient/inpatient visits, electroencephalogram procedures and claims for delayed development. The random forest methodology outperformed logistic regression and single tree methodology. Most of the important LGS predictor characteristics identified by the classifier were statistically significantly associated with LGS status (p < .05). Conclusions: The claims-based LGS classifier showed high sensitivity and specificity, outperformed single tree and logistic regression methodologies and identified a prevalence of probable LGS that was similar to previously published estimates.

      Pubmed     Full text   Copy Citation     Plaintext  

      Add institutional full text...

    Notes

     
    Knowledge, pearl, summary or comment to share?
    300 characters remaining
    help        
    You can also include formatting, links, images and footnotes in your notes
    • Simple formatting can be added to notes, such as *italics*, _underline_ or **bold**.
    • Superscript can be denoted by <sup>text</sup> and subscript <sub>text</sub>.
    • Numbered or bulleted lists can be created using either numbered lines 1. 2. 3., hyphens - or asterisks *.
    • Links can be included with: [my link to pubmed](http://pubmed.com)
    • Images can be included with: ![alt text](https://bestmedicaljournal.com/study_graph.jpg "Image Title Text")
    • For footnotes use [^1](This is a footnote.) inline.
    • Or use an inline reference [^1] to refer to a longer footnote elseweher in the document [^1]: This is a long footnote..

    hide…