Statistical methods in medical research
-
Stat Methods Med Res · Dec 2016
A cautionary note on the use of attributable fractions in cohort studies.
The attributable fraction is a widely used measure to quantify the public health impact of an exposure on an outcome. It was originally proposed for binary outcomes, but attributable fraction estimators have also been proposed for time-to-event outcomes. In this note, we consider an estimator which was proposed by Benichou (Stats Methods Med Res, 2001) and is supposed to estimate the cohort attributable fraction, i.e. the number of events that would have been prevented in the cohort during follow-up, if the exposure would hypothetically have been eliminated. ⋯ We further argue that the cohort attributable fraction may not be of substantial scientific interest in the first place. We propose a potentially more relevant measure of attributable fraction in cohort studies; the baseline attributable fraction. We show how the baseline attributable fraction can be conveniently estimated in Cox proportional hazards models.
-
Stat Methods Med Res · Oct 2016
Notes on testing equality and interval estimation in Poisson frequency data under a three-treatment three-period crossover trial.
When the frequency of event occurrences follows a Poisson distribution, we develop procedures for testing equality of treatments and interval estimators for the ratio of mean frequencies between treatments under a three-treatment three-period crossover design. Using Monte Carlo simulations, we evaluate the performance of these test procedures and interval estimators in various situations. ⋯ We also demonstrate that both interval estimators based on the WLS method and interval estimators based on Mantel-Haenszel (MH) approach can perform well, and are essentially of equal precision with respect to the average length. We use a double-blind randomized three-treatment three-period crossover trial comparing salbutamol and salmeterol with a placebo with respect to the number of exacerbations of asthma to illustrate the use of these test procedures and estimators.
-
Stat Methods Med Res · Oct 2016
Bayesian analysis of transformation latent variable models with multivariate censored data.
Transformation latent variable models are proposed in this study to analyze multivariate censored data. The proposed models generalize conventional linear transformation models to semiparametric transformation models that accommodate latent variables. ⋯ Simulation shows that the performance of the proposed methodology is satisfactory. The proposed method was applied to analyze a cardiovascular disease data set.
-
Stat Methods Med Res · Oct 2016
Comparative StudyA comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies.
Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the major sources of bias in such comparisons. Better performance in one or a few instances does not necessarily imply so on an average or on a population level and simulation studies may be a better alternative for objectively comparing the performances of machine learning algorithms. ⋯ For smaller number of correlated features, number of features not exceeding approximately half the sample size, LDA was found to be the method of choice in terms of average generalisation errors as well as stability (precision) of error estimates. SVM (with RBF kernel) outperforms LDA as well as RF and kNN by a clear margin as the feature set gets larger provided the sample size is not too small (at least 20). The performance of kNN also improves as the number of features grows and outplays that of LDA and RF unless the data variability is too high and/or effect sizes are too small. RF was found to outperform only kNN in some instances where the data are more variable and have smaller effect sizes, in which cases it also provide more stable error estimates than kNN and LDA. Applications to a number of real datasets supported the findings from the simulation study.
-
Stat Methods Med Res · Feb 2016
A semi-parametric approach to the frequency of occurrence under a simple crossover trial.
To analyze the frequency of occurrence for an event of interest in a crossover design, we propose a semi-parametric approach. We develop two point estimators and four interval estimators in closed forms for the treatment effect under a random effects multiplicative risk model. Using Monte Carlo simulations, we evaluate these estimators and compare the four interval estimators with the classical interval estimator suggested elsewhere in a variety of situations. ⋯ We note that as long as the number of patients per group is large, all the four interval estimators developed here can perform well. We also note that the classical interval estimator derived under the commonly assumed Poisson distribution for the frequency data can be conservative and lose precision if the Poisson distribution assumption is violated. We use a double-blind randomized crossover trial comparing salmeterol with a placebo in exacerbations of asthma to illustrate the practical use of these estimators.