Journal of Intelligent Learning Systems and Applications, 2013, 5, 184-193 http://dx.doi.org/10.4236/jilsa.2013.53020 Published Online August 2013 (http://www.scirp.org/journal/jilsa)
Evaluation and Comparison of Different Machine Learning Methods to Predict Outcome of Tuberculosis Treatment Course Sharareh R. Niakan Kalhori1,2*, Xiao-Jun Zeng3 1 Department of Public Health, School of Health, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran; 2Social Determinants of Health Research Center, School of Health, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran; 3Department of Machine Learning & Optimization, School of Computer Science, University of Manchester, Manchester, England. Email: *
[email protected]
Received December 24th, 2012; revised January 24th, 2013; accepted February 4th, 2013 Copyright © 2013 Sharareh R. Niakan Kalhori, Xiao-Jun Zeng. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
ABSTRACT Tuberculosis treatment course completion is crucial to protect patients against prolonged infectiousness, relapse, lengthened and more expensive therapy due to multidrug resistance TB. Up to 50% of all patients do not complete treatment course. To solve this problem, TB treatment with patient supervision and support as an element of the “global plan to stop TB” was considered by the World Health Organization. The plan may require a model to predict the outcome of DOTS therapy; then, this tool may be used to determine how intensive the level of providing services and supports should be. This work applied and compared machine learning techniques initially to predict the outcome of TB therapy. After feature analysis, models by six algorithms including decision tree (DT), artificial neural network (ANN), logistic regression (LR), radial basis function (RBF), Bayesian networks (BN), and support vector machine (SVM) developed and validated. Data of training (N = 4515) and testing (N = 1935) sets were applied and models evaluated by prediction accuracy, F-measure and recall. Seventeen significantly correlated features were identified (P 1) denote that our variables are far from normality and decision tree and other symbolic methods (nonparametric JILSA
192
Evaluation and Comparison of Different Machine Learning Methods to Predict Outcome of Tuberculosis Treatment Course
schemes) tackle robustly with distributions with large kurtosis and skew [22]. In this research’s dataset, the average values of skew and kurtosis are 2.169 and 7.469 respectively (P < 0.05) confirming the non-normal distribution. Hence, the only available nonparametric symbolic learning algorithm in the current study is decision tree which performed well to partition the input space. In actuality, high skew (>1) or kurtosis (>7) along with the presence of binary/categorical variables, using relevant and correlated predictors without any missed instances or noised data have prepared the best opportunity for decision tree to predict more accurately than other applied algorithms. In view of the rank of other employed classifiers, BN outperforms others and four remaining classifiers work relatively similar with prediction accuracy percentage ranging from 53.74% to 57.82%. In a study [24], Tu has reviewed a number of researches concluding that LR & MLP perform closely; it is the case here where they performed with identical prediction accuracy (57.82). RBF is actually a type of neural network and it might be a postulation that based on their algorithm similarities and data type entity results are comparable. Decision tree with flowchart-type structure is more likely method to be understandable for general users with low level of specialized knowledge about TB. Produced results of decision tree can be simply interpretable and applicable; their rules can be understood either by doctors or health practitioners who implement DOTS in rural area and make decision alone.
no conflict of interest.
REFERENCES [1]
A. D. Harries and C. Dye, “Tuberculosis,” Annals of Tropical Medicine and Parasitology, Vol. 100, No. 5, 2006, pp. 415-443. doi:10.1179/136485906X91477
[2]
World Health Organization, “The Stop TB Strategy, Building on and Enhancing DOTS to Meet the TB-Related Millennium Development Goals,” 2006.
[3]
W. D. Cuneo and D. J. Snider, “Enhancing Patient Compliance with Tuberculosis Therapy,” Clinics in Chest Medicine, Vol. 10, No. 3, 1989, pp. 375-380.
[4]
H. G. Tangüis, J. A. Caylà, P. García, J. M. Jansà and M. T. Brugal, “Factors Predicting Non-Completion of Tuberculosis Treatment among HIV-Infected Patients in Barcelona (1987-1996),” The International Journal of Tuberculosis and Lung Disease, Vol. 4, No. 1, 2000, pp. 5560.
[5]
W. W. Yew, “Directly Observed Therapy Short-Course: The Best Way to Prevent Multidrug-Resistant Tuberculosis,” Chemotherapy, Vol. 45, No. 2, 1999, pp. 26-33. doi:10.1159/000048479
[6]
J. Legrand, A. Sanchez, F. Le Pont, L. Camacho and B. Larouze, “Modeling the Impact of Tuberculosis Control Strategies in Highly Endemic Overcrowded Prisons,” Plos One, Vol. 3, No. 5, 2008, Article ID: e2100. doi:10.1371/journal.pone.0002100
[7]
S. Thiam, A. M. Le Fevre and F. Hane, “Effectiveness of a Strategy to Improve Adherence to tuberculosis Treatment in a Resource-Poor Setting: A Cluster Randomized Controlled Trial,” Journal of the American Medical Association, Vol. 297, No. 4, 2007, pp. 380-386. doi:10.1001/jama.297.4.380
[8]
W. J. Burman, D. L. Cohn, C. A. Rietmeijer, F. N. Judson, J. A. Sbarbaro and R. R. Reves, “Noncompliance with Directly Observed Therapy for Tuberculosis. Epidemiology and Effect on the Outcome of Treatment” Chest, Vol. 111, No. 5, 1997, pp. 1168-1173. doi:10.1378/chest.111.5.1168
[9]
P. D. O. Davies, “The Role of DOTS in Tuberculosis Treatment and Control,” American Journal of Respiratory Medicine, Vol. 2, No. 3, 2003, pp. 203-209. doi:10.1007/BF03256649
5. Conclusion To sum up, available big body of real data related to TB patients has created an opportunity to generate accurate models which can predict outcome of DOTS therapy. This provides us information about outcome of treatment course for each patient and defines who needs high level of supervision and support; this is valuable as it is not possible to give every single of patients a full supervision and support distinctly. The decision tree model can be used to screen risky patients for fail in treatment course completion in population using general data gathering in routine general practice. This will help healthcare practitioners especially in rural regions to evaluate the risks of MDR-TB among their patients quickly, inexpensively, and noninvasively. TB control through totally implemented DOTS therapy is such a crucial stage in public health improvement and promotion.
6. Acknowledgements We would like to express our gratitude to Iranian Ministry of Health and Medical Education for funding and data access. Furthermore, the authors declare that they have Copyright © 2013 SciRes.
[10] A. V. Sitar-Taut, D. Zdrenghea, D. Pop and D. A. SitarTaut, “Using Machine Learning Algorithms in Cardiovascular Disease Risk Evaluation,” Journal of Applied Computer Science & Mathematics, Vol. 5, No. 3, 2009, pp. 29-32. [11] M. Lazarescu, A. Turpin and S. Venkatesh, “An Application of Machine Learning Techniques for the Classification of Glaucomatous Progression,” Vol. 2396, Springer-Verlag, Berlin, 2006. [12] J. I. Serrano, M. Tomécková and J. Zvárová, “Machine Learning Methods for Knowledge Discovery in Medical Data on Atherosclerosis,” European Journal of Biomedical Informatics, 2006. [13] I. Guyon and A. Elissee, “An Introduction to Variable and JILSA
Evaluation and Comparison of Different Machine Learning Methods to Predict Outcome of Tuberculosis Treatment Course
193
Feature Selection,” Journal of Machine Learning Research, Vol. 3, 2003, pp. 1157-1182.
[20] S. Marsland, “Machine Learning: An Algorithmic Perspective,” 1st Edition, Chapman and Hall, London, 2009.
[14] M. Dash and H. Liu, “Feature Selection for Classification,” Intelligent Data Analysis, Vol. 1, No. 3, 1997, pp. 131-156. doi:10.1016/S1088-467X(97)00008-5
[21] L. Olson and D. Delen, “Advanced Data Mining Techniques,” Springer, Berlin, 2008.
[15] A. Field, “Discovering Statistics Using SPSS,” 2nd Edition, SAGE Publication LTD, London, 2005. [16] J. Han and M. Kamber, “Data Mining: Concepts and Techniques,” 2nd Edition, Morgan Kaufmann Publishers, Burlington, 2006. [17] E. Alpaydin, “Introduction to Machine Learning,” 1th Edition, The MIT Press, Cambridge, 2004. [18] S. B. Kotsiantis, “Supervised Machine Learning: A Review of Classification Techniques,” Informatica, Vol. 31, No. 3, 2007, pp. 249-268. [19] E. Vittinghoff, S. C. Shiboski, D. V. Glidden and C. E. McCulloch, “Regression Methods in Biostatistics, Linear, Logistic, Survival, and Repeated Measures Models,” Springer, Berlin, 2005.
Copyright © 2013 SciRes.
[22] R. D. King, C. Feng and A. Sutherland, ”Statlog: Comparison of Classification Algorithms on Large RealWorld Problems,” Applied Artificial Intelligence, Vol. 9, No. 3, 1995, pp. 289-333. [23] I. Kurt, M. True and A. T. Kurum, “Comparing Performances of Logistic Regression, Classification and Regression Tree, and Neural Networks for Predicting Coronary Artery Disease,” Expert System Application, Vol. 34, 2008, pp. 366-374. doi:10.1016/j.eswa.2006.09.004 [24] J. V. Tu, “Advantages and Disadvantages of Using Artificial Neural Networks Versus Logistic Regression for Predicting Medical Outcomes,” Journal of Clinical Epidemiology, Vol. 49, No. 11, 1996, pp. 1225-1231. doi:10.1016/S0895-4356(96)00002-9
JILSA