Early prediction of reading disability using machine learning.

This paper presents application of machine learning methods on a 356 sample dataset for early prediction of reading disability among first graders. A wide array of classifiers consisting of Support Vector Machines, Decision Trees (CART and C4.5), Linear Discriminant Analysis, k Nearest Neighbor and Naïve Bayes Classifiers were used in this study. Markov Blanket based feature selection algorithms (HITON-PC and HITON-MB) and wrapper based feature selection algorithms (forward, backward, forward and backward wrapping algorithm and support vector machine recursive feature elimination) were used to select the most relevant features for classification. The results indicate that an AUC score greater than 0.9 can be achieved using SVM classifiers even with a small set of demographics and screening variables. Moreover, a method for generating expert interpretable decision tree models from the high accuracy SVM models is also presented.

VIVO