tailieunhanh - Comparing Association Rules and Decision Trees for Disease Prediction
Association rules represent a promising technique to find hidden patterns in a medical data set. The main issue about mining association rules in a medical data set is the large number of rules that are discovered, most of which are irrelevant. Such number of rules makes search slow and interpretation by the domain expert difficult. In this work, search constraints are introduced to find only medically significant association rules and make search more efficient. In medical terms, association rules relate heart perfusion measurements and patient risk factors to the degree of stenosis in four specific arteries. Association rule medical significance is evaluated with the usual support and confidence metrics, but also lift | Comparing Association Rules and Decision Trees for Disease Prediction Carlos Ordonez University of Houston Houston TX USA ABSTRACT Association rules represent a promising technique to find hidden patterns in a medical data set. The main issue about mining association rules in a medical data set is the large number of rules that are discovered most of which are irrelevant. Such number of rules makes search slow and interpretation by the domain expert difficult. In this work search constraints are introduced to find only medically significant association rules and make search more efficient. In medical terms association rules relate heart perfusion measurements and patient risk factors to the degree of stenosis in four specific arteries. Association rule medical significance is evaluated with the usual support and confidence metrics but also lift. Association rules are compared to predictive rules mined with decision trees a well-known machine learning technique. Decision trees are shown to be not as adequate for artery disease prediction as association rules. Experiments show decision trees tend to find few simple rules most rules have somewhat low reliability most attribute splits are different from medically common splits and most rules refer to very small sets of patients. In contrast association rules generally include simpler predictive rules they work well with user-binned attributes rule reliability is higher and rules generally refer to larger sets of patients. Categories and Subject Descriptors Database Management Database Applications Data Mining Computer Applications Life and Medical Sciences Health General Terms Algorithms Experimentation Keywords Association rule decision tree medical data Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full .
đang nạp các trang xem trước