TY - GEN
T1 - Predictive analytics for breast cancer survivability
T2 - 2nd International Conference on Information and Communication Technology for Competitive Strategies, ICTCS 2016
AU - Jhajharia, Smita
AU - Verma, Seema
AU - Kumar, Rajesh
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/3/4
Y1 - 2016/3/4
N2 - Predicting the statistics of survival from the innumerable reports of breast cancer is a difficult job to perform, one that every researcher tackles in the field of medicals science. Since the birth stages of the related research, many new techniques have advanced. This point is evident from the groundbreaking biomedical technologies and discovery of better selfexplanatory predictive factors. Due to supply of cheaper computer hardware and easy availability of software technologies, large amount of qualitative data is being stored automatically. The developed ways of analyzing huge data quantities have bestowed upon this field efficient and effective means of processing. Five renowned data mining algorithms (Naive Bayes classification, decision trees, IBK, support vector machine (using SMO) and OneR,) were used for our prediction models applied to a substantiate dataset (more than 683cases). The10-fold cross-validation method was also taken into due consideration to determine the prediction accuracy of the five prognostic designs in comparing their credibility. Through a sensitivity analysis of support vector machines, we were able to gauge the prioritized importance of predictive factors in this work.
AB - Predicting the statistics of survival from the innumerable reports of breast cancer is a difficult job to perform, one that every researcher tackles in the field of medicals science. Since the birth stages of the related research, many new techniques have advanced. This point is evident from the groundbreaking biomedical technologies and discovery of better selfexplanatory predictive factors. Due to supply of cheaper computer hardware and easy availability of software technologies, large amount of qualitative data is being stored automatically. The developed ways of analyzing huge data quantities have bestowed upon this field efficient and effective means of processing. Five renowned data mining algorithms (Naive Bayes classification, decision trees, IBK, support vector machine (using SMO) and OneR,) were used for our prediction models applied to a substantiate dataset (more than 683cases). The10-fold cross-validation method was also taken into due consideration to determine the prediction accuracy of the five prognostic designs in comparing their credibility. Through a sensitivity analysis of support vector machines, we were able to gauge the prioritized importance of predictive factors in this work.
KW - Classification
KW - K-fold validation
KW - Predictive models
KW - UCI machine learning repository
UR - http://www.scopus.com/inward/record.url?scp=84988640464&partnerID=8YFLogxK
U2 - 10.1145/2905055.2905084
DO - 10.1145/2905055.2905084
M3 - Conference contribution
AN - SCOPUS:84988640464
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 2nd International Conference on Information and Communication Technology for Competitive Strategies, ICTCS 2016
PB - Association for Computing Machinery
Y2 - 4 March 2016 through 5 March 2016
ER -