Abstract
This paper presents a case study of using data mining techniques in the analysis of diagnosis and treatment events related to Breast Cancer disease. Data from over 16,000 patients has been pre-processed and several data mining techniques have been implemented by using Weka (Waikato Environment for Knowledge Analysis). In particular, Generalized Sequential Patterns mining has been used to discover frequent patterns from disease event sequence profiles based on groups of living and deceased patients. Furthermore, five models have been evaluated in Classification with the objective to classify the patients based on selected attributes. This research showcases the data mining process and techniques to transform large amounts of patient data into useful information and potentially valuable patterns to help understand cancer outcomes.
Original language | English |
---|---|
Title of host publication | 6th International Conference on IT in Bio- and Medical Informatics, September 2015, Valencia. |
Pages | 56-70 |
Number of pages | 15 |
Publication status | Published - 1 Sept 2015 |
Externally published | Yes |