ITC516 Data Mining and Visualisation for Business Intelligence – Assessment 3 – Weka Data Mining


Task: Weka Data Mining Practical and Report [15 marks]

There are two steps to complete this task:

Step 1: You are required to perform a data mining task to evaluate different classification algorithms. Load the breast-cancer.arff data set into Weka and compare the performance on this data set for the following classification algorithms:

  • Naive Bayes
  • J48
  • Random Forest
  • Random Tree
  • REPTree

Step 2: From step 1 outputs, write a report that shows the performance of the different algorithms and comment on their accuracy using the confusion matrix and other performance metrics used in Weka. In your report consider:

  • Is there a difference in performance between the algorithms?
  • Which algorithm performs best?

Your report should include the necessary screenshots, tables, graphs, etc. to make your report understandable to the reader.

Related posts: