Kernel: Python 3 (Anaconda 5)
SIT307 Assignment 2
Students:
Mitchell Razga - 218232709
Madushi Menahari Jayasundara - 217206634
Mario Silva - 217425643
Load Modules and Packages
In [1]:
Import Data
In [2]:
Reading data from CSV....
Encode Data
In [11]:
Year | Semester | Hands Raised | Resources Visited | Announcements Viewed | Discussions Participated In | Grade | Gender_F | Gender_M | Nationality_Egypt | ... | Subject_Science | Subject_Spanish | Parent Responsible_Father | Parent Responsible_Mother | Parent Survey Completed_No | Parent Survey Completed_Yes | Parent School Satisfaction_Bad | Parent School Satisfaction_Good | Absence Days_Above-7 | Absence Days_Under-7 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 8 | 1 | 30 | 90 | 33 | 35 | Middle-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
1 | 8 | 1 | 35 | 80 | 50 | 70 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
2 | 2 | 1 | 98 | 88 | 60 | 31 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 |
3 | 2 | 1 | 10 | 20 | 22 | 97 | Low-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
4 | 2 | 1 | 11 | 20 | 20 | 98 | Low-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
5 | 2 | 1 | 89 | 92 | 40 | 28 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
6 | 8 | 2 | 25 | 15 | 32 | 53 | Middle-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
7 | 8 | 2 | 80 | 71 | 52 | 51 | Middle-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
8 | 8 | 2 | 85 | 66 | 12 | 23 | Middle-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
9 | 8 | 2 | 45 | 58 | 52 | 43 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
10 | 8 | 2 | 22 | 51 | 42 | 40 | Middle-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 |
11 | 8 | 2 | 72 | 51 | 42 | 24 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 |
12 | 2 | 2 | 75 | 81 | 51 | 34 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 |
13 | 2 | 2 | 5 | 9 | 19 | 98 | Low-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
14 | 2 | 2 | 10 | 12 | 29 | 93 | Low-Level | 1 | 0 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
15 | 2 | 2 | 79 | 93 | 49 | 23 | High-Level | 1 | 0 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
16 | 8 | 1 | 25 | 15 | 12 | 33 | Low-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
17 | 2 | 1 | 20 | 88 | 31 | 28 | Middle-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 |
18 | 2 | 1 | 90 | 98 | 41 | 38 | High-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
19 | 2 | 1 | 80 | 95 | 21 | 28 | High-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
20 | 2 | 1 | 10 | 18 | 71 | 38 | Middle-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 |
21 | 2 | 1 | 10 | 17 | 50 | 21 | Middle-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
22 | 2 | 1 | 10 | 10 | 40 | 51 | Low-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
23 | 2 | 1 | 20 | 90 | 50 | 61 | Middle-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 |
24 | 2 | 1 | 10 | 30 | 50 | 91 | Low-Level | 0 | 1 | 0 | ... | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 1 | 0 |
25 | 2 | 1 | 69 | 82 | 20 | 28 | High-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
26 | 2 | 1 | 15 | 90 | 21 | 97 | Middle-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
27 | 2 | 1 | 4 | 10 | 11 | 7 | Low-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 0 |
28 | 8 | 2 | 85 | 75 | 62 | 53 | High-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 |
29 | 8 | 2 | 10 | 35 | 30 | 13 | Low-Level | 0 | 1 | 0 | ... | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
450 | 4 | 2 | 32 | 14 | 32 | 29 | Middle-Level | 0 | 1 | 0 | ... | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 |
451 | 4 | 2 | 22 | 34 | 15 | 9 | Low-Level | 0 | 1 | 0 | ... | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
452 | 4 | 2 | 72 | 64 | 59 | 89 | High-Level | 0 | 1 | 0 | ... | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
453 | 4 | 2 | 82 | 84 | 79 | 79 | Middle-Level | 0 | 1 | 0 | ... | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 |
454 | 4 | 2 | 42 | 34 | 29 | 39 | Middle-Level | 0 | 1 | 0 | ... | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 |
455 | 8 | 2 | 87 | 88 | 40 | 10 | Middle-Level | 1 | 0 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
456 | 11 | 2 | 10 | 51 | 40 | 40 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
457 | 8 | 2 | 17 | 21 | 42 | 14 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
458 | 8 | 2 | 27 | 41 | 49 | 14 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 |
459 | 8 | 2 | 70 | 81 | 39 | 84 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
460 | 8 | 2 | 27 | 90 | 82 | 14 | High-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
461 | 8 | 2 | 17 | 61 | 42 | 14 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
462 | 8 | 2 | 87 | 81 | 42 | 19 | High-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
463 | 8 | 2 | 7 | 61 | 22 | 14 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
464 | 8 | 2 | 17 | 50 | 2 | 4 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
465 | 8 | 2 | 5 | 21 | 42 | 14 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 |
466 | 8 | 2 | 27 | 41 | 32 | 61 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 |
467 | 8 | 2 | 96 | 61 | 42 | 94 | High-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 |
468 | 8 | 2 | 57 | 51 | 46 | 34 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
469 | 8 | 2 | 77 | 69 | 41 | 13 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
470 | 8 | 2 | 80 | 51 | 40 | 24 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 1 |
471 | 8 | 2 | 62 | 61 | 82 | 40 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 |
472 | 8 | 2 | 72 | 83 | 12 | 90 | High-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 |
473 | 8 | 2 | 87 | 81 | 22 | 70 | High-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 |
474 | 8 | 2 | 72 | 90 | 12 | 30 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 |
475 | 8 | 2 | 2 | 11 | 62 | 30 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 |
476 | 8 | 2 | 5 | 3 | 2 | 10 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 |
477 | 8 | 2 | 5 | 17 | 21 | 10 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 |
478 | 8 | 2 | 51 | 42 | 12 | 29 | Middle-Level | 0 | 1 | 0 | ... | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 |
479 | 8 | 2 | 9 | 7 | 21 | 20 | Low-Level | 0 | 1 | 0 | ... | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 |
480 rows × 63 columns
Configure data for classifier
In [12]:
Generate Test Data - 50/50 Split
In [13]:
Training Data size: 240 240
Test Data size: 240 240
Generate Decision Tree - 50/50 Split
In [14]:
Accurracy: 0.6875
<bound method BaseEstimator.get_params of DecisionTreeClassifier(class_weight=None, criterion='gini', max_depth=None,
max_features=None, max_leaf_nodes=None,
min_impurity_decrease=0.0, min_impurity_split=None,
min_samples_leaf=1, min_samples_split=2,
min_weight_fraction_leaf=0.0, presort=False, random_state=2000,
splitter='best')>
Visualise
In [15]:
In [16]:
File "<ipython-input-16-297303b31446>", line 38
def PlotDecisionTreeAccuracy(x, y):
^
IndentationError: expected an indented block
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
Random Forests
In [0]:
In [0]:
Improving Accuracy
Remove Outliers
In [0]:
Remove less important columns
In [0]:
Show Correlation
In [0]:
In [0]:
Remove Columns
In [0]:
Increase Split
In [0]:
In [0]:
Change parameters
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]: