Click here to view the file: PyCitySchools_Challenge
The purpose of this analysis is to replace the inaccurate data for the 9th graders at Thomas High School while keeping the rest of the data intact and perform the same analysis we did our module. The metrics that we want to analyze our data by include: total school budget, per student capita, average math and reading score, the passing percent for math and reading, and the overall passing percentage.
Much of the metrics from the original analysis remained the same. This makes sense given that we had a large set of data, and removing only a small portion of the data (math and reading scores for 9th graders at Thomas High School) would not drastically skew the data. Below is the table from the original analysis, which includes the math and reading scores for 9th graders at Thomas High School.
The following table is from the most recent analysis, where the math and reading scores for 9th graders at Thomas High School was removed. We can see that the only differences are the averages and percent passing at Thomas High School. Even with the removal of certain data, the overall results remained fairly the same.
The four major changes that occurred is the number of total students, the number of students counted at Thomas High School, the average math and reading scores, and the overall percentages for math and reading at Thomas High School. The removal of data implies a decrease in the count of total students overall, and total students at Thomas High School, specifically for this analysis. And since the population amount was decreased, this leads to a change in average scores and score percentages. Due to the fact that these score and percentage changes were minimal, we can assume that the removal of math and reading scores of 9th graders at Thomas High School not as significant as we would imagine it to be.