Using Anaconda and Jupyter Notebook to analyze school performance measures.
This project focused on cleaning, formatting, and analyzing school district data in order to determine how the schools performed. The district needed to know how to allocate funding to the schools based on how they had performed academically. The individual school size, type of school, and percentages of passing and failing totals were analyzed. After the final results had been presented to the school district, the 9th grade scores from Thomas High School were indicated as having cheated. The district had to be able to remove that school's results from the analysis report without affecting the other schools. The process used to reach this goal was to replace the 9th grade Thomas High School scores with NaN (Not a Number).
- As we can see in the images below, when comparing the original district summary and the revised district summary we can make a few conclusions.
- The Average Math Score fell by 0.1
- The Percentage of Passing Math fell by 1.1
- The Percentage of Passing Reading fell by 1.3
- The Overall Passing Percentage fell by 0.9
- As the images below show, the school summary also had some changes after Thomas High School 9th graders were removed from the calculations. Only Thomas High School was affected by the NaN scores.
- The Percentage of Passing Math fell by 26.4
- The Percentage of Passing Reading fell by 27.6
- The Overall Passing Percentage fell by 26
- Thomas High School used to be in the top percent of passing schools, now is at the bottom percent.
- Thomas High School 9th grade was the only grade affected by the revision, so only their reading and math score percentages changed.
- The scores by school spending range $630-644 were changed after the revision:
- The percentage of those passing math fell by 6.0
- The percentage of those passing reading fell by 7.0
- The overall passing percentage fell by 7.0
Revised School Spending Summary:
- The scores by school size also changed. The medium school size range percentages for the passing math, reading, and the overall passing percentage all fell by 6.0
- The scores by school type decreased as well. Since Thomas High School is a charter school, only the charter numbers were affected.
- The percentage of those passing math fell by 4.0
- The percentage of those passing reading fell by 4.0
- The percentage of those overall passing fell by 3.0
In conclusion, when NaN was used to replace the 9th grade scores from Thomas High School, the overall passing percentages did drop when analyzing parameters that had Thomas High School 9th grade in it. The charter school passing percentages dropped, the medium school size numbers dropped, and the passing percentages for reading and math dropped. Using NaN allowed us to only affect the data for Thomas High School while still keeping the total student numbers the same so the overall data analysis would still be accurate when it came to percentages.