Skip to content

Using Anaconda and Jupyter Notebook to analyze school performance measures.

Notifications You must be signed in to change notification settings

YolandaCasica/School_District_Analysis

Repository files navigation

School_District_Analysis

Using Anaconda and Jupyter Notebook to analyze school performance measures.


Overview of the school district analysis:

This project focused on cleaning, formatting, and analyzing school district data in order to determine how the schools performed. The district needed to know how to allocate funding to the schools based on how they had performed academically. The individual school size, type of school, and percentages of passing and failing totals were analyzed. After the final results had been presented to the school district, the 9th grade scores from Thomas High School were indicated as having cheated. The district had to be able to remove that school's results from the analysis report without affecting the other schools. The process used to reach this goal was to replace the 9th grade Thomas High School scores with NaN (Not a Number).


Results:

  • As we can see in the images below, when comparing the original district summary and the revised district summary we can make a few conclusions.
  1. The Average Math Score fell by 0.1
  2. The Percentage of Passing Math fell by 1.1
  3. The Percentage of Passing Reading fell by 1.3
  4. The Overall Passing Percentage fell by 0.9

Original District Summary: Orig_Dist_Sum

Revised District Summary: Revis_Dist_Sum


  • As the images below show, the school summary also had some changes after Thomas High School 9th graders were removed from the calculations. Only Thomas High School was affected by the NaN scores.
  1. The Percentage of Passing Math fell by 26.4
  2. The Percentage of Passing Reading fell by 27.6
  3. The Overall Passing Percentage fell by 26

Original School Summary: Orig_Sch_Sum

Revised School Summary: Revis_Sch_Sum


  • Thomas High School used to be in the top percent of passing schools, now is at the bottom percent.

Original Top Schools Orig_Top_Schools

Revised Bottom Schools Revis_Bottom_Schs


  • Thomas High School 9th grade was the only grade affected by the revision, so only their reading and math score percentages changed.
  • The scores by school spending range $630-644 were changed after the revision:
  1. The percentage of those passing math fell by 6.0
  2. The percentage of those passing reading fell by 7.0
  3. The overall passing percentage fell by 7.0

Revised School Spending Summary: Revis_Spending_Sum


  • The scores by school size also changed. The medium school size range percentages for the passing math, reading, and the overall passing percentage all fell by 6.0

Revised School Size Summary Revis_Sch_Size_Sum


  • The scores by school type decreased as well. Since Thomas High School is a charter school, only the charter numbers were affected.
  1. The percentage of those passing math fell by 4.0
  2. The percentage of those passing reading fell by 4.0
  3. The percentage of those overall passing fell by 3.0

Revised School Type Summary Revis_Sch_Type_Sum


Summary:

In conclusion, when NaN was used to replace the 9th grade scores from Thomas High School, the overall passing percentages did drop when analyzing parameters that had Thomas High School 9th grade in it. The charter school passing percentages dropped, the medium school size numbers dropped, and the passing percentages for reading and math dropped. Using NaN allowed us to only affect the data for Thomas High School while still keeping the total student numbers the same so the overall data analysis would still be accurate when it came to percentages.

About

Using Anaconda and Jupyter Notebook to analyze school performance measures.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published