Skip to content

Data cleaning process before to do some kind of analysis

Notifications You must be signed in to change notification settings

NicolaRizzitello/Data-Cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

This is a project about data cleaning process before to do some kind of analysis.

Language

alt text

About Datasets

Data_athlets

This dataset is composed by 271116 rows and 14 columns. Dataset contains both winter and summer Olympic Games from Athens 1896 to Rio 2016. Each row corresponds to an athlet who partecipated in that game. The variables are:

  • ID - Unique number for each athlete
  • Name - Athlete's name
  • Sex - M or F
  • Age - Integer
  • Height - In centimeters
  • Weight - In kilograms
  • Team - Team name
  • NOC - National Olympic Committee 3-letter code
  • Games - Year and season
  • City - Host city
  • Sport - Sport
  • Event - Event
  • Kind_medal - Gold, Silver, Bronze, or NA
  • Medal - Y or N

Summary of analysis

  • Populate Team data
  • Breaking out Games into Individual Columns (Year, Season)
  • Change Y and N to Yes and No in "Medal" field

About

Data cleaning process before to do some kind of analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published