Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix age <1 in aaa_clean_linelist #1

Open
thibautjombart opened this issue Oct 17, 2019 · 1 comment
Open

Fix age <1 in aaa_clean_linelist #1

thibautjombart opened this issue Oct 17, 2019 · 1 comment

Comments

@thibautjombart
Copy link
Contributor

thibautjombart commented Oct 17, 2019

From Neale Batra:

I am with Chris Jarvis and our new data manager Luke in Geneva. We have noticed that the cleaned Master Linelist age variable is missing for all patients < 1 year. Our investigation of the cleaning scripts showed that it sources from scientific notation in the raw Excel import.

For children under one, formulas are being used such as =36/365 which then comes up as 9.8E-2. When it’s read into R in aaa_clean_linelist.rmd on line 110 the clean_data function replaces the 9.8E-2 with 9_8E_2, which then becomes a missing value, as on line 366 where age is changed to be numeric. The two underscores are replaced to a . using gsub. This means we miss the age of these children who are under one year.

@joewalker127
Copy link

See pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants