-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Numeric fixes + combinators #99
Conversation
(Haven't fixed the 7.10 build yet, ignore that failure) |
|
, "value" .= toJSON v | ||
] | ||
fromMean NoMean = object [ | ||
"type" .= String "no-mean" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The usual question, I'm assuming we're not worried about any live data for this yet?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, the NumericSummary isn't in any live data.
Anyone mathsy feel like sanity-checking the mean/median combining? I would ping Huw or Aaron, but data scientists can't see the repo anymore. If not I can get an IRL review on Monday. /cc @jystic @markhibberd |
@amosr maybe? |
@olorin Can you give me a bit more time before you merge. |
@markhibberd yeah, waiting on that review from you. |
@olorin This looks good to me 👍 I have stepped through and can't see any issues. Possibly worth a chat sometime on how to make these easier to review for everyone at some point. Need to get to the point where we are providing enough context and have our own shared body of knowledge that anyone can get in and desk-check these algorithms and people don't fob them off. Having soft-references for TAoCP and stats books, and having a standard library and way of referencing it for this type of code would go a long way I think. I will have a think if there is anything easy we can do facilitate this type of things, but ideas very welcome. |
Agreed, this sounds good. Will have a think about ways we could implement it. I like the idea of having standard libraries for commonly-needed fiddly implementations, like some numeric things (this was the idea behind I was also wondering about including derivations for any non-obvious numeric stuff in either comments or associated documentation so anyone interested can check the work months down the track. It's not the easiest thing to do in text format, but pandoc can handle markdown-embedded LaTeX and it could be built along with the haddocks. |
Turns out I wasn't paying attention when I wrote the stddev accumulator - it was totally borked, and there was a bug in the tests which let it go undetected. Redid a bunch of the numeric types in order to fix.
Also: implemented the functions for combining partial numeric results (for resolving intermediate results of the parallel folds).
/cc @charleso @tmcgilchrist @thumphries