-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a tableau workbook to identify outliers #3
Comments
Create a workbook for a boxplot for the entire data. The X axis should be the house id. The Y axis should be a sensor type (e.g. relative humidity, or temperature). We should create one boxplot for every type of sensor to identify outliers. Send a screenshot of the plot to slack tableau channel referring this issue link rather than post here. |
I just got word from Eileen that the second requested timestamp (see in the bottom the two clear sections of the requests is suspected to contain outliners in July and August. You should double check with her the precise time frame and use the time range to showcase the boxplots you have been working on for this issue, hopefully as a dashboard. Please post here once you get the precise time range. |
We decided to create boxplots for each PurposeID by "type" (power, luminosity, humidity, temperature), and create a Dashboard containing the 4 plots for each house. |
This is the most current version: I am also trying to work on creating a time-series version of this to display one month at a time that is easily changeable, which is important because the original boxplot shows all of the data at once: However, I'm having trouble getting one filter to apply independently to different dashboard on the same file (the building ID filter on image one also connects with the second image, meaning I can't look at 2 different buildings on separate dashboards at the same time. It will change both when I change one.) I will post an update once I figure that out. |
@eileenpeppard @ryantanaka @jygh98 Contrary to the missing data plot, this is an outlier plot. It is supposed to help us pick abnormal values. The main inspiration for this plot is the infamous egauge e792, We didn't realize there was something wrong with the PV always being 0 in this eGauge until 2 months later. In the dashboard above, this could have been easily spotted in the bottom-right corner of I wish I could show the boxplot of egaugee792 so you would actually see, but at the time not only we had an error in it, but also had a lock on it preventing to be accessed from the url, hence missing data, and therefore a problem for #6 workbook to solve. Although in this case, we would also notice the absence of the PV column if all data would be missing (but not just some). Notice the plots are intended to don't require another table. In Power, both appliances and purpose id are included. Future Work@kathrynparadis will be adding the room type for the purpose ids of the other plots in the dashboard (temperature, light, and humidity). @kathrynparadis p.s.: Remember to change the multiple choice box to a mutually exclusive choice box (aka radio buttons), as multiple choice here makes no sense. |
So here are my initial impressions of the plot
Also would it be possible to make the labels horizontal for the x-axis? Otherwise i think overall it looks pretty solid. |
|
Two-line plots, one for min and one for max for every sensor should be created. Use the dhhl database to prototype using the
readings
table, and then test it infrog_uhm
.Since this impact our ability to see if something is wrong, this issue is of high priority.
The text was updated successfully, but these errors were encountered: