Full repository review for VRE on-boarding of @benelot by @stepaza #1

benelot · 2019-08-12T12:12:06Z

This is an artificially created full repository review. If you ever need to do the same, I added the procedure to do it in on the wiki: https://github.com/IDSC-io/idsc-data-science-wiki/wiki/Git#conduct-a-full-repository-review-on-github

…itHub as master

benelot

I am done with my first review without having run the code. So this is very preliminary and I only have a limited understanding as of now.

benelot · 2019-08-12T12:14:49Z

.gitignore

@@ -0,0 +1,299 @@
+


Full code repository review

All comments to it are inline to the repository. Please scroll on.

I suggest a clean up of the code before anything else is done:

Migration into datascience cookiecutter template

Code compliance

Clean out unused code (it is still in the version control)

Switch to english wording (I am a bit irritated by the german words that leaked into the function names etc. I would name all entities in proper english and make a clean cut between the database names and the code names. Every idea of selling a product in my opinion is diminished by language-dependent code.)

How do I get access to the Atelier_DS? Assume I am a noob on all the CDWH stuff, tell me everything, I will tell you what I know.

Who designed the dataset? It seems to pull very specific data, so I would not know how to extend it on the fly.

How do we proceed from here?

benelot · 2019-08-12T12:16:57Z

spitalhygiene/Docs/build/html/.buildinfo

@@ -0,0 +1,4 @@
+# Sphinx build info version 1


I think we should remove the docs build from the repository to reduce the clutter

benelot · 2019-08-12T12:19:17Z

spitalhygiene/Docs/source/Patient_Test_Data.rst

@@ -0,0 +1,258 @@
+********************


Patient Test Dataset

Is this for development vs. the full dataset for validation?

benelot · 2019-08-12T12:28:14Z

spitalhygiene/Docs/source/resources.rst

+This folder contains important functions for loading data from SQL into CSV, thereby preparing the "raw" data used for
+building the actual network models.
+
+Most importantly, this folder also contains the file ``Update_Model.sh``, which is a bash script controlling `all steps`


I should probably review that script.

benelot · 2019-08-12T12:32:39Z

spitalhygiene/Unused/BWTYP-BWART.csv

@@ -0,0 +1,72 @@
+BWART,Text


@stepaza: Quickly explain the original idea of the content of 'Unused'.

benelot · 2019-08-12T12:58:25Z

spitalhygiene/vre/src/main/python/vre/model/Appointment.py

@@ -0,0 +1,133 @@
+# -*- coding: utf-8 -*-


Model classes

benelot · 2019-08-12T13:00:41Z

spitalhygiene/vre/src/main/python/vre/networkx_graph.py

+
+
+
+        # logging.info(f"##################################################################################")


@stepaza That is a lot of unused, commented code here.

benelot · 2019-08-12T13:01:44Z

spitalhygiene/vre/src/main/python/vre/quality_control/distribution_export.py

+        write_file.write(csv_sep.join(data_list) + '\n')
+    print('\nSuccessfully collected patient information !\n')
+
+


Many trailing line feeds....

benelot · 2019-08-12T13:02:08Z

spitalhygiene/vre/src/main/python/vre/test.py

+import itertools
+import random
+import json
+


@stepaza Can this be deleted?

benelot · 2019-08-12T13:02:59Z

spitalhygiene/vre/src/main/python/vre/test_data_loader.py

@@ -0,0 +1,22 @@
+# Small script to test the patient import defined in the HDFS_data_loader.py script
+


@stepaza Is this a user-driven unit test? Could we make a unit test out of it?

stepaza and others added 11 commits July 29, 2019 09:51

Initial commit to migrate VRE branch from Sqooba BitBucket to Insel G…

d227de5

…itHub as master

Added README.md and upddated .gitignore file

ef7968e

Update README.md

2771b81

updated .gitignore and adjuted docstrings

fbc8123

Merge branch 'master' of https://github.com/IDSC-io/vre-data-science

263aff5

removed unused .idea directory

25fd781

Updated data_compiler.py, preprocessor.py and Update_Model.sh scripts

42cd107

Updated Sphinx Documentation

f77dd8a

Updated SQL queries for data extraction

560a591

Adjusted VRE Screen data loading

250c26c

Merge branch 'master' into review

ea8a260

benelot changed the title ~~Full repository review for handover from @stepaza to @benelot within the DS team~~ Full repository review for VRE on-boarding of @benelot by @stepaza Aug 12, 2019

benelot commented Aug 12, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full repository review for VRE on-boarding of @benelot by @stepaza #1

Full repository review for VRE on-boarding of @benelot by @stepaza #1

benelot commented Aug 12, 2019

benelot left a comment

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019

benelot Aug 12, 2019




		# logging.info(f"##################################################################################")

		write_file.write(csv_sep.join(data_list) + '\n')
		print('\nSuccessfully collected patient information !\n')

		@@ -0,0 +1,22 @@
		# Small script to test the patient import defined in the HDFS_data_loader.py script

Full repository review for VRE on-boarding of @benelot by @stepaza #1

Are you sure you want to change the base?

Full repository review for VRE on-boarding of @benelot by @stepaza #1

Conversation

benelot commented Aug 12, 2019

benelot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Full code repository review

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Patient Test Dataset

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Model classes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment