Remember to git add && git commit && git push each exercise!
We will execute your function with our test(s), please DO NOT PROVIDE ANY TEST(S) in your file
For each exercise, you will have to create a folder and in this folder, you will have additional files that contain your work. Folder names are provided at the beginning of each exercise under submit directory
and specific file names for each exercise are also provided at the beginning of each exercise under submit file(s)
.
My M And A | |
---|---|
Submit directory | . |
Submit files | my_m_and_a.py - my_ds_babel.py |
You've worked as a Junior Data Engineer
at Plastic Free Boutique
for three months.
Your first mission was to build a strong, robust, and scalable customers
database for the exponential growth the company will soon have. Your manager is delighted.
We've just acquired a new company, Only Wood Box,
which will be a perfect solution for our packaging department. They are experts in making wood packages at a competitive, light, and cheap price.
Expert in their technology, they didn't believe in the digital world. Despite the decent number of customers, they didn't have to invest in their infrastructure. Before quitting, their engineer told us that at least we had stored all the information; I don't understand what he meant.
You should use import pandas as pd
Your mission will be to merge their three customers (yes 3 :D) table into ours.
Our database schema:
"gender" - 'string'
"firstname" - 'string'
"Lastname" - 'string'
"email" - 'string'
"age" - 'string'
"city" - 'string'
"country" - 'string'
"created_at" - 'string'
"referral" - 'string'
1# Your function will be called my_m_and_a
and receive the 3 CSV
content.
2# Import your function from my_ds_babel to save the CSV's content into SQL.
We want to move on after this merge & acquisition; we don't want to keep their .csv
; if they are seen in your repository during your 1-1 meeting (Peer Review), it will be considered a fail for this project.
Example00
merged_csv = my_m_and_a(content_database_1, content_database_2, content_database_3)
my_ds_babel.csv_to_sql(merged_csv, 'plastic_free_boutique.sql', 'customers')
Tip Google: .gitignore file :-)