Skip to content

Commit

Permalink
updates
Browse files Browse the repository at this point in the history
  • Loading branch information
ogbinar committed Feb 19, 2024
1 parent b9d1612 commit 507d490
Show file tree
Hide file tree
Showing 5 changed files with 88 additions and 46 deletions.
10 changes: 6 additions & 4 deletions _quarto.yml
Original file line number Diff line number Diff line change
Expand Up @@ -24,16 +24,18 @@ website:
logo: images/favicon-32x32.png
href: https://dataengineering.ph/
right:
- text: "Home"
href: index.qmd
# - text: "Home"
# href: index.qmd
- text: "About"
href: about.qmd
- text: "Study Roadmap"
href: study-roadmap.qmd
- text: "Community"
href: community.qmd
href: community.qmd
- text: "Partnerships"
href: partners.qmd
- text: "Knowledgebase"
href: technical-knowledgebase.qmd
href: https://github.com/igorbarinov/awesome-data-engineering
- text: "Resources"
href: resources.qmd
- text: "Datasets"
Expand Down
57 changes: 33 additions & 24 deletions community.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,39 @@ description: |
Explore a rich collection of community-driven content, featuring insightful videos, blogs, and articles from data engineering experts. These resources offer valuable perspectives on various aspects of data engineering and analytics.
---


## People and Pages

As a community, we want to support our homegrown content creators and experts. Please check them out as they champion data advocacy:

- [Doc Ligot](https://docligot.com/)
- [Sherwin Pelayo](https://www.linkedin.com/in/sherwinpelayo/)
- [Kyle Escosia](https://linktr.ee/klescosia)
- [Sandy C. Lauguico](https://www.facebook.com/sandy.c.lauguico)
- [Josh Dev](https://www.facebook.com/profile.php?id=100087019650476)
- [Kuya Dev](https://www.facebook.com/KuyaDevDotCom)
- [Dev Stuff with JP](https://www.facebook.com/devstuffwithjp)
- [Power BI Pilipinas](https://www.facebook.com/powerbipilipinas)
- [Alex Gamboa](https://www.facebook.com/alexgamboa34/)
- [Aemyn Obinguar](https://www.facebook.com/aemyn.obinguar)
- [Furtim Dev](https://www.facebook.com/furtimdev)
- [Prof. Bob S.](https://www.youtube.com/@thetechedhelpphil4499)
- [Danielle Meer](https://www.linkedin.com/in/danielle-meer-7b7891209)
- [Atcha Abe](https://www.facebook.com/athenaaliafe)
- [The Agile Geek](https://theagilegeek.bio.link/)

## Blogs and Articles

- [Snowflake in the Philippines](https://medium.com/@kyle.escosia/the-rise-of-snowflake-in-the-philippines-why-its-the-hottest-thing-in-big-data-right-now-9b9d09c11e89) - Kyle Escosia's insights on Snowflake's rise in the Philippines.
- [UPSERTS and DELETS using AWS Glue and Delta Lake](https://dev.to/awscommunity-asean/sql-based-inserts-deletes-and-upserts-in-s3-using-aws-glue-3-0-and-delta-lake-42f0) - A guide on using AWS Glue with Delta Lake for data operations.
- [Table Indexes](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/table_index) - An educational piece on the importance and implementation of table indexes.

## Projects

- [Basic ETL project](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/pipeline_basic)
- [ETL Sales](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/ETL-SALES)
- [ETL Project with Azure Databricks](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/ETL-Databricks-code) - A walkthrough of an ETL project using Azure Databricks.
- [Containerization](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/containerization) - An exploration of containerization in data engineering.


## Videos and Presentations

Expand Down Expand Up @@ -70,26 +102,3 @@ The "Kwentuhan Meetup" is an online event for members of the Data Engineering Pi
{{< video https://www.youtube.com/watch?v=O1JXjUYwWc8 title="Jerel John Velarde's Mission in AI Startups for Democracy" >}}


## Blogs and Articles

- [Snowflake in the Philippines](https://medium.com/@kyle.escosia/the-rise-of-snowflake-in-the-philippines-why-its-the-hottest-thing-in-big-data-right-now-9b9d09c11e89) - Kyle Escosia's insights on Snowflake's rise in the Philippines.
- [UPSERTS and DELETS using AWS Glue and Delta Lake](https://dev.to/awscommunity-asean/sql-based-inserts-deletes-and-upserts-in-s3-using-aws-glue-3-0-and-delta-lake-42f0) - A guide on using AWS Glue with Delta Lake for data operations.
- [Table Indexes](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/table_index) - An educational piece on the importance and implementation of table indexes.

## Projects

- [Basic ETL project](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/pipeline_basic)
- [ETL Sales](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/ETL-SALES)
- [ETL Project with Azure Databricks](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/ETL-Databricks-code) - A walkthrough of an ETL project using Azure Databricks.
- [Containerization](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/projects/containerization) - An exploration of containerization in data engineering.

## People and Pages

- [Doc Ligot](https://docligot.com/)
- [Sherwin Pelayo](https://www.linkedin.com/in/sherwinpelayo/)
- [Kyle Escosia](https://linktr.ee/klescosia)
- [Sandy C. Lauguico](https://www.facebook.com/sandy.c.lauguico)
- [Josh Dev](https://www.facebook.com/profile.php?id=100087019650476)
- [Kuya Dev](https://www.facebook.com/KuyaDevDotCom)
- [Dev Stuff with JP](https://www.facebook.com/devstuffwithjp)
- [Alex Gamboa](https://www.facebook.com/alexgamboa34/)
31 changes: 17 additions & 14 deletions datasets.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -46,24 +46,23 @@ Explore various Philippine data sources for diverse data projects:
- [Google Research Datasets](https://datasetsearch.research.google.com/): Find datasets for research projects.
- [BigQuery Public Datasets](https://cloud.google.com/bigquery/public-data): Access a variety of public datasets through Google BigQuery.


### Specific Data Areas

* **World Health Organization (WHO) Collections:** Global health data: [https://www.who.int/data/collections](https://www.who.int/data/collections)
* **Healthcare Images:** Public datasets of healthcare images: [https://cloud.google.com/healthcare-api/docs/resources/public-datasets/idc](https://cloud.google.com/healthcare-api/docs/resources/public-datasets/idc)
* **Genome Datasets:** Data from the International Genome project: [https://www.internationalgenome.org/data](https://www.internationalgenome.org/data)
* **NOAA Data Products:** Datasets from the National Oceanic and Atmospheric Administration: [https://www.ncei.noaa.gov/products](https://www.ncei.noaa.gov/products)
* **Climate Data:** Comprehensive climate datasets: [https://www.climate.gov/maps-data/all](https://www.climate.gov/maps-data/all)
- [World Health Organization (WHO) Collections](https://www.who.int/data/collections): Global health data.
- [Healthcare Images](https://cloud.google.com/healthcare-api/docs/resources/public-datasets/idc): Public datasets of healthcare images.
- [Genome Datasets](https://www.internationalgenome.org/data): Data from the International Genome project.
- [NOAA Data Products](https://www.ncei.noaa.gov/products): Datasets from the National Oceanic and Atmospheric Administration.
- [Climate Data](https://www.climate.gov/maps-data/all): Comprehensive climate datasets.

### Additional Resources

* **Unicef Data Collections:** Data on global child welfare: [https://data.unicef.org/resources/dataset/sowc-2019-statistical-tables/](https://data.unicef.org/resources/dataset/sowc-2019-statistical-tables/)
* **US Population Data:** Population statistics from the US Bureau of Labor Statistics: [https://www.bls.gov/cps/tables.htm](https://www.bls.gov/cps/tables.htm)
* **Stanford Open Policing Project:** Data on police traffic stops: [https://openpolicing.stanford.edu/](https://openpolicing.stanford.edu/)
* **Scikit-Learn Datasets:** Datasets available in the Scikit-Learn library: [https://scikit-learn.org/stable/datasets/toy_dataset.html](https://scikit-learn.org/stable/datasets/toy_dataset.html)
* **PyTorch Datasets:** Datasets available in PyTorch: [https://pytorch.org/vision/stable/datasets.html](https://pytorch.org/vision/stable/datasets.html)
* **Huggingface Datasets:** A hub for various ML datasets: [https://huggingface.co/datasets](https://huggingface.co/datasets)
* **Wikipedia ML Research Datasets:** A list of datasets for machine learning research: [https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research](https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research)
- [Unicef Data Collections](https://data.unicef.org/resources/dataset/sowc-2019-statistical-tables/): Data on global child welfare.
- [US Population Data](https://www.bls.gov/cps/tables.htm): Population statistics from the US Bureau of Labor Statistics.
- [Stanford Open Policing Project](https://openpolicing.stanford.edu/): Data on police traffic stops.
- [Scikit-Learn Datasets](https://scikit-learn.org/stable/datasets/toy_dataset.html): Datasets available in the Scikit-Learn library.
- [PyTorch Datasets](https://pytorch.org/vision/stable/datasets.html): Datasets available in PyTorch.
- [Huggingface Datasets](https://huggingface.co/datasets): A hub for various ML datasets.
- [Wikipedia ML Research Datasets](https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research): A list of datasets for machine learning research.

### Specific Data Areas

Expand All @@ -82,4 +81,8 @@ Explore various Philippine data sources for diverse data projects:
- [PyTorch Datasets](https://pytorch.org/vision/stable/datasets.html): Datasets available through PyTorch.
- [Huggingface Datasets](https://huggingface.co/datasets): A platform offering various ML datasets.
- [Wikipedia ML Research Datasets](https://en.wikipedia.org/wiki/List_of_datasets_for_machine_learning_research): A comprehensive list of datasets for machine learning research.
- [OECD](https://data.oecd.org/)
- [OECD](https://data.oecd.org/)
- [Datahub](https://awesomedataengineering.com/): A free and open CKAN instance for hosting and sharing datasets.
- [Kaggle](https://www.kaggle.com/): A platform for data science competitions, and hosting, publishing, and analyzing data.
- [data.world](https://data.world/): Described as a "social network for data people," allows users to host and share data.
- [Academic Torrents](http://academictorrents.com/): A BitTorrent-based platform for the academic community to download and share large datasets for free.
20 changes: 20 additions & 0 deletions partners.qmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
title: "Partnerships"
description: |
We aim to connect and collaborate with like-minded communities that align with our values about tech, data, and community building.
---

## Partner Communities

- [Power BI Pilipinas](https://taplink.cc/powerbipilipinas) - A hub for Power BI users in the Philippines.
- [R Users Group - Philippines](https://www.facebook.com/rugph) - Facebook page for R programming language users.
- [Python Philippines](https://www.facebook.com/groups/570524223003384) - A community for Python enthusiasts in the Philippines.
- [Analytics Association of the Philippines](https://aap.ph/) - The official website for the Analytics Association of the Philippines.
- [Practical AI Philippines Meetup](https://www.meetup.com/practical-ai-philippines/) - A Meetup group for practical AI discussions.
- [Eudoxyz](https://discord.gg/XX7stU5u) - Eudoxyz is an online community dedicated to cater professionals and aspirants in their respective journey.
- [Tech Career Shifter](https://www.facebook.com/groups/techcareershifter) - A community for those looking to shift their careers into tech.
- [FWD Peers](https://www.facebook.com/groups/fwdpeers) - A group for forward-thinking individuals in various fields.
- [Student Developers Philippines](https://www.facebook.com/groups/studevph/) - A space for student developers in the Philippines to collaborate and share knowledge.
- [DataSense Analytics Group](https://www.facebook.com/groups/dataanalyticsph) - World-class training and development programs developed by top teachers and industry practitioners.
- [The SYNTAX Org](https://www.facebook.com/syntax.stimalolos) - Community for explorers of the tech's multiverse. Always #TowardsExcellence
- [Java User Group Philippines](https://linktr.ee/jugph) - Java User Group Philippines - A community for Java developers, students and enthusiasts in the Philippines to connect, share, and grow.
16 changes: 12 additions & 4 deletions resources.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -127,6 +127,11 @@ Building a professional portfolio is crucial for showcasing your skills and proj
- [Open intro to Statistics](https://www.openintro.org/stat/?stat_book=os) - Provides a solid introduction to statistics.
- [Regression Analysis](https://sta210-s22.github.io/website/course-overview.html) - Learn approaches for analyzing multivariate data sets, emphasizing analysis of variance, linear regression, and logistic regression.
- [Introduction to Data Wrangling at the School of Data](http://schoolofdata.org/) - Focuses on data cleaning and preparation techniques.
- [Elements of Statistical Learning by Hastie, Tibshirani, and Friedman](https://hastie.su.domains/ElemStatLearn/)
- [Pattern Recognition and Machine Learning by Bishop](https://www.microsoft.com/en-us/research/publication/pattern-recognition-machine-learning/)
- [Mathematics for Machine Learning by Deisenroth, Faisal, Soon Ong](https://mml-book.github.io/)
- [Machine Learning: a Probabilistic Perspective by Murphy](https://probml.github.io/pml-book/book0.html)
- [Probabilistic Machine Learning: An Introduction by Murphy](https://probml.github.io/pml-book/book1.html)

## Programming for Data Science
- [Python for Data Science](https://cognitiveclass.ai/courses/python-for-data-science/) - Course by Cognitive Class on using Python for data science tasks.
Expand All @@ -141,11 +146,13 @@ Building a professional portfolio is crucial for showcasing your skills and proj
- [SQLZOO](https://sqlzoo.net/wiki/SQL_Tutorial) - Interactive SQL tutorials for hands-on learning.

## Additional Learning Resources
- [Learn Git Branching](https://learngitbranching.js.org/) - Learn about Git
- [GitHub](https://github.com) - Platform for code sharing and collaboration.
- [PLURALSIGHT](https://www.pluralsight.com/) - Offers video courses on a wide range of tech topics, including data science.
- [hackerRank](https://www.hackerrank.com/) - Platform for practicing coding and data science skills.
- [World Quant](https://www.worldquant.com/) - Provides resources for quantitative analysis.
- [Real World Data Science Use Cases](https://realworlddatascience.net/) - A showcase for data science in action
- [SeattleDataGuy’s Newsletter](https://seattledataguy.substack.com/) - Learn About End-To-End Data Flows (Data Engineering, MLOps, and Data Science)

## Training and Internships
- [The Sparks Foundation](https://www.linkedin.com/company/the-sparks-foundation/): A remote one-month internship in Data Science and Business Analytics.
Expand All @@ -169,17 +176,18 @@ providing free data science and technology training for women

## Cloud Resources

* AWS Training and Certification: [https://aws.amazon.com/training/](https://aws.amazon.com/training/)
* Google Cloud Training: [https://cloud.google.com/training](https://cloud.google.com/training)
* Microsoft Learn: [https://docs.microsoft.com/en-us/learn/](https://docs.microsoft.com/en-us/learn/)
* Cloud Free Tier Comparison [Cloud Free Tier Comparison](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/Cloud-Free-Tier-Comparison): Articles comparing free tier offers from AWS, Azure, GCP, and Oracle Cloud.
- [AWS Training and Certification](https://aws.amazon.com/training/): Training and certification resources for Amazon Web Services.
- [Google Cloud Training](https://cloud.google.com/training): Training resources for Google Cloud Platform.
- [Microsoft Learn](https://docs.microsoft.com/en-us/learn/): Learning resources for Microsoft technologies, including Azure.
- [Cloud Free Tier Comparison](https://github.com/ogbinar/DataEngineeringPilipinas/tree/main/content/Cloud-Free-Tier-Comparison): Articles comparing free tier offers from AWS, Azure, GCP, and Oracle Cloud.

## Open Courseware
- [Data Analysis with R](https://andrewproctor.github.io/rcourse/)
- [Data Engineering Zoomcamp](https://dezoomcamp.streamlit.app/)
- [Data Science in a Box w/ R](https://datasciencebox.org/)
- [The Open Source Data Science Masters](https://datasciencemasters.org/)
- [Fast.ai Courses](https://course.fast.ai/)
- [Dive into Deep Learning](https://d2l.ai/index.html)
- [GIS Programming Roadmap on GitHub](https://github.com/petedannemann/GIS-Programming-Roadmap/blob/master/README.md)

## Open Books
Expand Down

0 comments on commit 507d490

Please sign in to comment.