Skip to content
View hucodelab's full-sized avatar

Organizations

@turing-usp

Block or report hucodelab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hucodelab/README.md
  • 👋 Hi, I’m @hucodelab and my hobbies are: theater, reading, and studying.
  • 👀 I’m interested in Data Engineering, Analytics, Data Science, Business Intelligence, Logistics and Stock Markets.
  • 📫 How to reach me: [email protected]

Skills: Python, SQL, PySpark, Cloud Computing, AI.

Data Portfolio Projects

Data Engineering

A solution that empowers global investors to stay ahead of potential crises worldwide. The solution followed the Medallion Architecture and was built on Databricks and PySpark, it was used Databricks Workflows to orchestrate the pipelines. The solution was deployed as a web-based application using Streamlit, enabling users to interact with the data in near real-time and access the features. https://latamfusionapp.azurewebsites.net/

This project gathers data from multiple sources, including the Yahoo Finance API, web scraping of Twitch data, and collecting fundamental data on video game companies. We then organize and streamline this information through data pipelines. Finally, the data is processed using machine learning regression models to predict future stock prices, providing actionable insights for investors.

This project collects data from multiple sources, including web scraping of Reddit, macroeconomic data from Central Bank API, and the Yahoo Finance API (stock prices). We streamline and integrate this data through automated pipelines, then apply regression models to predict future stock prices of PETROBRAS, empowering investors with data-driven insights.

Machine Learning

This project shows the ETL and Machine Learning model building process to predict whether a client will accept an e-commerce offer. It was also made a deploy of the app.

This project shows the process of manipulating data and building a Machine Learning model to predict whether a US citizen has an income higher than $50,000 per year. The project compares the performance of 3 models: Random Forest, Random Forest with Grid Search (hyperparameter optimization), and logistic regression.

Deep Learning

This project shows the process of data manipulation and construction of a LSTM model to predict whether a stock it's going to increase its market price or not. It was built a feature selection model to select the best features for the LSTM model.

This project shows the process of data manipulation and construction of a Deep Learning regression model trained with climatological data from the 20 largest soybean-producing municipalities in the state of Paraná - BR to make predictions of soybean productivity.

This project shows the process of data manipulation and Deep Learning classification model building trained to recognize Street numbers. The project compares the performance of two neural network models: Multilayer Perceptron (MLP) and Convolutional Neural Network (CNN).

Data Analysis

This project shows metrics and indicators related to the twitter accounts of the main candidates of the Brazilian 2022 elections. The visualizations created in this project were deployed to a web application using DASH and REACT. This project was developed by Turing USP and I contributed to the data processing and visuals generation.

This project shows the process of extracting, manipulating, and analyzing data from the 2020 StackOverflow survey. The project contains visuals of the number of respondents by programming language, salaries, and developers' job satisfaction.

This project shows the scrapping (extraction) of Airbnb data from Airbnb Brazil. It was possible to make a comparative analysis between accommodation's prices three different brazilian cities as well. This software was developed using the python library: BeautifulSoup.

This project shows the scrapping (extraction) of Rotten Tomatoes data and it was possible to build a dataset using the data. The project was developed by using the python library: Selenium.

Popular repositories Loading

  1. hucodelab hucodelab Public

    Config files for my GitHub profile.

    3

  2. python python Public

    python

    1

  3. stackoverflow stackoverflow Public

    Forked from jjrunner/stackoverflow

    Findings from Stackoverflow 2017

    Jupyter Notebook 1

  4. StackOverflow_Survey_Study2020 StackOverflow_Survey_Study2020 Public

    This project is about programming languages and how they are related with some of the most important professional aspects like job satisfaction and annual salaries.

    Jupyter Notebook 1

  5. Economic_indicators_and_energy_cons_US Economic_indicators_and_energy_cons_US Public

    Relação entre o consumo de energia e os indicadores económicos (US)

    Jupyter Notebook 1

  6. datathon_USP_BRK datathon_USP_BRK Public

    dados de saneamento

    1 1