Skip to content

PetroNet, an AI-Powered Cognitive Search Web App that leverages Azure Cloud & GPT LLM (Microsoft Code Without Barriers Hackathon Winner)

Notifications You must be signed in to change notification settings

fabecode/PetroNet-CWB-Hackathon-2023

Repository files navigation

PetroNet-CWB-Hackathon-2023

Microsoft Code Without Barriers Hackathon 2023 - Winner for PETRONAS Problem Statement

PETRONAS Problem Statement

There are 8 different types of publicly-available PETRONAS reports (i.e. Integrated & Annual Reports, Financial Reports & Sustainability) These reports contain a wealth of information, but their complex format and large volume make it challenging for users to quickly identify key topics and generate insights. How can we use Microsoft AI-related services to develop a solution that can automatically extract and organize relevant information from these PETRONAS reports to help users quickly find and understand the topics that they are interested in?

  • Leverage on Microsoft AI-related services to extract and categorize text and images from PETRONAS reports, and identify key topics within each report.
  • Develop a landing page with search bar that utilizes Natural Language Understanding (NLU) to allow users to search for topics of interest within the reports.
  • Upon a search query, the tool should surface relevant documents related to the query and highlight specific keywords from the content across multiple reports.
  • The tool should also generate a visual representation of relevant entities in a knowledge-graph with their relationships to help users better understand the context of the topics they are interested in.
  • View the full problem statement here

About PetroNet

PetroNet is an AI-powered search platform that harnesses the power of Microsoft Azure cloud services and GPT large language model to revolutionise and streamline the way you navigate and unlock key insights within PETRONAS reports. Key functionalities include intelligent information extraction, key topics summarisation, robust search capability, and an interactive knowledge graph.

  • View PetroNet devpost here

Building the App

  1. Create the required Azure resources via the Azure Portal - Azure Cognitive Search, Azure Blob Storage, Azure Function resources
  2. Run preprocessing/extract_n_upload_pdf.py to extract pages from PDFs (to fit Cognitive Search Basic Tier limit) and upload to Azure Blob Storage
    • Update the 'STORAGE_ACCT_NAME', 'STORAGE_ACCT_KEY', 'STORAGE_CONTAINER_NAME' variables in a .env file
  3. Run topic-modelling/TopicModellingAzureFunction/init.py and publish the topic modelling Azure Function as an Azure Function App
  4. Run search-index-pipeline/create-search.cmd to create the Azure Cognitive Search index
  5. Run web-app-frontend/CognitiveSearch.Template.sln in Visual Studio to load front-end
    • Update appsettings.json with your configurations

Online Resources

Azure Cognitive Search

Azure Blob Storage

Azure Functions

About

PetroNet, an AI-Powered Cognitive Search Web App that leverages Azure Cloud & GPT LLM (Microsoft Code Without Barriers Hackathon Winner)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published