Skip to content

Latest commit

 

History

History
31 lines (24 loc) · 4.72 KB

File metadata and controls

31 lines (24 loc) · 4.72 KB

Introduction

This is the final project of my Computer Science AP degree. And the title of the project is "Predictive Maintenance with Machine Learning". For more detailed descriptions of this project, please read the following sections.

Problem statement

Machines are critical to all business nowadays, and we expect them to operate at peak level for long time. Normally, we can use corrective maintenance strategy to make the best use of machines. But we could end up costing even higher due to downtime and labor.

Preventive maintenance comes in to reduce unplanned failures while businesses try to address problem in advance. However, the costs could still be high. By using predictive maintenance, we can prevent those unexpected problems more efficiently. We can fix the machines just in time as we monitor and predict the status of them.

In this project, predictive maintenance is the main concept. There are two aspects to do the predictive maintenance in this project, supervised and unsupervised learning. In supervised learning, predicting remaining useful life and failure prediction are the goals. While in unsupervised learning, detecting anomalies of the machine is the target. Python and its machine learning related libraries are the main tools in this project. The solution of this project could help industry to lower the chance of unexpected downtime and reduce costs.

Aims and objectives

The aim of this project is to propose machine learning solutions for predictive maintenance in responds to Industry 4.0. There are three tasks proposed:

  • Anomaly detection: The model should be able to detect the anomalies within data.
  • Remaining useful life prediction: The model should be able to predict the remaining useful life of a machine to help people be prepared for maintenance or replacement.
  • Failure prediction: The model should be able to predict whether there will be a failure.

Data source

There are two datasets used in this project as you can find in Datasets folder:

Solution approach

This project follows CRISP-DM methodology. Supervised and unsupervised algorithms are used to address set tasks. Please notice that artificial neuron network (ANN) is not explored in this project.

  • Anomaly detection: Local Outlier Factor (LOF), Elliptic Envelope, Isolation Forest and One-Class Support Vector Machine (SVM) are used.
  • Remaining useful life prediction: Support Vector Machine Regressor is used. For automated machine learning example, Tree-based Pipeline Optimization Tool (TPOT) is used. The output best model is evaluated together. Since the data is rather high-dimensional, feature importance analysis is performed to reduce the training size. The feature-reduced-version of dataset is also modeled to compare the computing time and results.
  • Failure prediction: Logistic Regression, KNeighbors Classifier, C-Support Vector Clas- sification with liner kernel, C-Support Vector Classification with RBF kernel, Gaussian Naive Bayes, Decision Tree and Random Forest Classifier are applied.

Summary of contributions and achievements