PyCaret 2.2

What is PyCaret?

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few words only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, spaCy, Optuna, Hyperopt, Ray, and many more.

The design and simplicity of PyCaret is inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more expertise. Seasoned data scientists are often difficult to find and expensive to hire but citizen data scientists can be an effective way to mitigate this gap and address data-related challenges in the business setting.

PyCaret is a great library which not only simplifies the machine learning tasks for citizen data scientists but also helps new startups to reduce the cost of investing in a team of data scientists. Therefore, this library has not only helped the citizen data scientists but has also helped individuals who want to start exploring the field of data science, having no prior knowledge in this field.

Official Website: https://www.pycaret.org
Documentation: https://pycaret.readthedocs.io/en/latest/

Current Release

PyCaret 2.2 is now available. See 2.2 release notes. The easiest way to install pycaret is using pip.

pip install pycaret

PyCaret's default installation is a slim version of pycaret which only installs hard dependencies that are listed in requirements.txt. To install the full version of pycaret, use the following command:

pip install pycaret[full]

Minor Release

[December 22, 2020] 2.2.3 released fixing several bugs. Major compatibility issues of catboost, pyod (other impacts unknown as of now) with sklearn=0.24 (released on Dec 22, 2020). Temporary fix is requiring 0.23.2 specifically in the requirements.txt. Click here to see release notes.
[November 25, 2020] 2.2.2 released fixing several bugs. Click here to see release notes.
[November 9, 2020] 2.2.1 released fixing several bugs. Click here to see release notes.

PyCaret on GPU

PyCaret >= 2.2 provides the option to use GPU for select model training and hyperparameter tuning. There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default slim version or the full version. The following estimators can be trained on GPU.

Extreme Gradient Boosting (requires no further installation)
CatBoost (requires no further installation)
Light Gradient Boosting Machine (requires GPU installation: https://lightgbm.readthedocs.io/en/latest/GPU-Tutorial.html)
Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression (requires cuML >= 0.15 https://github.com/rapidsai/cuml)

If you are using Google Colab you can install Light Gradient Boosting Machine for GPU but first you have to uninstall LightGBM on CPU. Use the below command to do that:

pip uninstall lightgbm -y

# install lightgbm GPU
pip install lightgbm --install-option=--gpu --install-option="--opencl-include-dir=/usr/local/cuda/include/" --install-option="--opencl-library=/usr/local/cuda/lib64/libOpenCL.so"

CatBoost is only enabled on GPU when dataset has > 50,000 rows.

cuML >= 0.15 cannot be installed on Google Colab. Instead use blazingSQL (https://blazingsql.com/) which comes pre-installed with cuML 0.15. Use following command to install pycaret:

# install pycaret on blazingSQL
!/opt/conda-environments/rapids-stable/bin/python -m pip install --upgrade pycaret

Important Links

Release notes: https://github.com/pycaret/pycaret/releases
Docs: https://pycaret.readthedocs.io/en/latest/
Tutorials: https://pycaret.readthedocs.io/en/latest/tutorials.html
Example Notebooks: https://github.com/pycaret/pycaret/tree/master/examples
Other Resources: https://github.com/pycaret/pycaret/tree/master/resources
Issue Logs: https://github.com/pycaret/pycaret/issues
Contribute: https://pycaret.readthedocs.io/en/latest/contribute.html
Join Slack Community: https://join.slack.com/t/pycaretworkspace/shared_invite/zt-kdoe7hee-yvNANPHXPM9VtK7R6Npx4Q

Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

Experienced Data Scientists who want to increase productivity.
Citizen Data Scientists who prefer a low code machine learning solution.
Data Science Students.
Data Science Professionals who wants to build rapid prototypes.

Current Contributors

Made with contributors-img.

License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. © 2020 GitHub, Inc.

Name		Name	Last commit message	Last commit date
Latest commit History 1,515 Commits
.github		.github
datasets		datasets
docker python37		docker python37
docs		docs
examples		examples
maint_tools		maint_tools
pycaret		pycaret
resources		resources
tutorials		tutorials
.gitignore		.gitignore
.log		.log
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
.slugignore		.slugignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
ISSUE_POLICY.md		ISSUE_POLICY.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
PULL_REQUEST_TEMPLATE.md		PULL_REQUEST_TEMPLATE.md
README.md		README.md
cli.py		cli.py
cli_app.py		cli_app.py
logo.png		logo.png
pycaret2-features.png		pycaret2-features.png
pycaret2.2.png		pycaret2.2.png
pycaret2.png		pycaret2.png
requirements-optional.txt		requirements-optional.txt
requirements.txt		requirements.txt
setup.py		setup.py
setup_nightly.py		setup_nightly.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyCaret 2.2

What is PyCaret?

Current Release

Minor Release

PyCaret on GPU

Important Links

Who should use PyCaret?

Current Contributors

License

About

Releases

Packages

Languages

License

SevillaFe/pycaret

Folders and files

Latest commit

History

Repository files navigation

PyCaret 2.2

What is PyCaret?

Current Release

Minor Release

PyCaret on GPU

Important Links

Who should use PyCaret?

Current Contributors

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages