Natalia's Machine Learning Utils

Introduction • Getting Started • Understand files •

Introduction

Repository to train & test AI and ML models.

Getting Started

Clone repository by SSH if you created and added your public key:

$ git clone [email protected]:nieszkodzi/ml-utils.git

Create virtual environment.

$ python3 -m venv ./ml-utils/venv

Install dependencies.

$ pip install -r requirements.txt

Understand files

experiments/train.py

script to train models,

required parameters: 'path_to_audio', 'path_to_csv', 'model_type', 'train_label',

additional_parameters: 'path_to_save_model', 'path_to_test_audio', 'path_to_test_csv', 'data_frac', ' negatives_to_positives_frac'

usage example: python3 experiments/train.py --path_to_audio '/home/{user}/Data/train_audio/' --path_to_test_audio ' /home/{user}/Data/test_audio' --train_label 'piano' --data_frac 1 --negatives_to_positives_frac 1 --model_type 'gpc' --path_to_csv '/home/{user}/Data/metadata.csv' --path_to_test_csv ' /home/{user}/Data/test_metadata.csv'

experiments/test.py

script to test model,

required parameters: 'test', 'path_to_audio', 'path_to_model', 'path_to_scaler', 'train_label', 'path_to_csv'

usage example: python3 experiments/test.py --test 'dataset' --path_to_audio '/home/{user}/Data/train_audio/' --path_to_model " saved_models/XGBoost_f1:0.833_2023-02-23.joblib" --path_to_scaler "saved_models/scaler_xgb_2023-02-23.joblib" --train_label 'piano' --path_to_csv '/home/{user}/Data/test_metadata.csv'

experiments/make_dataset.py

additional code used by train.py and test.py to create train or test datasets; feature extraction, splitting, cross validation etc.

experiments/models.py

additional code used by train.py and test.py with preparation of models to train and test and saving trained models

experiments/audio_utils.py

contains functions to convert m4a to wav (more will be added when needed), extract additional features

visualization/visualization.py

code to visualize audio features such as mfcc, wave forms etc. to choose the best candidates to training and code to extract audioset pretrained embeddings to improve models.

visualization/simple_video_save.py code to combine audio with its spectrogram as a simple video

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
experiments		experiments
visualization		visualization
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natalia's Machine Learning Utils

Introduction

Getting Started

Understand files

About

Releases

Packages

Languages

nieszkodzi/audio-ml-utils

Folders and files

Latest commit

History

Repository files navigation

Natalia's Machine Learning Utils

Introduction

Getting Started

Understand files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages