Skip to content
/ wer Public

Word Error Rate computation using components from huggingface-evaluate and openai-whisper projects

License

Notifications You must be signed in to change notification settings

ins8ai/wer

Repository files navigation

WER

Word Error Rate (wer) is a metric used to assess and compare accuracy of transcripts generated. WER is used to express the differences between predicted text and the reference text. Differences include insertion, deletion and substitutions required to transform predicted text to reference text.

This repo contains sample codes for how the ins8.ai team is evaluating WER scores. Normalisation includes removal of punctuation, uppercase conversion to lowercase, standisation in numerical representation, translation of british to american spellings and etc. Normalisation is recommended to be enabled. You may enable it when running the script. Normalisation is done based on OpenAI's whisper release.

Install and Pre-requisites

  • after cloning the repo, create a new virtual environment and activate it
  • install the required packages via pip install -r requirements.txt
  • prepare pred.txt(predicted) and ref.txt(reference/ground truth) with lines of text. the number of lines in both files must match.

Run

  • make sure your virtual environment is activated and run main.py

python -m venv venv

source venv/bin/activate

pip install -r requirements.txt

python main.py pred.txt ref.txt True

deactivate

About

Word Error Rate computation using components from huggingface-evaluate and openai-whisper projects

Topics

Resources

License

Stars

Watchers

Forks