if: CNN classifier
- get the bbox.
- crop the bbox with OpenCV.
- run OCR for that region.
else:
- Get the important keywords coordinates.
- Run OCR for the increased (let say 20% of width) bbox area to get the numerical data.
python train.py --img 1200 --batch 4 --epochs 500 --data train/data.yaml --cfg models/yolov5l.yaml --weights models/yolov5l.pt