License Number Extraction - YOLO v4, Vision API (GCP)

The objective is to extract the registration/license plate number from different vehicle images. The task is divided into two parts. First, detect the possible ROI(region of interest) - in this case, it is the license plate of vehicles. Second, extract the letters and numbers from the detected region. To separate ROI from the images, object detection model - YOLO v4 is used. YOLO v4 (You only look once) is a family of one-stage object detectors that are fast (i.e., able to give real-time results) and accurate. To support license plate detection, this model is trained with a custom dataset. The trained model can separate multiple ROI’s(i.e., license plates) from the image. The detected license plate images are then processed for OCR (i.e., Optical Character Recognition).

1000 custom dataset is collected from the Open Images Dataset V6 to train the YOLO v4 model.
The performance achieved from the model is Mean Average Precision, mAP = 91%.
The model can detect multiple ROI’s (license plate images).
Google Vision API is used to extract the numbers/letters from the detected license plates.

This Project’s GitHub Repository