One of our Fortune 500 clients in the logistics industry wanted to extract various product-related information by scanning images through a machine learning model. This scanned information had to then be supplied to a custom web application for further utilization and analysis.
The image scanning and detection had to happen on the below aspects
Our data scientists used the Faster-RCNN algorithm to solve the problem statement. We followed the below procedure to achieve the desired results.
Faster-RCNN works quicker because we pass the activation map through a few more layers to find the bounding box (interesting regions). This forward pass continuously takes place and during this training phase, the ML model continues to learn. Errors (if any) are captured at this stage and with continuous learning, the model becomes efficient in predicting the classes and bounding box coordinates.
For calculating the height and width of each object we continued to iterate every object in the image and calculated values using OpenCV.
To perform this image scanning process, we had a well-annotated object in each of the images in the dataset. We had around 1000 labels for each object.
The models were then exported to Microsoft Azure’s GPU for better performance. The expected inference time for a given image is ~0.2 seconds.
The technology stack used to implement this image scanning ML model was Python, Pytorch, OpenCV, Microsoft Azure.
Are you looking for a partner who can build advanced AI/ML technologies for your business and make every interaction of your business intelligent? You are at the right place.
We love data and we are problem solvers. Our expert team of data scientists dives deep into solving and automating complex business problems. From Automobile to Fintech, Logistics, Retail, and Healthcare, GoodWorkLabs can help you build a custom solution catered for your business.
Leave us a short message with your requirements.