The newcomer this week is Tesseract.js, the JavaScript port of a Tesseract, a famous OCR (Optical Character Recognition) project written in C++, whose development started in the 80s!
The online demo let you upload images from your computer and get the text content after a few seconds.
I tried with a simple screenshot (from Best of JavaScript of course) and it worked, I got my text!
It can work in the browser or in Node.js, it supports more than 100 languages and several image formats, including .JPG and .PNG files.
It works by downloading trained data for a given language and storing them in the browser’s IndexedDB, a key-value store available in modern browsers.
Any question or feedback? Reach us on GitHub.
Best of JavaScript is a project created by Michael Rambeau, made in Osaka, Japan.