Hello everyone.

I haven’t had any need for OCR software in probably 15 years, but I have a client who has 7 document boxes worth of forms filled out by hand that they need digitized. They’re scanning them into PDFs this week, but want to recover FirstName, LastName, Phone, Email and then a hand written feed back box and load those all into a database.

ChatGPT recommended ABBYY, but it looks like it might be overkill for a one time need like this.

I told them that a couple teenagers doing data entry might be more accurate and cheaper. IDK if that’s really true though. I’m not at all an expert on OCR software.

Does anyone have any suggestions?

  • naevaTheRat@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    4
    ·
    edit-2
    1 year ago

    Depending on the quality of the scan a quick python script using tesseract might be enough. Probs examples online

    the handwriting will be full of errors but spell check and an editing pass should be fine? I imagine you’ll have to do that anyway