Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

https://www.dece.com.tr/geodi-moduller#ocr

The GEODI OCR module can work not only on scanned documents, but also on images and even videos. It makes text and Barcode/QRcodes in these data sources searchable.

...

  • You need to complete the following settings for each source you want to OCR.

    • You can convert scanned documents to SPDF (Searchable PDF). SPDF creation requires additional space and time. When the result is a PDF, the word you are looking for is marked on the PDF.

    • OCR of very large documents (such as Scanned Project) is also optional.

    • Barcode and QRCode recognition can be provided.

    • Generating SPDF can increase the total time by around 50%.

  • On the last page of the project wizard general settings for OCR should be made. These settings affect all resources.

    • You can specify which engine will be used for OCR (GEODI or ABBYY).

    • You can add additional languages according to the languages in your documents.

    • With Fast OCR you can save 50%-70% of your time. With Fast OCR, the performance decreases slightly but saves a lot of time

OCR Settings for Source

...

OCR setting for the whole project

...

OCR those that have not been OCRed

This command allows you to activate OCR settings for the scanned project at a later time. It initiates the Rescan service for OCR. Since OCR can significantly increase the scanning time of the project in use, you can scan the project without OCR initially and then activate OCR settings, allowing you to save time by OCR-ing the unprocessed items in the background. This command does not perform OCR repeatedly, focusing on not increasing unnecessary file versions. You can execute this command through the user interface or via DCC.

In the user interface, you can narrow down the content to be OCR-ed by specifying query criteria in the "OCR Unprocessed" field.

...

If OCR settings are not enabled in the project

...

, the OCR command does not allow non-OCR'd files to be OCR'd. It does not allow starting a new OCR process until an OCR repair process is finished.

When you run the command, a message window opens. "... content will be examined for OCR. The process will be done in the background... Do you want to continue? Yes/No"

A few things to be aware of

...

Scope of Training

For GEODI User

  1. What is OCR, search impact of OCR performance, expectations

  2. Going over document samples to be OCRed

    1. Good document, Dirty document, document taken with cell phone, between the books, Barcode and QRCode sample

    2. Photo

    3. (question) Video

  3. Explaining that the process will happen automatically with drag and drop or other ways of adding data

  4. barcode recognition

  5. Awareness of OCR process performance

https://decesw.atlassian.net/wiki/spaces/geodien/pages/resumedraft.action?draftId=3973841411&draftShareId=6bd781e3-dd74-4c9b-b03b-f1feea72fcae3973841411/Module+OCR

For GEODI Admin

  1. GEODI-OCR - Difference of ABBY and Why GEODI OCR?

    1. Better

    2. No price per transaction..

    3. This document covers GEODI OCR

  2. OCR requires processing power

    1. If you want to OCR everything, a long process has begun.

  3. OCR Setup

    1. https://decesw.atlassian.net/wiki/spaces/geodien/pages/

...

    1. 3973841411/Module+OCR

    2. Activate OCR for a source in the GEODI Project

      1. Activate OCR

      2. Activate barcode recognition

    3. General OCR settings in GEODI Project

      1. What is the FastOCR setting? → Speed

        1. How to make

        2. Impact

      2. What is SPDF setting? → Capability

        1. How to make

        2. SPDF benefit, what happens without it?

        3. Impact

      3. Why we delete/don't delete TIFFs? → Saving

        1. They are not necessary after OCR, they take up space.

        2. Be sure to ask the user?

  1. Barcode recognition

  2. OCR with videos

    1. Mask application, why?

      1. Removing camera traces

      2. Elimination of edges

      3. Which details can we capture with video OCR?

  3. If we want to OCR all images

    1. SPDF of tif files containing geometric information, GeoTIFF, does not occur.

    2. SPDF directory and metafile awareness.

    3. OCR in different languages.

  4. Questions and Answers

**