/
Search Duplicate or Similar Content

Search Duplicate or Similar Content

GEODI searches for given criteria across all sources connected to your project.

 

Find Similars

GEODI finds similarities between text and image contents. It lists similar documents or texts based on the input you provide.

Copied and similar documents are also shown within the GEODI search interface and viewers.

In similarity search, you can also use the following expressions:

  • maxcount:<n> - limit similar count by n.

  • minsimilarity:0.7 - set similarity. The default is 0.7.

  • excludeDuplicates:true exclude copies default =false means copies are listed under similar.

similar:(doc:a.pdf)

similar:”Georgia Aquarium" (finds similar document containig the words)

 

 

Find Duplicates

Typically, 40% of the documents in an organization are duplicates. Duplicates cause confusion and make searches difficult. GEODI finds these and helps you eliminate them.

Typing "duplicate" will find all documents that have copies. Using "-duplicate" will find those without copies.

Duplicates and similar documents are also shown in the GEODI search interface and viewers.

duplicate (content with copies)

-duplicate (content w/o any copy. i.e unique ones)

duplicate:(doc:a.pdf) (finds copies of a.pdf)

duplicate:"Georgia Aquarium" (finds copy document containg the words)