...
Info |
---|
Monitoring IndexingGEODI informs you about the progress of indexing. Please be careful about progress bar, which is not lineer. That is GEODI can not know how much time will require to index for future documents, so the progress bar is only an estimate using the previous document indexing time. |
Sampling
Sampling is possible for both, structured and unstructured data. Each data source asks you the sampling values. Sampling saves great time for discovery projects. We suggest you always use sampling for DB discovery. For unstructured data sampling is also a good starting point. Start with sampled mode and see what is in data, are there any unnecessary types or are there any permission problem.
Filecontent Filtering
Any corpus contains various file types. Some may not be necessary for the project scope, and some may be too large to disrupt the network or unwanted at all.
...
Expand | ||
---|---|---|
| ||
GEODI discovery engine is one of the fastest among other discovery engines. Slow indexing may depend on machine, settings or enviroment.
|
Expand | ||
---|---|---|
| ||
High CPU usage for an engine like GEODI should be expected. CPU usage of GEODI never goes to unresponsive machine state. GEODI always leaves one core to other tasks.
|
...