Data Sources & Integrations

Data Sources & Integrations

 

Before You Start Explore the Overview.png

 

36a809c9-2f39-472c-93bf-155ddb01f6ac.png

 

 

 

 

 

📂 Adding Data Sources

GEODI can connect to many different data sources. These sources can be indexed and discovered within a single project or across multiple projects. Data sources are defined through the Project Wizard.

Project Wizard


Apart from source-specific settings, the following features apply to all sources:

  • 📑 File format support → File formats are processed in the same way regardless of the source.

    • Example: A PDF located in a folder, a PDF attached to a web page, or a PDF embedded in a database record are all processed in the same way.

    • Copies and similar files may exist across sources. For example, a file attached to an email may be a duplicate of a file stored in a folder.

  • 📊 Sampling discovery → Available for all sources. For instance, you may process one out of every N files, or M records per table.

  • 🛡️ Data Remediation Workflows → Actions such as deletion, quarantine, or classification are supported across many cloud and on-premise sources.

    • To enable this, you must allow it on a per-source basis and provide a user/credential with the necessary permissions.

  • 🔎 OCR → OCR processes can be activated per source.

  • 🔒 Permissions → You can define who can access or download data on a per-source basis. You should check Read content permissions for the source. ( For GDE, you must enable it from the GDE settings. )


⚠️ Risk Score

  • 📌 Risk score value should be set for each source.
    This score is used in reporting and helps prioritize actions such as deletion or quarantine after discovery. The risk score ranges between 0–100.

    Higher risk scores indicate higher priority.
    Findings with higher risk scores should be addressed first.
    Example: Shared or public areas typically have higher risk scores.

    📑 Risk scores are displayed in Content List reports.

 


ℹ️ Some data sources may not be included in your license.