Discovery
GEODI Discovery empowers organizations to perform deep data discovery across a wide range of industries and use cases. This page outlines a typical discovery process, step by step.
With GEODI, you can scan rich and diverse data sources, supporting local ID formats and regulations for many countries. High accuracy and speed are among our top priorities.
🧭 Discovery Workflow Overview
These are the key stages of a discovery project. Each data source should ideally follow this process independently, making it easier to manage progress. Once all relevant sources are covered, the project is complete. If new sources emerge over time, simply repeat the same process.
✅ Step-by-Step Discovery Process
🎯 1. Define the Objective
Start with a clear objective. It could be compliance with:
KVKK, GDPR, HIPAA, SAMA, PDPL, CMA, DPDPA, LGPD, Fraud Prevention, etc.
Through the DECE-STORE, GEODI provides ready-to-use recognizers, reports, dashboards, and actions aligned with various regulations. Once GEODI is installed, you can easily install the relevant DECE-STORE modules for your needs.
📂 2. Identify Sources & Launch Discovery
Use the Project Wizard to define data sources, set permissions, and configure discovery settings.
Depending on the environment, GEODI supports various data sources, whether agentless or agent-based. You can scan single or multiple sources simultaneously.
🔗 [GEODI Data Sources]
🔗 [GEODI Project Wizard]
The discovery phase identifies personal, financial, and other defined types across the selected scope.
Discovery duration depends on:
Source size
Scope
Options selected
💡 We recommend starting with a sample-based discovery, which saves time and helps you decide whether to proceed with a full scan.
📊 3. Analyze Results
Learn how to:
Examine discovery results
Detect risky findings
Take appropriate action
🔗 [Reviewing Discovery Results and Taking Action]
⚙️ 4. Take Action (Remediation)
Based on findings, GEODI enables you to remediate risks using actions such as:
Secure Delete
Quarantine
Masking
Classification
Anonymization
These actions are reflected in your dashboards and reports—helping you iterate until all risks are addressed.
🔗 [GEODI Discovery: Actions]
🔔 5. Stay Informed
Discovery is an ongoing task. Once the environment is configured, GEODI can alert you in real time about new risks—such as a CV or contract placed in a shared folder.
These alerts help you strengthen access policies and data protection strategies.
📈 Big Data at Scale
GEODI processes both structured and unstructured data at an average speed of 0.5–1.5 TB/day. However, with very large datasets (hundreds of TB or more), this speed alone may not be sufficient.
To scale effectively:
Deploy GEODI on multiple servers to form a GEODI Cluster
Cluster architecture supports centralized reporting and management
You can manage multiple GEODI instances as if they were one
Example:
To scan 50 TB of data in 3 weeks, you may need 2 to 4 GEODI servers.
🔗 GEODI 302 – System Administration will cover cluster configuration in detail.