Discovery

Discovery

GEODI Discovery empowers organizations to perform deep data discovery across a wide range of industries and use cases. This page outlines a typical discovery process, step by step.

With GEODI, you can scan rich and diverse data sources, supporting local ID formats and regulations for many countries. High accuracy and speed are among our top priorities.


Before You Start Explore the Overview.png
Before You Start Watch the Overview.png

๐Ÿงญ Discovery Workflow Overview

These are the key stages of a discovery project. Each data source should ideally follow this process independently, making it easier to manage progress. Once all relevant sources are covered, the project is complete. If new sources emerge over time, simply repeat the same process.

ย 

KeลŸif.png

โœ… Step-by-Step Discovery Process

๐ŸŽฏ 1. Define the Objective

Start with a clear objective. It could be compliance with:

  • KVKK, GDPR, HIPAA, SAMA, PDPL, CMA, DPDPA, LGPD, Fraud Prevention, etc.

Through the DECE-STORE, GEODI provides ready-to-use recognizers, reports, dashboards, and actions aligned with various regulations. Once GEODI is installed, you can easily install the relevant DECE-STORE modules for your needs.


๐Ÿ“‚ 2. Identify Sources & Launch Discovery

Use the Project Wizard to define data sources, set permissions, and configure discovery settings.

Depending on the environment, GEODI supports various data sources, whether agentless or agent-based. You can scan single or multiple sources simultaneously.

๐Ÿ”— https://support.decesoftware.com/space/geodien/3973808335/GEODI+Data+Sources
๐Ÿ”— https://support.decesoftware.com/space/geodien/3973840897/Project+Wizard

The discovery phase identifies personal, financial, and other defined types across the selected scope.
Discovery duration depends on:

  • Source size

  • Scope

  • Options selected

๐Ÿ’ก We recommend starting with a sample-based discovery, which saves time and helps you decide whether to proceed with a full scan.


๐Ÿ“Š 3. Analyze Results

Learn how to:

  • Examine discovery results

  • Detect risky findings

  • Take appropriate action

๐Ÿ”— https://support.decesoftware.com/space/geodien/5136908311/Reviewing+Results+After+a+Discovery


โš™๏ธ 4. Take Action (Remediation)

Based on findings, GEODI enables you to remediate risks using actions such as:

  • Secure Delete

  • Quarantine

  • Masking

  • Classification

  • Anonymization

These actions are reflected in your dashboards and reportsโ€”helping you iterate until all risks are addressed.

๐Ÿ”— Automation and Remediations


๐Ÿ”” 5. Stay Informed

Discovery is an ongoing task. Once the environment is configured, GEODI can alert you in real time about new risksโ€”such as a CV or contract placed in a shared folder.

These alerts help you strengthen access policies and data protection strategies.


๐Ÿ“ˆ Big Data at Scale

GEODI processes both structured and unstructured data at an average speed of 0.5โ€“1.5 TB/day. However, with very large datasets (hundreds of TB or more), this speed alone may not be sufficient.

To scale effectively:

  • Deploy GEODI on multiple servers to form a GEODI Cluster

  • Cluster architecture supports centralized reporting and management

  • You can manage multiple GEODI instances as if they were one

Example:
To scan 50 TB of data in 3 weeks, you may need 2 to 4 GEODI servers.

๐Ÿ”— GEODI 300 - Deployment & Architecture & Performance Design will cover cluster configuration in detail.

ย 


Quick Knowledge Check

They provide ready-to-use configurations aligned with regulations, enabling fast and standardized setup.

Structured, unstructured, cloud, on-prem, endpoints,emails and data lakes.

Agent-based uses installed agents (e.g., GDE) for deeper control; agentless connects directly without installation.

To identify who can access which data, detect over-permissioned users, and reveal risky access patterns.

QuickScan scans a subset of data or reduces indexing depth to provide faster initial insights. It is useful for PoC, large environments, or quick risk visibility.

Because data is constantly changing and new risks must be detected automatically.

Because data is constantly changing and new risks must be detected automatically.

By distributing GDE clients across multiple GEODI servers using NLB or built-in server assignment methods.

Because risks can reappear as new data is created; alerts ensure continuous control beyond initial discovery. Alerts enable continuous monitoring by notifying when predefined risk conditions occur. Triggering an alert when predefined:PII appears in a shared folder.

ย 

ย