GEODI can mask or anonimyze anonymize files or databases based on discovery results.
Masking means blacking out sensitive data. Anonimization , and anonymization means replacing a data with fake but real-looking oneones. Both function allows ou functions allow you to share a content with less concern. Masking completely hides a sensitive information, and you can identify whether a document if it is masket or notmasked. But anonimization anonymization creates a completelly completely real-looking version of the content. Anonimizaytion Anonymization is more suitable for testing or data science.
There are choices.
Persistent File Masking/Anonimizatşon: Selected files are masked/anonimized persistently. Keeping the original values is a choice. You may choose to replace all findings, PII or just financial values.
Dynamic Masking/Anonimization: Dynamic option allows you to set visibility of sensitive i,nformation by permissons. Same content (a PF, word , ..) looks different to different users based on permissions.
Database Masking/Anonimization: It is possible to mask/animiyze a database for software development teams. This capability should not be confused with products that perform dynamic database masking.
Masking Details
There are some masking rules. Each discovery result (an IBAN, name or any other) may be replaced with using any of the following rules.
Simple change with a char (like **)
Change a name or telno with [NAME] veya [TELNO]
First 2, last 4 char for IBAN or CreditCard like entities
Content names (File names, e-mail subjects, ..) are not masked
Anonimization Details
Each scientists.
Each finding is replaced with a real-looking value. Currently, the following entities are anonimiyzedanonymized
Name → Name
Money → Money
DisctionariesDictionaries(Placename, part number or other) → random value from the same dictionary
Creditcard → Creditcard
IBAN → IBAN
Tel → Tel
e-Mail → e-Mail
Date → Date
Anonimizatşon Anonimization has 2 mode. In the Default mode, the same value takes the same value. That is, a name (john smitJohn Smit) , is converted to the same name in every placeeverywhere. In the other mode, conversion is always random, and you can not even see the same results in a single session.
To set the second mode set Set ANO.SameToSame:False
in the project generic seettings.
...
settings for the second mode.
Info |
---|
Profiles Profiles defines which findings are to be anonimyzed. Ypu may anonimyze all, or PII or Financial data. The functions ask you to choose from existing profiles. |
Info |
---|
Anonimyze chocies
|
Persistent File Anonimization
GEODI creates a copy of the content undergoing masking to perform the process. The operation functions in the "Mask and Download" manner.
Note |
---|
You may also use batch anonimization options on the actions menu. Batch process allows you to convert thousands of files at the same time. |
Info |
---|
|
Dynamic
...
Anonymization
With dynamic masking, the results of the discovery are visible in screens such as the viewer, words, network graph, summary, etc., in a masked form.
Masking in the viewer is applicable to the following types, regardless of their source, whether they are embedded in a File Server, SharePoint, or a Database.
Office Files (Word, Excel, PowerPoint)
Open Office (ODT, ODS, ...)
PDF
TXT, XPS
On the last page of the project wizard, the "Dynamic Masking" box is selected, and settings are configured.
Once adjustments are made, files in the project are displayed to users in a masked format.
It operates with group-based authorization.
A masking metadata is defined for each group.
When group members open a file from the project, it is displayed in a masked form according to the defined masking metadata.
...
The "No Masking" setting displays the results without masking for the defined group.
Groups and members without authorization display all discovery results in a masked form.
In accordance with the "All" masking metadataAny group or user not matched with a profile is assumed to use the MaskALL profile.
When downloading files, it also downloads them in a masked form.
Database
...
Anonymization
Database masking performs permanent masking on the given database. If you want the original data to remain intact, you can also work on a copy.
...
When you need to share your database with software or testing teams. Masking permanently removes sensitive data from your database for this use case.
When you share your database for data analysis, it permanently removes sensitive data, allowing you to safely share the database.
...
Permissions
The authorization for masking is the same as the document download permission. Those with download permission can use masking.
Dynamic masking allows authorization at the user and discovery layer levels.
License
A MASKING license is required for masking.
The number of users using masking is equal to the number of GEODI users.
Masking is meaningful with Corporate Search.
It is recommended to have a Discovery license for masking. Otherwise, masking will be limited to information discovered within the scope of STD.
Batch Masking
To create a script for Bulk Masking, go to the Reports menu. For this script, specify the masking profile and the directory where result files will be located. The generated script can also be edited to modify the original files.
Profiles
Profiles determine which discovery results will be masked/anomiyzed. There are default profiles which you can customize.
You will need the profile ID values for batch operations like Bulk Masking. Below, you will find the default profiles:
...
ID: Finance01 -> [$.tr:Financial Data;en:Financial Data] → Identifiers labeled IsFinancial and IsMoney (IBAN, Credit Card Number (Visa, Mastercard, American, JCB...), Currencies)
...
ID: MoneyData01 -> [$.tr:Monetary Data;en:Monetary Data] → Identifiers labeled IsMoney (Euro(€), Dollar($), Turkish Lira(₺), Pound/GBP(£), ...)
...
ID: PersonalData_01 -> [$.tr:Personal Data Only;en:Personal Data Only] → Identifiers labeled PII (SSN, and identifiers for ID and Passport numbers of different countries)
Name
ID Number (Turkish: TCKN)
Tax ID (Turkish: VKN)
Email
Address Block Identifier
...
ID: All -> All
Used to mask all information recognized by the identifiers in your project.
...
and DISCOVERY license is required.