GEODI Classifier is a set of tools for manual and automatic classification. This page is about configuring central management of the tools.
Classifier Module Configuration
Use “Activate Classification Tools” on the last page of Project Wizard to open the management dialog. The classes, rules, header, footer, and watermark grans are all set here. There are ready project templates; we suggest you start with one to have ready-to-use settings.
As a best practice, create a separate project for classification other than search and/or discovery. The project for classification does not need to have data.
Classes
In this tab, we will define the class labels, their rules, and their appearance in plugins (Microsoft Office, others). Classes should be created with the most important at the top.
ID: You must give a unique value for the class.
Name: The text of the class that you want to appear in the interfaces.
Category: You can create a classification tree by giving different categories. It is useful if you have many classes. By default, it can be empty.
Description: With the description, you write down what you need to know about the class. This text will be a guideline for the users of the plugins. Descriptions will be displayed in the classification interface of the users.
Query: Documents that match the query automatically take the corresponding class. GEODI Query Rules are valid. There are predefined queries (
predefined:ClassPII
,predefined:ClassSecret
,...) to help you use the same queries in classes, panels, and other places. This is a good way to simplify configuration and management:Tags: Tags are the key and value pairs to write In MS Word, PDF, or ADS files. If you plan to switch to GEODI from another classification solution, GEODI adapts the existing schema. The transition will be seamless. Please check GEODI Classifier Label/Tag Definitions
Header, Footer, and Watermark values are valid only for MS Office software. If the user chooses this class, the text will be embedded into the document.
You may use %User% to have a user name in the values.
You can use \n to move to the next line for multiple lines of text. For example, “Personally\nIdentifialbe\nInformation”.
Auto classification chooses the last class if the content does not match any other query.
Behaviour
Default Classification rules are set here. These rules are automatically updated for all clients in about 10+ minutes. You may override/change rules by user, group, IP, or classification tools using the “Customize” tab.
Auto Classify Behaviour: Determines how auto-classification works.
Use as a suggestion → The user may or may not use auto.
Disabled → Automatic classification is off
Do not select the class under auto → Users can not choose classes lower than the auto.
Ask Classes on Save: MS Office add-ins open a dialog to choose classes when saving/close or printing. This option determines when the dialog opens.
Show when necessary → If auto-classification is possible or the document already has a class, the dialog does not open.
Always → The dialog opens after each change.
Never - Manuel Only → User can open the dialog manually.
Use OS Meta: Microsoft Office, Libre Office, and PDF formats can be tagged with ADS. These tags are only created by the Shell/Desktop classification tool.
Allow Class Lowering: Determines if a user can choose a lesser class for an already classified document. This is an important setting and you may override it by user, group, or IP.
Classifying internal emails: When the feature is used, if the sending domain address and the target domain address are the same, the email can be sent without asking for classification.
Log Format: By default, all classified documents are logged. This log is stored on the server side. We can deactivate this process or change the logging format. The logs are in the same location as other GEODI logs.
Customize
With customization, the default rules can be changed based on User, Group, IP, or the application.
For example, you can limit some groups to only auto-classification and allow some other class-lowering.
Each rule is a customization and you can add as many as you want. If rules overlap then the first rule is valid.
Active: You can use customization on/off for end users.
Name and Description: If you have a lot of customizations, we recommend that you write a systematic naming and description for ease of management.
Users: The person and group information that will be affected by the customization is written here. More than one person/group can be selected.
Macro: You can create advanced options with a macro. It is possible to create IP or, time-based rules.
Macro Examples
c.Client!=null && c.Client.GetClientIPAddress()=="192.168.1.14"
The IP address to which the customization will be valid can be selected by typing.DateTime.Now.Hour>8 && DateTime.Now.Hour < 18
Allows the use of customization between certain hours.
You can access the objects that can be used via the link. https://service.decesoftware.com/api
Applications: The applications that will be affected by the customization are selected. More than one can be selected. If none is selected, the customization applies to all applications.
Auto Classify Behaviour, Ask Classes on Save, OS Meta , Classifying internal emails , Allow Class Lowering : Choose the settings you want to customize/override.
Disable Classification: You can disable classification for selected users, groups, or IPs.
Available Classes: You may want some classes to be offered only by certain groups or departments. In this case, you can specify which classes are available for each customization.
Pop-Up Texts Settings
Use the terminology by your preferences.
Auto Button Text : This is the name of the automatic button in the classification interface.
Category: It adds a general title to the left side of the upper category classes in the open pop-up interface.
Subcategory: It is used to add titles to the left side of Subcategory classes in the open pop-up interface. If the category name in the class definitions is to be used as the title, "Use Class Category" must be checked. These fields should be left blank if the categorical structure is not used.
Form Title: The dialog's title that opens during saving/printing for MS Office can be changed.
Description: Description can be added/changed to the dialog opened during saving/printing for MS Office.
Class Not Selected Message: Message displayed on the dialog opened during saving/closing/printing for MS Office if the class is not selected for the document.
OK Button: The text of the OK button on the dialog opened during saving/closing/printing for MS Office can be changed.
Unknown Class Text : The class indicating that the document that will appear in unclassified documents is unclassified.
Top Bg, Text Color, Bottom Color: It changes the color combinations on the pop-up window in the GEODI Classifier Add-In screen. It works in our default colors when it receives an incorrect or missing value.
Class Not Defined Icon: The icon of the class indicating that the document to appear in unclassified documents is unclassified.
Setup
The software automatically generates the token and MSI parameters required for installation. Please refer to the client pages and OWA installation page for details.
Ignoring e-mail signatures
E-mail signatures contain the sender's PII information. To ignore this, you must either change e-mail server settings or prepare a dictionary.
The E-mail server should add a signature after the classification/or just before sending. The method for exchange is https://learn.microsoft.com/en-us/exchange/security-and-compliance/mail-flow-rules/disclaimers-signatures-footers-or-headersthe link. Your e-mail server may provide different methods.
If you do not have or prefer such a method, a GEODI dictionary will also solve the problem. To do that
You must have a senders list that contains at least the values used in signatures (name, phone, e-mail, etc.).
GEODI can use Excel or a Table as a dictionary. You must generate the signature using Excel formula or SQL. Excel should be updated manually.
Add this dictionary to the discoveries list for the classification project.
That is all.
The method works only if the dictionary and mail signatures match exactly. So before release, we suggest running a few tests for sample senders.
This dictionary can also be used for e-mail discovery to avoid signatures recognized as PII for old emails.
Sender’s actual PII data, in e-mail body, contracts, medical records, etc., will still be recognized.
Sample Excel