Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

GEODI can also evaluate databases, i.e., structured data. Databases and their file information (including embedded) can be indexed (all tables and rows). With simple rules, you can limit tables, rows, or fields and set how rows are displayed to the user.

Table of Contents

Database Clients

Some DBs may require a driver on the server where GEODI is locatedUse Project Wizard/Database connection to define the connection.

...

Use “New Connection” to choose the database and define the connection string. ( (info) This dialog only opens if you are logged into the server)

Some databases may require an extra driver to be installed. Please check the lşiste below.

VTYS

Client

Oracle (OleDB)

Depending on the Oracle version to be connected, the appropriate Oracle Client download can be accessed by searching ODAC.... in Google. Oracle requires membership for download. After scanning the necessary steps, it creates a link for download.

PostGresPostGres

No extra installation is required.

Microsoft SQL Server

No extra installation is required.

SqLite

No extra installation is required.

Microsoft SQL ServerShape File

No extra installation is required.

SqLiteCSV File

No extra installation is required.

Shape KML File

No extra installation is required.

CSV FileSQLCE, SQL express

No extra installation is required.

KML File Kafka

No extra installation is required.

Excel

Access Database Engine 2010 Oracle (OleDB)

Oracle ODAC driver must be installed.

https://www.microsoftoracle.com/database/technologies/odac-downloads.html

Excel

Access Database Engine 2010 must be installed.

https://www.microsoft.com/en-us/download/details.aspx?id=13255

ODBC

No extra installation is required.

MS Access

Access Database Engine 2010 must be installed.

https://www.microsoft.com/en-us/download/details.aspx?id=13255SQLCE

No extra installation is required.

SQL Express

No extra installation is required.

DB2

Oledb Driver must be installed.

https://www.microsoft.com/en-us/download/details.aspx?id=100917#:~:text=Microsoft%20OLE%20DB%20Provider%20for%20DB2%20works%20with%20Microsoft%20SQL,tools%20to%20IBM%20Db2%20databases.

Kafka

No extra installation is required.

Oracle BigData

Microsoft Hive ODBC Driver must be installed.

https://www.microsoft.com/en-us/download/details.aspx?id=40886

CassandraC Data Cassandra Drivers must be installed.

OLEDB or ODBC driver is required

https://www.cdata.com/drivers/cassandra/download/

MongoDBDevart ODBC Driver for MongoDB must be installed.

OLEDB or ODBC driver is required

https://www.devart.com/odbc/mongodb/

How to Connect Database Source

  1. You can start by selecting the Database Source with the Project Wizard/Database.

    1. Image Removed
  2. You must select the database you want to define with Define Connection from the screen that opens.

    1. Image Removed

How to Define Tabular Data

There are multiple ways:

...

Alternative Connection Methods

Other than using the project wizard

  1. File-based ones such as SQLite, MDB, and AccDB are indexed automatically . Rule files/DBMeta are considered if defined.

  2. Excel files are not recognized as structured data by default. But we can do this with appropriate meta and settings.

  3. With Project Wizard/Database source we can connect to all listed DBMSs. Rule files/DBMeta are taken into account if defined. In this structure, desired tables and columns can be selected and SQL statements can be written. Views are supported.

  4. Settings made with Project Wizard/Database can be stored. A structure in *.xDeceConnection format is now recognized as file based. This structure has multiple benefits.

    1. It is portable. This means you can use the same file in other GEODIs or the same GEODI in other projects.

    2. It is secure. Depending on the system settings security level settings, the password/username inside the connection cannot be known.

    3. Some DBMeta capabilities mentioned on this page only work this way.

How to Index?

Databases can consist of many tables. By default, GEODI traverses all tables and indexes all rows. System administrators can make definitions to restrict tables and rows through GEODI. If desired, "view(s)" can be defined and how GEODI will see the database can also be defined. Many different systems/formats such as Oracle, SQL Server, Postgres, Mysql, sqlite, access, shapefile, excel are supported.

There is no need to define for file based ones like sqlite and access. These files are automatically indexed when found in any source.

You can also store the connection definitions you create with GEODI as *.dececonnection and let GEODI scan this content like any other file.

IIndexed tables and rows are not taken into account in the crawl for changes process as long as they do not change. Changed rows will be automatically versioned. 

  1. Tables to be crawled must have a primary key. You can remove this constraint with a setting, but the crawl changes or versioning features will not work for tables without a primary field.

  2. . If you do not want Geodi Table objects in the search results, but only records, DisableIndexTables should be set to True in the Enumerator Detail settings.

  3. Files embedded in tables or linked within tables can also be included in indexing. How these features work is explained on this page.

  4. You overcome the limited search capabilities offered by the software that creates the database

Changing Table Indexing Settings

  • You can change the default behavior with jsettings settings files under Settings\Reader\DBMeta. Within the same settings file you can apply different settings for different tables or projects as shown in the example.

(warning) Separate setting types require a separate __type key.

Example:

...

languagejson

...

  1. when found in a directory and don't have a password. You may define DBMeta for them.

  2. You must use Project Wizard for Excel files to be indexed as structured content.

  3. Settings made with Project Wizard/Database can be saved in *.xDeceConnection format. These files are automatically processed. It is a portable and secure way of defining connections.

Indexing DBs

  1. The default behavior is to crawl all tables and rows.

  2. Tables must have primaryKey by default. You may choose to index tables w/o a primary key also.

  3. You may choose a sampling mode, as only the selected number of rows is indexed in each table.

  4. You may choose a subset of tables.

  5. You may limit columns by their names.

  6. You may define SQLs to change content to be indexed

  7. Embedded files are indexed with proper definitions

  8. File paths are processed with proper definitions

  9. If rows have some permission information you may utilize it with proper definitions.

  10. When you search in GEODI, Each row is content. You may define the name and appearance of a record database or table-wise.

DBmeta is a way to change all behavior here by database and/or tablewise. This page contains necessary information about how DBmetas are defined.

DBmetas are jsettings files under “Settings\Reader\DBMeta”

Columns filters

Limit indexing by table or column.

Key

Description

WorkspaceName

WorkSpace(s) to which the settings will apply.

TableFilter

Tables to which the settings will apply.

ColumnFilter

Tables that contain the given columns (if - is given, tables that do not contain the relevant columns are considered). If more than one is given, the relationship is AND.

Code Block
languagejson
{
   "Defines":[
      {
         "__type":"Geodi.Database.Meta.DBRowDisplayNameDBView, Geodi.Database",
         "WorkspaceName":"0000-Promotional and Educational Videos Educational Videos",
         "TableFilter":"*",
         "DisplayNameMacroColumnFilter":"[TITLE]*",       }  
  ]
}

With the values in the table you can limit the scope of the settings. For all settings, you can specify all with "*", exclude with "-" and considere with ",".

...

Key

...

Description

...

WorkspaceName

...

WorkSpace(s) to which the settings will apply.

...

TableFilter

...

Tables to which the settings will apply.

...

ColumnFilter

...

       "Columns":"-FILEPATH,-ID,DATE,GEODIFILELINK"
      },
   ]
}

How rows appear in GEODI

In databases, each record appears on a record-by-record basis. The default title is the first text column value. You can change this for each table individually using DisplayNameMacro. You can also use other columns in the title macro. Macro rules are given at the end of the page.

Code Block
languagejson
{
   "Defines":[
      {
         "__type":"Geodi.Database.Meta.DBRowDisplayName, Geodi.Database",
         "DisplayNameMacro":"[TEXT4]/[TEXT2]-[TEXT3]"
      }
   ]
}

...

Indexing files embedded in tables

 Geodi can scan files that have with file paths in the database or are embedded in the database as blobs. This document explains how to make the necessary settings. 

...