GEODI can also evaluate databases, i.e., structured data. Databases and their file information (including embedded) can be indexed (all tables and rows). With simple rules, you can limit tables, rows, or fields and set how rows are displayed to the user.
Table of Contents |
---|
Database Clients
Some DBs may require a driver on the server where GEODI is locatedUse Project Wizard/Database connection to define the connection.
...
Use “New Connection” to choose the database and define the connection string. ( This dialog only opens if you are logged into the server)
Some databases may require an extra driver to be installed. Please check the lşiste below.
VTYS | Client | ||
---|---|---|---|
Oracle (OleDB) | Depending on the Oracle version to be connected, the appropriate Oracle Client download can be accessed by searching ODAC.... in Google. Oracle requires membership for download. After scanning the necessary steps, it creates a link for download. | ||
PostGresPostGres | No extra installation is required. | ||
Microsoft SQL Server | No extra installation is required. | ||
SqLite | No extra installation is required. | ||
Microsoft SQL ServerShape File | No extra installation is required. | ||
SqLiteCSV File | No extra installation is required. | ||
Shape KML File | No extra installation is required. | ||
CSV FileSQLCE, SQL express | No extra installation is required. | ||
KML File Kafka | No extra installation is required. | ||
Excel | Access Database Engine 2010 Oracle (OleDB) | Oracle ODAC driver must be installed. https://www.microsoftoracle.com/database/technologies/odac-downloads.html | |
Excel | Access Database Engine 2010 must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=13255 | ODBC | No extra installation is required. |
MS Access | Access Database Engine 2010 must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=13255SQLCE | No extra installation is required. | |
SQL Express | No extra installation is required. | ||
DB2 | Oledb Driver must be installed. | ||
Kafka | No extra installation is required. | ||
Oracle BigData | Microsoft Hive ODBC Driver must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=40886 | ||
CassandraC Data Cassandra Drivers must be installed. | OLEDB or ODBC driver is required | ||
MongoDBDevart ODBC Driver for MongoDB must be installed. | OLEDB or ODBC driver is required |
How to Connect Database Source
You can start by selecting the Database Source with the Project Wizard/Database.
You must select the database you want to define with Define Connection from the screen that opens.
How to Define Tabular Data
There are multiple ways:
...
Alternative Connection Methods
Other than using the project wizard
File-based ones such as SQLite, MDB, and AccDB are indexed automatically . Rule files/DBMeta are considered if defined.
Excel files are not recognized as structured data by default. But we can do this with appropriate meta and settings.
With Project Wizard/Database source we can connect to all listed DBMSs. Rule files/DBMeta are taken into account if defined. In this structure, desired tables and columns can be selected and SQL statements can be written. Views are supported.
Settings made with Project Wizard/Database can be stored. A structure in *.xDeceConnection format is now recognized as file based. This structure has multiple benefits.
It is portable. This means you can use the same file in other GEODIs or the same GEODI in other projects.
It is secure. Depending on the system settings security level settings, the password/username inside the connection cannot be known.
Some DBMeta capabilities mentioned on this page only work this way.
How to Index?
Databases can consist of many tables. By default, GEODI traverses all tables and indexes all rows. System administrators can make definitions to restrict tables and rows through GEODI. If desired, "view(s)" can be defined and how GEODI will see the database can also be defined. Many different systems/formats such as Oracle, SQL Server, Postgres, Mysql, sqlite, access, shapefile, excel are supported.
There is no need to define for file based ones like sqlite and access. These files are automatically indexed when found in any source.
You can also store the connection definitions you create with GEODI as *.dececonnection and let GEODI scan this content like any other file.
IIndexed tables and rows are not taken into account in the crawl for changes process as long as they do not change. Changed rows will be automatically versioned.
Tables to be crawled must have a primary key. You can remove this constraint with a setting, but the crawl changes or versioning features will not work for tables without a primary field.
. If you do not want Geodi Table objects in the search results, but only records, DisableIndexTables should be set to True in the Enumerator Detail settings.
Files embedded in tables or linked within tables can also be included in indexing. How these features work is explained on this page.
You overcome the limited search capabilities offered by the software that creates the database
Changing Table Indexing Settings
You can change the default behavior with jsettings settings files under Settings\Reader\DBMeta. Within the same settings file you can apply different settings for different tables or projects as shown in the example.
Separate setting types require a separate __type key.
Example:
...
language | json |
---|
...
when found in a directory and don't have a password. You may define DBMeta for them.
You must use Project Wizard for Excel files to be indexed as structured content.
Settings made with Project Wizard/Database can be saved in *.xDeceConnection format. These files are automatically processed. It is a portable and secure way of defining connections.
Indexing DBs
The default behavior is to crawl all tables and rows.
Tables must have primaryKey by default. You may choose to index tables w/o a primary key also.
You may choose a sampling mode, as only the selected number of rows is indexed in each table.
You may choose a subset of tables.
You may limit columns by their names.
You may define SQLs to change content to be indexed
Embedded files are indexed with proper definitions
File paths are processed with proper definitions
If rows have some permission information you may utilize it with proper definitions.
When you search in GEODI, Each row is content. You may define the name and appearance of a record database or table-wise.
DBmeta is a way to change all behavior here by database and/or tablewise. This page contains necessary information about how DBmetas are defined.
DBmetas are jsettings files under “Settings\Reader\DBMeta”
Columns filters
Limit indexing by table or column.
Key | Description |
---|---|
WorkspaceName | WorkSpace(s) to which the settings will apply. |
TableFilter | Tables to which the settings will apply. |
ColumnFilter | Tables that contain the given columns (if - is given, tables that do not contain the relevant columns are considered). If more than one is given, the relationship is AND. |
Code Block | ||
---|---|---|
| ||
{ "Defines":[ { "__type":"Geodi.Database.Meta.DBRowDisplayNameDBView, Geodi.Database", "WorkspaceName":"0000-Promotional and Educational Videos Educational Videos", "TableFilter":"*", "DisplayNameMacroColumnFilter":"[TITLE]*", } ] } |
With the values in the table you can limit the scope of the settings. For all settings, you can specify all with "*", exclude with "-" and considere with ",".
...
Key
...
Description
...
WorkspaceName
...
WorkSpace(s) to which the settings will apply.
...
TableFilter
...
Tables to which the settings will apply.
...
ColumnFilter
...
"Columns":"-FILEPATH,-ID,DATE,GEODIFILELINK"
},
]
} |
How rows appear in GEODI
In databases, each record appears on a record-by-record basis. The default title is the first text column value. You can change this for each table individually using DisplayNameMacro. You can also use other columns in the title macro. Macro rules are given at the end of the page.
Code Block | ||
---|---|---|
| ||
{ "Defines":[ { "__type":"Geodi.Database.Meta.DBRowDisplayName, Geodi.Database", "DisplayNameMacro":"[TEXT4]/[TEXT2]-[TEXT3]" } ] } |
...
Indexing files embedded in tables
Geodi can scan files that have with file paths in the database or are embedded in the database as blobs. This document explains how to make the necessary settings.
...