Source:Databases
GEODI can also evaluate databases, i.e., structured data. Databases and their file information (including embedded) can be indexed (all tables and rows). With simple rules, you can limit tables, rows, or fields and set how rows are displayed to the user.
Use Project Wizard/Database connection to define the connection.
Conditions for connection
Connection parameters (connection string) to the DB for read-only access
Find the DB name on the table below if an additional driver is required (like ODAC for Oracle)
Sampled data discovery decision (N record per table)
The decision for Indexing and discovery of Embedded or linked files
Use “New Connection” to choose the database and define the connection string.
Views can now be excluded if preferred. Since Views often contain data already present in other tables, excluding them will help reduce duplicate data discovery and improve performance.
Supported DBMS
Some databases may require an extra driver to be installed. Please check the list below.
Supported DBMS | Client |
---|---|
OLEDB | It may be possible to connect to database management systems or data sources that are not listed below using OLEDB. For this purpose, you can use a small utility included with the GEODI installation. The utility located at: For sources not listed, you may need to install an appropriate OLEDB driver. Please refer to the technical documentation provided by the data source. |
ODBC | For ODBC sources, you can check whether the driver is registered using the ODBC Data Source Administrator. For sources not listed, you may need to install the appropriate ODBC driver. Please refer to the technical documentation provided by the data source. |
|
|
Elasticsearch |
Connection String for GEODI Dsn=my_elasticsearch_dsn;Driver={Elasticsearch Driver};description={my description};uid=username;pwd=password;server=your_es_host;port=9200 |
PostGres | No extra installation is required. |
Microsoft SQL Server | No extra installation is required. |
Sybase | ODBC driver installation is required. |
Oracle (OleDB) | Additional Oracle .NET and Visual Studio ODAC Downloads ODAC OUI and ODAC 12.1.0.2.4 can be downloaded from the following link: The downloaded ODAC 12.1.0.2.4 package should be extracted to a directory. Run CMD as Administrator, navigate to the directory where ODAC 12.1.0.2.4 is located, and execute the following command:
|
DB2 | Oledb Driver must be installed. |
Oracle BigData | Microsoft Hive ODBC Driver must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=40886 |
SqLite | No extra installation is required. |
Shape File | No extra installation is required. |
CSV File | No extra installation is required. |
KML File | No extra installation is required. |
SQLCE, SQL express | No extra installation is required. |
Kafka | No extra installation is required. |
Excel | Access Database Engine 2010 must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=13255 |
MS Access | Access Database Engine 2010 must be installed. https://www.microsoft.com/en-us/download/details.aspx?id=13255 |
Cassandra | OLEDB or ODBC driver is required |
MongoDB | OLEDB or ODBC driver is required ODBC Driver for MongoDB - Fast Access From ODBC-Compliant Tools and Languages |
Alternative Connection Methods
Other than using the project wizard
File-based ones such as SQLite, MDB, and AccDB are indexed automatically when found in a directory and don't have a password. You may define DBMeta for them.
You must use Project Wizard for Excel files to be indexed as structured content.
Settings made with Project Wizard/Database can be saved in *.xDeceConnection format. These files are automatically processed. It is a portable and secure way of defining connections.
Indexing DBs
The default behavior is to crawl all tables and rows.
Tables must have primaryKey by default. You may choose to index tables w/o a primary key also.
You may choose a sampling mode, as only the selected number of rows is indexed in each table.
You may choose a subset of tables.
You may limit columns by their names.
You may define SQLs to change content to be indexed
Embedded files are indexed with proper definitions
File paths are processed with proper definitions
If rows have some permission information you may utilize it with proper definitions. (There are also different ways to define applications like CRM and ERP as sources.)
When you search in GEODI, Each row is content. You may define the name and appearance of a record database or table-wise.
Trouble Shooting