Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »


The canonical phrase recognizer, as the name suggests, acts according to a rule that we define. It allows us to select and separate parts of text that are described using a formal language. We determine what the result will be. In general, this usage is referred to as "Regex" and is a general usage.

You can use the .NET Regex Tester site to get information about it and test your regexes.

Since Geodi capitalizes words, you should be careful to use capital letters in the regexes you prepare. Example usages are as follows.

[0-9,\,]+X[0-9,\,]+X[0-9,\,]*[ ]*CM|[0-9,\,]+X[0-9,\,]+[ ]*CM

In the example above you can find size expressions in documents. For example, a regular expression to find expressions like "25X25cm" or "5x10x12.5 cm".


[0-9]+[ ]*SAYFA|[0-9]+[ ]*PIECE

*  In this example, you can find the number of pages and pieces in the documents. For example "10 Pages, 20 Pieces".


MIL-STD-[0-9]*[A-Z]*

* A term used worldwide to refer to military standards. Examples you can capture with this regex: "MIL-STD-1376" "MIL-STD-1246C".


(HTTPS?://)?(WWW[.])?([Iİ]NSTAGRAM[.]COM)(([/?#][^\s]*)|\s+)

* Recognizes Instagram addresses. Supports different spellings. https://www.instagram.com/deceyazilim/ www.instagram.com/deceyazilim/ instagram.com/deceyazilim/.


\b[A-Z]{2}[0-9]{2}(?:[ ]?[0-9]{4}){4,5}(?!(?:[ ]?[0-9]){3})(?:[ ]?[0-9]{1,2})?\b

* Recognizes Iban numbers. Recognizes the version without spaces used worldwide and the version with spaces used in our country.


(0[1-9]|[1-7][0-9]|8[01])\s?((([A-Z])\s?(\d{4,5}))|(([A-Z]{2})\s?(\d{3,4}))|(([A-Z]{3})\s?(\d{2,3}))) 

* Recognizes license plates that comply with the Republic of Türkiye license plate standards.


Below you will find the canonical expressions of some card types. They may seem random, but they all have a standard.

* Mastercard : (?:(5[1-5][0-9]{2})|(222[1-9])|(22[3-9][0-9])|(2[3-6][0-9]{2})|(27[01][0-9])|(2720))[0-9]{12}

* Visa : 4[0-9]{12}(?:[0-9]{3})

* American Express : 3[47][0-9]{13}

* JCB : (?:2131|1800|35\d{3})\d{11}


You can use the Words page to better analyze the results.

You should make sure that there are no spaces at the beginning and end of the regex expression.

You must have Geodi Pro license and module to use this recognizer.


  • No labels