Recognizer:Regular Expression
The canonical phrase recognizer, as the name suggests, acts according to a rule that we define. It allows us to select and separate parts of text that are described using a formal language. We determine what the result will be. In general, this usage is referred to as "Regex" and is a general usage.
You can use the .NET Regex Tester site to get information about it and test your regexes.
Since Geodi capitalizes words, you should be careful to use capital letters in the regexes you prepare. Example usages are as follows.
[0-9,\,]+X[0-9,\,]+X[0-9,\,]*[ ]*CM|[0-9,\,]+X[0-9,\,]+[ ]*CM
In the example above you can find size expressions in documents. For example, a regular expression to find expressions like "25X25cm" or "5x10x12.5 cm".
[0-9]+[ ]*SAYFA|[0-9]+[ ]*PIECE
* In this example, you can find the number of pages and pieces in the documents. For example "10 Pages, 20 Pieces".
MIL-STD-[0-9]*[A-Z]*
* A term used worldwide to refer to military standards. Examples you can capture with this regex: "MIL-STD-1376" "MIL-STD-1246C".
You may also use near keywords and validation algorithms like Luhn, Verhoef, or custom ones.
You can use the Words page to analyze the results better.
You should ensure there are no spaces at the beginning and end of the regex expression.
You must have a Geodi Pro license and module to use this recognizer.