...
Expand | ||
---|---|---|
| ||
You can provide a single address or multiple addresses. Domain restriction settings will work independently for each address. |
Expand | ||
---|---|---|
| ||
With level=0 only the given page is indexed. The level must be large enough to access all pages. For cases with paging, the level value can be 1000+. |
Expand | ||
---|---|---|
| ||
Many web pages use URL parameters. By default, GEODI creates content for each unique URL. But in some cases, a parameter may not change the content of the page. In such situations, you may ignore such parameters to get a better index result. For example: |
Expand | ||
---|---|---|
| ||
Some websites may have social media links, advertising pages, or similar pages that you don't want in to the index.
|
Expand | ||||
---|---|---|---|---|
|
| |||
GEODI has rules on a per-web-page basis. Some rules come pre-configured. For example, only the "info box" containing content is processed on Wikipedia pages. Pagination controls found on some web pages (such as links appearing as 1, 2, 3,..., 10 and determining the pages) are automatically processed. |
Expand | ||
---|---|---|
| ||
Some pages are generated using JavaScript. In such cases, the HTML content of the page may not provide the necessary information. In these situations, the "Render like Browsers" option should be checked. Indexing will be slower but will yield the desired results. For this option to work, an alternative web browser module must be installed for the option to work. |
Expand | ||
---|---|---|
| ||
Page names are formed using the order of og:title → title → page URL You can query pages using |
...