{{type2count[type]}} |
{{item2group[igk].catalogs.length}} |
{{item2group[igk].catalogs.length}} |
{{g.catalogs.split(',').length}} |
{{catalog.desc}} |
This catalog appears to be empty, maybe the initial scraping is still running
{{catalog.username}} | ({{catalog.last_scrape}}) |
{{catalog.manual}} | ||
{{catalog.autoq}} | ||
{{catalog.na}} | ||
{{catalog.unmatched}} | ||
{{catalog.total}} |
|
{{type.cnt}} |
---|
{{ym.ym.substr(0,4)+'-'+ym.ym.substr(4,2)}} |
|
{{ym.cnt}} |
---|
{{u.cnt}} |
{{s.articles}} |
| |||
{{entry.ext_id}} {{entry.ext_id}} | |||
|
|||
{{entry.ext_desc|decodeEntities|removeTags|miscFixes}}
| |||
|
|||
| |||
{{entry.born}} – {{entry.died}} | |||
{{entry.lat}}/{{entry.lon}} | |||
|
|||
|
|||
| |||
{{e.title|decodeEntities|removeTags|miscFixes}} | {{e.snippet|decodeEntities|removeTags|miscFixes}} |
|
This page helps to create an automated web page scraper, to generate and update Mix'n'match catalogs.
The goal is to create a list of URLs, iterate through them, and scrape the respective pages to generate Mix'n'match entries.
[See example]
A URL can be constructed from a static part, and one or more variables, here called levels. Each level can be a defined list of keys (e.g., letters), a range (numeric from-to, plus step size),
or follow (get URLs listed on a page and follow them). The last level with be run through, before the level above it (next lower level) ticks ahead, and the higher level resets.
So, if the first level is keys A-Z, and the second is range 1-100 (step size 1), URLs will use A/1, A/2,... A/100, B/1, B/2,... Z/100.
ID | Name | Description | Type |
---|---|---|---|
{{r.id}} {{r.id}} | {{r.name}} | {{r.desc}} | {{r.type}} |