Seed URLs List
Domain Processor Control
Running
Background Processor:
Active and monitoring for new domains
Check Interval:
30
seconds
Domain Processor: This service automatically
monitors the TempSeedUrl table for new domains. When a new domain
is detected, it validates the website, checks if it's active, uses
the region from TempSeedUrl, analyzes JavaScript usage, and
inserts valid domains into the GenericWebsite table.
Pending domains: 3228
Auto-restart enabled - processor will automatically start when new domains are added.
Pending domains: 3228
Auto-restart enabled - processor will automatically start when new domains are added.
Total Websites
23076
Crawled
626
JS Heavy
12276
Total PDFs
37653393
Generic Website Data
Status:
Region:
Date Range:
| Domain URL | Status | Region | Spider Name | Crawled | Last Crawl Ended | Total PDFs Found | JS Heavy | Actions |
|---|---|---|---|---|---|---|---|---|
|
https://meinhartje.com
Query: Sicherheitsdatenblatt 15/01/2024 filetype:pdf
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://bezreg-muenster.de
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://welte-glasuren.com
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://correiobraziliense.com.br
Query: "Sicherheitsdatenblatt" "01/09/2024"
|
NEW | BR | Not Set | No | Never | 0 | Yes | |
|
https://baustoffshop.de
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://facts.net
Query: "Sicherheitsdatenblatt" "01/03/24"
|
NEW | US | Not Set | No | Never | 0 | Yes | |
|
https://petec.de
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://glutoclean.de
|
NEW | DE | Not Set | No | Never | 0 | No | |
|
https://maerkl-gmbh.de
Query: Sicherheitsdatenblatt 01/19/24 filetype:pdf
|
NEW | DE | Not Set | No | Never | 0 | Yes | |
|
https://theweather.com
Query: "Sicherheitsdatenblatt" "01/09/24"
|
NEW | ES | Not Set | No | Never | 0 | Yes | |
|
https://istanbeautiful.com
Query: "Sicherheitsdatenblatt" "05/01/24"
|
NEW | US | Not Set | No | Never | 0 | Yes | |
|
https://classengroup.com
|
NEW | DE | Not Set | No | Never | 0 | No | |
|
https://theatersonline.com
Query: "Sicherheitsdatenblatt" "08/01/2024"
|
NEW | GB | Not Set | No | Never | 0 | Yes | |
|
https://linde.ch
|
NEW | CH | Not Set | No | Never | 0 | Yes | |
|
https://alfresa-pharma-global.com
|
NEW | JP | Not Set | No | Never | 0 | No | |
|
https://uniti.de
|
NEW | DE | Not Set | No | Never | 0 | No | |
|
https://igk-facility.at
|
NEW | AT | Not Set | No | Never | 0 | Yes | |
|
https://hostrs.com
Query: "Sicherheitsdatenblatt" "01/08/2024"
|
NEW | IN | Not Set | No | Never | 0 | Yes | |
|
https://sakret-sachsen.de
|
NEW | DE | Not Set | No | Never | 0 | No | |
|
https://abionova.at
|
NEW | AT | Not Set | No | Never | 0 | Yes |
Showing 19481
to 19500
of 23076 websites
Show: