Seed URLs List
Domain Processor Control
Running
Background Processor:
Active and monitoring for new domains
Check Interval:
30
seconds
Domain Processor: This service automatically
monitors the TempSeedUrl table for new domains. When a new domain
is detected, it validates the website, checks if it's active, uses
the region from TempSeedUrl, analyzes JavaScript usage, and
inserts valid domains into the GenericWebsite table.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.
Total Websites
22999
Crawled
567
JS Heavy
12253
Total PDFs
24258968
Generic Website Data
Status:
Region:
Date Range:
| Domain URL | Status | Region | Spider Name | Crawled | Last Crawl Ended | Total PDFs Found | JS Heavy | Actions |
|---|---|---|---|---|---|---|---|---|
|
https://buyat.ppg.com/ehsdocumentmanagerpublic/documentSearchInnerFrame.aspx?Language=es-MX
Query: safety data sheet pdf
|
DONE | MX | buyat_ppg_spider | Yes |
2025-12-19
07:24:53
|
62756
|
No | |
|
https://myenginespecs.com
Query: "safety data sheet" "02/02/2025"
|
NEW | Global | Not Set | No | Never | 0 | No | |
|
https://cvfsa.org
|
NEW | Global | Not Set | No | Never | 0 | Yes | |
|
https://weststaraviation.com
Query: "safety data sheet" "05/30/2025"
|
NEW | Global | Not Set | No | Never | 0 | Yes | |
|
https://brillux.ch
|
NEW | CH | Not Set | No | Never | 0 | No | |
|
https://caranddriver.com
Query: "Sikkerhetsdatablad" "02/12/24"
|
NEW | Global | Not Set | No | Never | 0 | Yes | |
|
https://ocsarts.net
|
NEW | Global | Not Set | No | Never | 0 | Yes | |
|
https://ghpage.com
Query: "Sikkerhetsdatablad" "29/04/2024"
|
NEW | GB | Not Set | No | Never | 0 | Yes | |
|
https://clemco.no
Query: Sikkerhetsdatablad 30/06/24 filetype:pdf
|
NEW | NO | Not Set | No | Never | 0 | No | |
|
https://saudeamericas.com.br
Query: "Sikkerhetsdatablad" "30/12/2024"
|
NEW | BR | Not Set | No | Never | 0 | Yes | |
|
https://bokepindonesia.me
Query: "Sikkerhetsdatablad" "15/04/2024"
|
NEW | ME | Not Set | No | Never | 0 | No | |
|
http://download.rockwool.no
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:05:59
|
0 | No | |
|
https://www.brandenburg.com
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:06:03
|
6
|
No | |
|
https://finishmaster.com
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:07:11
|
0 | No | |
|
https://tiava.com
Query: "Sikkerhetsdatablad" "15/04/2024"
|
NEW | US | Not Set | No | Never | 0 | Yes | |
|
https://liberato.com.au
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:21:51
|
1068
|
No | |
|
http://www2.uwstout.edu
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:21:57
|
0 | No | |
|
https://sds.diversey.com
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-17
06:27:09
|
0 | No | |
|
https://galco.com
|
DONE | Global | comprehensive_site_spider | Yes |
2025-12-17
07:11:29
|
0 | No | |
|
https://cdn.simplegreen.com
|
DONE | Global | threaded_playwright_spider | Yes |
2025-12-17
07:33:49
|
0 | No |
Showing 22701
to 22720
of 22999 websites
Show: