Seed URLs List

Domain Processor Control

Running
Background Processor: Active and monitoring for new domains
Check Interval: 30 seconds
Domain Processor: This service automatically monitors the TempSeedUrl table for new domains. When a new domain is detected, it validates the website, checks if it's active, uses the region from TempSeedUrl, analyzes JavaScript usage, and inserts valid domains into the GenericWebsite table.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.

Total Websites

22999

Crawled

567

JS Heavy

12253

Total PDFs

24258968

Generic Website Data

Domain URL Status Region Spider Name Crawled Last Crawl Ended Total PDFs Found JS Heavy Actions
https://buyat.ppg.com/ehsdocumentmanagerpublic/documentSearchInnerFrame.aspx?Language=es-MX
Query: safety data sheet pdf
DONE MX buyat_ppg_spider Yes
2025-12-19
07:24:53
62756
No
https://myenginespecs.com
Query: "safety data sheet" "02/02/2025"
NEW Global Not Set No Never 0 No
https://cvfsa.org
NEW Global Not Set No Never 0 Yes
https://weststaraviation.com
Query: "safety data sheet" "05/30/2025"
NEW Global Not Set No Never 0 Yes
https://brillux.ch
NEW CH Not Set No Never 0 No
https://caranddriver.com
Query: "Sikkerhetsdatablad" "02/12/24"
NEW Global Not Set No Never 0 Yes
https://ocsarts.net
NEW Global Not Set No Never 0 Yes
https://ghpage.com
Query: "Sikkerhetsdatablad" "29/04/2024"
NEW GB Not Set No Never 0 Yes
https://clemco.no
Query: Sikkerhetsdatablad 30/06/24 filetype:pdf
NEW NO Not Set No Never 0 No
https://saudeamericas.com.br
Query: "Sikkerhetsdatablad" "30/12/2024"
NEW BR Not Set No Never 0 Yes
https://bokepindonesia.me
Query: "Sikkerhetsdatablad" "15/04/2024"
NEW ME Not Set No Never 0 No
http://download.rockwool.no
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:05:59
0 No
https://www.brandenburg.com
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:06:03
6
No
https://finishmaster.com
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:07:11
0 No
https://tiava.com
Query: "Sikkerhetsdatablad" "15/04/2024"
NEW US Not Set No Never 0 Yes
https://liberato.com.au
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:21:51
1068
No
http://www2.uwstout.edu
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:21:57
0 No
https://sds.diversey.com
DONE Global generic_crawler_one_domain Yes
2025-12-17
06:27:09
0 No
https://galco.com
DONE Global comprehensive_site_spider Yes
2025-12-17
07:11:29
0 No
https://cdn.simplegreen.com
DONE Global threaded_playwright_spider Yes
2025-12-17
07:33:49
0 No
Showing 22701 to 22720 of 22999 websites
Show: