Seed URLs List

Domain Processor Control

Running
Background Processor: Active and monitoring for new domains
Check Interval: 30 seconds
Domain Processor: This service automatically monitors the TempSeedUrl table for new domains. When a new domain is detected, it validates the website, checks if it's active, uses the region from TempSeedUrl, analyzes JavaScript usage, and inserts valid domains into the GenericWebsite table.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.

Total Websites

22999

Crawled

562

JS Heavy

12253

Total PDFs

24010331

Generic Website Data

Domain URL Status Region Spider Name Crawled Last Crawl Ended Total PDFs Found JS Heavy Actions
www.pecosales.com
PROCESSING Global Not Set No Never 0 No
www.quidelta.com.mx
DONE Global generic_crawler_one_domain Yes
2025-12-24
01:21:28
0 No
https://ohmynews.com
Query: "Sicherheitsdatenblatt" "11/01/2025"
NEW KR Not Set No Never 0 Yes
https://seegene.com
NEW KR Not Set No Never 0 No
https://www.albert-roller.de/sicherheitsdatenblaetter.aspx
DONE Global generic_crawler_one_domain Yes
2025-12-24
09:59:56
0 No
https://safetysheets.business.xerox.com/en-us/?_gl=1*uqvkcr*_ga*MjAzMTE0OTA4OC4xNjEwNDUxOTQw*_ga_XBMQ3R9MZE*MTY1ODE1MzgyNi44OS4xLjE2NTgxNTQzNTcuMA..&_ga=2.185843360.809565812.1658093466-2031149088.1610451940
DONE Global generic_crawler_one_domain Yes
2025-12-24
10:00:05
4
No
https://mypolycc.edu.my
NEW MY Not Set No Never 0 Yes
https://sensorstechforum.com
Query: "Sikkerhetsdatablad" "09/02/24"
NEW BG Not Set No Never 0 No
https://webpirs.nch.com
DONE Global generic_crawler_one_domain Yes
2025-12-25
00:09:31
0 No
https://d3qi0qp55mx5f5.cloudfront.net
DONE Global generic_crawler_one_domain Yes
2025-12-25
00:13:48
0 No
http://www.stonercarcare.hk
DONE Global generic_crawler_one_domain Yes
2025-12-25
01:24:22
368
No
https://urban.org
Query: "Sicherheitsdatenblatt" "11/20/24"
NEW Global Not Set No Never 0 No
https://cwhaydenonline.com
Query: safety data sheet 02/26/24 filetype:pdf
NEW Global Not Set No Never 0 No
https://www.exdron.co.il
DONE Global generic_crawler_one_domain Yes
2025-12-26
07:17:13
58031
No
https://pure-chemical.com
Query: Sicherheitsdatenblatt 01/05/25 filetype:pdf
NEW US Not Set No Never 0 Yes
http://www.wakefield-vette.com
DONE Global threaded_playwright_spider Yes
2025-12-27
03:24:14
266
No
https://store.safety-kleen.ca
DONE Global generic_crawler_one_domain Yes
2025-12-27
10:28:32
2733
No
http://h22235.www2.hp.com
DONE Global generic_crawler_one_domain Yes
2025-12-27
10:35:20
0 No
https://hepf.com
PROCESSING Global Not Set No Never 0 No
Showing 22981 to 22999 of 22999 websites
Show: