Seed URLs List
Domain Processor Control
Running
Background Processor:
Active and monitoring for new domains
Check Interval:
30
seconds
Domain Processor: This service automatically
monitors the TempSeedUrl table for new domains. When a new domain
is detected, it validates the website, checks if it's active, uses
the region from TempSeedUrl, analyzes JavaScript usage, and
inserts valid domains into the GenericWebsite table.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.
Pending domains: 3277
Auto-restart enabled - processor will automatically start when new domains are added.
Total Websites
22999
Crawled
562
JS Heavy
12253
Total PDFs
24010331
Generic Website Data
Status:
Region:
Date Range:
| Domain URL | Status | Region | Spider Name | Crawled | Last Crawl Ended | Total PDFs Found | JS Heavy | Actions |
|---|---|---|---|---|---|---|---|---|
|
www.pecosales.com
|
PROCESSING | Global | Not Set | No | Never | 0 | No | |
|
www.quidelta.com.mx
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-24
01:21:28
|
0 | No | |
|
https://ohmynews.com
Query: "Sicherheitsdatenblatt" "11/01/2025"
|
NEW | KR | Not Set | No | Never | 0 | Yes | |
|
https://seegene.com
|
NEW | KR | Not Set | No | Never | 0 | No | |
|
https://www.albert-roller.de/sicherheitsdatenblaetter.aspx
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-24
09:59:56
|
0 | No | |
|
https://safetysheets.business.xerox.com/en-us/?_gl=1*uqvkcr*_ga*MjAzMTE0OTA4OC4xNjEwNDUxOTQw*_ga_XBMQ3R9MZE*MTY1ODE1MzgyNi44OS4xLjE2NTgxNTQzNTcuMA..&_ga=2.185843360.809565812.1658093466-2031149088.1610451940
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-24
10:00:05
|
4
|
No | |
|
https://mypolycc.edu.my
|
NEW | MY | Not Set | No | Never | 0 | Yes | |
|
https://sensorstechforum.com
Query: "Sikkerhetsdatablad" "09/02/24"
|
NEW | BG | Not Set | No | Never | 0 | No | |
|
https://webpirs.nch.com
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-25
00:09:31
|
0 | No | |
|
https://d3qi0qp55mx5f5.cloudfront.net
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-25
00:13:48
|
0 | No | |
|
http://www.stonercarcare.hk
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-25
01:24:22
|
368
|
No | |
|
https://urban.org
Query: "Sicherheitsdatenblatt" "11/20/24"
|
NEW | Global | Not Set | No | Never | 0 | No | |
|
https://cwhaydenonline.com
Query: safety data sheet 02/26/24 filetype:pdf
|
NEW | Global | Not Set | No | Never | 0 | No | |
|
https://www.exdron.co.il
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-26
07:17:13
|
58031
|
No | |
|
https://pure-chemical.com
Query: Sicherheitsdatenblatt 01/05/25 filetype:pdf
|
NEW | US | Not Set | No | Never | 0 | Yes | |
|
http://www.wakefield-vette.com
|
DONE | Global | threaded_playwright_spider | Yes |
2025-12-27
03:24:14
|
266
|
No | |
|
https://store.safety-kleen.ca
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-27
10:28:32
|
2733
|
No | |
|
http://h22235.www2.hp.com
|
DONE | Global | generic_crawler_one_domain | Yes |
2025-12-27
10:35:20
|
0 | No | |
|
https://hepf.com
|
PROCESSING | Global | Not Set | No | Never | 0 | No |
Showing 22981
to 22999
of 22999 websites
Show: