You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 28, 2022. It is now read-only.
At the moment strings like www.domain.com, domain.com, subdomain.domain.com are treated as individual jobs with independent results. This can lead to some data collisions since we use document URL as a key of document tabel.
We should:
investigate if www.domain.com and domain.com are treated as the same address in Heritrix
for subdomains we can have independent Heritrix jobs, but we should link them to already existing documents if they were downloaded and checked before within another job.
The text was updated successfully, but these errors were encountered:
At the moment strings like www.domain.com, domain.com, subdomain.domain.com are treated as individual jobs with independent results. This can lead to some data collisions since we use document URL as a key of document tabel.
We should:
The text was updated successfully, but these errors were encountered: