What's Changed
New repositories:
- Add SeaNoe support: #84 by @micafer in #85
- Add B2Share support #88 by @micafer in #89
- Add data.europa.eu support by @micafer in #87
Other changes:
- Raise for status in more places by @J535D165 in #80
- Add dynamic test matrix by @J535D165 in #90
- Remove OSF storage integration test (temporarily) by @J535D165 in #91
- [pre-commit.ci] pre-commit autoupdate by @pre-commit-ci in #81
Full Changelog: v0.12...v0.13
Coverage report
The following benchmark was applied to 1000 randomly selected records from Datacite.
Percentages
Percentage of datasets supported: 26.3%
Percentage of datasets not supported: 69.8%
Percentage of datasets with error: 3.9%
Table with unexpected errors
id | type | url | service | error | |
---|---|---|---|---|---|
47 | 10.58100/ibcr0302rx67ws2 | dois | http://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=21038&SAM=IBCR0302RX67WS2 | nan | 503 Server Error: Service Unavailable for url: https://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=21038&SAM=IBCR0302RX67WS2 |
52 | 10.18730/v7c2= | dois | https://glis.fao.org/glis/doi/10.18730/V7C2= | nan | '10.18730/v7c2=' is not a correct resource identifier (e.g. a URL, DOI, Handle) |
73 | 10.20345/digitue.1029.61 | dois | http://idb.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141 | nan | 500 Server Error: Internal Server Error for url: https://opendigi.ub.uni-tuebingen.de/opendigi/litrdsch_1902#p=141 |
96 | 10.17876/plate/dr.2/plates/201_33742 | dois | https://www.plate-archive.org/objects/dr.2/plates/201_33742 | nan | 500 Server Error: Internal Server Error for url: https://www.plate-archive.org/objects/dr.2/plates/201_33742/ |
119 | 10.18430/m3.irrmc.4168 | dois | https://proteindiffraction.org/project/SETDB1-x122 | nan | 'NoneType' object has no attribute 'find' |
129 | 10.58100/ibcr0310rxocku2 | dois | http://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=22618&SAM=IBCR0310RXOCKU2 | nan | 503 Server Error: Service Unavailable for url: https://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=22618&SAM=IBCR0310RXOCKU2 |
133 | 10.14469/ch/8676 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/to-8701 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/to-8701 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3577e3d8b0>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
136 | 10.17614/q4h70857g | dois | http://pqr.pitt.edu/mol/KFKSYDSVYUWMHK-UHFFFAOYSA-N | nan | HTTPConnectionPool(host='pqr.pitt.edu', port=80): Max retries exceeded with url: /mol/KFKSYDSVYUWMHK-UHFFFAOYSA-N (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f3583a2eb10>, 'Connection to pqr.pitt.edu timed out. (connect timeout=3)')) |
256 | 10.17171/1-8-2854 | dois | http://repository.edition-topoi.org/collection/ICG/object/3675 | nan | HTTPConnectionPool(host='repository.edition-topoi.org', port=80): Max retries exceeded with url: /collection/ICG/object/3675 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f3577e66930>, 'Connection to repository.edition-topoi.org timed out. (connect timeout=3)')) |
261 | 10.24411/2312-8089-2020-10902 | dois | http://cyberdoi.ru/doi/10.24411/2312-8089-2020-10902 | nan | ('Connection aborted.', RemoteDisconnected('Remote end closed connection without response')) |
267 | 10.17614/q4td9p06n | dois | http://pqr.pitt.edu/mol/HJQMFSDWWCLFTC-TWJUVVLDSA-N | nan | HTTPConnectionPool(host='pqr.pitt.edu', port=80): Max retries exceeded with url: /mol/HJQMFSDWWCLFTC-TWJUVVLDSA-N (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f35833f6d80>, 'Connection to pqr.pitt.edu timed out. (connect timeout=3)')) |
362 | 10.14469/ch/1303 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/to-1328 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/to-1328 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3577e38b90>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
383 | 10.14456/scitechasia.2022.12 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14456/scitechasia.2022.12 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14456/scitechasia.2022.12 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
397 | 10.17876/plate/dr.2/envelopes/201_50873 | dois | https://www.plate-archive.org/objects/dr.2/envelopes/201_50873 | nan | 500 Server Error: Internal Server Error for url: https://www.plate-archive.org/objects/dr.2/envelopes/201_50873/ |
400 | 10.23725/akhp-6959 | dois | https://ors.datacite.org/doi:/10.23725/akhp-6959 | nan | HTTPSConnectionPool(host='ors.datacite.org', port=443): Max retries exceeded with url: /doi:/10.23725/akhp-6959 (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f358383e450>: Failed to resolve 'ors.datacite.org' ([Errno -2] Name or service not known)")) |
403 | 10.58100/ibcr0381exz5001 | dois | http://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26882&SAM=IBCR0381EXZ5001 | nan | 503 Server Error: Service Unavailable for url: https://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26882&SAM=IBCR0381EXZ5001 |
434 | 10.58100/ibcr0364exxoa01 | dois | http://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26567&SAM=IBCR0364EXXOA01 | nan | 503 Server Error: Service Unavailable for url: https://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26567&SAM=IBCR0364EXXOA01 |
452 | 10.14469/ch/129258 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/134211 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/134211 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3577e378c0>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
458 | 10.14469/ch/41814 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/48213 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/48213 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3577e347a0>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
483 | 10.18730/12n7m$ | dois | https://glis.fao.org/glis/doi/10.18730/12N7M$ | nan | '10.18730/12n7m$' is not a correct resource identifier (e.g. a URL, DOI, Handle) |
496 | 10.14457/cmu.the.2009.132 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14457/CMU.the.2009.132 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14457/CMU.the.2009.132 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
501 | 10.14456/stj.2019.4 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14456/stj.2019.4 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14456/stj.2019.4 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
503 | 10.14457/kmutt.res.2010.25 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14457/KMUTT.res.2010.25 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14457/KMUTT.res.2010.25 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
505 | 10.14469/ch/175982 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/180406 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/180406 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3583bcbe30>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
551 | 10.17876/plate/dr.2/plates/201_35722 | dois | https://www.plate-archive.org/objects/dr.2/plates/201_35722 | nan | 500 Server Error: Internal Server Error for url: https://www.plate-archive.org/objects/dr.2/plates/201_35722/ |
557 | 10.14457/mu.the.1999.140 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14457/MU.the.1999.140 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14457/MU.the.1999.140 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
625 | 10.17182/hepdata.60582.v1/t187 | dois | https://www.hepdata.net/record/61173 | nan | HTTPSConnectionPool(host='www.hepdata.net', port=443): Read timed out. (read timeout=10) |
639 | 10.58100/ibcr0364exf0601 | dois | http://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26688&SAM=IBCR0364EXF0601 | nan | 503 Server Error: Service Unavailable for url: https://dis.iodp.pangaea.de/BCRDIS/webview/CORES_INFO.aspx?SKEY=26688&SAM=IBCR0364EXF0601 |
680 | 10.58108/csrwa19919 | dois | https://rockstore.csiro.au/arrc/#/browsesamples/CSRWA19919 | nan | HTTPSConnectionPool(host='rockstore.csiro.au', port=443): Max retries exceeded with url: /arrc/ (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
683 | 10.20379/dbaud-1041 | dois | http://webdatenbank.grass-medienarchiv.de/receive/ggrass_mods_00001019 | nan | 503 Server Error: Service Unavailable for url: https://webdatenbank.grass-medienarchiv.de/receive/ggrass_mods_00001019 |
690 | 10.17188/1312407 | dois | http://www.osti.gov/servlets/purl/1312407/ | nan | HTTPSConnectionPool(host='www.osti.gov', port=443): Max retries exceeded with url: /servlets/purl/1312407/ (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f35837dcbf0>, 'Connection to www.osti.gov timed out. (connect timeout=3)')) |
708 | 10.48550/arxiv.2003.01181 | dois | https://arxiv.org/abs/2003.01181 | nan | HTTPSConnectionPool(host='arxiv.org', port=443): Read timed out. (read timeout=10) |
757 | 10.18730/q3s0= | dois | https://glis.fao.org/glis/doi/10.18730/Q3S0= | nan | '10.18730/q3s0=' is not a correct resource identifier (e.g. a URL, DOI, Handle) |
782 | 10.20372/nadre:1554185535.13 | dois | https://nadre.ethernet.edu.et/record/3238?ln=en | nan | HTTPSConnectionPool(host='nadre.ethernet.edu.et', port=443): Max retries exceeded with url: /record/3238?ln=en (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate has expired (_ssl.c:1000)'))) |
816 | 10.14469/ch/90617 | dois | https://spectradspace.lib.imperial.ac.uk:8443/dspace/handle/10042/97675 | nan | HTTPSConnectionPool(host='spectradspace.lib.imperial.ac.uk', port=8443): Max retries exceeded with url: /dspace/handle/10042/97675 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f35834319a0>, 'Connection to spectradspace.lib.imperial.ac.uk timed out. (connect timeout=3)')) |
821 | 10.14456/apsr.2022.3 | dois | http://doi.nrct.go.th/?page=resolve_doi&resolve_doi=10.14456/apsr.2022.3 | nan | HTTPSConnectionPool(host='doi.nrct.go.th', port=443): Max retries exceeded with url: /?page=resolve_doi&resolve_doi=10.14456/apsr.2022.3 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1000)'))) |
823 | 10.17171/1-9-1799-5 | dois | http://repository.edition-topoi.org/collection/MRMD/single/0047/13 | nan | HTTPConnectionPool(host='repository.edition-topoi.org', port=80): Max retries exceeded with url: /collection/MRMD/single/0047/13 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f3583431700>, 'Connection to repository.edition-topoi.org timed out. (connect timeout=3)')) |
852 | 10.17171/1-13-16687 | dois | http://repository.edition-topoi.org/collection/ANCM/object/9341 | nan | HTTPConnectionPool(host='repository.edition-topoi.org', port=80): Max retries exceeded with url: /collection/ANCM/object/9341 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPConnection object at 0x7f358383ecf0>, 'Connection to repository.edition-topoi.org timed out. (connect timeout=3)')) |
894 | 10.5287/bodleianjpcy.2 | dois | https://databank.ora.ox.ac.uk/ww1archives/datasets/ww1-3945?version=2 | nan | HTTPSConnectionPool(host='databank.ora.ox.ac.uk', port=443): Max retries exceeded with url: /ww1archives/datasets/ww1-3945?version=2 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f3583b33320>, 'Connection to databank.ora.ox.ac.uk timed out. (connect timeout=3)')) |
Table with unsupported repositories
netloc | count |
---|---|
pid.geoscience.gov.au | 103 |
app.geosamples.org | 79 |
doi.plutof.ut.ee | 60 |
www.gbif.org | 57 |
glis.fao.org | 30 |
www.e-periodica.ch | 26 |
ba.e-pics.ethz.ch | 22 |
dlc.library.columbia.edu | 19 |
bacdive.dsmz.de | 18 |
rgdoi.net | 16 |
digitallibrary.usc.edu | 14 |
www.ccdc.cam.ac.uk | 14 |
www.lfi.ch | 11 |
nakala.fr | 9 |
catalog.paradisec.org.au | 8 |
www.osti.gov | 8 |
sage.figshare.com | 8 |
digital.ucd.ie | 7 |
www.plate-archive.org | 7 |
doi.library.ubc.ca | 7 |
spectradspace.lib.imperial.ac.uk:8443 | 6 |
doi.nrct.go.th | 6 |
ntnu.tind.io | 6 |
www.die-bonn.de | 6 |
architekturmuseum.ub.tu-berlin.de | 6 |
dadosdepesquisa.fiocruz.br | 5 |
dis.iodp.pangaea.de | 5 |
publikationen.bibliothek.kit.edu | 5 |
digi.ub.uni-heidelberg.de | 5 |
straininfo.dsmz.de | 5 |
hdl.handle.net | 4 |
era.library.ualberta.ca | 4 |
data.neotomadb.org | 4 |
www.rvdata.us | 4 |
repositories.lib.utexas.edu | 3 |
apex.ipk-gatersleben.de | 3 |
www.boldsystems.org | 3 |
epos.myesr.org | 3 |
statisticaldatasets.data-planet.com | 3 |
journals.ub.uni-heidelberg.de | 3 |
ageconsearch.umn.edu | 3 |
doi.ala.org.au | 3 |
sr.ethz.ch | 3 |
www.hepdata.net | 3 |
repository.edition-topoi.org | 3 |
147.156.5.176:8080 | 2 |
ikee.lib.auth.gr | 2 |
biosys.e-pics.ethz.ch | 2 |
gdac.broadinstitute.org | 2 |
search.rads-doi.org | 2 |
d.lib.msu.edu | 2 |
cyberleninka.ru | 2 |
cocoon.huma-num.fr | 2 |
www.e-manuscripta.ch | 2 |
scholarworks.wm.edu | 2 |
pqr.pitt.edu | 2 |
bib-pubdb1.desy.de | 2 |
springernature.figshare.com | 2 |
doi.roper.center | 2 |
classiques-garnier.com | 2 |
viurrspace.ca | 2 |
core.tdar.org | 2 |
hasp.ub.uni-heidelberg.de | 2 |
www.e-gs.ethz.ch | 2 |
www.psycharchives.org | 1 |
underline.io | 1 |
www.sozialpolitik.ch | 1 |
proteindiffraction.org | 1 |
idb.ub.uni-tuebingen.de | 1 |
publica.fraunhofer.de | 1 |
ads.nipr.ac.jp | 1 |
data.caltech.edu | 1 |
www.worldpop.org.uk | 1 |
nsidc.org | 1 |
didomena.ehess.fr | 1 |
archaeologydataservice.ac.uk | 1 |
www.elibrary.ru | 1 |
cyberdoi.ru | 1 |
spiral.imperial.ac.uk | 1 |
opus.bibliothek.uni-wuerzburg.de | 1 |
www.tib.eu | 1 |
resolver.tudelft.nl | 1 |
daac.ornl.gov | 1 |
doi.ciser.cornell.edu | 1 |
journals.open.tudelft.nl | 1 |
tuprints.ulb.tu-darmstadt.de | 1 |
academiccommons.columbia.edu | 1 |
www.archaeolog.ru | 1 |
bl.iro.bl.uk | 1 |
dataservices.gfz-potsdam.de | 1 |
drops.dagstuhl.de | 1 |
boris.unibe.ch | 1 |
ruor.uottawa.ca | 1 |
encyclopedia.1914-1918-online.net | 1 |
theses.gla.ac.uk | 1 |
www.jamstec.go.jp | 1 |
epub.uni-regensburg.de | 1 |
www.icpsr.umich.edu | 1 |
ors.datacite.org | 1 |
campagnes.flotteoceanographique.fr | 1 |
www.e-rara.ch | 1 |
elib.spbstu.ru | 1 |
www.zora.uzh.ch | 1 |
archive.materialscloud.org | 1 |
ascomycete.org | 1 |
www.openagrar.de | 1 |
ojs.utlib.ee | 1 |
esdcdoi.esac.esa.int | 1 |
archiv.ub.uni-heidelberg.de | 1 |
archiviostorico.fondazione1563.it | 1 |
deepblue.lib.umich.edu | 1 |
www.repository.cam.ac.uk | 1 |
dlc.mpg.de | 1 |
rockstore.csiro.au | 1 |
webdatenbank.grass-medienarchiv.de | 1 |
arxiv.org | 1 |
mdsoar.org | 1 |
depositonce.tu-berlin.de | 1 |
rucore.libraries.rutgers.edu | 1 |
dataverse.callisto.calmip.univ-toulouse.fr | 1 |
www.openaccessrepository.it | 1 |
ap.elte.hu | 1 |
www.crd.york.ac.uk | 1 |
qatest.labarchives.com | 1 |
tecnoscienza.unibo.it | 1 |
databank.ora.ox.ac.uk | 1 |
data.oceannetworks.ca | 1 |
nadre.ethernet.edu.et | 1 |
ad.e-pics.ethz.ch | 1 |
resume.uni.lu | 1 |
www.bindingdb.org | 1 |
cdr.lib.unc.edu | 1 |
resolver.caltech.edu | 1 |
digitalcollection.zhaw.ch | 1 |
figshare.com | 1 |
cwm-archiv.gbv.de | 1 |