Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistent IRIs Between eccode and rhea Sources #349

Open
jplfaria opened this issue Feb 14, 2025 · 2 comments
Open

Inconsistent IRIs Between eccode and rhea Sources #349

jplfaria opened this issue Feb 14, 2025 · 2 comments

Comments

@jplfaria
Copy link
Contributor

Description:

I have observed that the eccode and the rhea sources use different IRI schemes for referencing Uniprot resources:

  • eccode example:
    Uses a Bioregistry URL, for example:
    https://bioregistry.io/uniprot:F4I907

  • rhea example:
    Uses a PURL URL, for example:
    http://purl.uniprot.org/uniprot/F4I907

Is the difference in Uniprot IRI usage between eccode and rhea intentional? I realize this isn't primarily a ROBOT discussion, but while merging multiple ontologies (including eccode and rhea) using ROBOT, I've noticed that ROBOT merge relies on exact IRI matching. This discrepancy might lead to duplicate or misaligned entities in the merged ontology. I’d appreciate any clarification on the cause of the discrepancy.

@cthoyt
Copy link
Member

cthoyt commented Feb 14, 2025

This might be because of several recent improvements to automatic URI assignment. I guess you're referring to some specific files in obo-db-ingest?

@jplfaria
Copy link
Contributor Author

jplfaria commented Feb 14, 2025

Your guess is correct – I should have made that clearer. I am working with the following sources:

  • rhea: https://w3id.org/biopragmatics/resources/rhea/rhea.owl.gz
  • eccode: https://w3id.org/biopragmatics/resources/eccode/eccode.owl

In these files, I also observed a discrepancy regarding EC URIs. For example, in ECCODE:

    <!-- https://bioregistry.io/eccode:1.1.1.n10 -->
    <owl:Class rdf:about="https://bioregistry.io/eccode:1.1.1.n10">
        <rdfs:subClassOf>
            <owl:Restriction>
                <owl:property rdf:resource="http://purl.obolibrary.org/obo/IAO_0100001"/>
                <owl:someValuesFrom rdf:resource="https://bioregistry.io/eccode:1.1.1.301"/>

Whereas in Rhea:

    <!-- https://www.rhea-db.org/rhea/10016 -->
    <owl:Restriction>
        <owl:onProperty rdf:resource="http://purl.obolibrary.org/obo/RO_0002333"/>
        <owl:someValuesFrom rdf:resource="https://www.ebi.ac.uk/intenz/query?cmd=SearchEC&amp;ec=3.1.1.49"/>
    </owl:Restriction>

Notably, the latter IRI (from Rhea) does not resolve. I understand there have been recent discussions regarding EC ontology issues in the Bioregistry GitHub, and while I’m not sure if this discrepancy falls within that scope, I’m reporting my findings since I actively work with these ontologies on my science project.

I appreciate all the hard done in maintaining these resources, they have become crucial to my research.

@jplfaria jplfaria changed the title Inconsistent Uniprot IRIs Between eccode and rhea Sources Inconsistent IRIs Between eccode and rhea Sources Feb 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants