-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPAD 1.2 to GPAD 2.0 seems to make an error with some relations #397
Comments
https://github.com/biolink/ontobio/blob/4abdd9fb83c0789f3f7400170969431f528c82a1/ontobio/rdfgen/relations.py#L128 -- has the reference to the outdated RO map we use. It could be happening when we parse GAF and use it to create GPAD: https://github.com/biolink/ontobio/blob/4abdd9fb83c0789f3f7400170969431f528c82a1/ontobio/io/gafparser.py#L522 Or somewhere else that makes association objects for GPAD: https://github.com/biolink/ontobio/blob/4abdd9fb83c0789f3f7400170969431f528c82a1/ontobio/model/association.py#L445 |
short term, an update to relations.py hard-coded dict might fix this? longer term, we may want to consider storing the mapping outside the code. |
@sierra-moxon I would agree with both of those points. (I missed the @dustine32 , it looks like you've been most active in there. Have you been handling that manually, or have you had some other process? |
@kltm - also, can you confirm your example is using the same "row"/annotation in both the 1.2 and 2.0 format? I put in a PR, and am writing a test for it. |
@sierra-moxon Apologies--copy-paste mistake. I've updated the initial issue above to the correct lines: #397 (comment) |
@kltm Yes, I've just been manually updating this hard-coding. It would be nice to eventually read in RO and do simple label->ID lookups. I think the initial impetus for this I'm guessing there must've been an update to these relations in RO and @sierra-moxon's PR brings it back up-to-date. |
Unrelated, but FYI - Ontobio also repairs relations via GORULE61 in qc.py |
The thing is, this relation is valid in RO, and the inference chain is correct. However we have decided that we dont want to use this in GO. I had added subsets in RO to express which relations were valid for GO (annotations, extensions, ontology...) but @cmungall remarked that we shouldn't have information specific to a project in a 'general'/broader application ontology, which makes sense. My question is, where can we put this information, since RO is not sufficient? I see a few options:
There must be a documented source of truth for what's hard-coded in the code. Thanks, Pascale |
For my understanding, is GOREL curated or otherwise automatically maintained as the source of truth for relations applicable in GO? From this file: https://github.com/geneontology/go-ontology/blob/master/src/ontology/extensions/gorel.obo I see that all terms in GOREL have an xref to RO. Can I use that with confidence to do this same work in the code (but use the GOREL ontology instead of the map in the code)? If that is the case, I don't think it would be terribly hard to convert. |
Okay, I want to separate this into two issues: 1, here) getting the immediate fix in, which @sierra-moxon has done with the PR above; 2) a long-term solution to make sure these are in sync in an automated way. The latter conversation should be had here now: #398 |
GPAD "1.2" input:
Produces the GPAD 2.0 output:
Of specific interest to us is that, in the extensions, it looks like
directly_negatively_regulates
is getting translated toRO:0002449
, instead of the expectedRO:0002630
. Taking a quick look around the ontobio code, while I could find some hardwired stuff in rdfgen/, I didn't see anything that would obviously produce this.My instinct is that there would either be an old hard-coded cache or the use of a stale ontology file. I'm also thinkging there might some composition logic going on with enables/.
This is possibly going to be the GO "shadow ticket" for a future ontobio ticket.
Tagging @dustine32 @mugitty @sierra-moxon , as people who probably have the most recent familiarity with the extensions transformation.
Also looping in reporters: @balhoff @vanaukenk @pgaudet
The text was updated successfully, but these errors were encountered: