specifications icon indicating copy to clipboard operation
specifications copied to clipboard

SequenceRange definition

Open AlasdairGray opened this issue 6 years ago • 3 comments

The new type SequenceRange needs a descriptive definition.

AlasdairGray avatar Feb 08 '19 14:02 AlasdairGray

TL;DR; I strongly suggest just copy pasting the definition of faldo:Region.

SequenceRange needs a redefinition, as a number of properties and concepts are conflated requiring human parsing and not enabling data integration. The proposed modelling requires the parsing of literal text values, which means tools such as reasoners can not use this data. One needs to combine an integer and a text property for one piece of knowledge,

Secondly, it assumes just regions and not single amino acid or nucleotide features (or features between them).

Thirdly, as the position and end are just integers one can not say very important things like this active-site is on the same position as that PTM. It requires reasoning and human reasoning at that to be able to deduce this. The model leaves it impossible to state this explicitly.

Also the very important Strandedness is lost which is critical for using this with DNA. Nor can we describe binding between sequences. Nor is it possible to evidence an endpoint annotation.

In general I am of the biased opinion think you should just have a reduced set of faldo here. Where we can in the schemata document state how these logically relate.

The usecases of faldo extends beyond what bioschema wants to do so cutting it down would be a reasonable approach. The current modelling is not solving what our community needs at a basic level so major adaptations need to be done.

JervenBolleman avatar Feb 14 '19 08:02 JervenBolleman

This is related to #468

AlasdairGray avatar Jun 11 '21 15:06 AlasdairGray

I would suggest to release the first draft as it is right now as it is how it was agreed by those working on the sequence annotation at that time (maybe the first or second BioHackathon Europe). We can mimic FALDO later. I also suggest to get some feedback from the community, if not much feedback, we will go with @JervenBolleman suggestions only.

ljgarcia avatar Jun 30 '21 15:06 ljgarcia