hgvs implement method to identify/describe transcripts regions for variant

implement method to identify/describe transcripts regions for variant

Open reece opened this issue 9 years ago • 1 comments

Originally reported by: Reece Hart (Bitbucket: reece, GitHub: reece)

e.g., given c. variant, return something like "exon 3", "intron 2", "exon 1..2", "5' UTR", {'exons': [1,2,3]}.

This needs a spec before implementing. The spec should provide a data model for representing the spans, then a way to turn that into something humans would read.

Considerations:

comparison overlaps v. contained in
output type: need human and machine usable (e.g., "exon 5..6" and {exon_start: 5, exon_end: 6})
Perhaps this is better implemented after refining the hdp interface types?
general function (e.g., which_region(var) -> dict) v. specific output (which_exon(var) -> int)? v. boolean (is_3p_UTR?) v. SO?

Bitbucket: https://bitbucket.org/biocommons/hgvs/issue/242

Jul 02 '15 18:07 reece

Thats would be useful information!

Our use case is information about all regions (exons introns and UTRs), which intersects with variant, so maybe something like that would work:

{
    "exons": [1,2,3],
    "introns": [1,2]
}

We just need the whole information about the regions, by specific transcript of a variant.

Feb 15 '22 11:02 Melodeiro

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 7 days.

Feb 29 '24 01:02 github-actions[bot]

This issue was closed because it has been stalled for 7 days with no activity.

Mar 08 '24 01:03 github-actions[bot]

hgvs hgvs copied to clipboard

implement method to identify/describe transcripts regions for variant

hgvs
hgvs copied to clipboard