dwc icon indicating copy to clipboard operation
dwc copied to clipboard

New term - totalReadCount

Open tucotuco opened this issue 3 months ago • 2 comments

New term - totalReadCount

Proposed attributes of the new term:

  • Term name (in lowerCamelCase for properties, UpperCamelCase for classes): totalReadCount
  • Term label (English, not normative): Total Read Count
  • Organized in Class (e.g., Occurrence, Event, Location, Taxon): NucleotideAnalysis
  • Definition of the term (normative): A total number of reads in a dwc:NucleotideAnalysis.
  • Usage comments (recommendations regarding content, etc., not normative):
  • Examples (not normative):
  • Refines (identifier of the broader term this term refines; normative):
  • Replaces (identifier of the existing term that would be deprecated and replaced by this term; normative):
  • ABCD 2.06 (XPATH of the equivalent term in ABCD or EFG; not normative):

tucotuco avatar Sep 11 '25 12:09 tucotuco

The term "read count" is too ambiguous and needs clarification. The value will vary significantly depending on the stage of data processing, and we've encountered this same issue previously when trying to record read count in organismQuantity. Please specify one of the following:

  • Raw reads – If this refers to reads as returned directly from the sequencing platform, state that explicitly.

  • Processed reads – If this refers to reads after quality assurance/quality control (QAQC), provide a clear reference to where the QAQC methodology is described. This is essential because QAQC procedures vary widely, and the endpoint chosen can dramatically affect the read count.

sformel avatar Nov 25 '25 13:11 sformel

The term "read count" is too ambiguous and needs clarification. The value will vary significantly depending on the stage of data processing, and we've encountered this same issue previously when trying to record read count in organismQuantity. Please specify one of the following:

  • Raw reads – If this refers to reads as returned directly from the sequencing platform, state that explicitly.
  • Processed reads – If this refers to reads after quality assurance/quality control (QAQC), provide a clear reference to where the QAQC methodology is described. This is essential because QAQC procedures vary widely, and the endpoint chosen can dramatically affect the read count.

Splitting this term into 2 terms (rawTotalReadCount and processedTotalReadCount) with clear diffinitions would address the above comments.

miwa582 avatar Nov 25 '25 14:11 miwa582