grammark icon indicating copy to clipboard operation
grammark copied to clipboard

Batch utility

Open yannis1962 opened this issue 7 years ago • 1 comments

dear Mr. Fullmer, would it be possible to use grammark as a batch utility placing a symbol (for example an asterisk) at every location where a grammatical error or imperfection has been spotted in a text? Or maybe a code indicating the type of error/imperfection? I would like to apply grammark to a large corpus of texts and this excludes the interactive approach. Thank you very very much in advance, Yannis Haralambous

yannis1962 avatar May 09 '18 06:05 yannis1962

Hi @yannis1962 , that's definitely possible, though if you're talking large corpus-sized processing, it might be better to do this in a non-web interface. Regardless, I think the main thing would be deciding things like:

  • What would be the preferred interface for importing texts?
  • How would any given researcher be able to define what symbols/codes would be associated with which items?
  • What would be the best output format for the processed texts? A single file? Multiple files?
  • What other kind of corpus-type statistics on the processed text should be provided?

I think the first step would be to add an "API" that abstracts the output of a single processed text from what is currently HTML markup, and instead provides the processed text in, say, JSON or XML format. Multiple files could then be concatenated together subsequently.

Would you be able to delineate in more detail answers to some of the questions, above?

markfullmer avatar May 09 '18 13:05 markfullmer