crowsetta
crowsetta copied to clipboard
A tool to work with any format for annotating vocalizations
as in https://github.com/NickleDave/vak/issues/458 script will need to parse yaml and then update: * version number * commit associated with release * DOI ```yaml identifiers: - type: doi value: 10.5281/zenodo.5828090 ......
https://www.sciencedirect.com/science/article/pii/S1574954115000151 http://taylor0.biology.ucla.edu/birdDBQuery/
https://buckeyecorpus.osu.edu/php/faq.php
Our using pandera to validata dataframes really adds to the number of things that get installed when you install crowsetta, largely because pandera depends on pydantic This makes it it...
We should add support for [chipper](https://github.com/CreanzaLab/chipper) from Creanza lab Looks like pykanto implements a function to parse, adapt and cite: https://github.com/nilomr/pykanto/blob/3ecaeb9747720d19afa10d1183112d06ea721680/pykanto/utils/custom.py#L104
I want to be able to just write `crowsetta.list_formats`, a top-level function ("flat is better than nested"). And the output should be a table with columns ("shorthand string name", "full...
Would be good to explain the logic of representing a set of annotations loaded from a file as a (data)class, and how each class that corresponds to an annotation format...
Right now if we just do something like ```python >>> import crowsetta >>> birdsongrec = crowsetta.example('Annotation.xml') >>> birdsongrec ``` the auto-generated `__repr__` provided by `attrs` will dump out every annotation...
Both the [tutorial](https://crowsetta.readthedocs.io/en/latest/tutorial.html) and this [walkthrough](https://crowsetta.readthedocs.io/en/latest/howto/munging-annotations-textgrid.html) use a number of annotation files from different datasets that we don't want to add to the package and increase its size But it's...