NEKO icon indicating copy to clipboard operation
NEKO copied to clipboard

In Progress Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks

Results 47 NEKO issues
Sort by recently updated
recently updated
newest added

For text task, when we would have multiple datasets, concatenation strategy could be moved to a more sophisticated logic by using huggingface concatenation. Further, we may wish to change the...

enhancement
good first issue

building on #40, capture: - original analysis document - updated analysis of storage and compute requirements - process across manifold for managing resources (can be project driven or org driven)

Context: We currently know what datasets we are using, but we are trying to centralize this knowledge into the survery [spreadsheet](https://docs.google.com/spreadsheets/d/1bvoS75q101-uUYBiOWZRZvPDlYM8EegaItjkFkZrUwQ/edit?usp=sharing). Output: Redact and know what datasets we are currently...

Context: We want to profile existing multimodal models we picked in #62 and evaluate them on the benchmark we are proposing. Output: Reported model performance on benchmark

@jsjung00 is catching up to context from control modality work w/ help from @daniellawson9999. We want to rapidly understand what next datasets are needed, as we are finalizing the v0...

Similar to #59 we want to understand next steps (both brief context on small level issues (i.e. bugs etc) and next major steps

As mentioned in [Source MiniGrid/BabyAI Dataset#14](https://github.com/ManifoldRG/NEKO/issues/14), once a dataset is sourced, it needs to be converted to Minari. GoToLocal expert trajectories have already been uploaded to [google drive](https://drive.google.com/drive/u/0/folders/1630PKrrNtVKAzd5rF_mabsDasVe2hrRc). Now it...

Audio may be a more generally useful modality to train a large multimodal model on. We want to understand what datasets are available for this modality. Outcome: Dataset Investigation Analysis,...