Open-Assistant
Open-Assistant copied to clipboard
Evaluate Detoxify to filter out unwanted prompts
Evaluate if unitaryai/detoxify could be used to automatically filter prompts (e.g. compute for all posts submitted to the db). Or whether it maybe could be used in a security-layer that filters input and output to a live assistant bot in production.
Please write a short report about your findings (or generate a ipynb), including the model sizes, GPU memory-requirements, inference performance, subjective opinion about the filtering quality (if possible provide some examples). Check if their license would allow us to use their model. Check how we could host the model (e.g. huggingface?).
I would like to take on this issue, my plan is to provide a ipynb notebook containing comparison of different models that includes:
- Inference speed and memory usage
- Training speed and memory usage
- Tests of detoxify on some inputs with different levels and kinds of toxic language
- The name of the license and main bulletpoints of it
- Information about hosting options
Maybe it seems that ideally, this would run on nvidia triton for fast inference right?
The notebook and a readme were posted in this pr: https://github.com/LAION-AI/Open-Assistant/pull/176 If anyone has any feedback or ideas how to expand this work feel free to contact me