Gareth Davidson

Results 115 comments of Gareth Davidson

I would guess there's several parts, which might be best done as separate tickets to keep the work nice and small 1. Developers 2. GitHub contributors 3. Data contributors 4....

It may be worth looking through the data for `# comments` and fixing those. I don't think we lose much by doing that do we?

I guess looking through the data when it's out and writing something that will identify how big the issue is. Another option would be to add "media-type" to the metadata...

Sounds good :) Steps to add a dataset are documented here: https://github.com/LAION-AI/Open-Assistant/blob/main/openassistant/datasets/README.md How are your Python skills? The tricky part if you don't have Python skills will be taking the...

I can't see any spreadsheet attached here. Can you attach it so I can have a peek at it please?

> In the meantime, can someone help me to format the data into a usable dataset? Okay so I downloaded it and had a look through it. I guess we...

Should this apply to other pages in general? Like we can't view individual messages without signing in.

Converted to draft for now, since some functions have been removed. I'll update in the next day or two

I changed the tests to do a simple test of the new cosine similarity functions, they're not great but still better than nothing.

It'd be pretty tough and depend on the context more than anything. Lots of languages are based on C and so look pretty similar, there are tells but the context...