Jose Lopez
Jose Lopez
Same problem here. In the documentation doesn't appear anything about the format of the json expected when using CHAT_CONVERSATIONAL_REACT_DESCRIPTION or at least is not obvious where to find it. Any...
I have generated a .csv with the first 1K rows of the alpaca_data.json using gpt-3.5 This is an example of the table generated after checking the first 1K rows of...
Please, @tloen let me know if this is relevant here. The main goal is to improve the quality of the data and measure the impact of that "automatic" task on...
I understand. I had my doubts as well, but this was a good opportunity to check if an automatic script could work. If you know where it can be useful,...
Maybe it can be the case, using several attempts for the same input and taking the one that is more common. In my script, I used temp = 0, but...
The price was really low. I tested yesterday thousands of rows several times (I can't say exactly the number because I was testing and debugging) and it cost 1.29$ in...
Regarding GPT-4 prices. It is x30 the price in the competition (which is the heavy part here) and x10 the price in Prompt: For the 50K samples, I think it...
Ok, I'll close this Pull Request, then. If anyone is interested, the repo is: https://github.com/josemlopez/check-with-gpt , there I'll continue with the cleaning to measure the impact on the performance with...
I'm just working on including more samples in Spanish in the dataset for improving the performance in Spanish. Any thumb rule for the number of samples to include for having...
Thanks Daniel! This is very interesting. I'll follow your leads and share here some of my insights.