Chengcheng Han
Chengcheng Han
Hello, I see in your paper that for datasets like MultiArith/SVAMP, you randomly sampled 500 data points to serve as a validation set, with the rest as the test set....
Thank you for your excellent project. I have conducted an evaluation of Flan-Alpaca-Base/Large/XL on the gsm8k/SVAMP/MultiArith datasets, and the evaluation results are as follows: | Model | gsm8k | MultiArith...
### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support personalization based on user preferences or usage history. ### Describe the solution you'd like...
### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support deploying different models for different modules. ### Describe the solution you'd like We hope...
### Is your feature request related to a problem? Please describe. The current OS-Copilot **cannot** run on Windows systems. ### Describe the solution you'd like We hope to extend OS-Copilot...
### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support multi-turn conversations. ### Describe the solution you'd like We hope OS-Copilot can add an...
### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support human-in-the-loop interactions, which are necessary for confirming potentially dangerous operations and for entering user...