Chengcheng Han issues

Results 7 issues of


                                            Chengcheng Han

Regarding the validation and test set split of datasets like MultiArith/SVAMP

Hello, I see in your paper that for datasets like MultiArith/SVAMP, you randomly sampled 500 data points to serve as a validation set, with the rest as the test set....

Performance of the model on gsm8k/SVAMP/MultiArith.

Thank you for your excellent project. I have conducted an evaluation of Flan-Alpaca-Base/Large/XL on the gsm8k/SVAMP/MultiArith datasets, and the evaluation results are as follows: | Model | gsm8k | MultiArith...

[Feature]: Add Personalization Based on User Preferences and Usage History

### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support personalization based on user preferences or usage history. ### Describe the solution you'd like...

enhancement

[Feature]: Enable Deployment of Different Models for Different Modules

### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support deploying different models for different modules. ### Describe the solution you'd like We hope...

enhancement

[Feature]: Extend OS-Copilot Compatibility to Windows Systems

### Is your feature request related to a problem? Please describe. The current OS-Copilot **cannot** run on Windows systems. ### Describe the solution you'd like We hope to extend OS-Copilot...

enhancement

good first issue

[Feature]: Add Interactive Mode for Multi-Turn Conversations in OS-Copilot

### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support multi-turn conversations. ### Describe the solution you'd like We hope OS-Copilot can add an...

enhancement

[Feature]: Add Human-in-the-Loop Functionality to OS-Copilot

### Is your feature request related to a problem? Please describe. Currently, OS-Copilot **does not** support human-in-the-loop interactions, which are necessary for confirming potentially dangerous operations and for entering user...

enhancement