gpt-2-output-dataset issues

RunTimeError: Error(s) in loading state_dict for RobertaForSequenceClassification

10

I get the following error after trying to run ``` pip install -r requirements.txt python -m detector.server detector-base.pt ``` Error: ``` RuntimeError: Error(s) in loading state_dict for RobertaForSequenceClassification: Missing key(s)...

greenunknown

Permission for Commercial Using

1

Can I use this project for commercial purposes? Actually, I want to create a website that can classify the text whether it is based on GPT-2 model or written by...

ghost

What is the full link for gs://gpt-2/output-dataset/v1

5

Thank you very much!

guotong1988

Complete improvements of download script.

_I modified the script to utilize data classes, JSON serialization, and the `tqdm` library, ensuring a seamless and informative data download process. It also offers options to specify data sizes,...

populated

Loss, Logits error while training

1

There is a assignment error in the train.py script where in the loss and logits are considered to be 'str' type after the assignment and hence have to be updated....

niranjanakella

Finding a strange error in a simple question of GPT4.5

Error text includes: OpenAI error. That model is currently overloaded with other requests. You can retry your request, or contact us through our help center at help.openai.com if the error...

MCoderman

Different detection result on localhost and the server

5

Tested with second sample of ChatGPT and the detection result is not same with server. The test result of https://openai-openai-detector.hf.space/ ![image](https://user-images.githubusercontent.com/54778084/211030406-85c0330f-a52b-45e3-8b4d-97a22e8c132d.png) Test result with `roberta-base` model on localhost ![image](https://user-images.githubusercontent.com/54778084/211030544-70d68d76-1676-483f-a482-e29d7682fecf.png) Test...

SnoopyDevelops

Download script improvements

2

This PR: * makes sure the download script doesn't clobber any existing files if they seem correct enough (same size as remote) * ensures 404, etc. errors don't get written...

akx

Training code fails on 0 length inputs (which are in several datasets included by the author/used in the report)

Some of the training data (specifically, the GPT2 generated datasets) contain texts of length 0. This causes training (and would cause inference) to error out. Is this expected? Please see...

veenapaddy

What prompt is used to generate the GPT2 datasets?

1

I see that GPT2 is trained on webtext, but not sure how the datasets here are generated? Specifically what prompt was used with GPT2 to generate the "fake" datasets?

veenapaddy

gpt-2-output-dataset
gpt-2-output-dataset copied to clipboard

Metadata

RunTimeError: Error(s) in loading state_dict for RobertaForSequenceClassification

Permission for Commercial Using

What is the full link for gs://gpt-2/output-dataset/v1

Complete improvements of download script.

Loss, Logits error while training

Finding a strange error in a simple question of GPT4.5

Different detection result on localhost and the server

Download script improvements

Training code fails on 0 length inputs (which are in several datasets included by the author/used in the report)

What prompt is used to generate the GPT2 datasets?

← Metadata

Owner

Metadata

gpt-2-output-dataset gpt-2-output-dataset copied to clipboard

Metadata

← Metadata

Owner

Metadata

gpt-2-output-dataset
gpt-2-output-dataset copied to clipboard