personality-prediction icon indicating copy to clipboard operation
personality-prediction copied to clipboard

where the essays dataset came from

Open MiaZF opened this issue 3 years ago • 2 comments

Hi~ Thanks for your sharing, which helped me a lot.

As stated in your paper "In psychometric personality trait assessments, personality is measured in continuous scores, yet the available benchmark datasets mostly provide personality traits scores in artificially binned form only. Future studies should aim to use datasets that provide continuous scores on personality traits".

I followed the quote in your paper to read another paper named Linguistic styles: Language use as an individual difference. However, the original data is not provided in that paper. According to the description of the paper, personality is supposed to be measured in continuous scores, yet the personality is only measured by yes or no in essays. I was wondering where the essays dataset came from and whether continuous scores are used to measure personality in the original dataset.

Thank you so much!!! Looking forward to your reply !!

MiaZF avatar Dec 08 '21 08:12 MiaZF

Glad to hear that the code was of use, and apologies for our delay response to your questions. The essays dataset was downloaded from mypersonality.org: https://sites.google.com/michalkosinski.com/mypersonality. However, they have now decided to stop sharing the dataset with scholars. The way to obtaining the dataset would be to get in touch with the original authors, i.e. Pennebaker and King [1999]

yashsmehta avatar May 13 '22 22:05 yashsmehta

Thank you so much.

MiaZF avatar Jun 21 '22 02:06 MiaZF

Closing this issue now! Please re-open if its not clear.

yashsmehta avatar Mar 18 '23 16:03 yashsmehta