self-instruct
self-instruct copied to clipboard
[How to] Generate dataset from pdf ?
I have my data in a bundle of pdf, documents, etc. Is there any way to extract data from them and generate instruction dataset using self-instruct?