Phishpedia
Phishpedia copied to clipboard
test data
Hi Lin, there is only one test site data in the test_sites folder. Where can I get more test data?
Hi Tjing, You can get some live phishing feeds from openphish: https://openphish.com/. Then crawl their screenshots as shot.png, save their URLs into the info.txt, and save their HTML source code in html.txt.
nTjing @.***> 于2023年10月30日周一 10:12写道:
Hi Lin, there is only one test site data in the test_sites folder. Where can I get more test data? [image: image] https://user-images.githubusercontent.com/113650779/278920380-388ac487-974c-4c03-b869-d0641b3923f0.png
— Reply to this email directly, view it on GitHub https://github.com/lindsey98/Phishpedia/issues/21, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMJCOK6UAETIKJI5RVK7DULYB4EHVAVCNFSM6AAAAAA6VJLFV2VHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3DOMZTGE4TGNY . You are receiving this because you are subscribed to this thread.Message ID: @.***>
Hi Tjing, You can get some live phishing feeds from openphish: https://openphish.com/. Then crawl their screenshots as shot.png, save their URLs into the info.txt, and save their HTML source code in html.txt. nTjing @.> 于2023年10月30日周一 10:12写道: … Hi Lin, there is only one test site data in the test_sites folder. Where can I get more test data? [image: image] https://user-images.githubusercontent.com/113650779/278920380-388ac487-974c-4c03-b869-d0641b3923f0.png — Reply to this email directly, view it on GitHub <#21>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMJCOK6UAETIKJI5RVK7DULYB4EHVAVCNFSM6AAAAAA6VJLFV2VHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3DOMZTGE4TGNY . You are receiving this because you are subscribed to this thread.Message ID: @.>
I got it, thanks! I have another question, in the paper Comparing Phishpedia with state-of-the- art baselines (RQ1), which dataset was used for this part of the test?
Hi Tjing, I think we use the phish30k ( https://drive.google.com/file/d/12ypEMPRQ43zGRqHGut0Esq2z5en0DH4g/view?usp=sharing)
- benign30k ( https://drive.google.com/file/d/1yORUeSrF5vGcgxYrsCoqXcpOUHt-iHq_/view?usp=sharing) dataset.
nTjing @.***> 于2023年10月30日周一 10:57写道:
Hi Tjing, You can get some live phishing feeds from openphish: https://openphish.com/. Then crawl their screenshots as shot.png, save their URLs into the info.txt, and save their HTML source code in html.txt. nTjing @.
> 于2023年10月30日周一 10:12写道: … <#m_6707026881957020439_> Hi Lin, there is only one test site data in the test_sites folder. Where can I get more test data? [image: image] https://user-images.githubusercontent.com/113650779/278920380-388ac487-974c-4c03-b869-d0641b3923f0.png https://user-images.githubusercontent.com/113650779/278920380-388ac487-974c-4c03-b869-d0641b3923f0.png — Reply to this email directly, view it on GitHub <#21 https://github.com/lindsey98/Phishpedia/issues/21>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMJCOK6UAETIKJI5RVK7DULYB4EHVAVCNFSM6AAAAAA6VJLFV2VHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3DOMZTGE4TGNY https://github.com/notifications/unsubscribe-auth/AMJCOK6UAETIKJI5RVK7DULYB4EHVAVCNFSM6AAAAAA6VJLFV2VHI2DSMVQWIX3LMV43ASLTON2WKOZRHE3DOMZTGE4TGNY . You are receiving this because you are subscribed to this thread.Message ID: @.>
I got it, thanks! I have another question, in the paper Comparing Phishpedia with state-of-the- art baselines (RQ1), which dataset was used for this part of the test?
— Reply to this email directly, view it on GitHub https://github.com/lindsey98/Phishpedia/issues/21#issuecomment-1784397881, or unsubscribe https://github.com/notifications/unsubscribe-auth/AMJCOK4S7R2HA6D6QDKCPJTYB4JQDAVCNFSM6AAAAAA6VJLFV2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOBUGM4TOOBYGE . You are receiving this because you commented.Message ID: @.***>