bdaca
bdaca copied to clipboard
Course Materials Big Data and Automated Content Analysis
https://github.com/RaRe-Technologies/gensim/blob/develop/docs/notebooks/gensim_news_classification.ipynb
http://nbviewer.jupyter.org/github/JosPolfliet/pandas-profiling/blob/master/examples/meteorites.ipynb
als voorbeeld voor advanced statistics: causal inference package DoWhy https://github.com/Microsoft/dowhy
https://www.goodreads.com/book/show/16179186-scraping-for-journalists bradshaw book "scraping for journalists"
statsmodels tutorial ijntegreren maar ook dit hier voor unevenly spaced time series: https://traces.readthedocs.io/en/latest/examples.html#basic-analysis
for example https://becominghuman.ai/building-an-image-classifier-using-deep-learning-in-python-totally-from-a-beginners-perspective-be8dbaf22dd8
https://nlpforhackers.io/topic-modeling/
benoemen dat linux steeds populairder wordt en inmiddels ook binnen windows kan: https://blogs.msdn.microsoft.com/commandline/2017/10/11/whats-new-in-wsl-in-windows-10-fall-creators-update/
misschien leuke corpus voor oefening: https://freedom-to-tinker.com/2016/09/14/all-the-news-thats-fit-to-change-insights-into-a-corpus-of-2-5-million-news-headlines/
editors - [ ] emacs - [ ] vi - [ ] geany (iets als textwrangler/notepad++)