learning
learning copied to clipboard
A log of things I'm learning
learning
Developing T-shaped skills by building strong core skills and adjacent skills a little bit everyday.
Strengthen generic core skills
Databases
- [X] Udacity: Intro to relational database
- [ ] Udacity: Database Systems Concepts & Design
- [ ] Datacamp: Database Design
- [X] Codecademy: SQL Track
- [X] Datacamp: Intro to SQL for Data Science
- [ ] Datacamp: Intermediate SQL
- [X] Datacamp: Querying with TransactSQL
- [ ] Datacamp: Joining Data in PostgreSQL
- [ ] Udacity: SQL for Data Analysis
- [ ] Datacamp: Exploratory Data Analysis in SQL
- [ ] Datacamp: Applying SQL to Real-World Problems
- [ ] Datacamp: Analyzing Business Data in SQL
- [ ] Datacamp: Reporting in SQL
- [ ] Datacamp: Data-Driven Decision Making in SQL
- [ ] Datacamp: Introduction to MongoDB in Python
- [ ] Datacamp: Introduction to Databases in Python
Linux
- [X] Article: Streamline your projects using Makefile
- [X] Article: Understand Linux Load Averages and Monitor Performance of Linux
- [X] Article: Command-line Tools can be 235x Faster than your Hadoop Cluster
- [X] Calmcode: makefiles
- [X] Calmcode: entr
- [X] Codecademy: Learn the Command Line
- [X] Datacamp: Introduction to Shell for Data Science
- [X] Datacamp: Introduction to Bash Scripting
- [X] Datacamp: Data Processing in Shell
- [ ] MIT: The Missing Semester of CS Education
- [X] Lecture 1: Course Overview + The Shell (2020)
0:48:16 - [ ] Lecture 2: Shell Tools and Scripting (2020)
0:48:55 - [ ] Lecture 3: Editors (vim) (2020)
0:48:26 - [ ] Lecture 4: Data Wrangling (2020)
0:50:03 - [ ] Lecture 5: Command-line Environment (2020)
0:56:06 - [ ] Lecture 6: Version Control (git) (2020)
1:24:59 - [ ] Lecture 7: Debugging and Profiling (2020)
0:54:13 - [ ] Lecture 8: Metaprogramming (2020)
0:49:52 - [ ] Lecture 9: Security and Cryptography (2020)
1:00:59 - [ ] Lecture 10: Potpourri (2020)
0:57:54 - [ ] Lecture 11: Q&A (2020)
0:53:52
- [X] Lecture 1: Course Overview + The Shell (2020)
- [ ] Thoughtbot: Mastering the Shell
- [ ] Thoughtbot: tmux
- [X] Udacity: Linux Command Line Basics
- [X] Udacity: Shell Workshop
- [X] Udacity: Configuring Linux Web Servers
- [ ] Web Bos: Command Line Power User
- [ ] Youtube: GNU Parallel
Project Structure
- [X] Article: pydantic
- [ ] Article: Organizing machine learning projects: project management guidelines
- [ ] Article: Logging and Debugging in Machine Learning - How to use Python debugger and the logging module to find errors in your AI application
- [ ] Article: Best practices to write Deep Learning code: Project structure, OOP, Type checking and documentation
- [X] Article: Configuring Google Colab Like A Pro
- [X] Article: Stop using print, start using loguru in Python
- [ ] Article: Hypermodern Python
- [ ] Article: Hypermodern Python Chapter 2: Testing
- [ ] Article: Hypermodern Python Chapter 3: Linting
- [ ] Article: Hypermodern Python Chapter 4: Typing
- [ ] Article: Hypermodern Python Chapter 5: Documentation
- [ ] Article: Hypermodern Python Chapter 6: CI/CD
- [ ] Article: Push and pull: when and why to update your dependencies
- [ ] Article: Reproducible and upgradable Conda environments: dependency management with conda-lock
- [ ] Article: Options for packaging your Python code: Wheels, Conda, Docker, and more
- [X] Article: Making model training scripts robust to spot interruptions
- [X] Calmcode: logging
- [X] Calmcode: tqdm
- [X] Calmcode: virtualenv
- [ ] Coursera: Structuring Machine Learning Projects
- [ ] Doc: Python Lifecycle Training
- [X] Datacamp: Introduction to Data Engineering
- [X] Datacamp: Conda Essentials
- [ ] Datacamp: Conda for Building & Distributing Packages
- [X] Datacamp: Software Engineering for Data Scientists in Python
- [X] Datacamp: Designing Machine Learning Workflows in Python
- [X] Datacamp: Object-Oriented Programming in Python
- [ ] Datacamp: Command Line Automation in Python
- [X] Datacamp: Creating Robust Python Workflows
- [X] Developing Python Packages
- [X] Treehouse: Object Oriented Python
- [X] Treehouse: Setup Local Python Environment
- [ ] Udacity: Writing READMEs
- [X] Youtube: Lecture 1: Introduction to Deep Learning
- [X] Youtube: Lecture 2: Setting Up Machine Learning Projects
- [X] Youtube: Lecture 3: Introduction to the Text Recognizer Project
- [X] Youtube: Lecture 4: Infrastructure and Tooling
- [X] Youtube: Hydra configuration
- [X] Youtube: Continuous integration
- [X] Youtube: Data Engineering + ML + Software Engineering // Satish Chandra Gupta // MLOps Coffee Sessions #16
- [X] Youtube: OO Design and Testing Patterns for Machine Learning with Chris Gerpheide
- [X] Youtube: Tutorial: Sebastian Witowski - Modern Python Developer's Toolkit
- [ ] Youtube: Lecture 13: ML Teams (Full Stack Deep Learning - Spring 2021)
0:58:13 - [ ] Youtube: Lecture 5: ML Projects (Full Stack Deep Learning - Spring 2021)
1:13:14 - [ ] Youtube: Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
1:07:21
Version Control
- [ ] Article: Mastering Git Stash Workflow
- [X] Article: How to Become a Master of Git Tags
- [X] Article: How to track large files in Github / Bitbucket? Git LFS to the rescue
- [ ] Article: Keep your git directory clean with
git cleanandgit trash - [X] Codecademy: Learn Git
- [X] Code School: Git Real
- [X] Datacamp: Introduction to Git for Data Science
- [ ] Thoughtbot: Mastering Git
- [X] Udacity: GitHub & Collaboration
- [X] Udacity: How to Use Git and GitHub
- [X] Udacity: Version Control with Git
- [X] Youtube: 045 Introduction to Git LFS
- [X] Youtube: Git & Scripting
Software Testing
- [X] Article: Effective testing for machine learning systems
- [X] Article: Unit Testing for Data Scientists
- [ ] Article: Testing in Production, the safe way
- [X] Article: How to cheat at unit tests with pytest and Black
- [X] Article: 4 Lesser-Known Yet Awesome Tips for Pytest
- [ ] Article: How to Unit Test Deep Learning: Tests in TensorFlow, mocking and test coverage
- [X] Article: Test-Driven Machine Learning Development (Deployment Series: Guide 07)
- [X] Datacamp: Unit Testing for Data Science in Python
- [X] Pluralsight: Test-driven Development: The Big Picture
- [ ] Test Driven Development with Python
- [ ] Thoughtbot: Fundamentals of TDD
- [ ] Udacity: Software Analysis & Testing
- [ ] Udacity: Software Testing
- [ ] Udacity: Software Debugging
- [X] Youtube: Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | AISC
- [ ] Youtube: Lecture 10: ML Testing & Explainability (Full Stack Deep Learning - Spring 2021)
1:41:12 - [X] Youtube: Lab 8: Testing and Continuous Integration (Full Stack Deep Learning - Spring 2021)
0:13:26
Programming (Python)
- [X] Article: A reverse chronology of some Python features
- [ ] Article: No Really, Python's Pathlib is Great
- [ ] Article: When to switch to Python 3.9
- [X] Article: A deep dive on Python type hints
- [X] Article: I wish I knew these things when I learned Python
- [X] Article: Python Concurrency: The Tricky Bits
- [ ] Article: The Complete Python Development Guide
- [ ] Article: Speeding Up Python with Concurrency, Parallelism, and asyncio
- [ ] Article: Speed Up Your Python Program With Concurrency
- [ ] Article: A Python prompt into a running process: debugging with Manhole
- [X] Regex For Noobs (like me!) - An Illustrated Guide
- [X] Book: A Byte of Python
- [X] Book: Learn Python The Hard way
- [ ] Book: Python 201
- [ ] Book: The Python 3 Standard Library By Example
- [ ] Book: Writing Idiomatic Python 3
- [X] Calmcode: ray
- [X] Codecademy: Learn Python
- [X] Cognitiveclass.ai: Python for Data Science
- [X] Datacamp: Python for R Users
- [X] Datacamp: Python for Spreadsheet Users
- [X] Datacamp: Importing Data in Python (Part 1)
- [X] Datacamp: Intermediate Python for Data Science
- [X] Datacamp: Python Data Science Toolbox (Part 1)
- [X] Datacamp: Python Data Science Toolbox (Part 2)
- [X] Datacamp: Intro to Python for Finance
- [X] Datacamp: Writing Efficient Python Code
- [X] Datacamp: Writing Functions in Python
- [ ] Datacamp: Working with Dates and Times in Python
- [X] edX: Introduction to Python for Data Science
- [X] edX: Programming with Python for Data Science
- [X] Google's Python Class
- [X] Treehouse: Python Basics
- [X] TheNewBoston: Python Programming Tutorials
- [X] Youtube: Python 3 Programming Tutorial - Regular Expressions / Regex with re
- [X] Youtube: Python Tutorial: re Module - How to Write and Match Regular Expressions (Regex)
- [ ] Youtube: Python Concurrency and Multithreading
- [ ] Youtube: Aaron Richter- Parallel Processing in Python| PyData Global 2020
- [ ] Youtube: The Clean Architecture in Python
Code Editors
- [X] Article: Work remotely with PyCharm, TensorFlow and SSH
- [X] Article: Python remote debugging with PyCharm, CUDA, and Conda
- [X] Article: How To Use Visual Studio Code for Remote Development via the Remote-SSH Plugin
- [X] Article: Docker as Remote Interpreter for PyCharm Professional
- [X] Youtube: Getting Started with Python in Visual Studio Code
- [X] Youtube: VSCode Keyboard Shortcuts For Productivity
- [X] Youtube: Getting Started with Jupyter Notebooks in VS Code
- [X] Youtube: Notebooks in VS Code Are Getting Revamped!
- [X] Youtube: Getting Started with PyTorch in VS Code
- [X] Youtube: What every GitHub user should know about VS Code - GitHub Satellite 2020
- [X] VS Code and GitHub
- [X] Visual Studio Code Crash Course
Docker and Containerization
- [X] Article: How To Pass Environment Info During Docker Builds
- [X] Article: Pass Docker Environment Variables During The Image Build
- [X] Article: Setting Default Docker Environment Variables During Image Build
- [X] Article: Docker Explained Visually, For Non-Technical Folks
- [X] Article: Tensorflow in Docker
- [X] Article: Enough Docker to be Dangerous
- [X] Article: How Docker Can Help You Become A More Effective Data Scientist
- [ ] Article: Deploying conda environments in (Docker) containers - how to do it right
- [X] Article: Configuring Gunicorn for Docker
- [X] Article: How to scale services using Docker Compose
- [X] Article: A Beginner-Friendly Introduction to Containers, VMs and Docker
- [X] Article: Smaller Docker images with Conda
- [X] Pluralsight: Docker and Containers: The Big Picture
- [X] Article: Docker for Machine Learning – Part I
- [X] Article: Docker for Machine Learning – Part II
- [X] Article: Docker for Machine Learning – Part III
- [X] Article: Using Docker to Generate Machine Learning Predictions in Real Time
- [ ] Article: Connection refused? Docker networking and how it impacts your image
- [ ] Article: Faster or slower: the basics of Docker build caching
- [ ] Article: Where’s your code? Debugging ImportError and ModuleNotFoundErrors in your Docker image
- [ ] Article: A tableau of crimes and misfortunes: the ever-useful docker history
- [ ] Article: Broken by default: why you should avoid most Dockerfile examples
- [ ] Article: A review of the official Dockerfile best practices: good, bad, and insecure
- [ ] Article: The best Docker base image for your Python application (February 2021)
- [ ] Article: A deep dive into the official Docker image for Python
- [ ] Article: Using Alpine can make Python Docker builds 50× slower
- [ ] Article: Building on solid ground: ensuring reproducible Docker builds for Python
- [ ] Article: Installing system packages in Docker with minimal bloat
- [ ] Article: Less capabilities, more security: minimizing privilege escalation in Docker
- [ ] Article: Avoiding insecure images from Docker build caching
- [ ] Article: Build secrets in Docker and Compose, the secure way
- [ ] Article: Security scanners for Python and Docker: from code to dependencies
- [ ] Article: The high cost of slow Docker builds
- [ ] Article: Faster Docker builds with pipenv, poetry, or pip-tools
- [ ] Article: Elegantly activating a virtualenv in a Dockerfile
- [ ] Article: Poetry vs. Docker caching: Fight!
- [ ] Article: Speed up pip downloads in Docker with BuildKit’s new caching
- [ ] Article: Multi-stage builds #1: Smaller images for compiled code
- [ ] Article: Multi-stage builds #2: Python specifics—virtualenv, –user, and other methods
- [ ] Article: Multi-stage builds #3: Why your build is surprisingly slow, and how to speed it up
- [ ] Article: Configuring Gunicorn for Docker
- [ ] Article: Activating a Conda environment in your Dockerfile
- [ ] Article: Shrink your Conda Docker images with conda-pack
- [ ] Article: What’s running in production? Making your Docker images identifiable
- [ ] Article: Your Docker build needs a smoke test
- [ ] Article: Docker BuildKit: faster builds, new features, and now it’s stable
- [ ] Article: Docker vs. Singularity for data processing: UIDs and filesystem access
- [ ] Article: Where’s that log file? Debugging failed Docker builds
- [ ] Article: An Introduction to Kubernetes for Data Scientists
- [ ] Article: How to Use Kubernetes Pods for Machine Learning
- [ ] Article: Kubernetes Jobs for Machine Learning
- [ ] Article: Kubernetes CronJobs for Machine Learning
- [ ] Article: Kubernetes Deployments for Machine Learning
- [ ] Article: Kubernetes Services for Machine Learning
- [ ] “Let’s use Kubernetes!” Now you have 8 problems
- [ ] Article: Kubernetes for Python Developers: Part 1
- [X] Doc: Environment variables in Compose
- [X] Pluralsight: Docker and Kubernetes: The Big Picture
- [ ] Udacity: Scalable Microservices with Kubernetes
- [X] Youtube: Docker
- [X] Youtube: Why Your Web Server Should Log to Stdout (Especially with Docker)
Cloud Computing
- [X] Article: A deep dive into AWS spot instance interruptions
- [ ] Article: Getting started with large-scale ETL jobs using Dask and AWS EMR
- [X] Datacamp: Cloud Computing for Everyone
- [X] Pluralsight: AWS Developer: The Big Picture
- [X] Pluralsight: AWS Networking Deep Dive: Virtual Private Cloud (VPC)
- [X] Pluralsight: AWS VPC Operations
- [X] Pluralsight: Building Applications Using Elastic Beanstalk
- [X] Udemy: AWS Concepts
- [X] Udemy: AWS Certified Developer - Associate 2018
- [X] Whitepaper: Architecting for the Cloud AWS Best Practices
- [X] Whitepaper: AWS Well-Architected Framework
- [X] Whitepaper: AWS Security Best Practices
- [X] Whitepaper: Blue/Green Deployments on AWS
- [X] Whitepaper: Microservices on AWS
- [X] Whitepaper: Optimizing Enterprise Economics with Serverless Architectures
- [X] Whitepaper: Practicing Continuous Integration and Continuous Delivery on AWS
- [X] Whitepaper: Running Containerized Microservices on AWS
- [X] Udemy: Serverless Concepts
- [ ] Udacity: Networking for Web Developers
- [X] Whitepaper: Serverless Architectures with AWS Lambda
- [X] Youtube: Deploying a machine learning model to the cloud using AWS Lambda
Monitoring
- [X] Article: Production Machine Learning Monitoring: Outliers, Drift, Explainers & Statistical Performance
- [X] Article: How to Monitor Models
- [ ] Article: The Playbook to Monitor Your Model’s Performance in Production
- [X] Article: Monitoring your Machine Learning Model
- [X] Article: Preventing model drift with continuous monitoring and deployment using Github Actions and Algorithmia Insights
- [ ] Article: Continuous monitoring for data projects
- [X] Article: Lessons Learned from 15 Years of Monitoring Machine Learning in Production
- [X] Article: Why is it Important to Monitor Machine Learning Models?
- [ ] Article: Using Statistical Distances for Machine Learning Observability
- [ ] Article: The Model’s Shipped; What Could Possibly go Wrong?
- [ ] Article: Quality assurance in data science
- [ ] Youtube: Instrumentation, Observability & Monitoring of Machine Learning Models
- [ ] Article: Incident Management in Machine Learning Systems
- [ ] Article: ML Infrastructure Tools — ML Observability
- [ ] Youtube: MLOps #24 Monitoring the ML stack // Lina Weichbrodt
0:55:32 - [X] Youtube: Josh Wills: Visibility and Monitoring for Machine Learning Models
- [X] Youtube: Lecture 11B: Monitoring ML Models (Full Stack Deep Learning - Spring 2021)
0:36:55 - [ ] Youtube: OpML '20 - How ML Breaks: A Decade of Outages for One Large ML Pipeline
- [ ] Youtube: MLOps #28 ML Observability // Aparna Dhinakaran - Chief Product Officer at Arize AI
0:55:04 - [ ] Youtube: MLOps #29 Continuous Evaluation & Model Experimentation // Danny Ma - Founder of Sydney Data Science
1:00:46 - [ ] Youtube: SE4AI: Quality Assessment in Production
1:18:45 - [ ] Youtube: SE4AI: Infrastructure Quality, Deployment and Operations
1:04:54
Data Structures and Algorithms
Frontend Technology
- [ ] Book: Refactoring UI
- [X] Codecademy: Learn HTML
- [X] Codecademy: Learn SASS
- [X] Codecademy: Make a website
- [X] Codecademy: Learn ReactJS: Part I
- [X] Codecademy: Learn ReactJS: Part II
- [X] Codecademy: Learn JavaScript
- [X] Codecademy: Jquery Track
- [X] Codecademy: Learn Ruby
- [X] Code School: Fundamentals of Design
- [X] Code School: Blasting Off with Bootstrap
- [X] (ES6) - Beau teaches JavaScript
- [X] Pluralsight: UX Fundamentals
- [X] Pluralsight: HTML, CSS, and JavaScript: The Big Picture
- [X] Pluralsight: CSS Positioning
- [X] Pluralsight: Introduction to CSS
- [X] Pluralsight: CSS: Specificity, the Box Model, and Best Practices
- [X] Pluralsight: CSS: Using Flexbox for Layout
- [X] Pluralsight: Using The Chrome Developer Tools
- [ ] Thoughtbot: Design for Developers
- [X] Treehouse: HTML
- [X] Treehouse: Javascript Booleans
- [X] Udacity: ES6 - JavaScript Improved
- [X] Udacity: Intro to Javascript
- [X] Udacity: Object Oriented JS 1
- [X] Udacity: Object Oriented JS 2
- [X] Udemy: Understanding Typescript
Infrastructure and System Design
- [ ] Article: The implications of pickling ML models
- [X] Article: Deploy a Keras Deep Learning Project to Production with Flask
- [X] Article: Deploy BERT for Sentiment Analysis as REST API using PyTorch, Transformers by Hugging Face and FastAPI
- [ ] Article: Microservice in Python using FastAPI
- [X] Article: Selecting gunicorn worker types for different python web applications.
- [X] Article: Better performance by optimizing Gunicorn config
- [X] Article: Exponential Backoff And Jitter
- [X] Article: How to Serve Models
- [X] Article: MLOps concepts for busy engineers: model serving
- [X] Article: MLOps concepts for busy engineers: model serving
- [ ] Article: Understanding TensorFlow Serving
- [ ] Article: Serving models using Tensorflow Serving and Docker
- [X] Article: Batch Inference vs Online Inference
- [X] Article: Online batching with Spell serving
- [ ] Article: Machine Learning System Design: Real-time processing
- [ ] Article: Machine Learning System Design: Models-as-a-service
- [X] Article: What Does it Mean to Deploy a Machine Learning Model? (Deployment Series: Guide 01)
- [X] Article: Software Interfaces for Machine Learning Deployment (Deployment Series: Guide 02)
- [X] Article: Batch Inference for Machine Learning Deployment (Deployment Series: Guide 03)
- [X] Article: The Challenges of Online Inference (Deployment Series: Guide 04)
- [X] Article: Online Inference for ML Deployment (Deployment Series: Guide 05)
- [X] Article: Model Registries for ML Deployment (Deployment Series: Guide 06)
- [X] Video: You trained a machine learning model. Now what?
- [X] Article: How Data Leakage Impacts Machine Learning Models
- [X] Article: 5 Challenges to Running Machine Learning Systems in Production
- [ ] Article: Enabling Machine-Learning-as-a-Service Through Privacy Preserving Machine Learning
- [X] Article: Shadow mode deployments
- [X] Cortex Blog
- [X] Django Best Practices
- [ ] Udacity: Authentication & Authorization: OAuth
- [ ] Udacity: HTTP & Web Servers
- [ ] Udacity: Designing RESTful APIs
- [ ] Udacity: Client-Server Communication
- [X] Youtube: PyConBY 2020: Sebastian Ramirez - Serve ML models easily with FastAPI
- [X] Youtube: FastAPI from the ground up
- [X] Youtube: Python pydantic Introduction – Give your data classes super powers
- [X] Youtube: PyData Vancouver meetup: cortex.dev : Serving machine learning models in production
- [X] Youtube: Lecture 11A: Deploying ML Models (Full Stack Deep Learning - Spring 2021)
0:53:25 - [ ] Youtube: Hands-on serving models using KFserving // Theofilos Papapanagiotou // MLOps Meetup #40
0:57:40 - [X] Youtube: Shawn Scully: Production and Beyond: Deploying and Managing Machine Learning Models
- [X] Article: Celery Execution Pools: What is it all about?
- [X] Article: Distill: Why do we need Flask, Celery, and Redis? (with McDonalds in Between)
- [X] Article: Celery: an overview of the architecture and how it works
- [ ] Article: Unit Testing Celery Tasks
- [ ] Article: Testing Celery Chains
- [ ] Article: Task Routing in Celery
- [ ] Article: Dynamic Task Routing in Celery
- [ ] Article: Dockerize a Celery app with Django and RabbitMQ
- [ ] Article: How to call a Celery task from another app
- [ ] Article: Distributed Monte Carlo with Celery chords
- [ ] Article: An incredibly simple no-frills Celery setup
- [ ] Article: 3 Strategies to Customise Celery logging handlers
- [ ] Article: Celery task exceptions and automatic retries
- [X] Article: Concurrency and Parallelism
- [ ] Article: Celery, docker and the missing startup banner
- [ ] Article: Monitoring a Dockerized Celery Cluster with Flower
- [ ] Article: Quick Guide: Custom Celery Task Logger
- [ ] Article: Celery on Docker: From the Ground up
- [ ] Article: Auto-reload Celery on code changes
- [X] Youtube: Loading Testing with Python
- [X] Article: Architecting a Machine Learning Pipeline
- [X] Article: Combining rule engines and machine learning
- [X] Article: Deploying Machine Learning Models: A Checklist
- [X] Article: Getting machine learning to production
- [X] Article: How to build scalable Machine Learning systems — Part 1/2
- [X] Article: How to properly ship and deploy your machine learning model
- [X] Article: How to put machine learning models into production
- [X] Article: Key Concepts for Deploying Machine Learning Models to Mobile
- [X] Article: Machine Learning to Production
- [X] Article: Machine learning is going real-time
- [ ] Article: ML Infrastructure Tools for Model Building
- [ ] Article: ML Infrastructure Tools for Production (Part 1)
- [ ] Article: ML Infrastructure Tools for Production
- [X] Article: How to Deploy a Machine Learning Model
- [X] Article: Building a feature store
- [X] Article: Model artifacts: the war stories
- [ ] Article: Machine learning system design
- [ ] Article: What is ML model governance?
- [ ] Article: Building scalable and efficient ML Pipelines
- [ ] Article: What are common dataset challenges at scale?
- [ ] Article: How to technically distinguish among data projects?
- [ ] Article: Securing ML applications
- [ ] Article: Data Pipelines — Agile considerations
- [ ] Article: Data Lineage — An Operational perspective
- [ ] Article: The Ultimate Guide to Model Retraining
- [ ] Book: Machine Learning Systems Design
- [ ] Doc: Lecture 3: Data engineering
- [X] Datacamp: Data Engineering for Everyone
- [ ] Youtube: Applied ML in Production
- [X] Objective · Applied ML in Production
0:02:38 - [X] Solution · Applied ML in Production
0:08:53 - [X] Evaluation · Applied ML in Production
0:04:02 - [X] Iteration · Applied ML in Production
0:04:35 - [X] Annotation · Applied ML in Production
0:14:34 - [X] Exploratory Data Analysis (EDA) · Applied ML in Production
0:09:02
- [X] Objective · Applied ML in Production
- [ ] Youtube: SE4AI: Software Architecture of AI-Enabled Systems
1:14:24 - [ ] Youtube: SE4AI: Invited Talk Molham Aref "Business Systems with Machine Learning"
0:47:53 - [ ] Youtube: MLOps #4: Shubhi Jain - Building an ML Platform @SurveyMonkey
0:55:42 - [ ] Youtube: MLOps Meetup #6: Mid-Scale Production Feature Engineering with Dr. Venkata Pingali
1:01:35 - [ ] Youtube: MLOps meetup #5 High Stakes ML with Flavio CLesio
0:55:27 - [ ] Youtube: MLOps meetup #7 Alex Spanos // TrueLayer 's MLOps Pipeline
0:56:17 - [ ] Youtube: #11 Machine Learning at scale in Mercado Libre with Carlos de la Torre
0:59:28 - [ ] Youtube: MLOps #14: Kubeflow vs MLflow with Byron Allen
0:54:57 - [ ] Youtube: MLOps #15 - Scaling Human in the Loop Machine Learning with Robert Munro
0:55:04 - [ ] Youtube: MLOps #18 // Nubank - Running a fintech on ML
0:53:19 - [ ] Youtube: Feature Stores: An essential part of the ML stack to build great data / Kevin Stumpf - CTO at Tecton
1:05:46 - [ ] Youtube: MLOps #31 Path to Production and Monetizing Machine Learning // Vin Vashishta - Data Scientist
0:56:35 - [ ] Youtube: MLOps #35: Streaming Machine Learning with Apache Kafka and Tiered Storage // Kai Waehner, Confluent
0:52:50 - [ ] Youtube: Luigi in Production // MLOps Coffee Sessions #18 // Luigi Patruno ML in Production
0:47:23 - [ ] Youtube: The Current MLOps Landscape // Nathan Benaich & Timothy Chen // MLOps Meetup #43
0:58:31 - [ ] Youtube: MLSys Seminars Fall 2020
- [X] Stanford MLSys Seminar Episode 0: ML + Systems
0:11:49 - [ ] Stanford MLSys Seminar Episode 1: Marco Tulio Ribeiro
1:00:38 - [ ] Stanford MLSys Seminar Episode 2: Matei Zaharia
0:59:44 - [ ] Stanford MLSys Seminar Episode 3: Virginia Smith
1:00:55 - [ ] Stanford MLSys Seminar Episode 4: Alex Ratner
1:13:34 - [X] Stanford MLSys Seminar Episode 5: Chip Huyen
1:06:44
- [X] Stanford MLSys Seminar Episode 0: ML + Systems
- [ ] Youtube: Xavier Amatriain on Practical Deep Learning Systems (Full Stack Deep Learning - November 2019)
- [ ] Article: Decoding Netflix: Metaflow
- [ ] Youtube: Avoid cascading failures in a distributed system
- [ ] Youtube: How databases scale writes: The power of the log
- [ ] Youtube: How to avoid a single point of failure in distributed systems
- [ ] Youtube: How to start with distributed systems? Beginner's guide to scaling systems.
- [ ] Youtube: What is Distributed Caching? Explained with Redis!
- [X] Youtube: Why do Databases fail? AntiPatterns to avoid!
- [ ] Youtube: A friendly introduction to System Design
- [ ] Youtube: Designing Instagram: System Design of News Feed
- [ ] Youtube: Introduction to NoSQL databases
- [ ] Youtube: System Design Basics: Horizontal vs. Vertical Scaling
- [ ] Youtube: What is an API and how do you design it?
- [ ] Youtube: Service discovery and heartbeats in micro-services
- [ ] Youtube: System Design: Tinder as a microservice architecture
- [X] Youtube: What is Load Balancing?
- [X] Youtube: What is a microservice architecture and it's advantages?
- [X] Youtube: What is Consistent Hashing and Where is it used?
- [X] Youtube: What is a Message Queue and Where is it used?
- [ ] Youtube: 5 Tips for System Design Interviews
- [ ] Youtube: Whatsapp System Design: Chat Messaging Systems for Interviews
- [ ] Youtube: Capacity Estimation: How much data does YouTube store daily?
- [ ] Youtube: What is Database Sharding?
- [ ] Youtube: How Netflix onboards new content: Video Processing at scale
- [ ] Youtube: What is the Publisher Subscriber Model?
- [ ] Youtube: Distributed Consensus and Data Replication strategies on the server
- [ ] Youtube: System design : Design Autocomplete or Typeahead Suggestions for Google search
- [ ] Youtube: Relational database index vs. NoSQL index
- [ ] Youtube: What's an Event Driven System?
- [ ] Youtube: Distributed Consensus and Data Replication strategies on the server
- [ ] Youtube: What is an API and how do you design it?
- [ ] Youtube: What is Distributed Caching? Explained with Redis!
- [ ] Youtube: Service discovery and heartbeats in micro-services
- [ ] Youtube: Relational database index vs. NoSQL index
- [ ] Youtube: How Netflix onboards new content: Video Processing at scale
- [ ] Youtube: How to start with distributed systems? Beginner's guide to scaling systems.
- [ ] Youtube: Capacity Estimation: How much data does YouTube store daily?
- [ ] Youtube: How databases scale writes: The power of the log
- [ ] Youtube: System design : Design Autocomplete or Typeahead Suggestions for Google search
- [X] Youtube: Human-Centric Machine Learning Infrastructure @Netflix
Mathematics
- [ ] 3Blue1Brown: Essence of Calculus
- [ ] The Essence of Calculus, Chapter 1
0:17:04 - [ ] The paradox of the derivative | Essence of calculus, chapter 2
0:17:57 - [ ] Derivative formulas through geometry | Essence of calculus, chapter 3
0:18:43 - [ ] Visualizing the chain rule and product rule | Essence of calculus, chapter 4
0:16:52 - [ ] What's so special about Euler's number e? | Essence of calculus, chapter 5
0:13:50 - [ ] Implicit differentiation, what's going on here? | Essence of calculus, chapter 6
0:15:33 - [ ] Limits, L'Hôpital's rule, and epsilon delta definitions | Essence of calculus, chapter 7
0:18:26 - [ ] Integration and the fundamental theorem of calculus | Essence of calculus, chapter 8
0:20:46 - [ ] What does area have to do with slope? | Essence of calculus, chapter 9
0:12:39 - [ ] Higher order derivatives | Essence of calculus, chapter 10
0:05:38 - [ ] Taylor series | Essence of calculus, chapter 11
0:22:19 - [ ] What they won't teach you in calculus
0:16:22
- [ ] The Essence of Calculus, Chapter 1
- [ ] 3Blue1Brown: Essence of linear algebra
- [ ] Vectors, what even are they? | Essence of linear algebra, chapter 1
0:09:52 - [ ] Linear combinations, span, and basis vectors | Essence of linear algebra, chapter 2
0:09:59 - [ ] Linear transformations and matrices | Essence of linear algebra, chapter 3
0:10:58 - [ ] Matrix multiplication as composition | Essence of linear algebra, chapter 4
0:10:03 - [ ] Three-dimensional linear transformations | Essence of linear algebra, chapter 5
0:04:46 - [ ] The determinant | Essence of linear algebra, chapter 6
0:10:03 - [ ] Inverse matrices, column space and null space | Essence of linear algebra, chapter 7
0:12:08 - [ ] Nonsquare matrices as transformations between dimensions | Essence of linear algebra, chapter 8
0:04:27 - [ ] Dot products and duality | Essence of linear algebra, chapter 9
0:14:11 - [ ] Cross products | Essence of linear algebra, Chapter 10
0:08:53 - [ ] Cross products in the light of linear transformations | Essence of linear algebra chapter 11
0:13:10 - [ ] Cramer's rule, explained geometrically | Essence of linear algebra, chapter 12
0:12:12 - [ ] Change of basis | Essence of linear algebra, chapter 13
0:12:50 - [ ] Eigenvectors and eigenvalues | Essence of linear algebra, chapter 14
0:17:15 - [ ] Abstract vector spaces | Essence of linear algebra, chapter 15
0:16:46
- [ ] Vectors, what even are they? | Essence of linear algebra, chapter 1
- [X] 3Blue1Brown: Neural networks
- [X] Article: A Visual Tour of Backpropagation
- [X] Article: Entropy, Cross Entropy, and KL Divergence
- [ ] Article: Interview Guide to Probability Distributions
- [ ] Article: Introduction to Linear Algebra for Applied Machine Learning with Python
- [ ] Article: Entropy of a probability distribution — in layman’s terms
- [ ] Article: KL Divergence — in layman’s terms
- [ ] Article: Probability Distributions
- [ ] Article: Relearning Matrices as Linear Functions
- [ ] Article: You Could Have Come Up With Eigenvectors - Here's How
- [ ] Article: PageRank - How Eigenvectors Power the Algorithm Behind Google Search
- [ ] Article: Interactive Visualization of Why Eigenvectors Matter
- [ ] Article: Cross-Entropy and KL Divergence
- [ ] Article: Why Randomness Is Information?
- [ ] Article: Basic Probability Theory
- [X] Article: Math You Need to Succeed In ML Interviews
- [ ] Book: Basics of Linear Algebra for Machine Learning
- [X] Datacamp: Introduction to Statistics
- [X] Datacamp: Introduction to Statistics in Python
- [X] Datacamp: Foundations of Probability in Python
- [X] Datacamp: Hypothesis Testing in Python
- [X] Datacamp: Statistical Thinking in Python (Part 1)
- [X] Datacamp: Statistical Thinking in Python (Part 2)
- [X] Datacamp: Experimental Design in Python
- [ ] Datacamp: Statistical Simulation in Python
- [X] edX: Essential Statistics for Data Analysis using Excel
- [ ] Computational Linear Algebra for Coders
- [ ] Khan Academy: Precalculus
- [ ] Khan Academy: Probability
- [ ] Khan Academy: Differential Calculus
- [ ] Khan Academy: Multivariable Calculus
- [ ] Khan Academy: Linear Algebra
- [ ] MIT: 18.06 Linear Algebra (Professor Strang)
- [X] 1. The Geometry of Linear Equations
0:39:49 - [X] 2. Elimination with Matrices.
0:47:41 - [X] 3. Multiplication and Inverse Matrices
0:46:48 - [X] 4. Factorization into A = LU
0:48:05 - [X] 5. Transposes, Permutations, Spaces R^n
0:47:41 - [X] 6. Column Space and Nullspace
0:46:01 - [X] 9. Independence, Basis, and Dimension
0:50:14 - [X] 10. The Four Fundamental Subspaces
0:49:20 - [X] 11. Matrix Spaces; Rank 1; Small World Graphs
0:45:55 - [X] 14. Orthogonal Vectors and Subspaces
0:49:47 - [X] 15. Projections onto Subspaces
0:48:51 - [X] 16. Projection Matrices and Least Squares
0:48:05 - [X] 17. Orthogonal Matrices and Gram-Schmidt
0:49:09 - [X] 21. Eigenvalues and Eigenvectors
0:51:22 - [ ] 22. Diagonalization and Powers of A
0:51:50 - [ ] 24. Markov Matrices; Fourier Series
0:51:11 - [ ] 25. Symmetric Matrices and Positive Definiteness
0:43:52 - [ ] 27. Positive Definite Matrices and Minima
0:50:40 - [ ] 29. Singular Value Decomposition
0:40:28 - [ ] 30. Linear Transformations and Their Matrices
0:49:27 - [ ] 31. Change of Basis; Image Compression
0:50:13 - [ ] 33. Left and Right Inverses; Pseudoinverse
0:41:52
- [X] 1. The Geometry of Linear Equations
- [ ] StatQuest: Statistics Fundamentals
- [ ] StatQuest: Histograms, Clearly Explained
0:03:42 - [ ] StatQuest: What is a statistical distribution?
0:05:14 - [ ] StatQuest: The Normal Distribution, Clearly Explained!!!
0:05:12 - [ ] Statistics Fundamentals: Population Parameters
0:14:31 - [ ] Statistics Fundamentals: The Mean, Variance and Standard Deviation
0:14:22 - [ ] StatQuest: What is a statistical model?
0:03:45 - [ ] StatQuest: Sampling A Distribution
0:03:48 - [X] Hypothesis Testing and The Null Hypothesis
0:14:40 - [ ] Alternative Hypotheses: Main Ideas!!!
0:09:49 - [ ] p-values: What they are and how to interpret them
0:11:22 - [ ] How to calculate p-values
0:25:15 - [ ] p-hacking: What it is and how to avoid it!
0:13:44 - [ ] Statistical Power, Clearly Explained!!!
0:08:19 - [ ] Power Analysis, Clearly Explained!!!
0:16:44 - [ ] Covariance and Correlation Part 1: Covariance
0:22:23 - [ ] Covariance and Correlation Part 2: Pearson's Correlation
0:19:13 - [ ] StatQuest: R-squared explained
0:11:01 - [ ] The Central Limit Theorem
0:07:35 - [ ] StatQuickie: Standard Deviation vs Standard Error
0:02:52 - [ ] StatQuest: The standard error
0:11:43 - [ ] StatQuest: Technical and Biological Replicates
0:05:27 - [ ] StatQuest - Sample Size and Effective Sample Size, Clearly Explained
0:06:32 - [ ] Bar Charts Are Better than Pie Charts
0:01:45 - [ ] StatQuest: Boxplots, Clearly Explained
0:02:33 - [ ] StatQuest: Logs (logarithms), clearly explained
0:15:37 - [ ] StatQuest: Confidence Intervals
0:06:41 - [ ] StatQuickie: Thresholds for Significance
0:06:40 - [ ] StatQuickie: Which t test to use
0:05:10 - [ ] StatQuest: One or Two Tailed P-Values
0:07:05 - [ ] The Binomial Distribution and Test, Clearly Explained!!!
0:15:46 - [ ] StatQuest: Quantiles and Percentiles, Clearly Explained!!!
0:06:30 - [ ] StatQuest: Quantile-Quantile Plots (QQ plots), Clearly Explained
0:06:55 - [ ] StatQuest: Quantile Normalization
0:04:51 - [X] StatQuest: Probability vs Likelihood
0:05:01 - [ ] StatQuest: Maximum Likelihood, clearly explained!!!
0:06:12 - [ ] Maximum Likelihood for the Exponential Distribution, Clearly Explained! V2.0
0:09:39 - [ ] Why Dividing By N Underestimates the Variance
0:17:14 - [ ] Maximum Likelihood for the Binomial Distribution, Clearly Explained!!!
0:11:24 - [ ] Maximum Likelihood For the Normal Distribution, step-by-step!
0:19:50 - [ ] StatQuest: Odds and Log(Odds), Clearly Explained!!!
0:11:30 - [ ] StatQuest: Odds Ratios and Log(Odds Ratios), Clearly Explained!!!
0:16:20 - [ ] Live 2020-04-20!!! Expected Values
0:33:00
- [ ] StatQuest: Histograms, Clearly Explained
- [X] Udacity: Eigenvectors and Eigenvalues
- [ ] Udacity: Linear Algebra Refresher
- [ ] Udacity: Statistics
- [ ] Udacity: Intro to Descriptive Statistics
- [X] Udacity: Intro to Inferential Statistics
Specialize in Machine Learning
Have basic business understanding
- [X] Book: Delivering Happiness
- [X] Book: Good to Great: Why Some Companies Make the Leap...And Others Don't
- [X] Book: Hello, Startup: A Programmer's Guide to Building Products, Technologies, and Teams
- [X] Book: How Google Works
- [X] Book: Learn to Earn: A Beginner's Guide to the Basics of Investing and Business
- [X] Book: Rework
- [X] Book: The Airbnb Story
- [X] Book: The Personal MBA
- [X] Facebook: Digital marketing: get started
- [X] Facebook: Digital marketing: go further
- [X] Google Analytics for Beginners
- [X] Moz: The Beginner's Guide to SEO
- [X] Smartly: Marketing Fundamentals
- [X] Treehouse: SEO Basics
- [ ] Udacity: App Monetization
- [ ] Udacity: App Marketing
- [ ] Udacity: How to Build a Startup
Be familiar with how ML is applied at other companies
- [X] Article: How Facebook uses super-efficient AI models to detect hate speech
- [X] Article: Recent Advances in Google Translate
- [X] Article: Cannes: How ML saves us $1.7M a year on document previews
- [X] Article: Machine Learning @ Monzo in 2020
- [X] Article: How image search works at Dropbox
- [X] Real-world AI Case Studies
- [X] Andrej Karpathy on AI at Tesla (Full Stack Deep Learning - August 2018)
- [X] Jai Ranganathan at Data Science at Uber (Full Stack Deep Learning - August 2018)
- [X] John Apostolopoulos of Cisco discusses "Machine Learning in Networking"
0:48:44 - [X] Joaquin Candela, Director of Applied Machine Learning, Facebook in conversation with Esteban Arcaute
0:52:27 - [X] Eric Colson, Chief Algorithms Officer, Stitch Fix
0:53:57 - [X] Claudia Perlich, Advisor to Dstillery and Adjunct Professor NYU Stern School of Business
0:51:59 - [X] Jeff Dean, Google Senior Fellow and SVP Google AI - Deep Learning to Solve Challenging Problems
0:58:45 - [X] James Parr, Director of Frontier Development Lab (NASA), FDL Europe & CEO, Trillium Technologies
0:55:46 - [X] Daphne Koller, Founder & CEO of Insitro - In Conversation with Carlos Bustamante
0:49:29 - [X] Eric Horvitz, Microsoft Research - AI in the Open World: Advances, Aspirations, and Rough Edges
0:56:11 - [X] Tony Jebara, Netflix - Machine Learning for Recommendation and Personalization
0:55:20
- [X] Datacamp: Analyzing Police Activity with pandas
- [X] Datacamp: HR Analytics in Python: Predicting Employee Churn
- [X] Datacamp: Predicting Customer Churn in Python
- [X] Youtube: How does YouTube recommend videos? - AI EXPLAINED!
0:33:53 - [X] Youtube: How does Google Translate's AI work?
0:15:02 - [X] Youtube: Data Science in Finance
0:17:52 - [X] Youtube: The Age of AI
- [X] Youtube: Using Intent Data to Optimize the Self-Solve Experience
- [X] Youtube: Trillions of Questions, No Easy Answers: A (home) movie about how Google Search works
- [X] Youtube: Google Machine Learning System Design Mock Interview
- [X] Youtube: Netflix Machine Learning Mock Interview: Type-ahead Search
- [X] Youtube: Machine Learning design: Search engine for Q&A
- [X] Youtube: Engineering Systems for Real-Time Predictions @DoorDash
- [X] Youtube: How Gmail Uses Iterative Design, Machine Learning and AI to Create More Assistive Features
- [X] Youtube: Wayfair Data Science Explains It All: Human-in-the-loop Systems
- [X] Youtube: Leaving the lab: Building NLP applications that real people can use
- [X] Youtube: Machine Learning at Uber (Natural Language Processing Use Cases)
- [X] Youtube: Google Wave: Natural Language Processing
- [X] Youtube: Natural Language Understanding in Alexa
- [X] Youtube: The Machine Learning Behind Alexa’s AI Systems
- [X] Youtube: Ines Montani Keynote - Applied NLP Thinking
- [X] Youtube: Lecture 9: Lukas Biewald
- [X] Youtube: Lecture 13: Research Directions
- [X] Youtube: Lecture 14: Jeremy Howard
- [X] Youtube: Lecture 15: Richard Socher
- [X] Youtube: Machine learning across industries with Vicki Boykis
0:34:02 - [X] Youtube: Rachael Tatman - Conversational A.I. and Linguistics
0:36:51 - [X] Youtube: Nicolas Koumchatzky - Machine Learning in Production for Self Driving Cars
0:44:56 - [X] Youtube: Brandon Rohrer - Machine Learning in Production for Robots
0:34:31 - [X] Youtube: [CVPR'21 WAD] Keynote - Andrej Karpathy, Tesla
Be able to frame an ML problem
- [X] AWS: Types of Machine Learning Solutions
- [X] Article: Apply Machine Learning to your Business
- [X] Article: Resilience and Vibrancy: The 2020 Data & AI Landscape
- [X] Article: Software 2.0
- [X] Article: Highlights from ICML 2020
- [X] Article: A Peek at Trends in Machine Learning
- [X] Article: How to deliver on Machine Learning projects
- [X] Article: Data Science as a Product
- [X] Article: Customer service is full of machine learning problems
- [X] Article: Choosing Problems in Data Science and Machine Learning
- [X] Article: Why finance is deploying natural language processing
- [X] Article: The Last 5 Years In Deep Learning
- [X] Article: Always start with a stupid model, no exceptions.
- [X] Article: Most impactful AI trends of 2018: the rise of ML Engineering
- [X] Article: Building machine learning products: a problem well-defined is a problem half-solved.
- [X] Article: Simple considerations for simple people building fancy neural networks
- [X] Article: Maximizing Business Impact with Machine Learning
- [X] Book: AI Superpowers: China, Silicon Valley, and the New World Order
- [X] Book: A Human's Guide to Machine Intelligence
- [X] Book: The Future Computed
- [X] Book: Machine Learning Yearning by Andrew Ng
- [X] Book: Prediction Machines: The Simple Economics of Artificial Intelligence
- [X] Book: Building Machine Learning Powered Applications: Going from Idea to Product
- [X] Coursera: AI For Everyone
- [X] Datacamp: Data Science for Everyone
- [X] Datacamp: Machine Learning with the Experts: School Budgets
- [X] Datacamp: Machine Learning for Everyone
- [X] Datacamp: Data Science for Managers
- [X] Facebook: Field Guide to Machine Learning
- [X] Google: Introduction to Machine Learning Problem Framing
- [X] Pluralsight: How to Think About Machine Learning Algorithms
- [X] State of AI Report 2020
- [X] Youtube: Vincent Warmerdam: The profession of solving (the wrong problem) | PyData Amsterdam 2019
- [X] Youtube: Hugging Face, Transformers | NLP Research and Open Source | Interview with Julien Chaumond
- [X] Youtube: Vincent Warmerdam - Playing by the Rules-Based-Systems | PyData Eindhoven 2020
- [X] Youtube: Building intuitions before building models
- [X] Youtube: Recent Breakthroughs in AI with Andrej Karpathy and Lex Fridman
Be familiar with data ethics
- [X] Article: How to Detect Bias in AI
- [X] Netflix: Coded Bias
- [X] Netflix: The Great Hack
- [X] Netflix: The Social Dilemma
- [ ] Practical Data Ethics
- [X] Lesson 1: Disinformation
- [ ] Lesson 2: Bias & Fairness
- [ ] Lesson 3: Ethical Foundations & Practical Tools
- [ ] Lesson 4: Privacy and surveillance
- [ ] Lesson 4 continued: Privacy and surveillance
- [ ] Lesson 5.1: The problem with metrics
- [ ] Lesson 5.2: Our Ecosystem, Venture Capital, & Hypergrowth
- [ ] Lesson 5.3: Losing the Forest for the Trees, guest lecture by Ali Alkhatib
- [ ] Lesson 6: Algorithmic Colonialism, and Next Steps
- [ ] Youtube: Lecture 9: Ethics (Full Stack Deep Learning - Spring 2021)
1:04:50 - [ ] Youtube: SE4AI: Ethics and Fairness
1:18:37 - [ ] Youtube: SE4AI: Security
1:18:24 - [ ] Youtube: SE4AI: Safety
1:17:37
Be able to import data from multiple sources
- [X] Docs: Beautiful Soup Documentation
- [X] Datacamp: Importing Data in Python (Part 2)
- [ ] Datacamp: Web Scraping in Python
Be able to setup data annotation efficiently
- [X] Article: Create A Synthetic Image Dataset — The “What”, The “Why” and The “How”
- [X] Article: We need Synthetic Data
- [X] Article: Weak Supervision for Online Discussions
- [X] Article: ML Infrastructure Tools for Data Preparation
- [X] Article: Exploring the Role of Human Raters in Creating NLP Datasets
- [X] Article: Inter-Annotator Agreement (IAA)
- [X] Article: How to compute inter-rater reliability metrics (Cohen’s Kappa, Fleiss’s Kappa, Cronbach Alpha, Krippendorff Alpha, Scott’s Pi, Inter-class correlation) in Python
- [X] Article: The Pitfalls of Inter-Rater Reliability in Data Labeling and Machine Learning
- [X] Youtube: Snorkel: Dark Data and Machine Learning - Christopher Ré
- [X] Youtube: Training a NER Model with Prodigy and Transfer Learning
- [X] Youtube: Training a New Entity Type with Prodigy – annotation powered by active learning
- [X] Youtube: ECCV 2020 WSL tutorial: 4. Human-in-the-loop annotations
- [X] Youtube: Active Learning: Why Smart Labeling is the Future of Data Annotation | Alectio
- [X] Youtube: Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)
0:59:42 - [X] Youtube: Lab 6: Data Labeling (Full Stack Deep Learning - Spring 2021)
0:05:06 - [X] Youtube: Lecture 6: Data Management
- [X] Youtube: SE4AI: Data Quality
1:07:15 - [X] Youtube: SE4AI: Data Programming and Intro to Big Data Processing
0:33:04 - [X] Youtube: SE4AI: Managing and Processing Large Datasets
1:21:27
Be able to manipulate data with Numpy
- [X] Article: A Visual Intro to NumPy and Data Representation
- [X] Article: Good practices with numpy random number generators
- [ ] Article: NumPy Illustrated: The Visual Guide to NumPy
- [X] Article: NumPy Fundamentals for Data Science and Machine Learning
- [X] Datacamp: Intro to Python for Data Science
- [X] Pluralsight: Working with Multidimensional Data Using NumPy
Be able to manipulate data with Pandas
- [X] Article: Visualizing Pandas' Pivoting and Reshaping Functions
- [X] Article: A Gentle Visual Intro to Data Analysis in Python Using Pandas
- [X] Article: Comprehensive Guide to Grouping and Aggregating with Pandas
- [X] Article: 8 Python Pandas Value_counts() tricks that make your work more efficient
- [X] Datacamp: pandas Foundations
- [X] Datacamp: Pandas Joins for Spreadsheet Users
- [X] Datacamp: Manipulating DataFrames with pandas
- [X] Datacamp: Merging DataFrames with pandas
- [X] Datacamp: Data Manipulation with pandas
- [X] Datacamp: Optimizing Python Code with pandas
- [X] Datacamp: Streamlined Data Ingestion with pandas
- [X] Datacamp: Analyzing Marketing Campaigns with pandas
- [X] edX: Implementing Predictive Analytics with Spark in Azure HDInsight
- [X] Article: Modern Pandas
Be able to manipulate data in spreadsheets
- [X] Datacamp: Spreadsheet basics
- [ ] Datacamp: Data Analysis with Spreadsheets
- [ ] Datacamp: Intermediate Spreadsheets for Data Science
- [ ] Datacamp: Pivot Tables with Spreadsheets
- [ ] Datacamp: Data Visualization in Spreadsheets
- [ ] Datacamp: Introduction to Statistics in Spreadsheets
- [ ] Datacamp: Conditional Formatting in Spreadsheets
- [ ] Datacamp: Marketing Analytics in Spreadsheets
- [ ] Datacamp: Error and Uncertainty in Spreadsheets
- [X] edX: Analyzing and Visualizing Data with Excel
Be able to perform feature selection and engineering
- [X] Article: Tips for Advanced Feature Engineering
- [ ] Article: Preparing data for a machine learning model
- [ ] Article: Feature selection for a machine learning model
- [ ] Article: Learning from imbalanced data
- [ ] Article: Hacker's Guide to Data Preparation for Machine Learning
- [ ] Article: Practical Guide to Handling Imbalanced Datasets
- [ ] Datacamp: Analyzing Social Media Data in Python
- [X] Datacamp: Dimensionality Reduction in Python
- [X] Datacamp: Preprocessing for Machine Learning in Python
- [X] Datacamp: Data Types for Data Science
- [X] Datacamp: Cleaning Data in Python
- [X] Datacamp: Feature Engineering for Machine Learning in Python
- [ ] Datacamp: Importing & Managing Financial Data in Python
- [ ] Datacamp: Manipulating Time Series Data in Python
- [ ] Datacamp: Working with Geospatial Data in Python
- [ ] Datacamp: Analyzing IoT Data in Python
- [ ] Datacamp: Dealing with Missing Data in Python
- [ ] Datacamp: Exploratory Data Analysis in Python
- [X] edX: Data Science Essentials
- [ ] Udacity: Creating an Analytical Dataset
- [X] Youtube: Applied ML 2020 - 04 - Preprocessing
1:07:40 - [X] Youtube: Applied ML 2020 - 11 - Model Inspection and Feature Selection
1:15:15
Be able to experiment in a notebook
- [X] Article: Securely storing configuration credentials in a Jupyter Notebook
- [X] Article: Automatically Reload Modules with %autoreload
- [ ] Calmcode: ipywidgets
- [X] Documentation: Jupyter Lab
- [X] Pluralsight: Getting Started with Jupyter Notebook and Python
- [X] Youtube: William Horton - A Brief History of Jupyter Notebooks
- [X] Youtube: I Like Notebooks
- [X] Youtube: I don't like notebooks.- Joel Grus (Allen Institute for Artificial Intelligence)
- [X] Youtube: Ryan Herr - After model.fit, before you deploy| JupyterCon 2020
- [X] Youtube: nbdev live coding with Hamel Husain
- [X] Youtube: How to Use JupyterLab
Be able to visualize data
- [X] Article: Creating a Catchier Word Cloud Presentation
- [X] Article: Effectively Using Matplotlib
- [ ] Article: Which color scale to use when visualizing data
- [ ] Datacamp: Introduction to Data Visualization with Python
- [X] Datacamp: Introduction to Seaborn
- [X] Datacamp: Introduction to Matplotlib
- [ ] Datacamp: Intermediate Data Visualization with Seaborn
- [ ] Datacamp: Visualizing Time Series Data in Python
- [ ] Datacamp: Improving Your Data Visualizations in Python
- [ ] Datacamp: Visualizing Geospatial Data in Python
- [ ] Datacamp: Interactive Data Visualization with Bokeh
- [X] Youtube: Applied ML 2020 - 02 Visualization and matplotlib
1:07:30
Be able to do literature review using research papers
- [X] Paper: A Neural Probabilistic Language Model
- [ ] Paper: Efficient Estimation of Word Representations in Vector Space
- [X] Paper: Sequence to Sequence Learning with Neural Networks
- [X] Paper: Neural Machine Translation by Jointly Learning to Align and Translate
- [X] Paper: Attention Is All You Need
- [X] Paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- [X] Paper: XLNet: Generalized Autoregressive Pretraining for Language Understanding
- [X] Paper: Synonyms Based Term Weighting Scheme: An Extension to TF.IDF
- [ ] Paper: RoBERTa: A Robustly Optimized BERT Pretraining Approach
- [ ] Paper: GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
- [X] Paper: Amazon.com Recommendations Item-to-Item Collaborative Filtering
- [X] Paper: Collaborative Filtering for Implicit Feedback Datasets
- [ ] Paper: BPR: Bayesian Personalized Ranking from Implicit Feedback
- [X] Paper: Factorization Machines
- [X] Paper: Wide & Deep Learning for Recommender Systems
- [X] Paper: Neural Factorization Machines for Sparse Predictive Analytics
- [X] Paper: Multiword Expressions: A Pain in the Neck for NLP
- [X] Paper: PyTorch: An Imperative Style, High-Performance Deep Learning Library
- [X] Paper: ALBERT: A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS
- [X] Paper: Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
- [X] Paper: A Simple Framework for Contrastive Learning of Visual Representations
- [X] Paper: Self-Supervised Learning of Pretext-Invariant Representations
- [X] Paper: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
- [X] Paper: Self-Labelling via Simultalaneous Clustering and Representation Learning
- [X] Paper: A survey on Semi-, Self- and Unsupervised Techniques in Image Classification
- [X] Paper: Multi-document Summarization by using TextRank and Maximal Marginal Relevance for Text in Bahasa Indonesia
- [X] Paper: Train Once, Test Anywhere: Zero-Shot Learning for Text Classification
- [X] Paper: Zero-shot Text Classification With Generative Language Models
- [X] Paper: How to Fine-Tune BERT for Text Classification?
- [X] Paper: Universal Sentence Encoder
- [X] Paper: Enriching Word Vectors with Subword Information
- [X] Paper: Beyond Accuracy: Behavioral Testing of NLP models with CheckList
- [X] Paper: Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks
- [X] Paper: Temporal Ensembling for Semi-Supervised Learning
- [X] Paper: Boosting Self-Supervised Learning via Knowledge Transfer
- [X] Paper: Follow-up Question Generation
- [X] Paper: The Hardware Lottery
- [X] Paper: Question Generation via Overgenerating Transformations and Ranking
- [X] Paper: Good Question! Statistical Ranking for Question Generation
- [ ] Paper: Towards ML Engineering: A Brief History Of TensorFlow Extended (TFX)
- [X] Paper: Neural Text Generation: A Practical Guide
- [ ] Paper: Pest Management In Cotton Farms: An AI-System Case Study from the Global South
- [ ] Paper: BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
- [ ] Paper: On the surprising similarities between supervised and self-supervised models
- [X] Paper: All-but-the-Top: Simple and Effective Postprocessing for Word Representations
- [X] Paper: Simple and Effective Dimensionality Reduction for Word Embeddings
- [X] Paper: AutoCompete: A Framework for Machine Learning Competitions
- [X] Paper: Cost-effective Deployment of BERT Models in Serverless Environment
- [X] Paper: Evaluating Large Language Models Trained on Code
- [X] Paper: What Does BERT Learn about the Structure of Language?
- [X] Paper: What do RNN Language Models Learn about Filler–Gap Dependencies?
- [X] Paper: Symbol Grounding and its Implications for Artificial Intelligence
- [X] Paper: Is this a wampimuk? Cross-modal mapping between distributional semantics and the visual world
- [X] Paper: MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
- [X] Paper: Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs
- [X] Paper: Show and Tell: A Neural Image Caption Generator
- [X] Paper: The Curious Case of Neural Text Degeneration
- [X] Paper: Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
- [X] Paper: ‘Calling on the classical phone’: a distributional model of adjective-noun errors in learners’ English
- [X] Youtube: mixup: Beyond Empirical Risk Minimization (Paper Explained)
Be able to version control data
- [ ] Youtube: DVC Basics
- [ ] Article: ML Ops: Data Science Version Control
- [X] Youtube: Data versioning in machine learning projects - Dmitry Petrov
0:34:44 - [X] Zoom: Data versioning with DVC Part 1
- [X] Zoom: Data versioning with DVC Part 2
Be able to use experiment management tools
- [X] Article: Supercharge your Training with Pytorch Lightning + Weights & Biases
- [X] Article: Storing Metadata from Machine Learning Experiments
- [X] Youtube: Weights and Biases Tutorial
- [X] Youtube: Integrate Weights & Biases with PyTorch
- [X] Youtube: Log (Almost) Anything with Weights & Biases
- [X] Youtube: Lab 5: Experiment Management (Full Stack Deep Learning - Spring 2021)
0:30:41 - [X] Youtube: Lecture 5: Tracking Experiments
- [ ] Youtube: Weight & Biases
- [ ] Youtube: SE4AI: Versioning, Provenance, and Reproducibility
1:18:29
Be able to setup model validation
- [ ] Article: Evaluating a machine learning model
- [X] Article: Validating your Machine Learning Model
- [ ] Article: Measuring Performance: AUPRC and Average Precision
- [ ] Article: Measuring Performance: AUC (AUROC)
- [ ] Article: Measuring Performance: The Confusion Matrix
- [ ] Article: Measuring Performance: Accuracy
- [ ] Article: ROC Curves: Intuition Through Visualization
- [X] Article: Precision, Recall, Accuracy, and F1 Score for Multi-Label Classification
- [ ] Article: The Complete Guide to AUC and Average Precision: Simulations and Visualizations
- [X] Article: Best Use of Train/Val/Test Splits, with Tips for Medical Data
- [X] Article: The correct way to evaluate online machine learning models
- [X] Article: Proxy Metrics
- [X] Youtube: Accuracy as a Failure
- [X] Youtube: Applied ML 2020 - 09 - Model Evaluation and Metrics
1:18:23 - [X] Youtube: Machine Learning Fundamentals: Cross Validation
0:06:04 - [X] Youtube: Machine Learning Fundamentals: The Confusion Matrix
0:07:12 - [X] Youtube: Machine Learning Fundamentals: Sensitivity and Specificity
0:11:46 - [X] Youtube: Machine Learning Fundamentals: Bias and Variance
0:06:36 - [X] Youtube: ROC and AUC, Clearly Explained!
0:16:26
Be familiar with inner working of models
- [ ] Article: Naive Bayes classification
- [ ] Article: Linear regression
- [ ] Article: Polynomial regression
- [ ] Article: Logistic regression
- [X] Article: Decision trees
- [ ] Article: K-nearest neighbors
- [X] Article: Support Vector Machines
- [ ] Article: Random forests
- [ ] Article: Boosted trees
- [ ] Article: Hacker's Guide to Fundamental Machine Learning Algorithms with Python
- [ ] Article: Neural networks: activation functions
- [ ] Article: Neural networks: training with backpropagation
- [ ] Article: Neural Network from scratch-part 1
- [ ] Article: Neural Network from scratch-part 2
- [ ] Article: Perceptron to Deep-Neural-Network
- [X] Article: One-vs-Rest strategy for Multi-Class Classification
- [X] Article: Multi-class Classification — One-vs-All & One-vs-One
- [X] Article: One-vs-Rest and One-vs-One for Multi-Class Classification
- [ ] Article: Deep Learning Algorithms - The Complete Guide
- [ ] Article: Machine Learning Techniques Primer
- [X] AWS: Understanding Neural Networks
- [ ] Book: Grokking Deep Learning
- [ ] Book: Make Your Own Neural Network
- [ ] Coursera: Neural Networks and Deep Learning
- [ ] Datacamp: Extreme Gradient Boosting with XGBoost
- [X] Datacamp: Ensemble Methods in Python
- [ ] StatQuest: Machine Learning
- [X] StatQuest: Fitting a line to data, aka least squares, aka linear regression.
0:09:21 - [X] StatQuest: Linear Models Pt.1 - Linear Regression
0:27:26 - [X] StatQuest: StatQuest: Linear Models Pt.2 - t-tests and ANOVA
0:11:37 - [X] StatQuest: Odds and Log(Odds), Clearly Explained!!!
0:11:30 - [X] StatQuest: Odds Ratios and Log(Odds Ratios), Clearly Explained!!!
0:16:20 - [X] StatQuest: Logistic Regression
0:08:47 - [X] Logistic Regression Details Pt1: Coefficients
0:19:02 - [X] Logistic Regression Details Pt 2: Maximum Likelihood
0:10:23 - [X] Logistic Regression Details Pt 3: R-squared and p-value
0:15:25 - [X] Saturated Models and Deviance
0:18:39 - [X] Deviance Residuals
0:06:18 - [X] Regularization Part 1: Ridge (L2) Regression
0:20:26 - [X] Regularization Part 2: Lasso (L1) Regression
0:08:19 - [X] Ridge vs Lasso Regression, Visualized!!!
0:09:05 - [X] Regularization Part 3: Elastic Net Regression
0:05:19 - [X] StatQuest: Principal Component Analysis (PCA), Step-by-Step
0:21:57 - [X] StatQuest: PCA main ideas in only 5 minutes!!!
0:06:04 - [X] StatQuest: PCA - Practical Tips
0:08:19 - [X] StatQuest: PCA in Python
0:11:37 - [X] StatQuest: Linear Discriminant Analysis (LDA) clearly explained.
0:15:12 - [X] StatQuest: MDS and PCoA
0:08:18 - [X] StatQuest: t-SNE, Clearly Explained
0:11:47 - [X] StatQuest: Hierarchical Clustering
0:11:19 - [X] StatQuest: K-means clustering
0:08:57 - [X] StatQuest: K-nearest neighbors, Clearly Explained
0:05:30 - [X] Naive Bayes, Clearly Explained!!!
0:15:12 - [X] Gaussian Naive Bayes, Clearly Explained!!!
0:09:41 - [X] StatQuest: Decision Trees
0:17:22 - [X] StatQuest: Decision Trees, Part 2 - Feature Selection and Missing Data
0:05:16 - [X] Regression Trees, Clearly Explained!!!
0:22:33 - [X] How to Prune Regression Trees, Clearly Explained!!!
0:16:15 - [X] StatQuest: Random Forests Part 1 - Building, Using and Evaluating
0:09:54 - [X] StatQuest: Random Forests Part 2: Missing data and clustering
0:11:53 - [X] The Chain Rule
0:18:23 - [X] Gradient Descent, Step-by-Step
0:23:54 - [X] Stochastic Gradient Descent, Clearly Explained!!!
0:10:53 - [ ] AdaBoost, Clearly Explained
0:20:54 - [ ] Gradient Boost Part 1: Regression Main Ideas
0:15:52 - [ ] Gradient Boost Part 2: Regression Details
0:26:45 - [ ] Gradient Boost Part 3: Classification
0:17:02 - [ ] Gradient Boost Part 4: Classification Details
0:36:59 - [X] Support Vector Machines, Clearly Explained!!!
0:20:32 - [X] Support Vector Machines Part 2: The Polynomial Kernel
0:07:15 - [X] Support Vector Machines Part 3: The Radial (RBF) Kernel
0:15:52 - [ ] XGBoost Part 1: Regression
0:25:46 - [ ] XGBoost Part 2: Classification
0:25:17 - [ ] XGBoost Part 3: Mathematical Details
0:27:24 - [ ] XGBoost Part 4: Crazy Cool Optimizations
0:24:27 - [ ] StatQuest: Fiitting a curve to data, aka lowess, aka loess
0:10:10 - [ ] Statistics Fundamentals: Population Parameters
0:14:31 - [ ] Principal Component Analysis (PCA) clearly explained (2015)
0:20:16 - [ ] Decision Trees in Python from Start to Finish
1:06:23
- [X] StatQuest: Fitting a line to data, aka least squares, aka linear regression.
- [ ] Udacity: Classification Models
- [ ] Youtube: Neural Networks from Scratch in Python
- [ ] Neural Networks from Scratch - P.1 Intro and Neuron Code
0:16:59 - [ ] Neural Networks from Scratch - P.2 Coding a Layer
0:15:06 - [ ] Neural Networks from Scratch - P.3 The Dot Product
0:25:17 - [ ] Neural Networks from Scratch - P.4 Batches, Layers, and Objects
0:33:46 - [ ] Neural Networks from Scratch - P.5 Hidden Layer Activation Functions
0:40:05
- [ ] Neural Networks from Scratch - P.1 Intro and Neuron Code
- [X] Youtube: Applied ML 2020 - 03 Supervised learning and model validation
1:12:00 - [ ] Youtube: Applied ML 2020 - 05 - Linear Models for Regression
1:06:54 - [ ] Youtube: Applied ML 2020 - 06 - Linear Models for Classification
1:07:50 - [X] Youtube: Applied ML 2020 - 07 - Decision Trees and Random Forests
1:07:58 - [X] Youtube: Applied ML 2020 - 08 - Gradient Boosting
1:02:12 - [ ] Youtube: Applied ML 2020 - 18 - Neural Networks
1:19:36 - [ ] Youtube: Applied ML 2020 - 12 - AutoML (plus some feature selection)
1:25:38
Be able to improve model generalization
- [ ] Article: Deep neural networks: preventing overfitting
- [ ] Article: Normalizing your data (specifically, input and batch normalization)
- [ ] Article: Batch Normalization
- [ ] Article: Are Deep Neural Networks Dramatically Overfitted?
- [ ] Article: In-layer normalization techniques for training very deep neural networks
- [ ] Article: Label Smoothing Explained using Microsoft Excel
- [X] Article: Uncertainty Quantification Part 4: Leveraging Dropout in Neural Networks (CNNs)
- [X] Article: Simple Ways to Tackle Class Imbalance
- [X] Youtube: Applied ML 2020 - 10 - Calibration, Imbalanced data
1:16:14 - [X] Youtube: Lecture 10: Troubleshooting Deep Neural Networks
Be familiar with fundamental ML concepts
- [ ] Article: Connections: Log Likelihood, Cross Entropy, KL Divergence, Logistic Regression, and Neural Networks
- [ ] Article: MLE and MAP — in layman’s terms
- [ ] An overview of gradient descent optimization algorithms
- [ ] Article: Optimization for Deep Learning Highlights in 2017
- [ ] Article: Gradient descent
- [ ] Article: Setting the learning rate of your neural network
- [ ] Article: Cross-entropy for classification
- [ ] Article: Dismantling Neural Networks to Understand the Inner Workings with Math and Pytorch
- [X] Datacamp: AI Fundamentals
- [ ] Datacamp: Foundations of Predictive Analytics in Python (Part 1)
- [ ] Datacamp: Foundations of Predictive Analytics in Python (Part 2)
- [X] Elements of AI
- [X] edX: Principles of Machine Learning
- [X] edX: Data Science Essentials
- [X] Fast.ai: Deep Learning for Coder (2020)
- [X] Youtube: Deep Double Descent
Be able to implement models in scikit-learn
- [X] Article: Stacking made easy with Sklearn
- [X] Article: Curve Fitting With Python
- [X] Article: A Guide to Calibration Plots in Python
- [X] Calmcode: human-learn
- [X] Datacamp: Supervised Learning with scikit-learn
- [X] Datacamp: Machine Learning with Tree-Based Models in Python
- [ ] Datacamp: Introduction to Linear Modeling in Python
- [X] Datacamp: Linear Classifiers in Python
- [ ] Datacamp: Generalized Linear Models in Python
- [X] Notebook: scikit-learn tips
- [X] Pluralsight: Building Machine Learning Models in Python with scikit-learn
- [ ] Video: human learn
- [X] Youtube: dabl: Automatic Machine Learning with a Human in the Loop
00:25:43 - [X] Youtube: Multilabel and Multioutput Classification -Machine Learning with TensorFlow & scikit-learn on Python
- [X] Youtube: DABL: Automatic machine learning with a human in the loop- AI Latim American SumMIT Day 1
Be able to implement models in Tensorflow and Keras
- [X] Coursera: Introduction to Tensorflow
- [X] Coursera: Convolutional Neural Networks in TensorFlow
- [X] Deeplizard: Keras - Python Deep Learning Neural Network API
- [ ] Book: Deep Learning with Python (Page: 276)
- [X] Datacamp: Deep Learning in Python
- [X] Datacamp: Convolutional Neural Networks for Image Processing
- [X] Datacamp: Introduction to TensorFlow in Python
- [X] Datacamp: Introduction to Deep Learning with Keras
- [X] Datacamp: Advanced Deep Learning with Keras
- [ ] Google: Machine Learning Crash Course
- [X] Pluralsight: Deep Learning with Keras
- [X] Udacity: Intro to TensorFlow for Deep Learning
Be able to implement models in PyTorch
- [ ] Article: Keeping Up with PyTorch Lightning and Hydra
- [ ] Article: The One PyTorch Trick Which You Should Know
- [ ] Article: How does automatic differentiation really work?
- [X] Article: 7 Tips To Maximize PyTorch Performance
- [X] Article: An introduction to PyTorch Lightning with comparisons to PyTorch
- [X] Article: Converting From Keras To PyTorch Lightning
- [X] Article: From PyTorch to PyTorch Lightning — A gentle introduction
- [X] Article: Introducing PyTorch Lightning Sharded: Train SOTA Models, With Half The Memory
- [X] Article: Sharded: A New Technique To Double The Size Of PyTorch Models
- [X] Article: Understanding Bidirectional RNN in PyTorch
- [ ] Article: A developer-friendly guide to mixed precision training with PyTorch
- [ ] Article: A developer-friendly guide to model pruning in PyTorch
- [ ] Article: A developer-friendly guide to model quantization with PyTorch
- [ ] Article: Tricks for training PyTorch models to convergence more quickly
- [X] Article: PyTorch Lightning Bolts — From Linear, Logistic Regression on TPUs to pre-trained GANs
- [X] Article: Scaling Logistic Regression Via Multi-GPU/TPU Training
- [X] Article: Training Neural Nets on Larger Batches: Practical Tips for 1-GPU, Multi-GPU & Distributed setups
- [X] Article: PyTorch Lightning 0.9 — synced BatchNorm, DataModules and final API!
- [X] Article: PyTorch Lightning: Metrics
- [X] Article: PyTorch Multi-GPU Metrics Library and More in PyTorch Lightning 0.8.1
- [ ] Article: Distributed model training in PyTorch using DistributedDataParallel
- [ ] Article: Distributed model training in PyTorch using DistributedDataParallel
- [ ] Article: EINSUM IS ALL YOU NEED - EINSTEIN SUMMATION IN DEEP LEARNING
- [ ] Article: Faster Deep Learning Training with PyTorch – a 2021 Guide
- [ ] Article: Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
- [ ] Article: PyTorch Lightning V1.2.0- DeepSpeed, Pruning, Quantization, SWA
- [ ] Article: But what are PyTorch DataLoaders really?
- [X] Article: Using PyTorch + NumPy? You're making a mistake.
- [X] Article: How Wadhwani AI Uses PyTorch To Empower Cotton Farmers
- [X] Article: Taming LSTMs: Variable-sized mini-batches and why PyTorch is good for your health
- [ ] Article: How to Build a Streaming DataLoader with PyTorch
- [X] Article: Transform your ML-model to Pytorch with Hummingbird
- [X] Article: PyTorch Loss Functions: The Ultimate Guide
- [X] Article: Pad pack sequences for Pytorch batch processing with DataLoader
- [ ] Article: Model Parallelism
- [ ] Notebook: Tensor Arithmetic
- [ ] Notebook: Autograd
- [ ] Notebook: Optimization
- [ ] Notebook: Network modules
- [ ] Notebook: Datasets and Dataloaders
- [X] Documentation: Pytorch Lightning
- [X] Datacamp: Introduction to Deep Learning with PyTorch
- [X] Deeplizard: Neural Network Programming - Deep Learning with PyTorch
- [ ] Udacity: Intro to Deep Learning with PyTorch
- [X] Youtube: PyTorch Lightning 101
- [X] Youtube: SimCLR with PyTorch Lightning
- [ ] Youtube: PyTorch Performance Tuning Guide
26:41:00 - [X] Youtube: Skin Cancer Detection with PyTorch
- [X] Youtube: Learn with Lightning
- [X] Youtube: PyTorch Tutorial - RNN & LSTM & GRU - Recurrent Neural Nets
00:15:51 - [X] Youtube: Pytorch Zero to All
- [X] PyTorch Developer Day 2020 | Full Livestream
- [X] Youtube: Lightning Chat: How a Grandmaster Won a Kaggle Competition Using Pytorch Lightning
- [ ] Youtube: Production Inference Deployment with PyTorch
- [ ] Youtube: What is Automatic Differentiation?
- [ ] Youtube: JAX: accelerated machine learning research via composable function transformations in Python
Be able to implement models using cloud services
- [X] AWS: Amazon Transcribe Deep Dive: Using Feedback Loops to Improve Confidence Level of Transcription
- [X] AWS: Build a Text Classification Model with AWS Glue and Amazon SageMaker
- [X] AWS: Deep Dive on Amazon Rekognition: Building Computer Visions Based Smart Applications
- [X] AWS: Hands-on Rekognition: Automated Video Editing
- [X] AWS: Introduction to Amazon Comprehend
- [X] AWS: Introduction to Amazon Comprehend Medical
- [X] AWS: Introduction to Amazon Elastic Inference
- [X] AWS: Introduction to Amazon Forecast
- [X] AWS: Introduction to Amazon Lex
- [X] AWS: Introduction to Amazon Personalize
- [X] AWS: Introduction to Amazon Polly
- [X] AWS: Introduction to Amazon SageMaker Ground Truth
- [X] AWS: Introduction to Amazon SageMaker Neo
- [X] AWS: Introduction to Amazon Transcribe
- [X] AWS: Introduction to Amazon Translate
- [X] AWS: Introduction to AWS Marketplace - Machine Learning Category
- [X] AWS: Machine Learning Exam Basics
- [X] AWS: Neural Machine Translation with Sockeye
- [X] AWS: Process Model: CRISP-DM on the AWS Stack
- [X] AWS: Satellite Image Classification in SageMaker
- [X] Datacamp: Introduction to AWS Boto in Python
- [X] edX: Amazon SageMaker: Simplifying Machine Learning Application Development
Be able to apply unsupervised learning algorithms
- [ ] Article: Decrypt Generative Adversarial Networks (GAN)
- [ ] Article: GANs in computer vision - Conditional image synthesis and 3D object generation
- [ ] Article: GANs in computer vision - Improved training with Wasserstein distance, game theory control and progressively growing schemes
- [ ] Article: GANs in computer vision - Introduction to generative learning
- [ ] Article: GANs in computer vision - self-supervised adversarial training and high-resolution image synthesis with style incorporation
- [ ] Article: GANs in computer vision - semantic image synthesis and learning a generative model from a single image
- [X] Article: Paper Summary: DeViSE: A Deep Visual-Semantic Embedding Model
- [ ] Article: Contrastive Self-Supervised Learning
- [ ] Article: From Autoencoder to Beta-VAE
- [ ] Article: Self-Supervised Representation Learning
- [ ] Article: Self-supervised learning: The dark matter of intelligence
- [ ] Article: Understanding self-supervised and contrastive learning with "Bootstrap Your Own Latent" (BYOL)
- [ ] Article: How to Generate Images using Autoencoders
- [ ] Article: Introduction to autoencoders
- [ ] Article: Soft clustering with Gaussian mixed models (EM)
- [ ] Article: Variational autoencoders
- [ ] Article: Build a simple Image Retrieval System with an Autoencoder
- [ ] Article: Deep Inside Autoencoders
- [X] Article: An overview of proxy-label approaches for semi-supervised learning
- [X] Article: From Research to Production with Deep Semi-Supervised Learning
- [ ] Article: Affinity Propagation Algorithm Explained
- [ ] Article: Algorithm Breakdown: Affinity Propagation
- [X] Article: Create, Visualize and Interpret Customer Segments
- [ ] Article: A Framework For Contrastive Self-Supervised Learning And Designing A New Approach
- [ ] Article: EfficientDet Meets Pytorch-Lightning
- [ ] Article: Principal components analysis (PCA)
- [X] Article: RecSys 2020 - Takeaways and Notable Papers
- [ ] Article: Grouping data points with k-means clustering
- [ ] Article: A gentle introduction to HDBSCAN and density-based clustering
- [ ] Article: Deepfakes: Face synthesis with GANs and Autoencoders
- [X] Article: The 3 Deep Learning Frameworks For End-to-End Speech Recognition That Power Your Devices
- [ ] Article: Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
- [ ] Berkeley: Deep Unsupervised Learning Spring 2020
- [X] L1 Introduction -- CS294-158-SP20 Deep Unsupervised Learning -- UC Berkeley, Spring 2020
1:10:02 - [X] L2 Autoregressive Models -- CS294-158-SP20 Deep Unsupervised Learning -- UC Berkeley, Spring 2020
2:27:23 - [ ] L3 Flow Models -- CS294-158-SP20 Deep Unsupervised Learning -- UC Berkeley -- Spring 2020
1:56:53 - [ ] L4 Latent Variable Models (VAE) -- CS294-158-SP20 Deep Unsupervised Learning -- UC Berkeley
2:19:33 - [ ] Lecture 5 Implicit Models -- GANs Part I --- UC Berkeley, Spring 2020
2:32:32 - [ ] Lecture 6 Implicit Models / GANs part II --- CS294-158-SP20 Deep Unsupervised Learning -- Berkeley
2:09:14 - [X] Lecture 7 Self-Supervised Learning -- UC Berkeley Spring 2020 - CS294-158 Deep Unsupervised Learning
2:20:41 - [ ] L8 Round-up of Strengths and Weaknesses of Unsupervised Learning Methods -- UC Berkeley SP20
0:41:51 - [X] L9 Semi-Supervised Learning and Unsupervised Distribution Alignment -- CS294-158-SP20 UC Berkeley
2:16:00 - [ ] L10 Compression -- UC Berkeley, Spring 2020, CS294-158 Deep Unsupervised Learning
3:09:49 - [X] L11 Language Models -- guest instructor: Alec Radford (OpenAI) --- Deep Unsupervised Learning SP20
2:38:19 - [ ] L12 Representation Learning for Reinforcement Learning --- CS294-158 UC Berkeley Spring 2020
2:01:56
- [X] L1 Introduction -- CS294-158-SP20 Deep Unsupervised Learning -- UC Berkeley, Spring 2020
- [ ] Datacamp: Customer Segmentation in Python
- [X] Datacamp: Unsupervised Learning in Python
- [X] Deck: Demystifying Self-Supervised Learning for Visual Recognition
- [X] DeepMind: Inefficient Data Efficiency
- [ ] Google: Clustering
- [X] Udacity: Segmentation and Clustering
- [X] Wandb: Unsupervised Visual Representation Learning with SwAV
- [X] Youtube: Applied ML 2020 - 14 - Clustering and Mixture Models
1:26:33 - [ ] Youtube: Applied ML 2020 - 13 - Dimensionality reduction
1:30:34 - [X] Youtube: BYOL: Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning (Paper Explained)
- [X] Youtube: A critical analysis of self-supervision, or what we can learn from a single image (Paper Explained)
- [X] Youtube: Week 10 – Lecture: Self-supervised learning (SSL) in computer vision (CV)
- [X] Youtube: CVPR 2020 Tutorial: Towards Annotation-Efficient Learning
- [X] Youtube: Yuki Asano | Self-Supervision | Self-Labelling | Labelling Unlabelled videos | CV | CTDS.Show #81
- [X] Youtube: Contrastive Clustering with SwAV
- [ ] Youtube: Variational Autoencoders - EXPLAINED!
0:17:36 - [X] Youtube: OptaProAnalyticsForum– Learning to watch football: Self-supervised representations for tracking data
- [X] Youtube: Can a Neural Net tell if an image is mirrored? – Visual Chirality
- [X] Youtube: Deep InfoMax: Learning deep representations by mutual information estimation and maximization
- [ ] Deep Learning Lecture Summer 2020
- [ ] Deep Learning: Unsupervised Learning - Part 1
- [ ] Deep Learning: Unsupervised Learning - Part 2
- [ ] Deep Learning: Unsupervised Learning - Part 3
- [ ] Deep Learning: Unsupervised Learning - Part 4
- [ ] Deep Learning: Unsupervised Learning - Part 5
- [ ] Deep Learning: Weakly and Self-Supervised Learning - Part 1
- [ ] Deep Learning: Weakly and Self-Supervised Learning - Part 2
- [ ] Deep Learning: Weakly and Self-Supervised Learning - Part 3
- [ ] Deep Learning: Weakly and Self-Supervised Learning - Part 4
- [X] ECCV 2020: New Frontiers for Learning with Limited Labels or Data
- [X] Youtube: Self-Supervised Learning - What is Next? - Workshop at ECCV 2020, August 28th
- [X] Youtube: Marco Cuturi - A Primer on Optimal Transport
- [X] Youtube: Sebastian Ruder: Neural Semi-supervised Learning under Domain Shift
- [X] Youtube: Clustering Algorithms
Be able to implement NLP models
- [X] Article: Fixing common Unicode mistakes with Python – after they’ve been made
- [ ] Article: 10 Popular Keyword Extraction Algorithms in Natural Language Processing
- [ ] Article: Deconstructing BERT
- [ ] Article: How To Create Data Products That Are Magical Using Sequence-to-Sequence Models
- [ ] Article: Why Rasa uses Sparse Layers in Transformers
- [ ] Article: Semantic Search On Documents
- [ ] Article: Locality-sensitive Hashing and Singular to Plural Noun Conversion
- [X] Article: Build A Keyword Extraction API with Spacy, Flask, and FuzzyWuzzy
- [ ] Article: What is Hidden in the Hidden Markov Model?
- [ ] Article: Unsupervised NER using BERT
- [ ] Article: Unsupervised creation of interpretable sentence representations
- [ ] Article: Unsupervised synonym harvesting
- [ ] Article: Zero shot NER using RoBERTA
- [X] Article: Maximizing BERT model performance
- [X] Article: Swiss army knife for unsupervised task solving
- [X] Article: 10 Exciting Ideas of 2018 in NLP
- [ ] Article: 74 Summaries of Machine Learning and NLP Research
- [X] Article: Advance BERT model via transferring knowledge from Cross-Encoders to Bi-Encoders
- [ ] Article: Learning to select data for transfer learning
- [ ] Article: T5 — a model that explores the limits of transfer learning
- [ ] Article: The State of Transfer Learning in NLP
- [X] Article: Haystack: The State of Search in 2021
- [X] Article: How to build a State-of-the-Art Conversational AI with Transfer Learning
- [ ] Article: Commonsense Reasoning for Natural Language Processing
- [ ] Article: Language Models
- [ ] Article: Paraphrasing
- [ ] Article: Poor man’s GPT-3: Few shot text generation with T5 Transformer
- [ ] Article: Text Generation
- [X] Article: Controlling Text Generation with Plug and Play Language Models
- [X] Article: What makes a good conversation?
- [ ] Article: How to steal modern NLP systems with gibberish?
- [X] Article: Intuition & Use-Cases of Embeddings in NLP & beyond
- [X] Article: The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning)
- [X] Article: The Illustrated GPT-2 (Visualizing Transformer Language Models)
- [X] Article: The Illustrated Transformer
- [X] Article: The Illustrated Word2vec
- [ ] Article: A Deep Dive into the Reformer
- [ ] Article: A Survey of Long-Term Context in Transformers
- [ ] Article: Large Memory Layers with Product Keys
- [ ] Article: Optimal Transport and the Sinkhorn Transformer
- [ ] Article: Pattern-Exploiting Training
- [ ] Article: Talking-Heads Attention
- [ ] Article: How to Apply BERT to Arabic and Other Languages
- [ ] Article: Neural Language Models as Domain-Specific Knowledge Bases
- [X] Article: Domain-Specific BERT Models
- [X] Article: Existing Tools for Named Entity Recognition
- [X] Article: Smart Batching Tutorial - Speed Up BERT Training
- [ ] Article: Attention? An Other Perspective!: Part 2
- [ ] Article: Attention? An Other Perspective!: Part 3
- [ ] Article: Attention? An Other Perspective!: Part 4
- [ ] Article: Attention? An Other Perspective!: Part 5
- [X] Article: Attention? An Other Perspective!: Part 1
- [ ] Article: Rebuilding the spellchecker, pt.4: Introduction to suggest algorithm
- [ ] Article: Rebuilding the spellchecker: Hunspell and the order of edits
- [X] Article: Rebuilding the most popular spellchecker. Part 1
- [X] Article: Rebuilding the spellchecker, pt.2: Just look in the dictionary, they said!
- [X] Article: Rebuilding the spellchecker, pt.3: Lookup—compounds and solutions
- [ ] Article: Automatic Topic Labeling in 2018: History and Trends
- [ ] Article: Deep Learning for NLP Best Practices
- [X] Article: Attention and Memory in Deep Learning and NLP
- [X] Article: Long Short-Term Memory: From Zero to Hero with PyTorch
- [X] Article: Ten trends in Deep learning NLP
- [X] Article: The Illustrated Wav2vec
- [ ] Article: A Review of the Neural History of Natural Language Processing
- [ ] Article: ColumnTransformer Meets Natural Language Processing
- [ ] Article: Neural Transfer Learning for Natural Language Processing
- [ ] Article: Tracking the Progress in Natural Language Processing
- [X] Article: Implementing Bengio’s Neural Probabilistic Language Model (NPLM) using Pytorch
- [ ] Article: Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
- [X] Article: How many data points is a prompt worth?
- [X] Article: NLP: Pre-trained Sentiment Analysis
- [X] Article: Transformer-based Encoder-Decoder Models
- [X] Article: Understanding BigBird's Block Sparse Attention"
- [ ] Article: Interactive Topic Modeling with BERTopic
- [ ] Article: Understanding Climate Change Domains through Topic Modeling
- [ ] Article: When Topic Modeling is Part of the Text Pre-processing
- [X] Article: Keyword Extraction with BERT
- [X] Article: Topic Modeling with BERT
- [ ] Article: Question Classification using Self-Attention Transformer — Part 1.1
- [ ] Article: Question Classification using Self-Attention Transformer — Part 1
- [ ] Article: Question Classification using Self-Attention Transformer — Part 2
- [ ] Article: Question Classification using Self-Attention Transformer — Part 3
- [ ] Article: Generalized Language Models
- [ ] Article: Learning Word Embedding
- [ ] Article: Reducing Toxicity in Language Models
- [ ] Article: The Transformer Family
- [X] Article: DialogRPT with Huggingface Transformers
- [X] Article: Hugging Face Reads - 01/2021 - Sparsity and Pruning
- [X] Article: Hugging Face Reads, Feb. 2021 - Long-range Transformers
- [X] Article: Porting fairseq wmt19 translation system to transformers
- [ ] Article: On word embeddings - Part 1
- [ ] Article: On word embeddings - Part 2: Approximating the Softmax
- [ ] Article: On word embeddings - Part 3: The secret ingredients of word2vec
- [ ] Article: Word embeddings in 2017: Trends and future direction
- [ ] Article: DaCy: New Fast and Efficient State-of-the-Art in Danish NLP!
- [ ] Article: State-of-the-Art Language Models in 2020
- [X] Article: ML and NLP Publications in 2020
- [X] Article: Zero-Shot Learning in Modern NLP
- [ ] Article: Introduction to recurrent neural networks
- [ ] Article: Understanding LSTM Networks
- [X] Article: Explaining RNNs without neural networks
- [X] Article: Understanding Convolutional Neural Networks for NLP
- [X] Article: Search (Pt 1) — A Gentle Introduction
- [X] Article: Search (Pt 2) — A Semantic Horse Race
- [X] Article: Search (Pt 3) — Elastic Transformers
- [ ] Article: Improved Few-Shot Text classification
- [ ] Article: Text classification from few training examples
- [X] Article: Multi-Label Text Classification
- [ ] Article: Semantic search using BERT embeddings
- [ ] Article: What Semantic Search Can do for You
- [ ] Article: How To Create Natural Language Semantic Search For Arbitrary Objects With Deep Learning
- [ ] Article: How to Implement a Beam Search Decoder for Natural Language Processing
- [X] Article: Creating a class-based TF-IDF with Scikit-Learn
- [X] Article: String Matching with BERT, TF-IDF, and more!
- [ ] Article: How to Use n-gram Models to Detect Format Errors in Datasets
- [ ] Article: The Unreasonable Effectiveness of Recurrent Neural Networks
- [ ] Article: A review of BERT based models
- [X] Article: Document clustering
- [ ] Article: Document search with fragment embeddings
- [ ] Doc: Huggingface Summary of the models
- [ ] Doc: Summary of the tokenizers
- [ ] Article: GPT-2 A nascent transfer learning method that could eliminate supervised learning in some NLP tasks
- [ ] Article: Evaluation Metrics for Language Modeling
- [ ] Article: Representation Learning and Retrieval
- [ ] Article: A survey of cross-lingual word embedding models
- [ ] Article: Unsupervised Cross-lingual Representation Learning
- [X] Article: Spelling Correction: How to make an accurate and fast corrector
- [X] Article: Speller100: Zero-shot spelling correction at scale for 100-plus languages
- [ ] Article: Understanding BERT’s Semantic Interpretations
- [ ] Article: Using NLP (BERT) to improve OCR accuracy
- [ ] Article: Brief review of word embedding families (2019)
- [ ] Article: Trends in input representation for state-of-art NLP models (2019)
- [ ] Article: An Overview of Multi-Task Learning in Deep Neural Networks
- [ ] Article: Multi-Task Learning Objectives for Natural Language Processing
- [X] Article: GPU Benchmarks for Fine-Tuning BERT
- [X] Article: Recent Advances in Language Model Fine-tuning
- [X] Article: The Current Best of Universal Word Embeddings and Sentence Embeddings
- [X] Article: Topic Modeling for Keyword Extraction
- [X] Article: Understanding ARPA and Language Models
- [ ] Article: Gaussian Mixture Models for Clustering
- [ ] Article: Explain NLP models with LIME & SHAP
- [X] Article: How to solve 90% of NLP problems: a step-by-step guide
- [ ] Article: Does GPT-2 Know Your Phone Number?
- [X] Article: How to Outperform GPT-3 by Combining Task Descriptions With Supervised Learning
- [ ] Article: LSTM Primer With Real Life Application( DeepMind Kidney Injury Prediction )*
- [ ] Article: T5 — XLNet — a clever language modeling solution
- [X] Article: Using an NLP Q&A System To Study Climate Hazards and Nature-Based Solutions
- [ ] Article: Hyperparameter Optimization for 🤗Transformers: A guide
- [ ] Article: How To Do Things With Words. And Counters
- [ ] Article: Automatically Summarize Trump’s State of the Union Address
- [ ] Article: Solving NER with BERT for any entity type with very little training data (compared to past approaches)
- [X] Article: 10 Things You Need to Know About BERT and the Transformer Architecture That Are Reshaping the AI Landscape
- [X] Article: Semantic Entailment
- [ ] Article: Shrinking fastText embeddings so that it fits Google Colab
- [X] Article: Fuzzy Matching/Fuzzy Logic Explained
- [ ] Article: Under the Hood of RNNs
- [X] Article: All Our N-gram are Belong to You
- [ ] Article: Perplexity Intuition (and its derivation)
- [ ] Article: Part of Speech Tagging with Hidden Markov Chain Models
- [X] Article: NLP Year In Review
- [X] Article: UNDERSTANDING WORD2VEC THROUGH CULTURAL DIMENSIONS
- [ ] Article: Exploring LSTMs
- [ ] Article: Aspect-Based Opinion Mining (NLP with Python)
- [ ] Article: pyLDAvis: Topic Modelling Exploration Tool That Every NLP Data Scientist Should Know
- [X] Article: ML and NLP Research Highlights of 2020
- [X] Article: Introducing spaCy
- [ ] Article: 3 subword algorithms help to improve your NLP model performance
- [ ] Article: Examining BERT’s raw embeddings
- [ ] Article: Making sense of LSTMs by example
- [X] Article: The Transformer Explained
- [X] Article: Understanding building blocks of ULMFIT
- [X] Article: Building a sentence embedding index with fastText and BM25
- [ ] Article: The Annotated GPT-2
- [X] Article: Key topics extraction and contextual sentiment of users reviews
- [X] Article: Google mT5 multilingual text-to-text transformer: A Brief Paper Analysis
- [ ] Article: Building RNNs is Fun with PyTorch and Google Colab
- [ ] Article: Faster and smaller quantized NLP with Hugging Face and ONNX Runtime
- [X] Article: Visualizing A Neural Machine Translation Model (Mechanics of Seq2seq Models With Attention)
- [X] Article: How I Used Deep Learning To Train A Chatbot To Talk Like Me (Sorta)
- [ ] Article: Generating Questions Using Transformers
- [X] Article: Feature-based Approach with BERT
- [ ] Article: Performers: The Kernel Trick, Random Fourier Features, and Attention
- [ ] Article: Text Similarities : Estimate the degree of similarity between two texts
- [ ] Article: NLP's ImageNet moment has arrived
- [ ] Article: Simple PyTorch Transformer Example with Greedy Decoding
- [ ] Article: Character level language model RNN
- [X] Article: How we used Universal Sentence Encoder and FAISS to make our search 10x smarter
- [X] Article: Adapting Text Augmentation to Industry problems
- [ ] Article: The Annotated Transformer
- [ ] Article: OpenAI's GPT-3 Language Model: A Technical Overview
- [X] Article: NLP for Supervised Learning - A Brief Survey
- [ ] Article: The 4 Biggest Open Problems in NLP
- [X] Article: How GPT3 Works
- [ ] Article: Why You Should Do NLP Beyond English
- [X] Article: Breaking the spell of the spelling check
- [X] Article: How to Write a Spelling Corrector
- [X] Article: Spellchecking by computer
- [X] Article: A Spellchecker Used to Be a Major Feat of Software Engineering
- [X] Article: 1000x Faster Spelling Correction algorithm (2012)
- [X] Article: The Pruning Radix Trie — a Radix Trie on steroids
- [X] Article: Text Data Cleanup - Dynamic Embedding Visualisation
- [ ] Article: Rotary Embeddings: A Relative Revolution
- [ ] Article: Using embeddings to help find similar restaurants in Search
- [ ] Article: Evolution of and experiments with feed ranking at Swiggy
- [ ] Article: Personalizing Swiggy POP Recommendations
- [ ] Article: Fan(s)tastic: Search for blazing-fast results
- [ ] Article: Find My Food: Semantic Embeddings for Food Search Using Siamese Networks
- [ ] Article: Learning To Rank Restaurants
- [ ] Article: Is Word Sense Disambiguation outdated?
- [ ] Article: Named-Entity evaluation metrics based on entity-level
- [ ] Article: Comparison Of Ngram Fuzzy Matching Approaches
- [ ] Article: String similarity — the basic know your algorithms guide!
- [ ] Article: Evolution of Word to Vector
- [X] Article: Unsupervised Auto-labeling of Websites
- [X] Article: A friendly introduction to Recurrent Neural Networks
- [X] Article: Introducing Translatotron: An End-to-End Speech-to-Speech Translation Model
- [ ] Book: Embeddings in Natural Language Processing
- [ ] Book: Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax
- [ ] Coursera: Sequence Models
- [ ] Coursera: Natural Language Processing in TensorFlow
- [X] CMU: Low-resource NLP Bootcamp 2020
- [ ] CMU: Neural Nets for NLP 2021
- [X] CMU Neural Nets for NLP 2021 (1): Introduction
1:22:40 - [ ] CMU Neural Nets for NLP 2021 (2): Language Modeling, Efficiency/Training Tricks
0:58:24 - [ ] CMU Neural Nets for NLP 2021 (3): Building A Neural Network Toolkit for NLP, minnn
0:34:42 - [X] CMU Neural Nets for NLP 2021 (4): Efficiency Tricks for Neural Nets
0:43:28 - [ ] CMU Neural Nets for NLP 2021 (5): Recurrent Neural Networks
0:38:50 - [ ] CMU Neural Nets for NLP 2021 (6): Conditioned Generation
0:45:06 - [ ] CMU Neural Nets for NLP 2021 (7): Attention
0:38:23 - [ ] CMU Neural Nets for NLP 2021 (8): Distributional Semantics and Word Vectors
0:42:44 - [ ] CMU Neural Nets for NLP 2021 (9): Sentence and Contextual Word Representations
0:50:53 - [ ] CMU Neural Nets for NLP 2021 (11): Structured Prediction with Local Independence Assumptions
0:36:43 - [ ] CMU Neural Nets for NLP 2021 (10): Debugging Neural Nets (for NLP)
0:43:58 - [X] CMU Neural Nets for NLP 2021 (12): Model Interpretation
0:28:52 - [ ] CMU Neural Nets for NLP 2021 (13): Generating Trees and Graphs
0:41:05 - [ ] CMU Neural Nets for NLP 2021 (14): Margin-based and Reinforcement Learning for Structured Prediction
0:47:20 - [ ] CMU Neural Nets for NLP 2021 (15): Sequence-to-sequence Pre-training
0:27:22 - [ ] CMU Neural Nets for NLP 2021 (16): Machine Reading w/ Neural Nets
0:43:08 - [ ] CMU Neural Nets for NLP 2021 (17): Neural Nets + Knowledge Bases
0:44:19 - [ ] CMU Neural Nets for NLP 2021 (18): Advanced Search Algorithms
0:47:58 - [ ] CMU Neural Nets for NLP 2021 (19): Adversarial Methods
0:41:56 - [ ] CMU Neural Nets for NLP 2021 (20): Models w/ Latent Random Variables
0:41:06 - [ ] CMU Neural Nets for NLP 2021 (21): Multilingual Learning
0:33:10 - [ ] CMU Neural Nets for NLP 2021 (22): Bias in NLP
0:32:44 - [X] CMU Neural Nets for NLP 2021 (23): Document-level Models
0:40:04
- [X] CMU Neural Nets for NLP 2021 (1): Introduction
- [X] CMU Multilingual NLP 2020
- [ ] CMU Advanced NLP 2021
- [ ] CMU Advanced NLP 2021 (1): Introduction to NLP
1:08:39 - [ ] CMU Advanced NLP 2021 (2): Text Classification
1:16:56 - [ ] CMU Advanced NLP 2021 (3): Language Modeling and Neural Networks
1:16:37 - [ ] CMU Advanced NLP 2021 (4): Text Classification
1:14:19 - [ ] CMU Advanced NLP 2021 (5): Recurrent Neural Networks
1:13:43 - [ ] CMU Advanced NLP 2021 (6): Conditional Generation
1:17:56
- [ ] CMU Advanced NLP 2021 (1): Introduction to NLP
- [ ] CS685: Advanced Natural Language Processing
- [ ] UMass CS685 (Advanced NLP): Attention mechanisms
0:48:53 - [ ] UMass CS685 (Advanced NLP): Question answering
0:59:50 - [ ] UMass CS685 (Advanced NLP): Better BERTs
0:52:23 - [ ] UMass CS685 (Advanced NLP): Text generation decoding and evaluation
1:02:32 - [ ] UMass CS685 (Advanced NLP): Paraphrase generation
1:10:59 - [ ] UMass CS685 (Advanced NLP): Crowdsourced text data collection
0:58:31 - [ ] UMass CS685 (Advanced NLP): Model distillation and security threats
1:09:25 - [ ] UMass CS685 (Advanced NLP): Retrieval-augmented language models
0:52:13 - [ ] UMass CS685 (Advanced NLP): Implementing a Transformer
1:12:36 - [ ] UMass CS685 (Advanced NLP): vision + language
1:06:28 - [ ] UMass CS685 (Advanced NLP): exam review
1:24:36 - [ ] UMass CS685 (Advanced NLP): Intermediate fine-tuning
1:10:35 - [ ] UMass CS685 (Advanced NLP): ethics in NLP
0:56:57 - [ ] UMass CS685 (Advanced NLP): probe tasks
0:54:30 - [ ] UMass CS685 (Advanced NLP): semantic parsing
0:48:49 - [ ] UMass CS685 (Advanced NLP): commonsense reasoning (guest lecture by Lorraine Li)
0:58:53
- [ ] UMass CS685 (Advanced NLP): Attention mechanisms
- [X] Datacamp: Advanced NLP with spaCy
- [X] Datacamp: Building Chatbots in Python
- [X] Datacamp: Clustering Methods with SciPy
- [X] Datacamp: Feature Engineering for NLP in Python
- [X] Datacamp: Machine Translation in Python
- [X] Datacamp: Natural Language Processing Fundamentals in Python
- [X] Datacamp: Natural Language Generation in Python
- [X] Datacamp: RNN for Language Modeling
- [X] Datacamp: Regular Expressions in Python
- [X] Datacamp: Sentiment Analysis in Python
- [ ] Datacamp: Spoken Language Processing in Python
- [ ] Notebook: NNLM - Predict Next Word
- [ ] Notebook: Word2Vec
- [ ] Notebook: FastText Sentence Classification
- [ ] Notebook: TextCNN - Binary Sentiment Classification
- [ ] Notebook: TextRNN - Predict Next Step
- [ ] Notebook: TextLSTM - Autocomplete
- [ ] Notebook: Bi-LSTM - Predict Next Word in Long Sentence
- [ ] Notebook: SeqSeq - Change Word
- [ ] Notebook: Seq2Seq with Attention - Translate
- [ ] Notebook: Bi-LSTM with Attention - Binary Sentiment Classification
- [ ] Notebook: The Transformer - Translate
- [ ] Notebook: The Transformer - Greedy Decoder
- [ ] Notebook: BERT - NSP and MLM
- [X] Notebook: Logistic regression-Tf-Idf baseline
- [ ] RNN and LSTM
- [X] Spacy Tutorial
- [X] Stanford CS224U: Natural Language Understanding | Spring 2019
- [X] Stanford CS224N: Stanford CS224N: NLP with Deep Learning | Winter 2019
- [ ] Stanford: CS214 From Languages to Information
- [ ] 1 1 Regular Expressions 11 25
0:11:25 - [ ] 1 2 Regular Expression Substitutions
0:06:10 - [ ] 1 3 Words and Corpora
0:06:25 - [X] 1 4 Word Tokenization
0:08:21 - [ ] 1 5 Byte Pair Encoding
0:07:38 - [X] 1 6 Word Normalization
0:06:23 - [X] 2 1 Defining Minimum Edit Distance 7 04
0:07:05 - [X] 2 2 Computing Minimum Edit Distance 5 54
0:05:55 - [X] 2 3 Backtrace for Computing Alignments 5 55
0:05:56 - [X] 2 4 Weighted Minimum Edit Distance 2 47
0:02:48 - [X] 2 5 Minimum Edit Distance in Computational Biology 9 29
0:09:30 - [X] 3 1 Introduction to N grams 8 41
0:08:41 - [X] 3 2 Estimating N gram Probabilities 9 38
0:09:38 - [X] 3 3 Evaluation and Perplexity
0:12:37 - [X] 3 4 Generalization and Zeros 5 15
0:05:15 - [X] 3 5 Smoothing Add One 6 30
0:06:31 - [X] 3 6 Interpolation 10 25
0:10:25 - [X] 3 8 Kneser Ney Smoothing 8 59
0:08:59 - [ ] 5 1 What is Text Classification 8 12
0:08:12 - [ ] Naive Bayes Lecture 2 The Naive Bayes Classifier
0:12:24 - [ ] Naive Bayes 3 Learning in Naive Bayes
0:06:04 - [ ] Naive Bayes 4 Sentiment and Binary NB
0:08:14 - [ ] 4 5 More on Sentiment Classification
0:05:14 - [ ] 5 2 Naive Bayes Relationship to Language Modeling 4 35
0:04:36 - [ ] 5 7 Precision, Recall, and the F measure 16 16
0:16:17 - [ ] 5 8 Text Classification Evaluation 7 17
0:07:17 - [ ] Logistic Regression 1 Generative and Discriminative Classifiers
0:05:25 - [ ] Logistic Regression 2 Classification
0:07:48 - [ ] Logistic Regression 3 A Sentiment Example
0:05:09 - [ ] Logistic Regression 4 Cross Entropy Loss
0:07:59 - [ ] Logistic Regression 5 Stochastic Gradient Descent
0:09:46 - [ ] Logistic Regression 6 A worked example of gradient descent
0:05:10 - [ ] 7 1 Introduction to Information Retrieval 9 16
0:09:16 - [ ] 7 2 Term Document Incidence Matrices 8 59
0:08:59 - [ ] 7 3 The Inverted Index 10 42
0:10:43 - [ ] 7 4 Query Processing with the Inverted Index 6 43
0:06:44 - [ ] 7 5 The Boolean Retrieval Model 14 06
0:14:07 - [ ] 7 6 Phrase Queries and Positional Indexes 19 45
0:19:46 - [ ] 8 1 Introducing Ranked Retrieval 4 27
0:04:27 - [ ] 8 2 Scoring with the Jaccard Coefficient 5 06
0:05:07 - [ ] 8 3 Term Frequency Weighting 5 59
0:06:00 - [ ] 8 4 Inverse Document Frequency Weighting 10 16
0:10:17 - [ ] 8 5 TF IDF Weighting 3 42
0:03:42 - [ ] 8 6 The Vector Space Model 16 22
0:16:23 - [ ] 8 7 Calculating TF IDF Cosine Scores 12 47
0:12:48 - [ ] 8 8 Evaluating Search Engines 9 02
0:09:03 - [ ] Introduction to Named Entity Tagging
0:05:06 - [ ] Introduction to Part of Speech Tagging
0:09:03 - [ ] Vector 1 Word Meaning
0:09:09 - [ ] Vector 2 Vector Semantics
0:06:37 - [ ] Vector 3 Words and Vectors
0:05:16 - [ ] Vector 4 Cosine Similarity
0:04:23 - [ ] Vector 5 TF IDF
0:05:32 - [ ] Vector 6 Word2vec
0:07:39 - [ ] Vector 7 Learning in Word2vec
0:07:36 - [ ] Vector 8 Properties of Embeddings
0:06:08 - [ ] Neural Networks 1 Neural Units
0:05:41 - [ ] Neural Networks 2 XOR
0:07:32 - [ ] Neural Networks 3 Feedforward Neural Networks
0:08:55 - [ ] Neural Networks 4 Applying Feedforward Networks to NLP
0:07:15 - [ ] Neural Networks 5 Overview of Training
0:04:21 - [ ] Neural Networks 6 Computation Graphs and Backward Differentiation
0:10:31 - [ ] Dialog 1 Overview
0:03:11 - [ ] Dialogue 2 Human Conversation
0:10:31 - [ ] Dialogue 3 ELIZA
0:09:27 - [ ] Dialogue 4 Corpus Chatbots
0:09:35 - [ ] Dialogue 5 Frame Based Dialogue
0:07:41 - [ ] Dialogue 6 Dialogue State Architecture
0:08:58 - [ ] Dialogue 7 Dialogue State Architecture Policy and Generation
0:08:23 - [ ] Dialogue 8 Evaluation
0:04:38 - [ ] Dialogue 9 Design and Ethical Issues
0:03:29 - [ ] Recommender Systems 1 Introduction
0:06:02 - [ ] Recommender Systems 2 Content Based
0:05:50 - [ ] Recommender Systems 3 User User Collaborative Filtering
0:07:50 - [ ] Recommender Systems 4 Item Item Collaborative Filtering
0:06:52 - [ ] Recommender Systems 5 Simplified version for PA6
0:02:10 - [ ] 14 1 Anchor Text 3 39
0:03:40 - [ ] 14 2 PageRank Overview and Markov Chains 12 10
0:12:10 - [ ] 14 3 Computing PageRank 8 09
0:08:10 - [ ] Social Networks 1 Networks
0:06:58
- [ ] 1 1 Regular Expressions 11 25
- [X] TextBlob Tutorial Series
- [ ] Youtube: fast.ai Code-First Intro to Natural Language Processing
- [X] What is NLP? (NLP video 1)
0:22:42 - [X] Topic Modeling with SVD & NMF (NLP video 2)
1:06:39 - [X] Topic Modeling & SVD revisited (NLP video 3)
0:33:05 - [X] Sentiment Classification with Naive Bayes (NLP video 4)
0:58:20 - [ ] Sentiment Classification with Naive Bayes & Logistic Regression, contd. (NLP video 5)
0:51:29 - [ ] Derivation of Naive Bayes & Numerical Stability (NLP video 6)
0:23:56 - [ ] Revisiting Naive Bayes, and Regex (NLP video 7)
0:37:33 - [ ] Intro to Language Modeling (NLP video 8)
0:40:58 - [ ] Transfer learning (NLP video 9)
1:35:16 - [ ] ULMFit for non-English Languages (NLP Video 10)
1:49:22 - [ ] Understanding RNNs (NLP video 11)
0:33:16 - [ ] Seq2Seq Translation (NLP video 12)
0:59:42 - [ ] Word embeddings quantify 100 years of gender & ethnic stereotypes-- Nikhil Garg (NLP video 13)
0:47:17 - [ ] Text generation algorithms (NLP video 14)
0:25:39 - [ ] Implementing a GRU (NLP video 15)
0:23:13 - [ ] Algorithmic Bias (NLP video 16)
1:26:17 - [ ] Introduction to the Transformer (NLP video 17)
0:22:54 - [ ] The Transformer for language translation (NLP video 18)
0:55:17 - [X] What you need to know about Disinformation (NLP video 19)
0:51:21 - [ ] Article: Zero to Hero with fastai - Beginner
- [ ] Article: Zero to Hero with fastai - Intermediate
- [X] What is NLP? (NLP video 1)
- [ ] NLP Course | For You
- [ ] Word Embeddings
- [ ] Text Classification
- [ ] Language Modeling
- [ ] Seq2seq and Attention
- [X] Youtube: BERT Research Series
- [X] YouTube: Intro to NLP with Spacy
- [X] Talk: Practical NLP for the Real World
- [X] YouTube: Level 3 AI Assistant Conference 2020
- [X] Youtube: RASA Algorithm Whiteboard
- [X] Introducing The Algorithm Whiteboard
0:01:16 - [X] Rasa Algorithm Whiteboard - Diet Architecture 1: How it Works
0:23:27 - [X] Rasa Algorithm Whiteboard - Diet Architecture 2: Design Decisions
0:15:06 - [X] Rasa Algorithm Whiteboard - Diet Architecture 3: Benchmarking
0:22:34 - [X] Rasa Algorithm Whiteboard - Embeddings 1: Just Letters
0:13:48 - [X] Rasa Algorithm Whiteboard - Embeddings 2: CBOW and Skip Gram
0:19:24 - [X] Rasa Algorithm Whiteboard - Embeddings 3: GloVe
0:19:12 - [X] Rasa Algorithm Whiteboard - Embeddings 4: Whatlies
0:14:03 - [X] Rasa Algorithm Whiteboard - Attention 1: Self Attention
0:14:32 - [X] Rasa Algorithm Whiteboard - Attention 2: Keys, Values, Queries
0:12:26 - [X] Rasa Algorithm Whiteboard - Attention 3: Multi Head Attention
0:10:55 - [X] Rasa Algorithm Whiteboard: Attention 4 - Transformers
0:14:34 - [X] Rasa Algorithm Whiteboard - StarSpace
0:11:46 - [X] Rasa Algorithm Whiteboard - TED Policy
0:16:10 - [X] Rasa Algorithm Whiteboard - TED in Practice
0:14:54 - [X] Rasa Algorithm Whiteboard - Response Selection
0:12:07 - [X] Rasa Algorithm Whiteboard - Response Selection: Implementation
0:09:25 - [X] Rasa Algorithm Whiteboard - Countvectors
0:13:32 - [X] Rasa Algorithm Whiteboard - Subword Embeddings
0:11:58 - [X] Rasa Algorithm Whiteboard - Implementation of Subword Embeddings
0:10:01 - [X] Rasa Algorithm Whiteboard - BytePair Embeddings
0:12:44
- [X] Introducing The Algorithm Whiteboard
- [X] Youtube: A brief history of the Transformer architecture in NLP
- [X] Youtube: The Transformer neural network architecture explained. “Attention is all you need” (NLP)
- [X] Youtube: How does a Transformer architecture combine Vision and Language? ViLBERT - NLP meets Computer Vision
- [X] Youtube: Strategies for pre-training the BERT-based Transformer architecture – language (and vision)
- [X] Youtube: Ilya Sutskever - GPT-2
- [X] Youtube: NLP Masterclass | Modeling Fallacies in NLP
- [X] Youtube: What is GPT-3? Showcase, possibilities, and implications
- [X] Youtube: TextAttack: A Framework for Data Augmentation and Adversarial Training in NLP
- [X] Article: How the Embedding Layers in BERT Were Implemented
- [X] Youtube: Easy Data Augmentation for Text Classification
- [X] Youtube: Webinar: Special NLP Session with Hugging Face
- [X] Youtube: Spacy IRL 2019
- [X] Youtube: The Future of Natural Language Processing
- [X] Youtube: Sentiment Analysis: Key Milestones, Challenges and New Directions
- [X] Youtube: Simple and Efficient Deep Learning for Natural Language Processing, with Moshe Wasserblat, Intel AI
- [X] Youtube: Why not solve biological problems with a Transformer? BERTology meets Biology
- [X] Youtube: Self-attention step-by-step | How to get meaning from text
- [X] Youtube: Chat Bot with PyTorch
- [ ] Youtube: NLP with Friends Talks
- [X] NLP with Friends, Featured Friend: Tom McCoy
0:36:48 - [X] NLP with Friends, Featured Friend: Maarten Sap
0:36:11 - [ ] NLP with Friends, featured friend: Nitika Mathur
1:01:42 - [ ] NLP with Friends, Featured Friend: Sabrina J Mielke
1:01:28
- [X] NLP with Friends, Featured Friend: Tom McCoy
- [X] Youtube: Insincere Question Classification with PyTorch
- [ ] Crash Course: Linguistics
- [X] Crash Course Linguistics Preview
0:02:50 - [X] What is Linguistics?: Crash Course Linguistics #1
0:11:11
- [X] Crash Course Linguistics Preview
- [X] Youtube: Recent Advances in Language Pretraining and Generation
- [X] Youtube: Talks # 3: Lorenzo Ampil - Introduction to T5 for Sentiment Span Extraction
- [X] Youtube: Frontiers in ML: Learning from Limited Labeled Data: Challenges and Opportunities for NLP
- [X] Youtube: DeepLearning.ai NLP talk: Chris Manning
- [X] Youtube: DeepLearning.ai NLP talk: Oren Etzioni
- [X] Youtube: DeepLearning.ai NLP talk: Quoc Le
- [X] Youtube: What can MIR learn from transfer learning in NLP?
- [X] Youtube: The Narrated Transformer Language Model
- [X] Youtube: spaCy v3.0: Bringing State-of-the-art NLP from Prototype to Production
00:22:40 - [X] Youtube: Conversational AI with Transformers and Rule-Based Systems
1:53:24 - [X] Talk: High Performance Natural Language Processing
- [X] Talk: EmoTag1200: Understanding the Association between Emojis and Emotions
- [X] Youtube: Research Paper Walkthrough
- [X] Simple Unsupervised Keyphrase Extraction using Sentence Embeddings | Research Paper Walkthrough
0:21:23 - [X] Leveraging BERT for Extractive Text Summarization on Lectures | Research Paper Walkthrough
0:20:10 - [X] Data Augmentation Techniques for Text Classification in NLP | Research Paper Walkthrough
0:14:33 - [X] CRIM at SemEval-2018 Task 9: A Hybrid Approach to Hypernym Discovery | Research Paper Walkthrough
0:23:47 - [X] Data Augmentation using Pre-trained Transformer Model (BERT, GPT2, etc) | Research Paper Walkthrough
0:17:43 - [X] A Supervised Approach to Extractive Summarisation of Scientific Papers | Research Paper Walkthrough
0:19:01 - [X] BLEURT: Learning Robust Metrics for Text Generation | Research Paper Walkthrough
0:13:38 - [X] DeepWalk: Online Learning of Social Representations | ML with Graphs | Research Paper Walkthrough
0:17:44 - [X] LSBert: A Simple Framework for Lexical Simplification | Research Paper Walkthrough
0:20:27 - [X] SpanBERT: Improving Pre-training by Representing and Predicting Spans | Research Paper Walkthrough
0:14:21 - [X] Text Summarization of COVID-19 Medical Articles using BERT and GPT-2 | Research Paper Walkthrough
0:21:52 - [X] Extractive & Abstractive Summarization with Transformer Language Models | Research Paper Walkthrough
0:16:58 - [X] Unsupervised Multi-Document Summarization using Neural Document Model | Research Paper Walkthrough
0:15:11 - [X] SummPip: Multi-Document Summarization with Sentence Graph Compression | Research Paper Walkthrough
0:16:54 - [X] Combining BERT with Static Word Embedding for Categorizing Social Media | Research Paper Walkthrough
0:13:51 - [X] Reformulating Unsupervised Style Transfer as Paraphrase Generation | Research Paper Walkthrough
0:19:41 - [X] PEGASUS: Pre-training with Gap-Sentences for Abstractive Summarization | Research Paper Walkthrough
0:15:04 - [X] Evaluation of Text Generation: A Survey | Human-Centric Evaluations | Research Paper Walkthrough
0:15:54 - [X] TOD-BERT: Pre-trained Transformers for Task-Oriented Dialogue Systems (Research Paper Walkthrough)
0:15:25 - [X] TextRank: Bringing Order into Texts (Research Paper Walkthrough)
0:14:34 - [X] Node2Vec: Scalable Feature Learning for Networks | ML with Graphs (Research Paper Walkthrough)
0:14:33 - [X] HARP: Hierarchical Representation Learning for Network | ML with Graphs (Research Paper Walkthrough)
0:15:10 - [X] URL2Video: Automatic Video Creation From a Web Page | AI and Creativity (Research Paper Walkthrough)
0:15:21 - [X] On Generating Extended Summaries of Long Documents (Research Paper Walkthrough)
0:14:24 - [X] Nucleus Sampling: The Curious Case of Neural Text Degeneration (Research Paper Walkthrough)
0:12:48 - [X] T5: Exploring Limits of Transfer Learning with Text-to-Text Transformer (Research Paper Walkthrough)
0:12:47 - [X] DialoGPT: Generative Training for Conversational Response Generation (Research Paper Walkthrough)
0:13:17 - [X] Hierarchical Transformers for Long Document Classification (Research Paper Walkthrough)
0:12:46
- [X] Simple Unsupervised Keyphrase Extraction using Sentence Embeddings | Research Paper Walkthrough
- [X] NLP Summit 2020
- [ ] Youtube: Explainability for Natural Language Processing
- [X] Youtube: Gibberish Detector
- [X] Youtube: NLP Lecture 7 Constituency Parsing
- [X] NLP Lecture 7 - Overview of Constituency Parsing Lecture
0:01:50 - [X] NLP Lecture 7 - Introduction to Constituency Parsing
0:10:29 - [X] NLP Lecture 7(a) - Context Free Grammar
0:17:03 - [X] NLP Lecture 7(b) - Constituency Parsing
0:13:28 - [X] NLP Lecture 7(c) - Statistical Constituency Parsing
0:09:38 - [X] NLP Lecture 7(d) - Dependency Parsing
0:17:15
- [X] NLP Lecture 7 - Overview of Constituency Parsing Lecture
- [X] Youtube: LING 83 Teaching Video: Constituency Parsing
- [X] Youtube: SpaCy for Digital Humanities with Python Tutorials
- [ ] Youtube: Billion-scale Approximate Nearest Neighbor Search
- [X] Youtube: Data Science - Fuzzy Record Matching
- [X] Youtube: Minimum Edit Distance Dynamic Programming
- [X] Youtube: Cheuk Ting Ho - Fuzzy Matching Smart Way of Finding Similar Names Using Fuzzywuzzy
- [X] Youtube: What's in a Name? Fast Fuzzy String Matching - Seth Verrinder & Kyle Putnam - Midwest.io 2015
- [X] Youtube: Jiaqi Liu Fuzzy Search Algorithms How and When to Use Them PyCon 2017
- [X] Youtube: 1 + 1 = 1 or Record Deduplication with Python | Flávio Juvenal @ PyBay2018
- [X] Youtube: Mike Mull: The Art and Science of Data Matching
- [X] Youtube: Record linkage: Join for real life by Rhydwyn Mcguire
- [X] Youtube: Approximate nearest neighbors and vector models, introduction to Annoy
- [X] Youtube: Librosa Audio and Music Signal Analysis in Python | SciPy 2015 | Brian McFee
- [ ] Video: Recent Advances in LM Pre-training
- [X] Youtube: Deep Learning (for Audio) with Python
- [X] Youtube: Advanced Information Retrieval 2021 - TU Wien
- [X] AIR - 2021 Course Introduction
0:21:39 - [X] Crash Course IR - Fundamentals
0:46:31 - [ ] Crash Course IR - Evaluation
0:37:15 - [ ] Crash Course IR - Test Collections
0:51:12 - [ ] Word Representation Learning
0:42:02 - [ ] Sequence Modelling with CNNs and RNNs
0:55:04 - [ ] Transformer and BERT Pre-training
0:47:15 - [ ] Introduction to Neural Re-Ranking
0:59:20 - [ ] Transformer Contextualized Re-Ranking
0:49:06 - [ ] Domain Specific Applications
0:38:32 - [ ] Dense Retrieval ❤ Knowledge Distillation
0:59:28
- [X] AIR - 2021 Course Introduction
- [ ] Youtube: Introduction to Dense Text Representation
- [X] Introduction to Dense Text Representations - Part 1
0:12:56 - [X] Introduction to Dense Text Representations - Part 2
0:23:13 - [X] Introduction to Dense Text Representation - Part 3
0:38:07 - [ ] Training State-of-the-Art Sentence Embedding Models
0:43:43
- [X] Introduction to Dense Text Representations - Part 1
- [X] Youtube: Fine-tuning a large language model without your own supercomputer
- [X] Youtube: How to build a custom spell checker using python NLP
- [X] Youtube: Transformers 🤗 to Rule Them All? Under the Hood of the AI Recruiter Chatbot 🤖, with Keisuke Inoue
- [X] Youtube: Artificial Intelligence and Natural Language Processing in E-Commerce by Katherine Munro | smec
- [X] Youtube: Abhishek Thakur - Classifying Search Queries Without User Click Data
- [X] Youtube: Chatbots Revisted | by Abhishek Thakur | Kaggle Days Warsaw
- [X] Youtube: Abhishek Thakur - Is That a Duplicate Quora Question?
- [X] Youtube: Design Considerations for building ML-Powered Search Applications - Mark Moyou
- [X] Youtube: Analyze Customer Feedback in Minutes, Not Months
- [X] Youtube: NLP in Feedback Analysis - Yue Ning
- [X] Youtube: Productionizing an unsupervised machine learning model to understand customer feedback
- [X] Youtube: Extracting topics from reviews using NLP - Dr. Tal Perri
- [X] Youtube: Bringing innovation to online retail: automating customer service with NLP
- [X] Youtube: Transform customer service with machine learning (Google Cloud Next '17)
- [X] Youtube: Real life aspects of opinion sentiment analysis within customer reviews - Dr. Jonathan Yaniv
- [X] Youtube: Deep Learning Methods for Emotion Detection from Text - Dr. Liron Allerhand
- [X] Youtube: Learning How to Learn NLP : Developing Introductory Concepts Through Scaffolded Discoveries
- [X] Youtube: What are Transformer Neural Networks?
- [X] Youtube: Applied ML 2020 - 15 - Working with Text Data
1:27:08 - [X] Youtube: Applied ML 2020 - 16 - Topic models for text data
1:18:34 - [ ] Youtube: Applied ML 2020 - 17 - Word vectors and document embeddings
1:03:04 - [X] Youtube: A Briefish Introduction to Discourse Representation Theory
- [X] Youtube: HuggingFace Crash Course - Sentiment Analysis, Model Hub, Fine Tuning
- [X] Youtube: Huggingface Course Part 1
- [X] Youtube: Should we care about linguistics?
- [ ] Youtube: Transformers From Scratch
- [ ] How-to Use HuggingFace's Datasets - Transformers From Scratch #1
0:14:21 - [ ] Build a Custom Transformer Tokenizer - Transformers From Scratch #2
0:14:17 - [ ] Building MLM Training Input Pipeline - Transformers From Scratch #3
0:23:11 - [ ] Training and Testing an Italian BERT - Transformers From Scratch #4
0:30:38
- [ ] How-to Use HuggingFace's Datasets - Transformers From Scratch #1
- [X] In Search of Best Practices for NLP Projects | Ivan Bilan | PyData Pune Meetup | December 2020
0:50:00 - [X] Youtube: Generating and Understanding Natural Language with AI (Aidan Gomez, PhD)
0:52:12 - [X] Youtube: The giant leaps in language technology -- and who's left behind | Kalika Bali
- [ ] Youtube: ML for Audio Study Group
Be familiar with multi-modal machine learning
- [ ] CMU: MultiModal Machine Learning Fall 2020
- [ ] Lecture 1.1: Course Introduction
- [ ] Lecture 1.2: Multimodal applications and datasets
- [ ] Lecture 2.1: Basic concepts: neural networks
- [ ] Lecture 2.2: Basic concepts: network optimization
- [ ] Lecture 3.1: Visual unimodal representations
- [ ] Lecture 3.2: Language unimodal representations
- [ ] Lecture 4.1: Multimodal representation learning
- [ ] Lecture 4.2: Coordinated representations
- [ ] Lecture 5.1: Multimodal alignment
- [ ] Lecture 5.2: Alignment and representation
- [ ] Lecture 7.1: Alignment and translation
- [ ] Lecture 7.2: Probabilistic graphical models
- [ ] Lecture 8.1: Discriminative graphical models
- [ ] Lecture 8.2: Deep Generative Models
- [ ] Lecture 9.1: Reinforcement learning
- [ ] Lecture 9.2: Multimodal RL
- [ ] Lecture 10.1: Fusion and co-learning
- [ ] Lecture 10.2: New research directions
- [X] Youtube: W&B Paper Reading Group: MDETR with author Aishwarya Kamath
Be familiar with Recommendation Systems
- [ ] Google: Recommendation Systems
- [X] Pluralsight: Understanding Algorithms for Recommendation Systems
- [X] Youtube: Learning to Rank: From Theory to Production - Malvina Josephidou & Diego Ceccarelli, Bloomberg
- [X] Youtube: Learning "Learning to Rank"
- [X] Youtube: Learning to rank search results - Byron Voorbach & Jettro Coenradie [DevCon 2018]
Be able to implement computer vision models
- [ ] Article: Common architectures in convolutional neural networks
- [ ] Article: Convolutional neural networks
- [ ] Article: Densely Connected Convolutional Networks in Tensorflow
- [ ] Article: EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
- [ ] Article: Multimodal Neurons in Artificial Neural Networks
- [ ] Article: Understanding the receptive field of deep convolutional networks
- [ ] Article: Deep learning in medical imaging - 3D medical image segmentation with PyTorch
- [ ] Article: Intuitive Explanation of Skip Connections in Deep Learning
- [ ] Article: Localization and Object Detection with Deep Learning
- [ ] Article: Understanding coordinate systems and DICOM for deep learning medical image analysis
- [X] Article: The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3)
- [ ] Article: An overview of object detection: one-stage methods
- [ ] Article: An overview of semantic image segmentation
- [ ] Article: Evaluating image segmentation models
- [ ] Article: Semantic Segmentation in the era of Neural Networks
- [ ] Article: DenseNet Architecture Explained with PyTorch Implementation from TorchVision
- [ ] Article: Group Normalization
- [ ] Article: Squeeze and Excitation Networks Explained with PyTorch Implementation
- [ ] Article: What is Focal Loss and when should you use it?
- [ ] Article: Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS
- [ ] Article: Object Detection for Dummies Part 2: CNN, DPM and Overfeat
- [ ] Article: Object Detection for Dummies Part 3: R-CNN Family
- [ ] Article: A Short Introduction to Generative Adversarial Networks
- [ ] Article: Semi-supervised Learning with GANs
- [ ] Article: Human Pose Estimation
- [X] Article: How to extract Key-Value pairs from Documents using deep learning
- [ ] Article: Building an image search service from scratch
- [ ] Article: Breaking Linear Classifiers on ImageNet
- [ ] Article: Essential Pil (Pillow) Image Tutorial (for Machine Learning People)
- [ ] Article: A Brief History of CNNs in Image Segmentation: From R-CNN to Mask R-CNN
- [ ] Article: YOLO - You only look once (Single shot detectors)
- [ ] Article: NonCompositional
- [ ] Article: Part 1: Deep Representations, a way towards neural style transfer
- [X] Article: Looking Inside The Blackbox — How To Trick A Neural Network
- [ ] Article: A gentle introduction to OCR
- [ ] Article: ECCV 2020: Some Highlights
- [X] AWS: Semantic Segmentation Explained
- [ ] Book: Deep Learning for Computer Vision with Python
- [ ] Book: Practical Python and OpenCV
- [ ] Coursera: Convolutional Neural Networks
- [ ] Datacamp: Image Processing in Python
- [X] Google: ML Practicum: Image Classification
- [ ] Stanford: CS231N Winter 2016
- [X] CS231n Winter 2016: Lecture 1: Introduction and Historical Context
1:19:08 - [X] CS231n Winter 2016: Lecture 2: Data-driven approach, kNN, Linear Classification 1
0:57:28 - [ ] CS231n Winter 2016: Lecture 3: Linear Classification 2, Optimization
1:11:23 - [ ] CS231n Winter 2016: Lecture 4: Backpropagation, Neural Networks 1
1:19:38 - [ ] CS231n Winter 2016: Lecture 5: Neural Networks Part 2
1:18:37 - [ ] CS231n Winter 2016: Lecture 6: Neural Networks Part 3 / Intro to ConvNets
1:09:35 - [ ] CS231n Winter 2016: Lecture 7: Convolutional Neural Networks
1:19:01 - [ ] CS231n Winter 2016: Lecture 8: Localization and Detection
1:04:57 - [ ] CS231n Winter 2016: Lecture 9: Visualization, Deep Dream, Neural Style, Adversarial Examples
1:18:20 - [ ] CS231n Winter 2016: Lecture 10: Recurrent Neural Networks, Image Captioning, LSTM
1:09:54 - [ ] CS231n Winter 2016: Lecture 11: ConvNets in practice
1:15:03 - [ ] CS231n Winter 2016: Lecture 12: Deep Learning libraries
1:21:06 - [ ] CS231n Winter 2016: Lecture 14: Videos and Unsupervised Learning
1:17:36 - [ ] CS231n Winter 2016: Lecture 13: Segmentation, soft attention, spatial transformers
1:10:59 - [X] CS231n Winter 2016: Lecture 15: Invited Talk by Jeff Dean
1:14:49
- [X] CS231n Winter 2016: Lecture 1: Introduction and Historical Context
- [ ] Youtube: Autoencoders - EXPLAINED
0:10:53 - [ ] Youtube: Building an Image Captioner with Neural Networks
0:12:54 - [ ] Youtube: Convolution Neural Networks - EXPLAINED
0:19:20 - [ ] Youtube: Depthwise Separable Convolution - A FASTER CONVOLUTION!
0:12:43 - [ ] Youtube: Generative Adversarial Networks - FUTURISTIC & FUN AI !
0:14:20 - [ ] Youtube: How Convolution Works
- [ ] Youtube: Mask Region based Convolution Neural Networks - EXPLAINED!
0:09:34 - [ ] Youtube: Sound play with Convolution Neural Networks
0:11:57 - [ ] Youtube: The Evolution of Convolution Neural Networks
0:24:02 - [X] Youtube: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)
- [X] Youtube: Deep Residual Learning for Image Recognition (Paper Explained)
- [ ] Youtube: Evolution of Face Generation | Evolution of GANs
0:12:23 - [ ] Youtube: ConvNets Scaled Efficiently
0:13:19 - [X] Youtube: Implementing ResNet from scratch
- [X] Youtube: DETR: End-to-End Object Detection with Transformers (Paper Explained)
- [ ] Youtube: Unpaired Image-Image Translation using CycleGANs
0:16:22 - [ ] Youtube: Formula Indexing, Search, and Entry in the MathSeer Project
0:52:07 - [ ] Youtube: What's new in computer vision | July Queensland AI
1:21:33
Be familiar with graphs and network data
Be familiar with timeseries and forecasting
- [ ] Datacamp: Machine Learning for Finance in Python
- [X] Datacamp: Introduction to Time Series Analysis in Python
- [ ] Datacamp: Machine Learning for Time Series Data in Python
- [ ] Datacamp: Intro to Portfolio Risk Management in Python
- [ ] Datacamp: Financial Forecasting in Python
- [X] Datacamp: Predicting CTR with Machine Learning in Python
- [X] Datacamp: Intro to Financial Concepts using Python
- [X] Datacamp: Fraud Detection in Python
- [ ] Datacamp: Forecasting Using ARIMA Models in Python
- [ ] Datacamp: Introduction to Portfolio Analysis in Python
- [ ] Datacamp: Credit Risk Modeling in Python
- [ ] Datacamp: Machine Learning for Marketing in Python
- [ ] Udacity: Machine Learning for Trading
- [ ] Udacity: Time Series Forecasting
- [ ] Youtube: Applied ML 2020 - 21 - Time Series and Forecasting
1:12:36
Be familiar with basics of Reinforcement Learning
Be able to perform hyperparameter tuning
- [X] A recipe for training neural networks
- [ ] Article: Hyperparameter tuning for machine learning models
- [ ] Article: Hacker's Guide to Hyperparameter Tuning
- [ ] Article: Environment and Distribution Shift
- [ ] Coursera: Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization
- [X] Datacamp: Model Validation in Python
- [X] Datacamp: Hyperparameter Tuning in Python
- [ ] Google: Testing and Debugging
- [ ] Troubleshooting Deep Neural Networks
- [ ] Youtube: How do GPUs speed up Neural Network training?
0:08:19 - [ ] Youtube: Why use GPU with Neural Networks?
0:09:43 - [X] Youtube: Auto-Tuning Hyperparameters with Optuna and PyTorch
Be familiar with literature on model interpretability
- [ ] Article: Model Interpretation Frameworks
- [X] Article: How to leverage Explainable Machine Learning
- [X] Article: TracIn — A Simple Method to Estimate Training Data Influence
- [ ] Article: How to Explain the Prediction of a Machine Learning Model?
- [ ] NeurIPS 2020: Tutorial on Explaining ML Predictions: State-of-the-art, Challenges, and Opportunities
- [ ] Youtube: Jay Alammar - Take A Look Inside Language Models With Ecco
- [X] Youtube: How do we check if a neural network has learned a specific phenomenon?
- [X] Youtube: What is Adversarial Machine Learning and what to do about it? – Adversarial example compilation
- [ ] Youtube: SE4AI: Explainability and Interpretability (Part 1)
1:17:45 - [ ] Youtube: SE4AI: Explainability and Interpretability (Part 2)
1:21:50
Be able to optimize models for inference
- [X] Article: A Survey of Methods for Model Compression in NLP
- [X] Article: Why you should convert your NLP pipelines to ONNX
- [ ] Article: Neural Network Pruning
- [ ] Article: FasterAI
- [X] Article: Is the future of Neural Networks Sparse? An Introduction (1/N)
- [X] Article: Sparse Neural Networks (2/N): Understanding GPU Performance.
- [X] Article: Block Sparse Matrices for Smaller and Faster Language Models
- [ ] Article: Plunging Into Model Pruning in Deep Learning
- [ ] Article: How to accelerate and compress neural networks with quantization
- [ ] Article: Scaling up BERT-like model Inference on modern CPU - Part 1
Be able to build interactive UI for models
- [X] Article: Build and Deploy a Dashboard with Streamlit
- [X] Article: New layout options for Streamlit
Be able to perform A/B testing
- [X] Article: Multi-Armed Bandit (MAB) – A/B Testing Sans Regret
- [X] Article: When to Run Bandit Tests Instead of A/B/n Tests
- [X] Article: A/B Testing Machine Learning Models (Deployment Series: Guide 08)
- [ ] Datacamp: Customer Analytics & A/B Testing in Python
- [ ] Udacity: A/B Testing
- [ ] Udacity: A/B Testing for Business Analysts
- [ ] Youtube: Hypothesis testing with Applications in Data Science
0:10:33
Be able to apply proper software engineering process
- [X] Pluralsight: Security Awareness: Basic Concepts and Terminology
- [X] Pluralsight: Secure Software Development
- [X] Pluralsight: Clean Architecture: Patterns, Practices, and Principles
- [ ] Thoughtbot: Software Development Process
- [ ] Thoughtbot: Refactoring
- [ ] Udacity: Design of Computer Programs
- [ ] Udacity: Product Design
- [ ] Udacity: Rapid Prototyping
- [ ] Udacity: Software Development Process
Be able to communicate and collaborate well
- [ ] Google: Technical Writing
- [X] Book: Emotional Intelligence
- [X] Book: How to Win Friends & Influence People
- [X] Book: Influence: The Psychology of Persuasion
- [X] Book: Leaders Eat Last: Why Some Teams Pull Together and Others Don't
- [X] Book: Multipliers: How the Best Leaders Make Everyone Smarter
- [X] Book: Soft Skills: The software developer's life manual
- [X] Book: The New One Minute Manager
- [X] Calmcode: Remote Work
- [X] Youtube: Building a psychologically safe workplace | Amy Edmondson | TEDxHGSE
- [X] Youtube: A short introduction to LaTeX and Overleaf
Be familiar with the hiring pipeline
- [X] Article: What You Need to Know Before Considering a PhD
- [X] Article: Advice to aspiring data scientists: start a blog
- [ ] Article: Systems Design Interview Guide
- [ ] Article: A Guide to Cold Emailing
- [ ] Article: Uber Data Science Interview
- [ ] Article: Google Data Science Interview
- [ ] Article: Facebook Data Science Interview
- [ ] Article: Acing the Data Science Interview — Part 1
- [ ] Article: Acing the Data Science Interview — Part 2
- [ ] Article: Amazon Data Science Interview
- [ ] Article: Microsoft Data Science Interview
- [ ] Article: Microsoft Data Science Interview
- [ ] Article: Apple AI Interview Questions — Acing the AI Interview
- [ ] Article: Salesforce Data Science Interview— Acing the AI Interview
- [ ] Article: LinkedIn Data Science Interview
- [ ] Article: Netflix Data Science Interview
- [ ] Article: Walmart Data Science Interview
- [ ] Article: Twitter Data Science Interview
- [ ] Article: Ebay Data Science Interview
- [ ] Article: Zillow Data Science Interview
- [ ] Article: Intel Data Science Interview
- [ ] Article: Adobe Data Science Interview
- [ ] Article: Tesla Data Science Interview
- [ ] Article: IBM Data Science Interview
- [ ] Article: Top Data Science Interview Questions & Answers
- [ ] Article: Top Data Science Interview Questions & Answers — Part 2
- [ ] Article: Top Data Science Interview Questions & Answers — Part 3
- [ ] Article: Data Science Interview Questions and Solutions — Linear and Logistic regression
- [ ] Article: Capital One Data Science Interview
- [ ] Article: Paypal Data Science Interview
- [ ] Article: Airbnb Data Science Interview
- [ ] Article: Spotify Data Science Interview Questions
- [ ] Article: Yelp Data Science Interview
- [ ] Article: Twitch Data Science Interview
- [ ] Article: Oracle Data Science Interview
- [ ] Article: Citrix Data Science Interview
- [ ] Article: Quora Data Science Interview
- [ ] Article: Splunk Data Science Interview
- [ ] Article: JP Morgan Data Science Interview
- [ ] Article: Stripe Data Science Interview Questions
- [ ] Article: Box Data Science Interview
- [ ] Article: Instacart Data Science Interview Questions
- [ ] Article: Square Data Science Interview
- [ ] Article: AmEx Data Science Interview
- [ ] Article: Citibank Data Science Interview
- [ ] Article: Sprint Data Science Interview
- [ ] Article: Dropbox Data Science Interview
- [ ] Article: Booking.com Data Science Interview
- [ ] Article: Lyft Data Science Interview
- [ ] Article: Expedia Data Science Interview
- [ ] Article: Shopify Data Science Interview Questions
- [ ] Article: Goldman Sachs Data Science Interviews
- [ ] Article: Workday Data Science Interviews
- [ ] Article: Acing Data Science Interviews
- [ ] Article: Analysis of Data Science Interview Questions
- [ ] Article: Visa Data Science Interviews
- [ ] Article: Data Science Quiz— Part 1
- [ ] Book: Machine Learning Interviews
- [X] Datacamp: Preparing for Statistics Interview Questions in Python
- [X] Datacamp: Practicing Machine Learning Interview Questions in Python
- [X] Datacamp: Kaggle Competition
- [X] Udacity: Optimize your GitHub
- [X] Udacity: Strengthen Your LinkedIn Network & Brand
- [X] Udacity: Data Science Interview Prep
- [X] Udacity: Full-Stack Interview Prep
- [ ] Udacity: Refresh Your Resume
- [ ] Udacity: Craft Your Cover Letter
- [ ] Udacity: Technical Interview
- [X] Youtube: Panel Discussion: Do I need a PhD to work in ML? (Full Stack Deep Learning - Spring 2021)
- [X] Youtube: The Importance of Writing in a Tech Career - Eugene Yan
- [X] Youtube: How to prepare for Machine Learning interviews- Part 1 | Applied AI Course
- [X] Youtube: How to prepare for Machine Learning interviews- Part 2 | Applied AI Course
- [X] Youtube: Guest Lecture - Chip Huyen - Machine Learning Interviews - Full Stack Deep Learning
- [X] Youtube: Tutorial: Technical Blogging for Python Programmers
Broaden Perspective
- [X] Book: Atomic Habits
- [X] Book: Deep Work
- [X] Book: Outliers: The Story of Success
- [X] Book: Platform: The Art and Science of Personal Branding
- [X] Book: Rich Dad Poor Dad
- [X] Book: The Power of Broke
- [X] Book: The 10X Rule
- [X] Book: The Millionaire Fastlane
- [X] Book: The Subtle Art of Not Giving a F**k
- [X] Calmcode: Pomodoro
- [X] Youtube: Why specializing early doesn't always mean career success | David Epstein
- [X] Youtube: Chamath Palihapitiya, Founder and CEO Social Capital, on Money as an Instrument of Change
- [X] Youtube: How to Build a Personal Monopoly with Jack Butcher
- [X] Youtube: How to Use Twitter
- [X] Youtube: A. Jesse Jiryu Davis - Write an Excellent Programming Blog - PyCon 2016
- [X] Youtube: The Great ML Stagnation (Mark Saroufim and Dr. Mathew Salvaris)
- [X] Youtube: What Machine Learning Can Teach Us About Life: 7 Lessons - Talk Python Live Stream
- [X] Youtube: Ross Tuck - Things I Believe Now That I'm Old at Laracon EU 2014
- [X] Youtube: "How to teach programming (and other things)?" by Felienne Hermans
0:52:08 - [X] Youtube: The secrets of learning a new language | Lýdia Machová