data-engineering-ecosystem
data-engineering-ecosystem copied to clipboard
Repo to migrate old wiki to, esp for devs and code examples
The spark intro contains the following invalid command: ``` local_machine:~$ scp -r -i ubuntu@:~/price_data ``` It should actually be something like: ```scp -r -i ubuntu@:~/price_data```
Currently in line 26 of `kafka_producer.py` the tutorial uses `self.producer.send_messages()`. For the latest version of `kafka-python` this should be `self.producer.send()`.
The instructions to run example flink stream at: 1. [https://github.com/InsightDataScience/data-engineering-ecosystem/wiki/flink#running-locally-using-the-command-line](https://github.com/InsightDataScience/data-engineering-ecosystem/wiki/flink#running-locally-using-the-command-line) 1. [https://github.com/InsightDataScience/data-engineering-ecosystem/wiki/flink#running-on-a-cluster](https://github.com/InsightDataScience/data-engineering-ecosystem/wiki/flink#running-on-a-cluster) Class name should be `streamExample` instead of `flinkStreamExample`.