aztk
aztk copied to clipboard
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
`aztk spark cluster list-nodes --id {cluster-id}` With `--quiet` or `-q` flag that will only list the id. And `--master` or `-m` to only list the master node.
It looks like there is an issue with telegraph on reboot of node: ``` Configuration file '/etc/telegraf/telegraf.conf' ==> File on system created by you or by a script. ==> File...
The HEALTHCHECK [instruction](https://docs.docker.com/engine/reference/builder/#healthcheck) is a neat way of checking that an image is running as expected. However, it doesn't have built in support for restarting a container if it is...
In long-running jobs, I often see low priority nodes pre-empted, then return later, as expected. Usually, they come back up fine as a fresh node, but it's not uncommon to...
Currently the cluster run return value is a mess, especially in the SDK: The SDK returns a tuple of `(node_id, output)` where output can be a string representing the stdout...
We're using `azure spark cluster submit ...` to run automated jobs from a VSTS release pipeline. Doing so means driver logs are streamed back to VSTS, which can be helpful...
Supervisord is a process manager that will help ensure proper process launching of key services. It will also be a single location to monitor processes health and status giving a...
I would like to change the subnet for docker0 interfaces as it overlaps with my subnets in other VNETs. Is there a way to do that somehow when creating a...