dora Configure remote working directory

In this PR:

In the case of multiple daemons:
- the dataflow.yml

nodes:
  - id: rust-node
    _unstable_deploy:
      machine: A
      local: true
    custom:
      build: cargo build -p multiple-daemons-example-node
      source: ../../target/debug/multiple-daemons-example-node
      inputs:
        tick: dora/timer/millis/10
      outputs:
        - random
  - id: runtime-node
    _unstable_deploy:
      machine: B
      local: false
      working_dir: /home/ubuntu/dora/examples/multiple-daemons
    operators:
      - id: rust-operator
        build: cargo build -p multiple-daemons-example-operator
        shared-library: ../../target/debug/multiple_daemons_example_operator
        inputs:
          tick: dora/timer/millis/100
          random: rust-node/random
        outputs:
          - status
  - id: rust-sink
    _unstable_deploy:
      machine: A
      local: true
    custom:
      build: cargo build -p multiple-daemons-example-sink
      source: ../../target/debug/multiple-daemons-example-sink
      inputs:
        message: runtime-node/rust-operator/status

add two items for every node. local (whether they are the same as cli) and working_dir (the working_dir of the daemon for this dataflow.
- the local(default true) and working_dir(no default value) are all option. So if the dataflow is not distributed, you can write dataflow.yml just like now.
when you use absolute path
- if you specify the working_dir, we will use this working_dir
- if you not specify, we will change the working_dir to the home_directory(such as in linux,/home/miyamo). I do this because if you don't specify the working_dir but use absolute path, the dataflow can still run. .
when you use relative path
- if the node are local, we can use the dataflow.yml working_dir from cli, just like now.
- if the node is not local, we must specify working_dir for it(I also check this in check_dataflow), otherwise dora will throw an error.

follow #538 #534

Jun 10 '24 14:06 Gege-Wang

@phil-opp since the dataflow check is needed in cli and coordinator, I prefer to skip all path exist check in multiple daemons. the path exist check should be done by per-daemon, instead of cli and coordinator(maybe?). I changed the path(in ubuntu case) and tried this PR, the CI in my workflow and EC2 is good, but it seems there are some No left space error？

Jun 11 '24 01:06 Gege-Wang

Thanks for the PR! I'm not sure whether it's a good idea to implicitly change the working directory based on the number of machines. But we could add a config option to set a working directory for each machine. Then relative paths on those machines could be allowed again.

the path exist check should be done by per-daemon, instead of cli and coordinator(maybe?).

This seems like a good approach too. The CLI could send a CheckPaths message to the coordinator on dora check, which forwards it to the correct daemon for each node. The daemon could then check whether the path exists and report back to the coordinator, which then reports back to the CLI. For dora start this could be implemented as part of the existing Start message. What do you think about this idea @haixuanTao?

Jun 11 '24 09:06 phil-opp

@phil-opp since the dataflow check is needed in cli and coordinator, I prefer to skip all path exist check in multiple daemons.

the path exist check should be done by per-daemon, instead of cli and coordinator(maybe?).

I changed the path(in ubuntu case) and tried this PR, the CI in my workflow and EC2 is good, but it seems there are some No left space error？

The no left space error is probably independent of your PR.

Jun 11 '24 10:06 haixuanTao

Thanks for the PR! I'm not sure whether it's a good idea to implicitly change the working directory based on the number of machines. But we could add a config option to set a working directory for each machine. Then relative paths on those machines could be allowed again.

the path exist check should be done by per-daemon, instead of cli and coordinator(maybe?).

This seems like a good approach too. The CLI could send a CheckPaths message to the coordinator on dora check, which forwards it to the correct daemon for each node. The daemon could then check whether the path exists and report back to the coordinator, which then reports back to the CLI. For dora start this could be implemented as part of the existing Start message. What do you think about this idea @haixuanTao?

Sounds good to me ☺️

Jun 11 '24 10:06 haixuanTao

Would prefer to have a separate PR for working directory as we wanted to have abs path only for this PR.

Jun 11 '24 10:06 haixuanTao

I'm not sure whether it's a good idea to implicitly change the working directory based on the number of machines.

I have considered this question. since the check path and working_dir is relevant, in this pr, for multiple daemons in one dataflow, we must all use abs path, the working_dir seems like make no sense , except generator log(maybe?).

But we could add a config option to set a working directory for each machine.

yes, I have though about this, making working_dir configurable for every daemon, the implementation would conflict with the current implementation(the default /tmp).

Jun 11 '24 10:06 Gege-Wang

Would prefer to have a separate PR for working directory as we wanted to have abs path only for this PR.

The abs path only is implemented in #538.

Jun 11 '24 11:06 phil-opp

I left some issues about absolute path in #535 and why this PR is conflict with #538.

Jun 13 '24 08:06 Gege-Wang

Closed in favor of #658

Oct 07 '24 12:10 haixuanTao