gavel icon indicating copy to clipboard operation
gavel copied to clipboard

Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

Results 10 gavel issues
Sort by recently updated
recently updated
newest added

Fixes the `python setup.py egg_info did not run successfully` error while running `pip install -r scheduler/requirements.txt`. Replaces `sklearn` with `scikit-learn` in `requirements.txt`. `sklearn` is deprecated (see here: https://pypi.org/project/sklearn/)

Hi, Can I know from where the listed datasets of [artifact_evaluation.trace](https://github.com/stanford-futuredata/gavel/blob/c4fa40097339e9608ab71fd64c8bdea96a5e4a4c/scheduler/traces/physical_cluster/artifact_evaluation.trace) are downloaded? It would save me the effort of debugging the data-processing part. For example, I downloaded Monet2Photo from...

Hi, Is the AMI 'gavel' still available and how can I find it?

I wanted to run the test ```scheduler_tests.py```. I believe, for a given trace, this test will give me the schedule in a file ```/tmp/simple.output```. The traces used seems to be...

Hi, Are you going to distribute the code of SJT policy? I cannot find this from the repository. Thanks.

The lease variables `duration` and `max_duration` are too ambiguous - rename these

The `Scheduler` and `Profiler` classes currently share a lot of code - we can factor this out into a common superclass (e.g. `SchedulerMechanism`)

Bumps [torch](https://github.com/pytorch/pytorch) from 1.4.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...

dependencies