Theodore Vasiloudis
Theodore Vasiloudis
*Issue #, if available:* Fixes https://github.com/awslabs/graphstorm/issues/1262 *Description of changes:* * Add status badges * Align quick start examples with main documentation, simplify/condense instructions * Add blog post links * Remove...
GraphStorm has a few blog posts on the AWS Blog explaining use cases. We should link to those from the main project README
Because DistDGL and by extension GraphStorm has an assumption of a shared filesystem to function properly, in our SageMaker implementations need to implement various downloads and uploads to "fake" the...
Currently, when a user sets a specific number of top-k models to be retained, the model files are removed but the learnable embeddings of every epoch are maintained. The expected...
Currently we read files in GSProcessing by directly using the path provided by the user in the config in a `spark.read.parquet/csv(filepath)` call. Spark doesn't support wildcards when used like this,...
Currently early stop has two strategies that only trigger if a validation metric _decreases_ with two criteria: > There are two strategies: 1) **consecutive_increase**, early stop is triggered if the...
When users set `--do-repartition True` for a GSProcessing job, there's a chance that job will fail. Within GSP we don't fail the entire job when that happens, in order to...
SageMaker local execution allows users to configure the Docker containers using a local file under `$HOME/.sagemaker/config.yaml`. See https://aws.amazon.com/blogs/machine-learning/configure-and-use-defaults-for-amazon-sagemaker-resources-with-the-sagemaker-python-sdk/ for details An example file can be: ```yaml local: local_code: true #...
Because we include a `graphstorm.sagemaker` module, we can false positives in pylint such as ``` ************* Module python.graphstorm.sagemaker.sagemaker_partition [2024-06-17T22:04:35.164Z] python/graphstorm/sagemaker/sagemaker_partition.py:33:0: C0411: third party import "from joblib import Parallel, delayed" should...