Amog Kamsetty comments

Results 55 comments of


                                            Amog Kamsetty

[Core] Make `ray.get_gpu_ids()` always return `List[int]`

Ah yes, that's right- it can also be UUIDs. > GPU identifiers are given as integer indices or as UUID strings. From https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars. In this case, then we should always...

[Core] Make `ray.get_gpu_ids()` always return `List[int]`

@pcmoritz thoughts about the proposed change to always return strings?

[Train/AIR] Training large models with HuggingfaceTrainer

@dumpmemory what are the advantages of deepspeed vs. fsdp?

Support Ray 2.0

Is this ready to be merged in?

How to reduce prompt with too many tools

Perhaps try out compress gpt: https://github.com/yasyf/compress-gpt It seems to be built for exactly this use case.

Ray Tune sweep does not support multi GPU

Hey @reciprocated, @LouisCastricato, @ayulockin-- circling back on this thread. As @ayulockin mentions, you can use `tune.with_resources` to allocate multiple GPUs per trial, but the challenge is that you need a...

[AIR] Add `TorchVisionPreprocessor`

Some torchvision transforms work on a batched input. Should we allow users to also specify a separate batched_transform for better performance? See our batch prediction benchmarks for an example

[AIR] Add `TorchVisionPreprocessor`

API could be something like this: ``` single_transform = transforms.ToTensor() batch_transform = transforms.Compose([CenterCrop(...), Normalize(...)]) preprocessor = TorchVisionPreprocessor(transform=single_transform, batch_transform=batch_transform) ``` Single preprocessor that can accept both options. `batch_transform` arg is Optional.

[WIP] Dataset iterator

Are we still planning to remove `iter_batches` from the base Dataset API and instead have `ds.to_iterator().iter_batches()`? Basically avoiding duplicate APIs in both `Dataset` and `DatasetIterator`

[Data] Track blocks waiting time in async `iter_batches`

This was an intentional change previously as `ray.wait()` time is not interpretable to the user. From the user perspective, how are they supposed to interpret this?