NeMo-Curator
NeMo-Curator copied to clipboard
Deprecate `max_text_bytes_per_part`
trafficstars
We have long strings support in cuDF now, so we can deprecate the max_text_bytes_per_part parameter.
Related:
- https://github.com/NVIDIA/NeMo-Curator/pull/77
- https://github.com/NVIDIA/NeMo-Curator/issues/233
- https://github.com/NVIDIA/NeMo-Curator/pull/314
Will work on this after https://github.com/NVIDIA/NeMo-Curator/pull/316 is resolved.
Closing as no longer relevant after the Ray refactor.