jay
jay
## What type of PR is this: - [ ] BugFix - [x] Feature - [ ] Enhancement - [ ] Refactor - [ ] UT - [ ] Doc...
## Why I'm doing: Fixes #47367 ## What I'm doing: You can learn more about the background from pr's https://github.com/StarRocks/starrocks/pull/38466. This PR is mainly used to solve the problem that...
this pr will fix when spark connector using filter push down, be can't recognize slot name ## Why I'm doing: ## What I'm doing: Fixes #issue ## What type of...
## Why I'm doing: ## What I'm doing: Fixes #issue ## What type of PR is this: - [x] BugFix - [ ] Feature - [ ] Enhancement - [...
### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested?
## Why are these changes needed? Using np.split to split a list in NumPy is not as performant as not using split_list, and the performance difference can be significant. ```python...
https://github.com/lancedb/lance/issues/3228
like this issue https://github.com/mosaicml/streaming/issues/832 Lance can perform random reads very well, and the primary key can bring good shuffle ability. However, streaming can provide a better shuffle algorithm for training....
Relevant information: https://github.com/ray-project/ray/pull/49070/files, https://github.com/ray-project/ray/issues/49063. I submitted a pull request in the Ray community regarding Ray's distributed read-write operations add_columns, delete_rows, and compaction. However, the Ray community believes that at this...
### What is the feature? A customizable sampler can obtain the current epoch. ### Any other context? _No response_