jay

Results 35 issues of jay

## What type of PR is this: - [ ] BugFix - [x] Feature - [ ] Enhancement - [ ] Refactor - [ ] UT - [ ] Doc...

## Why I'm doing: Fixes #47367 ## What I'm doing: You can learn more about the background from pr's https://github.com/StarRocks/starrocks/pull/38466. This PR is mainly used to solve the problem that...

this pr will fix when spark connector using filter push down, be can't recognize slot name ## Why I'm doing: ## What I'm doing: Fixes #issue ## What type of...

## Why I'm doing: ## What I'm doing: Fixes #issue ## What type of PR is this: - [x] BugFix - [ ] Feature - [ ] Enhancement - [...

### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested?

## Why are these changes needed? Using np.split to split a list in NumPy is not as performant as not using split_list, and the performance difference can be significant. ```python...

triage
data

https://github.com/lancedb/lance/issues/3228

enhancement
python

like this issue https://github.com/mosaicml/streaming/issues/832 Lance can perform random reads very well, and the primary key can bring good shuffle ability. However, streaming can provide a better shuffle algorithm for training....

Relevant information: https://github.com/ray-project/ray/pull/49070/files, https://github.com/ray-project/ray/issues/49063. I submitted a pull request in the Ray community regarding Ray's distributed read-write operations add_columns, delete_rows, and compaction. However, the Ray community believes that at this...

### What is the feature? A customizable sampler can obtain the current epoch. ### Any other context? _No response_