Sabri Eyuboglu
Sabri Eyuboglu
Is it possible to create an ICD combining multivariate distributions and univariate distributions? Specifically, I'd like to fit a mixture model to some d + 1 dimensional data where the...
Cool task – thanks for putting together this resource. Many of the images in the dataset are near duplicates of one another – presumably because the KITTI images were taken...
A call to copy.deepcopy on a datapanel corrupts _block_index of the columns: ``` dp = mk.DataPanel({ "a": pd.Series([0,1,2,3]), "b": pd.Series([0,1,2,3]), "c": pd.Series([0,1,2,3]), }) dp.consolidate() print(dp["a"]._block_index) import copy dp = copy.deepcopy(dp)...
There are some examples in audioset who's start time and end time are outside of the length of the video. For example, ``` balanced_train_segments/YTID=kKf9OprN9nw_st=400.0_et=410.wav ``` When creating the Audioset DataPanel...
If a single DataPanel contains a chain of LambdaColumns, like so ``` dp["a_b"] = LambdaColumn(dp["a"], fn) dp["a_b_c"] = LambdaColumn(dp["a_b"], fn_2) ``` then indexing the DataPanel with `dp[0]` will perform the...
Meerkat imposes a `ListColumn` index on all `DataPanel`. In many cases, this is the slowest column in the dp and it bottlenecks performance, since all the other columns are based...
I’m envisioning is something in between a `map` and a `LambdaColumn` where the computation happens lazily but is cached once it’s computed. Right now, it’s either you do it all...
When we write `LambdaColumn` we also write the column or DataPanel underlying it. When writing a DataPanel with a `LambdaColumn` dependent on other columns in the `DataPanel` this can lead...
Add a `rename` to `DataPanel`, like https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.rename.html, except only for columns
Can probably use the RG identifier class for this