Zhuo Peng
Zhuo Peng
what is the type of your date/timestamp feature? I don't think the default slicer will be able to slice by ranges but you can implement your own slicer. A slicer...
btw, we are looking at allowing using SQL statements to do slicing which may be able to support your use case. However there's no timeline yet.
sorry, which DATETIME type did you mean? I don't think TFDV supports such types (only integral, floating and string/bytes). Our feature value slicer is no exception than other potential custom...
We alrady have a stats generator (tensorflow_data_validation/statistics/generators/cross_feature_stats_generator.py). You can try enabling it by specifying it in StatsOptions.generators But currently Facets does not visualize the results. We could attach the cross...
What do you mean by "visualizeable formatted" data? The schema and stats are protocol buffer [1] objects. They implemented `__str__` so if you `print()` them, you'll get a Protobuf Text...
Thanks for the clarification. We noted it in our internal bug tracker. What you suggested makes sense to me. But I'll check w/ the Kubeflow team to understand what their...
one caveat to that solution is that your original columns must not contain null (nil, None, etc).
I think there may be an easy patch to your existing solution so it handles Nulls correctly, but I'm not familiar with pandas APIs. Basically, we want to translate something...
@tanguycdls : if you have something workable, do you mind making a contribution?
@tanguycdls > actually List Array does not have a mask parameter in the from_arrays function? are you running w/ a nightly version It's true that ListArray.from_arrays doesn't accept a mask...