Tom Baeyens
Tom Baeyens
Updated SQL metric expression docs (column and dataset level) : https://deploy-preview-768--jovial-piroshki-65ff4d.netlify.app/soda/data-contracts-checks#sql-metric-expression Updated SQL metric query docs (column and dataset level) : https://deploy-preview-768--jovial-piroshki-65ff4d.netlify.app/soda/data-contracts-checks#sql-metric-query Both of these check types require a `metric`...
The problem with the current identity generation is that dataset level checks often will need an identity_suffix. We need to improve the default behavior so that in less circumstances the...
Goal: The goal is create a better correlation mechanism between checks in the files and checks in Soda Cloud. We want to impose the least amount of burden on the...
More analysis notes: Solution 2 would also work best on the Soda Cloud backend. Based on the correlation properties, contracts lib should create an `identity` property in the genrated SodaCL...
Duplicate of #2108
See this section in the code https://github.com/sodadata/soda-core/blob/807866557c46e678137042ab5dbf0e4e106e0225/soda/contracts/soda/contracts/contract.py#L59 ``` # TODO explain filter_expression_sql, default filter and named filters # filter name must part of the identity of the metrics # -...
@Antoninj I think the solution should be to change the condition ``` if self.scan_yml.is_metric_enabled(Metric.FREQUENT_VALUES, column_name) \ and (scan_column.is_number or scan_column.is_column_numeric_text_format): ``` to ``` if self.scan_yml.is_metric_enabled(Metric.FREQUENT_VALUES, column_name) \ and scan_column.has_numeric_values ```...
Frequent values : how often does a certain value occur. All values are ordered by occurrence (aka frequency). The values with the highest frequencies are on top of this list....
> Extreme values currently holds the column.profile.mins and column.profile.maxs, which translates to the Metric.MINS and Metric.MAXS in soda-sql. Yes. Looks good to me. > If I understand you correctly @tombaeyens...
> When I look at NNIP's Security Master client_b_id column I can see that the Extreme Values is enabled (which means mins and maxs metrics was calculated). The result of...