Roly Perera
Roly Perera
See [notebook](https://github.com/alan-turing-institute/ptype-dmkd/blob/develop/notebooks/classify-data-as-anomalous.ipynb). Summary: Part 1 (False Negatives): - created a toy example - read dataset again and look at first few rows - instantiate ptype and fit schema - let...
Should we consider making missing/anomalous values column-specific rather than a global list? This can be similar to how it is handled in Pandas’ `read_csv` which is `keep_default_na=False, na_values={’species’:['']}`.
As discussed, this seems to either be a bug in the training code, or a consequence of poorly-chosen training data. For the training tests to be useful, they have to...
See [notebook](https://github.com/alan-turing-institute/ptype-dmkd/blob/develop/notebooks/merge-missing-data-encodings.ipynb). Summary: - read dataset using `utils.read_data` - examine some values in `LRE Ages 3-5 - Full Incl #` column - plot the frequences of the unique values in...
The current system for naming types is overly complex: each type has an internal name (e.g. `"integer"`) and an associated PFSM. We then separately have a map (twice – see...
See [notebook](https://github.com/alan-turing-institute/ptype-dmkd/blob/develop/notebooks/demo_categorical_values.ipynb). I will add a summary for this. So far, we have the following use-cases at the moment: - [ ] handling anomalous values misclassified as categorical values (e.g.,...
Local wrapper for `publish.sh` which gets the passwords from a keyring (or `.pypirc` file).
Every mouse event triggers a re-run of the appropriate analysis (e.g. linked inputs), which causes the d3.js chart to be rebuilt from scratch. This in turn causes `mouseenter` to be...
With #980 there is a fair amount of “flicker” as the data view grows and shrinks. It might be less intrusive to have the data shown in a separate column...
Now that the UI is responsive enough, we should switch to enabling selection based on `mouseenter`, which is a blocker for some important interaction features (see below). This may require...