JGSweets

Results 30 issues of JGSweets

**Please provide the issue you face regarding the documentation** Need to mimic the diff section in structured_profilers: https://github.com/capitalone/DataProfiler/blob/main/examples/structured_profilers.ipynb Add to unstructured_profilers example

Documentation
Help Wanted
good_first_issue

**Please provide the issue you face regarding the documentation** Need typo fix `appliied` -> `applied` in `Differences in Data` section https://github.com/capitalone/DataProfiler/blob/main/examples/structured_profilers.ipynb

Documentation
Help Wanted
good_first_issue

**Is your feature request related to a problem? Please describe.** Within https://github.com/capitalone/DataProfiler/blob/main/dataprofiler/profilers/numerical_column_stats.py#L350 Need to add PSI - https://medium.com/model-monitoring-psi/population-stability-index-psi-ab133b0a5d42 Should be a helper function to calculate which is called withtin the...

New Feature
good_first_issue

**Is your feature request related to a problem? Please describe.** Currently, `boto3` is installed as a default package in the DataProfiler. I suggest adding it as an optional package such...

New Feature

All calculations should have a `get`, `update`, and `merge.` Where `get` -> calcs from raw data. `merge` -> takes two existing calcs and merges them `update` -> takes in new...

Documentation
Low Priority
contribution_day

Currently, the tests pop the the times dict as opposed to validating the times are properly generated by mocking the timeit functionality. Location: https://github.com/capitalone/DataProfiler/blob/56e34f837bb0667d69392fd06caf2145724452f4/dataprofiler/tests/profilers/test_categorical_column_profile.py We could utilize this similar functionality:...

Help Wanted
Medium Priority
Refactor
good_first_issue
contribution_day

Currently when pushing an update to a service, if there's an overlap in requirements for resources the new version does not utilize resources from the old version. It would be...

serve

**General Information:** - OS: `linux/x86_64` - Python version: `3.10.14` - Library version: `DataProfiler==0.10.8` **Describe the bug:** On line: https://github.com/capitalone/DataProfiler/blob/f8b3e5dbd4b76f0ecc291911ace9e8e21cf1ecb1/dataprofiler/labelers/labeler_utils.py#L360 I receive the error: `TypeError: Metric.add_weight() got multiple values for argument...

Bug

This PR updates the schema config to allow specification of IAM roles for resources based on the skypilot naming conventions. Address: https://github.com/skypilot-org/skypilot/issues/3487 In `~/.sky/config.yaml`: * When setting for the controller...

Prior to this PR, ECS containers could not launch skypilot jobs because as the type is considered: `container-role` as opposed to `iam-role` when evaluating `aws configure list`. This PR: *...