type_infer
type_infer copied to clipboard
Type inference for Machine Learning pipelines
### Short description of current behavior I am using a postgres database with a money column that has several different values. But mindsdb (or lightwood) infers it as categorial input....
Column `L1` in [this dataset](https://app.surgehq.ai/datasets/financial-transactions-intent-and-expense-category) is inferred as `short_text` but should probably be reported as categorical instead.
The current implementation of `type_infer` is not suitable to be used in distributed compute environments (i.e. non-scalable); currently, `type_infer` can only be executed in a single node and needs to...
Right now, documentation is in a very preliminar and rough state. Any help towards making it easier to understand the codebase and the purpose of the package will be of...
Important logic, we should unittest it with real-world examples and generated data
The current rule-based approach is failing to recognize a tag-filled column when it contains a large number of unique tags. Additionally, it should be robust to additional separators or formatting,...