Adam Pocock
Adam Pocock
Extend the CSVLoader class to read from different datasources/targets and different kinds of formats
The `CSVLoader` is designed to be a very simple and quick way of getting a numerical csv with a response column up off the disk and into Tribuo. The file...
Extend the CSVLoader class to read from different datasources/targets and different kinds of formats
I'm finishing off a tutorial on `RowProcessor` which uses `CSVDataSource` and `JsonDataSource` to load more complex columnar data from csv and json files respectively.
Extend the CSVLoader class to read from different datasources/targets and different kinds of formats
We already have something that can transparently figure out if it's a GZipped file elsewhere in OLCUT, which will return the appropriate input stream implementation. We could probably extend that...
Extend the CSVLoader class to read from different datasources/targets and different kinds of formats
So concretely there would be: - optional loading of gzip or zip compressed files through the data sources - loading files over the web (most libraries that do this provide...
Unfortunately I think the current Tribuo APIs don't have a good way of doing this. `RowProcessor` operates on a single row at a time, it never has the view of...
I've had a look through the default query set, and it flags a few things which aren't issues (like our use of hash functions for features). We also run static...
We're currently looking at Tribuo's serialization mechanisms, and after that we're going to look at exporting to ONNX format (as it seems odd to import ONNX models when we can't...
To follow up on this we've started landing ONNX export support in `main` (initially for Tribuo's linear models - https://github.com/oracle/tribuo/pull/154) and will expand that support across multiple model types for...
If you want to use a regex to generate the field processors at loading time then you're supplying the `Mapping` argument to the wrong constructor. You need to use this...
I think K-Medoids would be a useful algorithm to add into Tribuo, though we don't have plans to add it ourselves at the moment. Contributions to add it are welcome....