openml-python icon indicating copy to clipboard operation
openml-python copied to clipboard

feature: upload additional files in run object

Open uddhavphatak opened this issue 1 month ago • 1 comments

Metadata

  • Reference Issue: Fixes #1332
  • New Tests Added: No
  • Documentation Updated: No
  • Change Log Entry: feature: upload additional files in run object

Details

  • What does this PR implement/fix? Explain your changes.

This PR adds first-class support for attaching arbitrary additional files (e.g., ONNX models, binary representations, logs, custom metadata) to an OpenMLRun object and having them uploaded to the OpenML server when the run is published.

The PR introduces:

A new internal storage container:

self._additional_files: dict[str, tuple[str, bytes]]

A public method:

run.add_file("filename.ext", file_content)

Updated _get_file_elements() so that these files are included in the upload payload sent via _perform_api_call().

This matches the API design discussed in issue #1332 and the extension use-case described in openml-tensorflow@259f356.

  • Why is this change necessary? What is the problem it solves?

OpenML extensions (e.g., openml-tensorflow, ONNX flow exporters, model provenance modules) need a standard mechanism to attach arbitrary files to a run.

Until now:

  • Only predictions and trace files were supported.

-There was no generic user-facing API to upload additional artifacts.

-Extensions had to patch _get_file_elements() or depend on private methods.

This PR introduces a clean, extensible, documented mechanism for extensions and users to attach files to a run without modifying low-level upload code.

uddhavphatak avatar Nov 15 '25 11:11 uddhavphatak

Hi, This is my first PR to this repo, should I write test cases as well to support this enhancement addition?

uddhavphatak avatar Nov 15 '25 11:11 uddhavphatak