Julian Fong
Julian Fong
When I specify a `transformer_backbone` to something other than the google flan t5-large, i get these errors: ``` size mismatch for encoder.block.11.layer.1.DenseReluDense.wo.weight: copying a param with shape torch.Size([1024, 2816]) from...
Please do not merge, this pr is used to test functionality on tests for `PeftForecaster` relates to #7086
Implements the `PeftForecaster` other details will be added as progress is made #### Reference Issues/PRs Referencing #7077, #6580 and #6968 #### Does your contribution introduce a new dependency? If yes,...
@benHeid This is the draft pr for the `PeftTuner` that we discussed synchronously on 10/09/2024 You can always make a fork to play around with it if you wish :)...
DO NOT MERGE! Trying to fix macos errors in #6570 by setting threads = 1
This issue is being opened to track possible improvements or changes that can be made to help users debug tests when implementing estimators. As `check_estimator` is the primary tool to...
This PR aims to add support for AdapterPlus Github: https://github.com/visinf/adapter_plus Paper: https://arxiv.org/pdf/2406.06820 Integration of AdapterPlus into the `adapters` library will involve adding new parameters/options to the `BnConfig` Checklist of things...
Given that we specify a bottleneck config with layer normalization before the adapter bottle neck, and we specify the initial weights of the adapter to be of type `mam_adapter`, it...
Completes #440 and is a polars extension of #437 Adds polars version of dummy proba regressor - also to help test e2e input flow of polars tables In this pull...
Implement the `DummyProbaRegressor` but with complete end to end support in `skpro`. Some current limitations: `fit` inside `DummyProbaRegressor` uses `skpro.distributions` which only supports pandas dataframes - needs a workaround `predict_proba`...