Daniel van Strien
Daniel van Strien
>Does it make sense combine Part 1 and 2 of the lessons into a single Kaggle notebook? Happy to combine both parts into a single Kaggle notebook. >One concern I...
> @davanstrien are your Zenodo datasets linked in both parts of the lesson tutorials? Just want to make sure everything is synced up The datasets are hosted in Kaggle so...
> CC: @davanstrien I should have some time to look at this on Friday :)
@albertvillanova I've tried this with a few different XML datasets. One issue I've run into is getting a `KeyError` when the attributes of a field differ from the first parsed...
Just to add, thanks to @julien-c, we will very soon have support for an array of models in our `base_model` metadata (https://huggingface.co/docs/hub/model-cards#specifying-a-base-model). This should make it possible to easily extract...
@cg123 sounds great! >Generating a model card with some pre-populated metadata sounds like a great idea. An array of models in base_models is easy enough. An approximate percentage breakdown could...
@Wauplin I've used a Mixin to implement this new `__repr__` format. Let me know if you have something else in mind :) I don't think the failing tests are because...
> Hi, I have a similar issue as OP but the suggested solutions do not work for my case. Basically, I process documents through a model to extract the last_hidden_state,...
@jeremylhour glad this worked for you :)
Sorry to ping a long closed issue! Just for reference for anyone who stumbles upon this issue in the future, currently, Hugging Face datasets don't have hard size limits. There...