Luis Blanche
Luis Blanche
Even if I don't whant to use a chat template this function is called and sets `valid_data` to `None` which results in the downstream error below https://github.com/huggingface/autotrain-advanced/blob/01673f192f56439f083fc8f84a414d62eb2f5d28/src/autotrain/trainers/clm/utils.py#L443-L463 ``` Special tokens...
https://github.com/huggingface/autotrain-advanced/blob/10defa96fc8ee6c0b56ef7f1ba44a750e84c6bef/src/autotrain/trainers/clm/utils.py#L333C1-L351C14 ```python def process_input_data(config): if config.data_path == f"{config.project_name}/autotrain-data": logger.info("loading dataset from disk") train_data = load_from_disk(config.data_path)[config.train_split] else: if ":" in config.train_split: dataset_config_name, split = config.train_split.split(":") train_data = load_dataset( config.data_path, name=dataset_config_name, split=split,...