tensor2tensor
tensor2tensor copied to clipboard
The checkpoint is always not found when i use decoder command
Description
The checkpoint is always not found in T2T-decoder. After deleting checkpoint, can run T2T-decoder,Which model is used at this time?and why?
Environment information
OS:
$ pip freeze | grep tensor mesh-tensorflow==0.1.17 tensor2tensor==1.15.7 tensorboard==2.3.0 tensorboard-plugin-wit==1.7.0 tensorflow==2.3.1 tensorflow-addons==0.11.2 tensorflow-datasets==4.0.1 tensorflow-estimator==2.3.0 tensorflow-gan==2.0.0 tensorflow-gpu==2.3.0 tensorflow-hub==0.9.0 tensorflow-metadata==0.24.0 tensorflow-probability==0.7.0
$ python -V 3.7.7
For bugs: reproduction and error logs
Steps to reproduce:
this is my decoder command。 t2t-decoder --t2t_usr_dir=self_script --problem=my_problem --data_dir=./self_data --model=evolved_transformer --hparams_set=evolved_transformer_deep --output_dir=./train_evolved_transformer_v1 --decode_hparams="beam_size=4,alpha=0.6" --decode_from_file=./decoder/test_C.txt --decode_to_file=./decoder/test_OE.out
Error logs:
tensorflow.python.framework.errors_impl.NotFoundError: 2 root error(s) found. (0) Not found: Key evolved_transformer/body/decoder/layer_0/first_attend_to_encoder/multihead_attention/k/kernel not found i n checkpoint [[{{node save/RestoreV2}}]] (1) Not found: Key evolved_transformer/body/decoder/layer_0/first_attend_to_encoder/multihead_attention/k/kernel not found i n checkpoint [[{{node save/RestoreV2}}]] [[save/RestoreV2_1/_25]] 0 successful operations. 0 derived errors ignored.
During handling of the above exception, another exception occurred:
Traceback (most recent call last): File "/home/WwhStuGrp/WwhStu11G/anaconda3/envs/py3.7-tensorflow/lib/python3.7/site-packages/tensorflow/python/training/saver .py", line 1299, in restore {self.saver_def.filename_tensor_name: save_path}) File "/home/WwhStuGrp/WwhStu11G/anaconda3/envs/py3.7-tensorflow/lib/python3.7/site-packages/tensorflow/python/client/session .py", line 958, in run run_metadata_ptr) File "/home/WwhStuGrp/WwhStu11G/anaconda3/envs/py3.7-tensorflow/lib/python3.7/site-packages/tensorflow/python/client/session .py", line 1181, in _run feed_dict_tensor, options, run_metadata) File "/home/WwhStuGrp/WwhStu11G/anaconda3/envs/py3.7-tensorflow/lib/python3.7/site-packages/tensorflow/python/client/session .py", line 1359, in _do_run run_metadata) File "/home/WwhStuGrp/WwhStu11G/anaconda3/envs/py3.7-tensorflow/lib/python3.7/site-packages/tensorflow/python/client/session .py", line 1384, in _do_call raise type(e)(node_def, op, message) tensorflow.python.framework.errors_impl.NotFoundError: 2 root error(s) found. (0) Not found: Key evolved_transformer/body/decoder/layer_0/first_attend_to_encoder/multihead_attention/k/kernel not found i n checkpoint [[node save/RestoreV2 (defined at /lib/python3.7/site-packages/tensorflow_estimator/python/estimator/estimator.py:629 ) ]] (1) Not found: Key evolved_transformer/body/decoder/layer_0/first_attend_to_encoder/multihead_attention/k/kernel not found i n checkpoint [[node save/RestoreV2 (defined at /lib/python3.7/site-packages/tensorflow_estimator/python/estimator/estimator.py:629 ) ]] [[save/RestoreV2_1/_25]] 0 successful operations. 0 derived errors ignored.