CorefQA icon indicating copy to clipboard operation
CorefQA copied to clipboard

Getting error during mention proposal model training

Open shantanu778 opened this issue 2 years ago • 0 comments

System setup: tensorflow 1.15 torch 1.2 cuda 10.0 python 3.7.16 use_tpu False

I am geting this error during training

ERROR:tensorflow:Error recorded from training_loop: GetNext() failed because the iterator has not been initialized.
Ensure that you have run the initializer operation for this iterator before getting the next element.
	 [[node IteratorGetNext (defined at /home/shantanu/anaconda3/envs/corefqa/lib/python3.7/site-packages/tensorflow_core/python/framework/ops.py:1748) ]]

Original stack trace for 'IteratorGetNext':
  File "./run/run_mention_proposal.py", line 192, in <module>
    tf.app.run()
  File "/home/shantanu/anaconda3/envs/corefqa/lib/python3.7/site-packages/tensorflow_core/python/platform/app.py", line 40, in run
    _run(main=main, argv=argv, flags_parser=_parse_flags_tolerate_undef)
  File "/home/shantanu/anaconda3/envs/corefqa/lib/python3.7/site-packages/absl/app.py", line 312, in run
    _run_main(main, args)
  File "/home/shantanu/anaconda3/envs/corefqa/lib/python3.7/site-packages/absl/app.py", line 258, in _run_main
    sys.exit(main(argv))
  File "./run/run_mention_proposal.py", line 129, in main
    window_size=model_config.window_size, max_num_mention=model_config.max_num_mention, is_training=True, drop_remainder=True), max_steps=num_train_steps)
  File "/home/shantanu/anaconda3/envs/corefqa/lib/python3.7/site-packages/tensorflow_estimator/python/estimator/tpu/tpu_estimator.py", line 3030, in train
    saving_listeners=saving_listeners)

I have no idea about this error.

shantanu778 avatar May 23 '23 22:05 shantanu778