tensor2tensor
tensor2tensor copied to clipboard
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Hi, When I am trying to reproduce the adafactor experiments on en-de translation task, I encountered the following issue. `AttributeError: 'AdafactorOptimizer' object has no attribute 'get_gradients'` Could any one tell...
### Description Hi, I want to use adafactor to replace the Adam in my code; But I do not use the T2T framework; Based on the google-released BERT-finetune framework, I...
Dear all We want to train a UT model on our dataset. in the training, after some steps, the OOM error occurs. We continued training with decrease bach size, but...
### Description I am training a `Transformer` model on the `Librispeech` dataset using 4 GPUs with 8 CPU-cores. I have tested the following: #### Single-GPU ```bash export CUDA_VISIBLE_DEVICES=0 t2t-trainer \...
### Description Dear T2T team, I was trying to reproduce the result of Transformer base model in the original paper "Attention Is All You Need"; however, I found the hyper-parameters...
Hi, I am a little bit confused why should we set `REFERENCE_TEST_TRANSLATE_DIR=t2t_local_exp_runs_dir_master/t2t_datagen/dev/newstest2014-deen-ref.en.sgm` . because in my mind, the reference should be `de.sgm`. Do you have any idea? Thanks!
# Patching CVE-2007-4559 Hi, we are security researchers from the Advanced Research Center at [Trellix](https://www.trellix.com). We have began a campaign to patch a widespread bug named CVE-2007-4559. CVE-2007-4559 is a...