meshed-memory-transformer issues

Results 57 meshed-memory-transformer issues

Sort by recently updated

Q/A visual for coding

Hi @baraldilorenzo, I 'm trying to improve the speed of beam_search. When doing it, I found this function: `visual = self._expand_visual(visual, cur_beam_size, selected_beam)` in the iter function of beam_search.py Please...

TrungThanhTran

Random output after several early epoch then start training

Hi @marcellacornia, When I started my train, I got random outputs for about the first five epochs, I mean it generated words. Then, it produced nothing, and I had to...

TrungThanhTran

Parallelizing the Network

Hi, Thanks for providing the implementation it really helps. I am getting the following error when trying to use Multiple GPUs with DataParallel. Please note the implementation works perfectly fine...

mujtabaasif

Hello, I would like to ask myself where is the description of the output generated by the model?

775266553

OSError: [Errno 24] Too many open files

File "train.py", line 100, in train_scst tokenizer_pool = multiprocessing.Pool() File "/root/anaconda3/envs/m2/lib/python3.6/multiprocessing/context.py", line 119, in Pool context=self.get_context()) File "/root/anaconda3/envs/m2/lib/python3.6/multiprocessing/pool.py", line 174, in __init__ self._repopulate_pool() File "/root/anaconda3/envs/m2/lib/python3.6/multiprocessing/pool.py", line 239, in _repopulate_pool w.start()...

775266553

coco_detections.hdf5

Hello, excuse me, I recently read the paper Meshed_memory, and I am a little confused about the data set coco_detections.hdf5 he used. Is the value extracted by the imgid_features key...

songbohui

Beam search error

Hi everyone, While evaluating the model, I got the following error: `RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAILED when calling `cublasLtMatmul( ltHandle, computeDesc.descriptor(), &alpha_val, mat1_ptr, Adesc.descriptor(), mat2_ptr, Bdesc.descriptor(), &beta_val, result_ptr, Cdesc.descriptor(), result_ptr, Cdesc.descriptor(),...

Faiail

FileNotFoundError: [WinError 2] 系统找不到指定的文件。

something was wrong when run in ![image](https://user-images.githubusercontent.com/105775884/202346354-abc8d8ed-bc3f-431f-a291-f6be97d1dfde.png) ![image](https://user-images.githubusercontent.com/105775884/202346382-62a36102-4f0f-4695-bb69-2d71ba923dbc.png)

5132210

Vocabulary of the test split

Hi! Thanks for the written paper and the availabe code. I have what may be a stupid question, but I didn't find a straight answer to it anywhere: When evaluating...

gondimjoaom

Ensemble problem

Thanks for this amazing work! The confusion about the paper when I read is that how to use the ensemble trick. In my opinion, ensembles of mutiple models means training...

xarryon

meshed-memory-transformer
meshed-memory-transformer copied to clipboard

Metadata

Q/A visual for coding

Random output after several early epoch then start training

Parallelizing the Network

Hello, I would like to ask myself where is the description of the output generated by the model?

OSError: [Errno 24] Too many open files

coco_detections.hdf5

Beam search error

FileNotFoundError: [WinError 2] 系统找不到指定的文件。

Vocabulary of the test split

Ensemble problem

← Metadata

Owner

Metadata

meshed-memory-transformer meshed-memory-transformer copied to clipboard

Metadata

← Metadata

Owner

Metadata

meshed-memory-transformer
meshed-memory-transformer copied to clipboard