Optimus icon indicating copy to clipboard operation
Optimus copied to clipboard

Optimus: the first large-scale pre-trained VAE language model

Results 23 Optimus issues
Sort by recently updated
recently updated
newest added

Hello, Following up on the previous issue. I cannot afford to run your docker on a cloud instance and I dont have a gpu do you have any suggestions? I...

I'd like to run this on colab any suggestions? Awesome work super excited to experiment!

Dear authors, First, thank you for sharing the code! I was interested in the experiment of Label-Conditional Text Generation. I would have some questions about the losses. 1. [loss_lsd](https://github.com/ChunyuanLI/Optimus/blob/master/code/examples/big_ae/modules/ctrl_gen.py#L71) corresponds...

Currently, text is generated from latent point by sampling from distributions produced by generator over vocabulary of tokens. But since z is multivariate gaussian we can also sample from it...

Bumps [psutil](https://github.com/giampaolo/psutil) from 5.6.3 to 5.6.6. Changelog Sourced from psutil's changelog. 5.6.6 2019-11-25 Bug fixes 1179_: [Linux] Process cmdline() now takes into account misbehaving processes renaming the command line and...

dependencies

Dears Thanks for sharing your amazing work! I am trying to run the Label-Conditional Text Generation experiment, but unfortunately, I didn't find the entry point for the training where there...

Not sure what the purpose of these two losses is if they are cancelling each other. I believe this is a mistake. Can you please explain what is the goal...

Bumps [gitpython](https://github.com/gitpython-developers/GitPython) from 3.0.2 to 3.1.34. Release notes Sourced from gitpython's releases. 3.1.34 - fix resource leaking What's Changed util: close lockfile after opening successfully by @​skshetry in gitpython-developers/GitPython#1639 New...

dependencies

Can't access datasets at "https://github.com/ChunyuanLI/Optimus/blob/master/data/download_datasets.md"

Optimus/doc/optimus_finetune_language_models.md beta=0, latent size = 32 https://chunylcus.blob.core.windows.net/machines/msrdl/optimus/output/pretrain/philly_rr3_vc4_g8_base_vae_wikipedia_pretraining_beta_schedule_beta0.0_d1.0_ro0.5_ra0.25_32_v2/checkpoint-508523.zip Pre-trained model download is not available. Need Permissions?