dasher-web icon indicating copy to clipboard operation
dasher-web copied to clipboard

Integrate the Language Model - Google Research Repository

Open jcope opened this issue 4 years ago • 4 comments

https://github.com/google-research/mozolm

jcope avatar Mar 08 '21 16:03 jcope

Maybe here in the issue, or in a design document, could the following be answered please.

  • How does Mozolm handle persistent storage?
  • Could there be a public or other hosted service that runs the Mozolm server?

sjjhsjjh avatar Mar 08 '21 18:03 sjjhsjjh

So. We could run a server for the gRPC service. But there would be some major issues to check on privacy. But I think we should consider them. It might need a decent payment scheme behind it too to fund its running particularly if commercial entities use it. But the privacy. That’s a big thing.

willwade avatar Mar 08 '21 19:03 willwade

Maybe here in the issue, or in a design document, could the following be answered please.

  • How does Mozolm handle persistent storage?

At the moment the microservice will load whichever model and/or training (in case of PPM) files upon initialization. We haven't implemented the persistence in the microservice API, but it is there on a model implementation level - unlike the JavaScript implementation that uses a non-persistent suffix trie backend, Brian's implementation uses finite-state transducer which is trivially savable to disk. All we need is to hook-up a network request to flush the current state of the model to disk.

  • Could there be a public or other hosted service that runs the Mozolm server?

Yes.

agutkin avatar Jun 11 '21 23:06 agutkin

Cc-ing Brian (@roark-google) on this.

agutkin avatar Jun 11 '21 23:06 agutkin