Raymond Cheng comments

Repositories
Issues
Comments

Results 135 comments of


                                            Raymond Cheng

Dynamic Configurations

To be clear, this issue is mean to track dynamic resharding and failure recovery on the server-side. None of these should be visible to the client

Scheduling

We can start with running some profiling studies on application workloads

Multiple Talek Instances

Currently, we assume 1 instance => 1 process => 1 GPU High level question: What is the best way to run multiple Talek instances (e.g. to support multiple data sizes)

generalized followers

Right now it's set up for chaining. e.g. leader -> follower1 -> follower2 -> etc. any server can choose to terminate the chain. Is that not sufficient?

generalized followers

Sure, conceptually it's essentially a frontend gateway (that can coexist with any one of the servers). So the leader = follower + frontendGateway