petals issues

Results 92 petals issues

Sort by recently updated

Report average queue size in tokens (per last 10 min) for routing

This would help to take the server load into account while planning a route for inference and fine-tuning.

borzunov

API reference

We should create a detailed API reference for public interface to ease the development for new users.

borzunov

documentation

Problem running petals on virtual CPU

I ran into this when trying to run: https://github.com/petals-infra/chat.petals.dev ``` $ flask run --host=0.0.0.0 --port=5000 Floating point exception (core dumped) ``` But I believe this is an issue with the...

tijszwinkels

Petals Error: GPU is not available

I'm running a 660Ti, so I'm pretty used to it not playing well with other things. If there's something I can do, that would be great. Otherwise, this is just...

SephReed

How to parallelize this code for model.generate?

As the title says, how can I parallelize this? ``` def generate_output(row): inputs = tokenizer(prompt, return_tensors="pt")["input_ids"] outputs = model.generate(inputs, max_new_tokens=185, temperature=0.0, eos_token_id=tokenizer.encode("}")[0]) result = tokenizer.decode(outputs[0]) completion = extract_completion(result) for index,...

ryanshrott

How to get faster inference?

I added my RTX 3080 to swarm using: conda install pytorch pytorch-cuda=11.7 -c pytorch -c nvidia pip install git+https://github.com/bigscience-workshop/petals python -m petals.cli.run_server enoch/llama-65b-hf --adapters timdettmers/guanaco-65b But I still find my...

ryanshrott

petals
petals copied to clipboard

Metadata

Report average queue size in tokens (per last 10 min) for routing

API reference

Problem running petals on virtual CPU

Petals Error: GPU is not available

How to parallelize this code for model.generate?

How to get faster inference?

[feature] add privacy

How to upload to hub and use model later on?

Allow filtering by max sequence length

Expand on FLUX network of decentralized nodes

← Metadata

Owner

Metadata

petals petals copied to clipboard

Metadata

← Metadata

Owner

Metadata

petals
petals copied to clipboard