petals issues

batch processing/parallel processing

1

Hi there, does Petals currenly support batch processing/parallel processing? For example, to increase resource usage or system throughput, we would like to see servers parallelly processing multiple prompts at the...

oldcpple

Donating System Memory?

1

is there a way to donate System memory instead of GPU VRAM? this may become more economical

NavodPeiris

System_prompt

1

Good afternoon, I noticed that the model works as if with a pre-configured system prompt, which cannot be changed in any way. Please tell me if there is an opportunity...

EvilSumrak2049

Performance improving chances in the future

1

Hi there, I've been following this work for a few months and found it's really an amazing idea to run LLMs over the Internet, while I'm also trying to improve...

oldcpple

Mac M3 Any Model crashing

2

I am able to run `python -m petals.cli.run_server meta-llama/Meta-Llama-3.1-405B-Instruct --num_blocks 2 --max_disk_space=50G` for a bit but it always eventually exits with the an `AssertionError: Span served by this server is...

andrew-morris-rgs

CzsGit

attention_mask = FalconModel._prepare_attn_mask(attention_mask, (batch_size, seq_length), past_length) AttributeError: type object 'FalconModel' has no attribute '_prepare_attn_mask'

I can't run the falcon 7b instruct model it gives me that error

peteblank

petals
petals copied to clipboard

Metadata

batch processing/parallel processing

Donating System Memory?

System_prompt

Performance improving chances in the future

Mac M3 Any Model crashing

Enhance Logging in RemoteGenerationMixin for Better Debugging

Enhance Speculative Generation with Better Type Hints and Cleaner Code

Question about overlapped serving blocks

Error using the local llama3.1 model

attention_mask = FalconModel._prepare_attn_mask(attention_mask, (batch_size, seq_length), past_length) AttributeError: type object 'FalconModel' has no attribute '_prepare_attn_mask'

← Metadata

Owner

Metadata

petals petals copied to clipboard

Metadata

← Metadata

Owner

Metadata

petals
petals copied to clipboard