buildflow icon indicating copy to clipboard operation
buildflow copied to clipboard

BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and BuildF...

Results 38 buildflow issues
Sort by recently updated
recently updated
newest added

if you have low throughput with massive fan out it will actually be better to use loads jobs, we should support this option.

gcp
io

Right now we just fail when we start but things still attempt to start. Ideally this would kill the pipeline immediately.

io

Some more file types: JSON, Avro, CSV

enhancement
io

I think this in particular is just a batch source cause you can't really stream in a file. We do have a similar streaming source for GCS files but that...

enhancement
io

The idea is you can have one processor that is an input into another processor. Right now this can kind of be accomplished by having the sink of one processor...

enhancement

Things like avro and parquet that have a schema associated with them. If the user is writing a dataclass we should verify that the schema matches with the existing file...

enhancement
io

Can look at starting with Google Cloud Spanner for this, but ideally we can figure out something generic enough to extend to any database that supports change streams.

enhancement
aws
gcp
azure
io

Will likely just start with simple process time windowing (e.g. give me all messages in the last X seconds)

enhancement
runner
io