framework issues

Implement filesystem/sqlite cache for memory consuming operations?

# Overview We have some functions that require collecting data in memory like: - `checks.duplicate_row` - `checks.deviated_cell/value` - `resource.analyze` - etc We might provide an internal cache system (switching to...

roll

enhancement

Add an ability to parallel package transforms

# Overview Parallelization can be added to some steps/etc

roll

feature

Table aggregate does not work with len.

2

# Overview "table-aggregate" step when used with len doesn't work. ``` source = Resource(path="784/transform.csv") target = transform( source, steps=[ steps.table_normalize(), steps.table_aggregate( group_name="name", aggregation={"min": ("population", len)} ), ], ) print(target.schema) print(target.to_view())...

shashigharti

bug

Implement `steps.package_publish`

# Overview We need an ability to save metadata + data (package + all resources)

roll

feature

Review Row class (transform/mutability/table concept/etc)

# Overview As a part of v6's transform work. Probably we need to make it immutable (proxy for cells) for performance

roll

general

Implement `steps.resource_write`?

# Overview We need an ability to save metadata + data

roll

feature

Normalize bytes/hash calculation between UNIX and Windows?

# Overview At the moment, it doesn't match. Shall we normalize line endings etc? It's complicated because `python.csv` requires opening files without a universal newline. On the other hand, the...

roll

bug

Implement download delay

1

# Overview @pwalsh has wrote > sleep: > > it is a killer if you can't force a sleep between runs. This was a crude way to work around API...

roll

feature

Create benchmark and optimize the framework

2

# Overview The migration from `tabulator/tableschema/datapackage/goodtables` gave good speed improvement but we still can make it faster especially for working with numbers - https://github.com/frictionlessdata/frictionless-py/issues/461 # Tasks - [ ] create...

roll

general

Implement transform utils?

# Overview We only need to wrap corresponding PETL's functions.

roll

feature

framework
framework copied to clipboard

Metadata

Implement filesystem/sqlite cache for memory consuming operations?

Add an ability to parallel package transforms

Table aggregate does not work with len.

Implement `steps.package_publish`

Review Row class (transform/mutability/table concept/etc)

Implement `steps.resource_write`?

Normalize bytes/hash calculation between UNIX and Windows?

Implement download delay

Create benchmark and optimize the framework

Implement transform utils?

← Metadata

Owner

Metadata

framework framework copied to clipboard

Metadata

← Metadata

Owner

Metadata

framework
framework copied to clipboard