Agustín Benassi

Results 111 issues of Agustín Benassi

When facing hierarchical groups of time data series, there are groups of them that added together conform an aggregated time data series that is also present in the spreadsheet. In...

Idea

- Time header coordinate could be searched with approximate string matching if `time_header` is provided. - If time header is not provided, an approximate string matching search could be performed...

Idea

- When looking for date values, if something could be parsed by `openpyxl` directly into `datetime.datetime` the time format is certain. But if strings have to be evaluated, there should...

Idea

- If the time index values are directly parsed into datetime.datetime it wouldn't be difficult to infer the frequency from a sample of the first time values. - Strategies to...

Idea

- Presence of missings can be determined by following a data series values together with the time index values. Whenever a data value is a string that cannot be converted...

Idea

- These two parameters should be discovered together. `blank_rows` indicates that a series is interrupted by blank rows from time to time and `continuity` indicates (when `False`) that a series...

Idea

- Tag rows with ratio of digits as done for `data_starts` discovery. When ratio goes significantly up or excel file ends (hit the dimension boundary -as interpreted by `openpyxl`\- or...

Idea

- Tag rows with ratio of digits over total characters in the row. When a certain (low) ratio of digits changes towards a high ratio of digits consistently (for more...

Idea

- From a header coordinate, look up to see if there is a string value right before the header. That should indicate the presence of `composed_headers`. Also a merged cell...

Idea

- Headers coordinates could be discovered just having `data_starts` parameter and looking up to the first string that is not an empty or `None` value. There should be a way...

Idea