Manuel Aristarán

Results 34 issues of Manuel Aristarán

Here's a case that we might want to look into: https://www.dropbox.com/s/0i6ae5kgtcy0frb/s-013163.pdf It's definitely a "spreadsheet", but the lines-of-text / ruling-lines ratio is way below/above the heuristic's defined threshold.

- Move all of its methods to `Rectangle2D` - Make the "rectangular" entities (`TextElement`, `Page`, etc) inherit from `Rectangle2D` (we reopened it, anyway)

The test suite code has become a unmaintainable mess. Let's clean that up. While we're at it, we should merge [`icdar-groundtruth-tests`](https://github.com/jazzido/tabula-extractor/tree/icdar-groundtruth-tests) into `master`.

see PDFBox operator `setLineWidth` Also, see what to do with PDFs that draw lines as thin rectangles (as per discussion in #16)

make `TextElement#should_add_space?` accept an optional parameter `TOLERANCE_THRESHOLD`

When PDFBox 3.0.0 leaves RC status, will merge and release Tabula 1.1.0.

Hey, First off, thanks for `react-jsx-parser`! I've found a (possible) bug with boolean expressions. In a JSX string, this expression is not evaluated to the expected value: ``` ... {boundFunction(!boundValue)}...

help wanted

http://nlp.lsi.upc.edu/freeling/doc/userman/html/node84.html Perhaps, client/server mode can be specified as optional arguments to `Freeling::Analyzer.new`.