Sujee Maniyam
Sujee Maniyam
if allmark chooses a random portnumber please print it on startup e.g listening on port number : 30000
thanks for the report... I will update this soon http://sujee.net On Mon, Jan 6, 2014 at 6:24 AM, OhadR [email protected] wrote: > in FreqCounter1.java, > scan.addColumns(columns); does not compile. >...
no need to publish the 'bad word files' to pypi. But can we give a url to a accessible badwords file (we can point to our example from github (https://github.com/IBM/data-prep-kit/tree/dev/transforms/language/doc_quality/ray/ldnoobw)...
no code change necessary, just to be clear :-) I will work on an example showcasing: 1. downloading the bad-words files from a location (could be ours or any other...
with latest releases (1.0.0 + ) using doc_quality plugin works via pip install
I do see `document_hash` in the contents. I would like to see this propagated up as a top-level column in the output parquet. Along with actual file size. 
@dolfim-ibm with the new Docling integration, will this be addressed as well?
pdf2pq now blocked on #767
> I do like the idea of having this resource page, but I saw @Bytes-Explorer 's reservations about doing this. I discussed it with Nirmit this morning and his general...
> @sujee Need to sync with upstream and update branch with latest before editing latest. done. please check again. thx