spark-ocr-workshop icon indicating copy to clipboard operation
spark-ocr-workshop copied to clipboard

Public runnable examples of using John Snow Labs' OCR for Apache Spark.

Results 26 spark-ocr-workshop issues
Sort by recently updated
recently updated
newest added

I’ve created a custom pipelines that uses `DicomToImage` as well as `PdfToImage`. The pipelines works as expected when run normally. However, I encounter an issue when I attempt to convert...

Notebooks with error while writing Dicom files in Pyspark==3.5.0 env. For the time being added #NB_EVAL comment to the cells which can fail. 1. jupyter/Dicom/SparkOcrDicomDeIdentificationV3.ipynb 2. jupyter/Dicom/SparkOcrDeidentificationDicomWithEncapsulatedPDF.ipynb All other nb's...

I am not able to access this: https://pypi.johnsnowlabs.com/ Gives this: ``` 403 Forbidden Code: AccessDenied Message: Access Denied RequestId: QGHN8VPV3 (truncated) HostId: 3mdkTlnGi3YWyNhj (truncated) ```

Add Notebooks : SparkOCRPdfToText_V1.0.0_Bench.ipynb SparkOCRPdfToText_V1.1.2_Bench.ipynb

imagePath = "./data/signature/*" image_df = spark.read.format("binaryFile").load(imagePath) I am facing a path issue while running the code in google colab. I have 30 day trial version right now. Spark version: 3.0.2...