knime-textprocessing icon indicating copy to clipboard operation
knime-textprocessing copied to clipboard

KNIME - Text Processing Extension (Labs)

KNIME® Text Processing

Jenkins Quality Gate Status

This repository is maintained by the KNIME Team Rakete.

KNIME Analytics Platform - Text Processing Integration is designed to read, enrich, manipulate, and extract textual data, and transform it into numerical representations, such as document or term vectors. Once numerical vectors are created, regular KNIME nodes can be applied, for example, for predictive modeling, clustering analysis, or visualization.

Content

This repository contains the source code for KNIME - Text Processing Integration. The code is organized as follows:

  • org.knime.ext.textprocessing: Text Processing integration nodes and data types
  • org.knime.ext.textprocessing.dl4j: Text Processing Word2Vec integration
  • org.knime.ext.textprocessing.models: External models for tokenization and tagging (NER, POS, etc.)

Development Notes

You can find instructions on how to work with our code or develop extensions for KNIME Analytics Platform in the knime-sdk-setup repository on BitBucket or GitHub.

Join the Community!