machinelearning icon indicating copy to clipboard operation
machinelearning copied to clipboard

Broken Links

Open rnowling opened this issue 2 years ago • 0 comments

The links to the Wikipedia page on N-grams and the L-p vector normalization are not being interpreted correctly on the TextFeaturizingEstimator page:

This estimator gives the user one-stop solution for doing:

    Language Detection
    [Tokenization](https://en.wikipedia.org/wiki/Lexical_analysis#Tokenization)
    [Text normalization](https://en.wikipedia.org/wiki/Text_normalization)
    [Predefined and custom stopwords removal](https://en.wikipedia.org/wiki/Stop_words)
    [Word-based or character-based Ngram extraction and SkipGram extraction (through the advanced [options](https://learn.microsoft.com/en-us/dotnet/api/microsoft.ml.transforms.text.textfeaturizingestimator?view=ml-dotnet))](https://en.wikipedia.org/wiki/N-gram)
    [TF, IDF or TF-IDF](https://en.wikipedia.org/wiki/Tf%E2%80%93idf)
    [L-p vector normalization](xref: Microsoft.ML.Transforms.LpNormNormalizingTransformer)

Document Details

Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.

rnowling avatar Mar 04 '23 15:03 rnowling