Kerem Turgutlu issues

Results 10 issues of


                                            Kerem Turgutlu

SkipItemException + DPP

_Be sure you've searched [the forums](https://forums.fast.ai) for the error message you received. Also, unless you're an experienced fastai developer, first ask on the forums to see if someone else has...

Tokenization Course Issues

Hello, I believe the corpus and the `word_freqs` output used in the [BPE](https://github.com/huggingface/course/blob/main/chapters/en/chapter6/5.mdx#implementing-bpe) / [WordPiece](https://github.com/huggingface/course/blob/main/chapters/en/chapter6/6.mdx#implementing-wordpiece) implementations have a mismatch simply `Course -> course` is not capitalized in corpus but `word_freqs`...

Split nn speedup

## Description - Modified activation calculation to speed things up. For example, calling `numpy()` and `detach()` in a for loop over and over again is very slow and bad practice....

Downloading Dataset

How can we download CXR and mask data for research purposes? Command line gives error and web UI says "you don't have permission to download": ``` darwin dataset pull v7-labs/covid-19-chest-x-ray-dataset:all-images...

download() halts/stuck forever with a specific URL

The following doesn't timeout nor return anything. ``` url = "http://http-live.sr.se/srextra01-mp3-192" article = newspaper.Article(url, request_timeout=5) article.download() ``` Same with: ``` from newspaper.network import get_html_2XX_only article.config.__dict__ {'MIN_WORD_COUNT': 300, 'MIN_SENT_COUNT': 7, 'MAX_TITLE':...

Active Intent Labeling Issues

I am trying to understand the role that active intents play in the dataset and replicate Active Intent Accuracy (MultiWoz 2.1) 0.924 [from](https://github.com/google-research/google-research/tree/master/schema_guided_dst#results) . But I've noticed a lot of...

Kerem Turgutlu

SkipItemException + DPP

Tokenization Course Issues

Split nn speedup

Downloading Dataset

download() halts/stuck forever with a specific URL

Active Intent Labeling Issues

Reddit Data

Question about dynamic booking pointer during dialogue generation

CC data Language Splits

Pile-CC Size