francesco-mollica

Results 2 issues of francesco-mollica

Why add (t/f) in this formula for discards: ``` t = 0.0001 f = np.array(list(self.word_frequency.values())) / self.token_count self.discards = np.sqrt(t / f) + (t / f) ```

def evaluate_pair_count(self, window_size): return self.sentence_length * (2 * window_size - 1) - ( self.sentence_count - 1) * (1 + window_size) * window_size where does this formula come from? Is it...