Rustam
Rustam
> Reverting the uchardet changes (#52) sounds like a bad idea anyway. I didn’t mean to cancel, but to try to improve on the basis of the knowledge gained :)...
Maybe this issue resolve #76 (see https://github.com/alberto-dev/notepad-plus-plus/commit/a504ebba54c41309f42006f8d82ecea435085731#diff-18d581d96114cd69e207975bf1c4fa43L249)
Take a look. Thus, new encoding detections were deleted (https://github.com/notepad-plus-plus/notepad-plus-plus/pull/5414/commits/9a39fafd335f2e1e5af4b5a3251c7cd961ee5fe9#diff-7c6715d4fafa723d6682f3b295c32875L82) This allowed us to discard cases when the same metrics arise (https://github.com/CharsetDetector/UTF-unknown/issues/77#issuecomment-573397518)
Before adding, you need to make sure that everything will be fine, https://github.com/CharsetDetector/UTF-unknown/issues/77#issuecomment-573397518
Hello! I created a pr #63 for ease of understanding. In order to detect the encoding prober's objects are created. They are defined for multiple languages. With a small sample...
It seems to me that first we need to try to single out single-byte probers by language, as models
Hello, @304NotModified ! We can make breaking changes and override, using `internal`, everything that is in `src/Core`? This would make it easier to change the code.
I think it would be nice if we could just change the source in `src/core` without thinking about breaking changes. That is, change the modifier from `public` to `internal`. I...
In the _Status Log_, the following metrics are the same: > SBCS 0.8360017: [iso-8859-15] > SBCS: 0.8360017 [iso-8859-15] > > SBCS 0.8360017: [iso-8859-1] > SBCS: 0.8360017 [iso-8859-1] > > SBCS...
As I understand it, in this case it is easier to get the same statistics https://en.wikipedia.org/wiki/ISO-8859-1#Similar_character_sets