Results 40 comments of BrenBarn

I finally got irritated enough to take another look at this and managed to create a "manual override" like I described above. The change can be seen on my fork...

The author of DreamPie doesn't appear to be maintaining it. It would be cool to have a maintained version for sure. I don't know of any maintained fork. It seems...

"multistream" apparently means it was compressed in a different way (see [here](http://stackoverflow.com/questions/33642107/multistream-wikipedia-dump)), so maybe WikiExtractor doesn't know how to handle that. It works on non-multistream bz2 dump files.

What is the status of this? As it is, since list contents are not nested, there's no easy way to remove a list (or a list item) completely from the...

I took a look at the parsing code. There seem to be several situations where linebreaks really ought to be considered tokens, but are not. This results in parsing that...

That's true, although I think it's not unreasonable to punt on handling `` since it's kind of a different kettle of fish. Even other HTML tags aren't interpreted inside ``.

> Would it be possible to eliminate the cross-language counting problem by adding a magic directive that allows a user to specify that there will only be one view of...

Strange. What could be causing it to work on some systems and not others?

You mean latest windows updates? I think so. I don't have Google Chrome installed, but I do have Vivaldi installed and `` works fine in Vivaldi.

I agree with the other comments that limiting to `__all__` should be the default behavior. It's great for IPython to be *able* to list hidden names to allow under-the-hood poking...