Karthikeyan Madathil

Results 57 comments of Karthikeyan Madathil

I have updated the documentation with more information about our internals.

@gasyoun - Install is as simple as `pip install sanskrit_parser`, which is documented. If your concern is about the MS VC++ dependency on gensim, please take it up with them....

Thanks for the observation, @sumanthegde , and apologies for the delay in responding. @avinashvarna , could you take a look at whether this is relevant to our usage?

Sure, we should. @drdhaval2785 - I had looked at this, and I remember we'd discussed this briefly as well. Is this completely in PHP, or is there a python version...

Thank you very much. It would be great if you could point to parts of your php that you think are best to reuse (I'm sure there are a lot)....

Current status - YAML format for Sutras defined and parser implemented. This allows Sutras to be coded easily. This is way better than coding directly in Python, but I'm not...

``` $ time python ../../scripts/sanskrit_generator -t rAma -p jas --verbose unable to import 'smart_open.gcs', disabling that module INFO Inputs [rAma, as] INFO rAma ['prAtipadika', 'pum'] INFO as ['pratyaya', 'svAdi', 'sup',...

Since this subsumes UoHD, I think we can make this our primary test corpus for sandhi. We need to find a corpus for parsing.

I can see some erroneous spaces (which we can remove programmatically) and clear bad splits in sandhikosh. They don't split some samAsas (which we do) and usually do not split...

Take a look at branch multigraph, tests/SandhiKosh. `manual_test.py` runs tests and outputs to `Results.xls`. I've run for 1000 tests, with 622 passes. I will run for the full dataset next.