David Riccitelli

Results 80 comments of David Riccitelli

We don't have a specific domain, we would like this to work on any content.

@wetneb is there a reason you're not using the _edges_ in OpenTapioca/Wikidata to run the classifier? Did you try already?

I am not sure, I am trying to understand how to solve the above reference case. I can see the Apple Inc. (Q312) has an edge to Steve Jobs (Q19837)...

I see. I think there some tuning to do then, take for example this scenario: "Apple is a fruit", works nicely: ![image](https://user-images.githubusercontent.com/11438/112032763-05b5d280-8b3d-11eb-8e70-1550e65a0169.png) ![image](https://user-images.githubusercontent.com/11438/112032778-09e1f000-8b3d-11eb-9cfd-602fef7bc1a4.png) "Apple Inc has been founded by Steve...

Yes, and this is the path that I followed, I thought I could find a better dataset for training, but this increases complexity. And then I thought is there a...

Yes, sorry. But is this a classifier issue? "Apple Inc" is detected while "Apple" as alias isn't. Sorry, I might not be to expert with the classifier. I am asking...

Great! In the next days, I'll cleanup things a bit and I'll start making PRs.

Maybe we can ping someone from [scikit-learn](https://github.com/scikit-learn/scikit-learn) like @ogrisel (we worked together on Apache Stanbol) or @amueller, @thomasjpfan, @hermidalc... I see that there's an open issue, https://github.com/scikit-learn/scikit-learn/issues/11536