selvas4u

Results 12 comments of selvas4u

Elasticsearch Version is 1.2.1 and tika version is 1.4(tika-core-1.4)

I have reinstalled the river web plugin, Now its started perfectly. I have made the crawling using the data in the issue 18 by making incremental false. Still I got...

I tried it. Its working perfectly... Thanks a lot... But when I changed the URL to "https://www.google.com" or any https site, crawling is not working. Please help me in resolving...

When Im using the http url, then the log files shows the url has picked up by the Robot client. Please see the below logs [2014-07-28 11:25:00,000][INFO ][org.codelibs.elasticsearch.web.river.WebRiver] web.my_webJob is...

Im using elastic search 1.0.2 version and river-web 1.1.0

I'm using rest client for running the command.. Pls find below steps that i followed I'm used some other https site than www.google.com Create an Index http://localhost:9200/webindex Mapping http://localhost:9200/webindex/my_web/_mapping {...

Great .... Now its started crawling. But i got connection refused. is any proxy need to set? Please see the below logs [2014-07-29 16:56:46,067][WARN ][org.seasar.framework.container.assembler.BindingTypeShouldDef] Skip setting property, because property(requestListener)...

Im using robotsTxt as false. Then it should ignore the robot txt. Am i right? depends on your network environment - do you mean firewall or proxy?

is there configuration required for this for river-web?

We have meta values like this META NAME="title" CONTENT="Sample Page" META NAME="keywords" CONTENT="Test,Test1" META NAME="PageType" CONTENT="HomePage" We need to store these values in the elasticsearch index's input suggestor as follow...