hive-solr
hive-solr copied to clipboard
Code to index Hive tables to Solr and Solr indexes to Hive
Hi I'm pushing 600 million records every day to solr using Hive. As data is huge, solr is responding slow. Due to which i'm getting socket timeout exception. Read timeout....
Hi sorry to ask a question here in the Issues section, but I'm at a loss as to where else to ask. Does the solr-hive-serde integration support solr multiValued solr...
When the num of fields is 34 then import successfully,once the num larger than 34 , i will meet exception of connection reset. (There are 2 clusters for hive and...
Hi all, I am trying to use the hive_1x branch and have built _solr-hive-serde-3.0.0.jar_ based on your instructions. I was also able to add this jar in my Hive runtime...
I am trying to do the following and I am getting "FAILED: SemanticException Cannot find class 'com.lucidworks.hadoop.hive.LWStorageHandler'" `create external table IF NOT EXISTS movies_solr2(movieId INT, title STRING, genres STRING) STORED...
solr-hive-serde has jersey-client packaged into it, which is not used by the serde, but will be added to the classpath, and this causes issues for HiveServer2 and Ranger. I have...
When the solr-hive-serde is added as UDF to Hive, HiveServer2 fails to communicate with Ranger, and cannot download updated Ranger policies. This is due to a conflict caused by having...
Hi all, We are trying to index more than 250MLN rows from Hive table (ORC format) but we have noticed that the indexing is too slow. We have 9 SolR...
I've been using hive-solr to index my data to Solr collections very well. Last week I wanted to try Time Routed Alias feature of Solr, which behaved as expected when...
Today mixed case field return null value instead. For example: Using the following query: SELECT id, clientId, ownerId from my_solr_table limit 1 will return the values: - abcefg - null...