SolrTextTagger
SolrTextTagger copied to clipboard
tagging within html attributes
trafficstars
If i send below text to solrTextTagger, it also annotated attribute of object tag
<html>
<head>
</head>
<body>
<object title="term1">term2</object>
</body>
</html>
is there any way i can skip attributes of html/xml
Pull requests welcome :-)