ruby_tika_app
ruby_tika_app copied to clipboard
A ruby wrapper for the Tika jar (tika-app.jar) that extracts text in a lot of formats from PDF, xls, doc, etc files
Add an optional config file path parameter to the initialize method to allow for custom configuration. Closes #16
In our application we customize the Tika configuration, so it would be helpful be able to pass in a config file path.
Hi, @mrcsparker! How add encoding=utf-8 to tika app? I need to run tika-app-1.19.1.jar as: ``` final_cmd = "#{@tika_cmd} #{option} --encoding=UTF-8 '#{@document}'" ```
NoMethodError: undefined method `=~' for # from /home/jasonp/.rvm/gems/ruby-2.1.1/gems/ruby_tika_app-1.5.0/lib/ruby_tika_app.rb:19:in`initialize' from (irb):3:in `new' from (irb):3 from /home/jasonp/.rvm/gems/ruby-2.1.1/gems/railties-3.2.22/lib/rails/commands/console.rb:47:in`start' from /home/jasonp/.rvm/gems/ruby-2.1.1/gems/railties-3.2.22/lib/rails/commands/console.rb:8:in `start' from /home/jasonp/.rvm/gems/ruby-2.1.1/gems/railties-3.2.22/lib/rails/commands.rb:41:in`' from script/rails:6:in `require' from script/rails:6:in`'