May 20, 2010

Apache Tika = Power to Parse Almost Everything

Filed under: java, lucene — Tags: — Rahul Sharma @ 10:19 pm

The other day I was browsing through the subprojects that have evolved under Lucene. There are a couple of them that have been organised for a couple of varied purposes. So I landed at Apache Tika. Initially it sounded something like any other parser but latter when I started playing, it was quite a fun. (more…)