Also I want to know how to add meta data while indexing so that i can boost some parameters
There are several frameworks for extracting text suitable for Lucene indexing from rich text files (pdf, ppt etc.)
2.1m questions
2.1m answers
60 comments
57.0k users