Searching in large PDF files. What technology to use?
I made an applicant tracking system which has more than 20,000 CV and the best approach for you is ElasticSearch, Because:
- very high performance
- 100% accuracy for searching
- very easy to use with simplest APIs
- easy to backup with replicas
And I recommend to use ElasticSearch amazon service ES.
And about UI framework I just use JavaScript with FineUploader what hepled me a lot with chunking and parallel upload.