Question and answer

You are given a task of:

Analyzinga big dataset 2TBs of contractual JSON docs organized by doc_id
Taggingentities (companies) in the documents using entity_id from a collection of companyprofiles
Buildinga search system that in response to the query <company_name> :
    Retrievesrelated documents ranked by their relevance
    Retrievesrelated companies and their products

Question:What kinds of databases you would use for this task and why?
The attached file gives you more information.

Leave a Reply

Your email address will not be published. Required fields are marked *