README.md

Produce Europeana Entity Completation

take the mongodb dump and produce solr documents

./scripts/dump-surface-forms.py termlist.json outputsolr.json sameAs.json type

where sameAs.json (contains only the uri of the entities of the type) termlist contains the description of the entities type is the type of the entities in sameAs.json (concept, agent etc etc)

get the surface forms in different languages

output would contains json document ready to be indexed in solr

export-language-surface-forms.py solrdocument.json wikidata.json enriched-solr-documents.json