- move 'input' files to resources - outputs not meant for downstream go in output/intermediate - csv outputs for downstream go in output/csv |
||
---|---|---|
.. | ||
analysis | ||
output | ||
resources | ||
src | ||
README.md | ||
requirements.txt |
Transliterations
This part of the project collects tranliterations of key phrases related to COVID-19 using Wikidata. We search the Wikidata API for entities in src/wikidata_search.py
and then we make simple SPARQL queries in src/wikidata_transliterations.py
to collect labels and aliases the entities. The labels come with language metadata. This seems to provide a decent initial list of relevant terms across multiple languages.