covid19/transliterations/README.md
Nathan TeBlunthuis 36167295ec Finish MVP for transliterations
code is reasonably well-written
checked that we get seemingly good data back
adding README
adding data
2020-03-24 22:06:45 -07:00

4 lines
443 B
Markdown

# Transliterations
This part of the project collects tranliterations of key phrases related to COVID-19 using Wikidata. We search the Wikidata API for entities in `src/wikidata_search.py` and then we make simple SPARQL queries in `src/wikidata_transliterations.py` to collect labels and aliases the entities. The labels come with language metadata. This seems to provide a decent initial list of relevant terms across multiple languages.