covid19/transliterations
2020-03-28 17:31:36 -05:00
..
analysis regenerated following update to R src that creates this file 2020-03-28 17:31:36 -05:00
data Update transliteration results for 2020-03-28 2020-03-28 14:03:16 -07:00
src Read entire input files before making api calls. 2020-03-28 13:55:52 -07:00
README.md Finish MVP for transliterations 2020-03-24 22:06:45 -07:00
requirements.txt add mwapi to requirements 2020-03-27 20:05:07 -07:00

Transliterations

This part of the project collects tranliterations of key phrases related to COVID-19 using Wikidata. We search the Wikidata API for entities in src/wikidata_search.py and then we make simple SPARQL queries in src/wikidata_transliterations.py to collect labels and aliases the entities. The labels come with language metadata. This seems to provide a decent initial list of relevant terms across multiple languages.