covid19/transliterations
Nathan TeBlunthuis 09d171608f reorganize file structure
- move 'input' files to resources
- outputs not meant for downstream go in output/intermediate
- csv outputs for downstream go in output/csv
2020-03-29 21:49:57 -07:00
..
analysis regenerated following update to R src that creates this file 2020-03-28 17:31:36 -05:00
output reorganize file structure 2020-03-29 21:49:57 -07:00
resources reorganize file structure 2020-03-29 21:49:57 -07:00
src reorganize file structure 2020-03-29 21:49:57 -07:00
README.md Finish MVP for transliterations 2020-03-24 22:06:45 -07:00
requirements.txt add mwapi to requirements 2020-03-27 20:05:07 -07:00

Transliterations

This part of the project collects tranliterations of key phrases related to COVID-19 using Wikidata. We search the Wikidata API for entities in src/wikidata_search.py and then we make simple SPARQL queries in src/wikidata_transliterations.py to collect labels and aliases the entities. The labels come with language metadata. This seems to provide a decent initial list of relevant terms across multiple languages.