38fdd07b39
- Renamed the articles.txt to something more specific Changes to both scripts: - Updated filenames to match the new standard - Reworked the logging code so that it can write to stderr by default. Because we can only call logging.basicConfig() once, this eneded up being a bigger changes. - Caused scripts to output git commits and export to track which code produced which dataset. - Caused programs to take files instead of directories as output (allows us to run programs more than once a day). Changes to the wikipedia_views/scripts/fetch_daily_views.py: - Change output that it outputs a sequence of JSON dictionaries (one per line) as per the standard we agreed to and which is what Twitter, Github, and other dumps do. Previous behavior was to create output a single JSON list object. - A number of other small changes and tweaks throughout. |
||
---|---|---|
transliterations | ||
wikipedia_views | ||
code_of_conduct.md | ||
LICENSE | ||
README.md |
COVID-19 Digital Observatory
The COVID-19 Digital Observatory collects, aggregates, and distributes data from social media, search engine results, and Wikipedia to support immediate public health response and social and data science research related to the pandemic.
The community data science collective is the early stages of building this project. We expect to make rapid progess and to begin releasing code and data soon.
We eagerly welcome contributors! Please get in touch. Contributors are held to the code of conduct.