Go to file
Benjamin Mako Hill 38fdd07b39 changes to a bunch of the wikipedia view code
- Renamed the articles.txt to something more specific

Changes to both scripts:

- Updated filenames to match the new standard
- Reworked the logging code so that it can write to stderr by
  default. Because we can only call logging.basicConfig() once, this
  eneded up being a bigger changes.
- Caused scripts to output git commits and export to track which code
  produced which dataset.
- Caused programs to take files instead of directories as
  output (allows us to run programs more than once a day).

Changes to the wikipedia_views/scripts/fetch_daily_views.py:

- Change output that it outputs a sequence of JSON dictionaries (one
  per line) as per the standard we agreed to and which is what
  Twitter, Github, and other dumps do. Previous behavior was to create
  output a single JSON list object.
- A number of other small changes and tweaks throughout.
2020-04-01 07:15:12 -07:00
transliterations reorganize file structure 2020-03-29 21:49:57 -07:00
wikipedia_views changes to a bunch of the wikipedia view code 2020-04-01 07:15:12 -07:00
code_of_conduct.md add code of conduct and elaborate description 2020-03-24 12:12:57 -07:00
LICENSE Initial commit 2020-03-24 11:10:29 -07:00
README.md Update README.md 2020-03-24 14:28:17 -07:00

COVID-19 Digital Observatory

The COVID-19 Digital Observatory collects, aggregates, and distributes data from social media, search engine results, and Wikipedia to support immediate public health response and social and data science research related to the pandemic.

The community data science collective is the early stages of building this project. We expect to make rapid progess and to begin releasing code and data soon.

We eagerly welcome contributors! Please get in touch. Contributors are held to the code of conduct.