Commit Graph

  • befb87c8f5
    Merge pull request #20 from makoshark/master master Benjamin Mako Hill 2020-04-07 14:45:05 -0700
  • 1a27b68061
    Merge pull request #18 from CommunityDataScienceCollective/dsaez_submodule Benjamin Mako Hill 2020-04-07 14:43:26 -0700
  • e32e826083 updated script to ensure the correct working dir Benjamin Mako Hill 2020-04-07 16:39:58 -0500
  • 01de1facde ignore emacs temp files article_ids Nathan TeBlunthuis 2020-04-04 15:28:37 -0700
  • eae5464fd2 monitor pages from dsaez's wikidata crawler Nathan TeBlunthuis 2020-04-04 15:23:33 -0700
  • cfe21254d9 rename scripts Nathan TeBlunthuis 2020-04-04 15:23:00 -0700
  • c97028fabb update cron scripts with new data format Nathan TeBlunthuis 2020-04-04 15:20:34 -0700
  • 0c4cfcdfcf made cronjobs executable Benjamin Mako Hill 2020-04-04 11:19:43 -0500
  • 974dc48b12 Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into dsaez_submodule dsaez_submodule Nathan TeBlunthuis 2020-04-03 15:34:16 -0700
  • a3e40a072f Add dsaez's submodule for crawling wikidata Nathan TeBlunthuis 2020-04-03 15:24:56 -0700
  • 152704df7c
    Merge pull request #17 from makoshark/master Kaylea Champion 2020-04-02 14:18:05 -0700
  • 13371fd83e
    Merge pull request #16 from aaronshaw/master Kaylea Champion 2020-04-02 14:16:57 -0700
  • 40f528f4ff revisions to reflect updated example filename and clean comments in R code aaronshaw 2020-04-02 13:38:32 -0500
  • 1cec120dfa changes to allow historical view data collection Benjamin Mako Hill 2020-04-02 13:28:34 -0500
  • 3d0d4eee76 updated to just write a single log file for each day Benjamin Mako Hill 2020-04-02 12:48:19 -0500
  • ba21acdf37 initial commit of revisions analysis example with output files aaronshaw 2020-04-02 10:59:44 -0500
  • 4ab432f399 removing outdated file names aaronshaw 2020-04-02 07:49:08 -0500
  • f770ade87a
    Merge pull request #15 from aaronshaw/master Aaron Shaw 2020-04-01 19:15:21 -0500
  • 576d882c04 renaming example analysis directories aaronshaw 2020-04-01 19:12:45 -0500
  • ff96d52cb9
    Merge pull request #12 from makoshark/master groceryheist 2020-04-01 16:36:56 -0700
  • ff5521d44b ignore __pycache__ Benjamin Mako Hill 2020-04-01 18:23:50 -0500
  • b26a2b5a86 fix bug in previous commit Benjamin Mako Hill 2020-04-01 18:22:36 -0500
  • 427eddd141 cleaned up unnecessary files Benjamin Mako Hill 2020-04-01 18:21:41 -0500
  • b457cd726b use the type= feature in argparse Benjamin Mako Hill 2020-04-01 18:13:02 -0500
  • 17c3f75389 Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory Benjamin Mako Hill 2020-04-01 17:19:33 -0500
  • 070d23f718 changes in response to code review by nate Benjamin Mako Hill 2020-04-01 17:16:34 -0500
  • 34f8b9a23e
    Merge pull request #14 from aaronshaw/aaronshaw-master Aaron Shaw 2020-04-01 16:58:02 -0500
  • 282588772e pointing at updated data url, adding explicit NA handling to factor, cutting unnecessary call to ggplot2, and updated corresponding output from new data file. May not work while kibo urls are getting resolved aaronshaw 2020-04-01 16:52:22 -0500
  • 4fe5deb013 Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory Benjamin Mako Hill 2020-04-01 16:42:16 -0500
  • d655e1ce93 tweaks to revision export code Benjamin Mako Hill 2020-04-01 16:39:53 -0500
  • 3f19805d36 fix bug in rev scraper script Benjamin Mako Hill 2020-04-01 15:49:28 -0500
  • 95d37cff7a change copy to move in cron scripts Benjamin Mako Hill 2020-04-01 15:49:02 -0500
  • 5739d1c404 Merge branch 'master' of github.com:makoshark/COVID-19_Digital_Observatory Benjamin Mako Hill 2020-04-01 15:18:50 -0500
  • 141871eda6 add two small shellscripts for automation Benjamin Mako Hill 2020-04-01 15:15:11 -0500
  • 04e00f363b address confusion with date Benjamin Mako Hill 2020-04-01 15:14:05 -0500
  • 06d2fd1563 fix bugs with the date stamps Benjamin Mako Hill 2020-04-01 10:47:33 -0500
  • 4f8a698c62
    Merge pull request #11 from jdfoote/master Aaron Shaw 2020-04-01 10:41:02 -0500
  • 4e1b7fbdfe fixed typo in debug message Benjamin Mako Hill 2020-04-01 08:18:05 -0700
  • 061105b7b4 Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory Benjamin Mako Hill 2020-04-01 07:53:40 -0700
  • 268f9e1cf3 added gitignore for wikipedia/data directory Benjamin Mako Hill 2020-04-01 07:52:15 -0700
  • 784458f206 renamed the wikipedia_views module to wikipedia Benjamin Mako Hill 2020-04-01 07:51:20 -0700
  • 6493361fbd added initial version of revision-scraper Benjamin Mako Hill 2020-04-01 07:42:38 -0700
  • cb26ecabda fixed typo in description of view scraper Benjamin Mako Hill 2020-04-01 07:42:24 -0700
  • 5c861cfca4 renamed daily views to make it clear that it's just enwiki Benjamin Mako Hill 2020-04-01 07:29:01 -0700
  • 38fdd07b39 changes to a bunch of the wikipedia view code Benjamin Mako Hill 2020-04-01 07:15:12 -0700
  • 6b05896aa5 Adding a tidyverse example (with very verbose comments) Jeremy Foote 2020-03-31 22:42:31 -0400
  • 8bb3db8b46 add examples using the translations data Nathan TeBlunthuis 2020-03-31 16:56:59 -0700
  • c8b886364f add documentation for the output files Nathan TeBlunthuis 2020-03-31 16:22:30 -0700
  • 29ae62c83e create 'latest.csv' to link to the most recent output. Nathan TeBlunthuis 2020-03-31 16:16:36 -0700
  • 687da1284f Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory Nathan TeBlunthuis 2020-03-31 16:01:43 -0700
  • 603a7b6ec3 update output Nathan TeBlunthuis 2020-03-31 16:01:38 -0700
  • 74667cf4dc use 'item' instead of 'entity' Nathan TeBlunthuis 2020-03-31 15:30:08 -0700
  • 3d142377ca rename compile script Nathan TeBlunthuis 2020-03-31 15:27:39 -0700
  • 55110c7f21 update compile script Nathan TeBlunthuis 2020-03-31 15:27:21 -0700
  • 4fd516a700 Improve README.md for keywords Nathan TeBlunthuis 2020-03-31 15:25:51 -0700
  • 98b07b8098 rename 'transliterations' to 'keywords' Nathan TeBlunthuis 2020-03-31 15:15:01 -0700
  • 20ad09d155
    Update README.md Aaron Shaw 2020-03-31 17:09:58 -0500
  • 10a7d915a5
    Merge pull request #10 from makoshark/master Kaylea Champion 2020-03-31 12:23:36 -0700
  • 72bf7bcd37 stop writing writing header to one-column list Benjamin Mako Hill 2020-03-31 08:35:23 -0700
  • 09d171608f reorganize file structure Nathan TeBlunthuis 2020-03-29 21:49:57 -0700
  • 50f58a3887 migrating to new directory structure Kaylea Champion 2020-03-29 13:42:01 -0500
  • a86c3a97ee
    Merge pull request #7 from kayleachampion/master Kaylea Champion 2020-03-29 11:39:32 -0700
  • 317c32cdb5 all march data Kaylea Champion 2020-03-29 00:19:54 -0700
  • 3bd1c684df adding a logs dir without adding my log files, assuming those don't belong in repo Kaylea Champion 2020-03-28 23:50:04 -0700
  • fa8e977741 new version of this from scrape. no double quotes around articles any more Kaylea Champion 2020-03-28 23:47:55 -0700
  • 4226b45b97 adds a scraper to update the articles file Kaylea Champion 2020-03-28 23:46:48 -0700
  • c7af46f8fb adds in new logging capability Kaylea Champion 2020-03-28 18:46:35 -0700
  • 05b8025e15
    Merge pull request #9 from aaronshaw/master Aaron Shaw 2020-03-28 20:42:40 -0500
  • 5dfbe3dab4 minimal analysis example with pageview data aaronshaw 2020-03-28 20:33:23 -0500
  • c0e50fe297
    Merge pull request #8 from aaronshaw/master Aaron Shaw 2020-03-28 17:38:20 -0500
  • 1f5b15f099 regenerated following update to R src that creates this file aaronshaw 2020-03-28 17:31:36 -0500
  • 9e0c92242e Loading data directly from github URL. Commenting out commands that assume cloned repository. aaronshaw 2020-03-28 17:30:37 -0500
  • 7b3062ffb1 Merge branch 'master' of https://github.com/CommunityDataScienceCollective/COVID-19_Digital_Observatory Kaylea Champion 2020-03-28 14:46:00 -0700
  • 033149776c
    Merge pull request #5 from kayleachampion/master Kaylea Champion 2020-03-28 14:17:21 -0700
  • dd7d968bb6
    Merge pull request #1 from CommunityDataScienceCollective/kaylea/master Kaylea Champion 2020-03-28 14:15:53 -0700
  • c690df4852 Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master kaylea/master Nathan TeBlunthuis 2020-03-28 14:13:46 -0700
  • f5ac92330c Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master Nathan TeBlunthuis 2020-03-28 14:12:36 -0700
  • 1b2bb7d1df Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master Nathan TeBlunthuis 2020-03-28 14:12:36 -0700
  • ee91df4c04 Read the whole input file before making api calls Nathan TeBlunthuis 2020-03-28 14:09:28 -0700
  • 24e5590836 Read the whole input file before making api calls Nathan TeBlunthuis 2020-03-28 14:09:28 -0700
  • 0fb8ac2ed9
    Merge pull request #4 from CommunityDataScienceCollective/translations groceryheist 2020-03-28 14:07:04 -0700
  • 2b56ed26f4 Update transliteration results for 2020-03-28 translations Nathan TeBlunthuis 2020-03-28 14:03:16 -0700
  • 207b1f8b95 Read entire input files before making api calls. Nathan TeBlunthuis 2020-03-28 13:55:52 -0700
  • 282208507a Keep better track of time. Nathan TeBlunthuis 2020-03-28 13:49:19 -0700
  • ed0641ecc7 Merge branch 'master' of https://github.com/CommunityDataScienceCollective/COVID-19_Digital_Observatory Kaylea Champion 2020-03-28 12:21:37 -0700
  • cd08294288 trialing new approach Kaylea Champion 2020-03-28 12:18:01 -0700
  • c677d8d70a trialing new approach Kaylea Champion 2020-03-28 12:17:45 -0700
  • e720653a23 typo fix Nathan TeBlunthuis 2020-03-28 10:01:43 -0700
  • a9f129f1d6 Merge branch 'translations' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into translations Nathan TeBlunthuis 2020-03-28 09:58:43 -0700
  • 18118328cc
    Merge pull request #6 from aaronshaw/translations Aaron Shaw 2020-03-28 10:28:41 -0500
  • c025a526e8 a minimal example in R that outputs a table of top 5 related search terms per day per query aaronshaw 2020-03-28 10:18:33 -0500
  • 49c3203d78 A few suggestions for the python script: Nathan TeBlunthuis 2020-03-27 20:27:02 -0700
  • c54d8ba28a Reorganize wikipedia views subproject into subpackage. Nathan TeBlunthuis 2020-03-27 20:13:11 -0700
  • 6e7afee8b3 add mwapi to requirements Nathan TeBlunthuis 2020-03-27 20:05:07 -0700
  • 5ffb2cacd6 all data Kaylea Champion 2020-03-27 18:24:19 -0700
  • 7d7fe9aaf6 cleaning out commented code Kaylea Champion 2020-03-27 18:19:22 -0700
  • d845c30455 reorganizes comments Kaylea Champion 2020-03-27 18:17:39 -0700
  • 7ab95ae5f6 initial files Kaylea Champion 2020-03-27 18:10:13 -0700
  • e71b896cec makes TSV makes JSON Kaylea Champion 2020-03-27 18:08:43 -0700
  • 0cc1ffd0b6 many bug fixes Kaylea Champion 2020-03-27 17:24:18 -0700