Commit Graph

77 Commits

Author SHA1 Message Date
34f8b9a23e
Merge pull request #14 from aaronshaw/aaronshaw-master
pointing at updated data url, adding explicit NA handling to factor, …
2020-04-01 16:58:02 -05:00
aaronshaw
282588772e pointing at updated data url, adding explicit NA handling to factor, cutting unnecessary call to ggplot2, and updated corresponding output from new data file. May not work while kibo urls are getting resolved 2020-04-01 16:52:22 -05:00
4f8a698c62
Merge pull request #11 from jdfoote/master
Adding a tidyverse example (with very verbose comments)
2020-04-01 10:41:02 -05:00
Jeremy Foote
6b05896aa5 Adding a tidyverse example (with very verbose comments) 2020-03-31 22:42:31 -04:00
8bb3db8b46 add examples using the translations data 2020-03-31 16:56:59 -07:00
c8b886364f add documentation for the output files 2020-03-31 16:22:30 -07:00
29ae62c83e create 'latest.csv' to link to the most recent output. 2020-03-31 16:16:36 -07:00
687da1284f Merge branch 'master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory 2020-03-31 16:01:43 -07:00
603a7b6ec3 update output 2020-03-31 16:01:38 -07:00
74667cf4dc use 'item' instead of 'entity' 2020-03-31 15:34:34 -07:00
3d142377ca rename compile script 2020-03-31 15:27:39 -07:00
55110c7f21 update compile script 2020-03-31 15:27:21 -07:00
4fd516a700 Improve README.md for keywords 2020-03-31 15:25:51 -07:00
98b07b8098 rename 'transliterations' to 'keywords' 2020-03-31 15:15:01 -07:00
20ad09d155
Update README.md
linking to project pages more fully
2020-03-31 17:09:58 -05:00
10a7d915a5
Merge pull request #10 from makoshark/master
stop writing writing header to one-column list
2020-03-31 12:23:36 -07:00
Benjamin Mako Hill
72bf7bcd37 stop writing writing header to one-column list
This feels like it's asking for trouble. Description of the contents
of the list is in the filename.
2020-03-31 08:35:23 -07:00
09d171608f reorganize file structure
- move 'input' files to resources
- outputs not meant for downstream go in output/intermediate
- csv outputs for downstream go in output/csv
2020-03-29 21:49:57 -07:00
Kaylea Champion
50f58a3887 migrating to new directory structure 2020-03-29 13:42:01 -05:00
a86c3a97ee
Merge pull request #7 from kayleachampion/master
cleanup with merge
2020-03-29 11:39:32 -07:00
Kaylea Champion
317c32cdb5 all march data 2020-03-29 00:19:54 -07:00
Kaylea Champion
3bd1c684df adding a logs dir without adding my log files, assuming those don't
belong in repo
2020-03-28 23:50:04 -07:00
Kaylea Champion
fa8e977741 new version of this from scrape. no double quotes around articles any
more
2020-03-28 23:47:55 -07:00
Kaylea Champion
4226b45b97 adds a scraper to update the articles file 2020-03-28 23:46:48 -07:00
Kaylea Champion
c7af46f8fb adds in new logging capability 2020-03-28 18:46:35 -07:00
05b8025e15
Merge pull request #9 from aaronshaw/master
minimal analysis example with pageview data
2020-03-28 20:42:40 -05:00
aaronshaw
5dfbe3dab4 minimal analysis example with pageview data 2020-03-28 20:33:23 -05:00
c0e50fe297
Merge pull request #8 from aaronshaw/master
Update to load data from github url and include 3/28 data in output
2020-03-28 17:38:20 -05:00
aaronshaw
1f5b15f099 regenerated following update to R src that creates this file 2020-03-28 17:31:36 -05:00
aaronshaw
9e0c92242e Loading data directly from github URL. Commenting out commands that assume cloned repository. 2020-03-28 17:30:37 -05:00
Kaylea Champion
7b3062ffb1 Merge branch 'master' of https://github.com/CommunityDataScienceCollective/COVID-19_Digital_Observatory 2020-03-28 14:46:00 -07:00
033149776c
Merge pull request #5 from kayleachampion/master
view data
2020-03-28 14:17:21 -07:00
dd7d968bb6
Merge pull request #1 from CommunityDataScienceCollective/kaylea/master
Some suggested changes.
2020-03-28 14:15:53 -07:00
c690df4852 Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master 2020-03-28 14:13:46 -07:00
f5ac92330c Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master 2020-03-28 14:13:26 -07:00
1b2bb7d1df Merge branch 'kaylea/master' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into kaylea/master 2020-03-28 14:12:36 -07:00
ee91df4c04 Read the whole input file before making api calls 2020-03-28 14:12:17 -07:00
24e5590836 Read the whole input file before making api calls 2020-03-28 14:09:28 -07:00
groceryheist
0fb8ac2ed9
Merge pull request #4 from CommunityDataScienceCollective/translations
Transliterations: Use data from google trends and wikidata to find transliterations.
2020-03-28 14:07:04 -07:00
2b56ed26f4 Update transliteration results for 2020-03-28
- renamed results from yesterday into time stamped file
2020-03-28 14:03:16 -07:00
207b1f8b95 Read entire input files before making api calls.
This is nicer style to not hold onto resources for as long.
It will use a bit more memory.
2020-03-28 13:55:52 -07:00
282208507a Keep better track of time.
- Add timestamp ot transliterations output file.
- Append wikidata search terms instead of overwriting
2020-03-28 13:52:54 -07:00
Kaylea Champion
ed0641ecc7 Merge branch 'master' of https://github.com/CommunityDataScienceCollective/COVID-19_Digital_Observatory
updates my branch with all the master changes so far
2020-03-28 12:21:37 -07:00
Kaylea Champion
cd08294288 trialing new approach 2020-03-28 12:18:01 -07:00
Kaylea Champion
c677d8d70a trialing new approach 2020-03-28 12:17:45 -07:00
e720653a23 typo fix 2020-03-28 10:01:43 -07:00
a9f129f1d6 Merge branch 'translations' of github.com:CommunityDataScienceCollective/COVID-19_Digital_Observatory into translations 2020-03-28 09:58:43 -07:00
18118328cc
Merge pull request #6 from aaronshaw/translations
minimal example in R
2020-03-28 10:28:41 -05:00
aaronshaw
c025a526e8 a minimal example in R that outputs a table of top 5 related search terms per day per query 2020-03-28 10:18:33 -05:00
49c3203d78 A few suggestions for the python script:
- using format strings (f-strings) is a nice way in python to build
strings using variables.
- you can read and process a file in one pass if you iterate over the
open file itself instead of reading it into a variable and then
looping
- i had to change your strip code when i stopped using csv reader
- my python linter and auto-formater hate non-indendent comments
- i added a few lines to print cases where we don't get Ok responses.
2020-03-27 20:43:29 -07:00