13
0
Commit Graph

4 Commits

Author SHA1 Message Date
Nate E TeBlunthuis
2d425600a8 Update submissions to parse using the backfill queue. 2020-08-11 22:37:36 -07:00
Nate E TeBlunthuis
aa84a7df03 Bugfixes in scripts. 2020-07-07 23:29:36 -07:00
Nate E TeBlunthuis
40d4563770 Build comments dataset similarly to submissions and improve partitioning scheme 2020-07-07 11:45:43 -07:00
Nate E TeBlunthuis
4ec9c14247 Move the spark part of submissions_2_parquet to a separate script. 2020-07-06 22:27:34 -07:00