This website requires JavaScript.
Explore
Help
Sign In
collective
/
cdsc_reddit
Watch
13
Star
0
Fork
0
You've already forked cdsc_reddit
Code
35
Commits
9
Branches
0
Tags
2.7
MiB
57951050c0
Commit Graph
3 Commits
Author
SHA1
Message
Date
Nate E TeBlunthuis
c666302b4a
remove is_submitter field from submissions which doesn't exist.
2020-07-09 17:12:14 -07:00
Nate E TeBlunthuis
40d4563770
Build comments dataset similarly to submissions and improve partitioning scheme
2020-07-07 11:45:43 -07:00
Nate E TeBlunthuis
4ec9c14247
Move the spark part of submissions_2_parquet to a separate script.
2020-07-06 22:27:34 -07:00