13
0
Go to file
2020-07-05 23:27:05 -07:00
.gitignore update .gitignore 2020-07-03 13:55:25 -07:00
check_submission_shas.py Script for checking shas for submissions. 2020-07-03 13:35:46 -07:00
comments_2_parquet.py Create a second dataset sorted by author. 2020-07-05 23:27:05 -07:00
pull_pushshift_comments.sh update the reddit comment dumps 2020-07-03 10:41:13 -07:00
pull_pushshift_submissions.sh bugfix in retrieving old data and rename file. 2020-07-03 13:54:55 -07:00
submissions_2_parquet.py Create parquet datasets of reddit submissions from pushshift. 2020-07-05 23:20:17 -07:00