cdsc_reddit/datasets/submissions_2_parquet.sh

## this should be run manually since we don't have a nice way to wait on parallel_sql jobs

#!/usr/bin/env bash

./parse_submissions.sh

start_spark_and_run.sh 1 $(pwd)/submissions_2_parquet_part2.py
Update submissions to parse using the backfill queue. 2020-08-12 05:37:36 +00:00			`## this should be run manually since we don't have a nice way to wait on parallel_sql jobs`
Script to run both parts of submissions_2_parquet.sh 2020-07-07 06:27:18 +00:00
Update submissions to parse using the backfill queue. 2020-08-12 05:37:36 +00:00			`#!/usr/bin/env bash`
Script to run both parts of submissions_2_parquet.sh 2020-07-07 06:27:18 +00:00
Update submissions to parse using the backfill queue. 2020-08-12 05:37:36 +00:00			`./parse_submissions.sh`
Script to run both parts of submissions_2_parquet.sh 2020-07-07 06:27:18 +00:00
			`start_spark_and_run.sh 1 $(pwd)/submissions_2_parquet_part2.py`