Skip to content
This repository has been archived by the owner on Feb 23, 2022. It is now read-only.

Commit

Permalink
gzip befoe uploading saves time/space
Browse files Browse the repository at this point in the history
  • Loading branch information
ryankicks committed May 27, 2015
1 parent 0aaf7bf commit 0e87633
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions batch.py
Original file line number Diff line number Diff line change
Expand Up @@ -115,6 +115,8 @@ def process_files(path, table):
file_result = "master.json"
Utils.cat_all(path, file_result)

file_gz = Utils.gzip(file_result)

# load to bigquery
call_batch = "bq load --source_format=NEWLINE_DELIMITED_JSON --max_bad_records=500000 %s %s" % (table, file_result)
print call_batch
Expand Down

0 comments on commit 0e87633

Please sign in to comment.