Latest Posts

BigQuery + Colaboratory setup in 5 mins

For data analysis and exploration, Jupyter/IPython notebooks has often been the tool of choice for its ease in sharing work and explaining the thought process. If your data resides mainly …

Pass

argparse or argh … pass

As part of our data analytics infrastructure migration we’re using the argparse package much more heavily now to standardise how we run scripts. Along the way I ran into multiple …

Sampling

Random Sampling with BigQuery

BigQuery is great for storing huge amounts of data over which to run analytics. But sometimes, what we need is just a sample of a dataset to for a dry …