Load a broken csv file into duckdb
Some CSV files have issues with them. Below is a code snippet to save the information in a csv file into a duckdb database, ignoring all the ‘broken’ rows.
Requirements Gathering
Requirements gathering is the process of understanding and documenting what characteristics a project must or should have. This is an important part of the project for success, after all if …
Export BigQuery table to Google Cloud Storage
A code example for how to write BigQuery SQL to export a table to Google Cloud Storage in csv and parquet file formats. Parquet CSV
Data Ops Crash Course
Data Ops, AKA Dev Ops for data. Essential content for learning about improving the velocity of development and effective maintenance of data products (products produced by data teams). Christopher Bergh …
Pandas for data analysis – Video mega guide
Pandas is the go to library in Python for analysing and manipulating data. This post lists some useful video resources for learning this package. Brandon Rhodes – Pandas From The …
Using SQL in Google Sheets
SQL is a great language for analysing and slicing and dicing data. The folks at Google who build Google Sheets appreciate this too. So much so that they have built …
Execute queries in BigQuery in parallel using python
An example of how to run multiple queries in parallel.