MarquinSmith

Analytical – Honest – Passionate

  • Home
  • Services
  • Mini Projects
    • Data Analytics Helper Tools
    • Bitcoin Price Dashboard
    • Journey to data engineering
    • Tarantino Film Dashboard
    • Woolworths stock price tracker
    • How to reason and argue
  • Reference
    • The BookShelf
    • SQL Cheat Sheet
    • Dockerfile Cheat Sheet
  • Health, fitness and wellbeing
    • Crossfit Workouts
    • Gymnastics
    • Strength and fitness masterclass
    • Nutrition Course
    • Workout Stats
    • Physical Activity Stats
    • 6 week pre season soccer fitness plan
  • About

Author: M Smith

Data Analysis

Load a broken csv file into duckdb

21st October 202421st October 2024 M SmithLeave a comment

Some CSV files have issues with them. Below is a code snippet to save the information in a csv file into a duckdb database, ignoring all the ‘broken’ rows.

Read More ›
Business

Requirements Gathering

15th October 202415th October 2024 M SmithLeave a comment

Requirements gathering is the process of understanding and documenting what characteristics a project must or should have. This is an important part of the project for success, after all if …

Read More ›
Data Analysis

Export BigQuery table to Google Cloud Storage

10th October 202415th October 2024 M SmithLeave a comment

A code example for how to write BigQuery SQL to export a table to Google Cloud Storage in csv and parquet file formats. Parquet CSV

Read More ›
Business

Data Ops Crash Course

8th October 202415th October 2024 M SmithLeave a comment

Data Ops, AKA Dev Ops for data. Essential content for learning about improving the velocity of development and effective maintenance of data products (products produced by data teams). Christopher Bergh …

Read More ›
Data Analysis

Pandas for data analysis – Video mega guide

7th October 20246th October 2024 M SmithLeave a comment

Pandas is the go to library in Python for analysing and manipulating data. This post lists some useful video resources for learning this package. Brandon Rhodes – Pandas From The …

Read More ›
Uncategorised

Physical Health Tracker – September 2024

6th October 20246th October 2024 M SmithLeave a comment
Read More ›
Personal Development

Physical Health Tracker – August 2024

1st September 20246th October 2024 M SmithLeave a comment
Read More ›
Data Analysis

Using SQL in Google Sheets

11th August 202410th August 2024 M SmithLeave a comment

SQL is a great language for analysing and slicing and dicing data. The folks at Google who build Google Sheets appreciate this too. So much so that they have built …

Read More ›
Data Analysis

Execute queries in BigQuery in parallel using python

10th August 202410th August 2024 M SmithLeave a comment

An example of how to run multiple queries in parallel.

Read More ›
Data Analysis

Copying BigQuery tables to a new dataset with bq command line tool

28th July 202428th July 2024 M SmithLeave a comment

With the bq command line tool moving multiple tables at a time from one BigQuery dataset to another can be done with a few lines of code. The tool can …

Read More ›

Posts pagination

1 … 4 5 6 7 8 9 10 … 17

Recent Posts

  • An introduction to data testing – a presentation
  • Physical Health Tracker – December 2025
  • Load an excel file into DuckDB database using Python
  • String replacement across multiple files in current directory using `sed`
  • Physical Health Tracker – November 2025

Categories

  • Automation
  • Business
  • cybersecurity
  • Data Analysis
  • Finance
  • Personal Development
  • Python
  • Uncategorised
  • Web Analytics

Archives

Copyright © 2026 MarquinSmith.
Theme Easthill by ThemeMunk. Powered by WordPress.