MarquinSmith

Analytical – Honest – Passionate

  • Home
  • Services
  • Mini Projects
    • Data Analytics Helper Tools
    • Bitcoin Price Dashboard
    • Journey to data engineering
    • Tarantino Film Dashboard
    • Woolworths stock price tracker
    • How to reason and argue
  • Reference
    • The BookShelf
    • SQL Cheat Sheet
    • Dockerfile Cheat Sheet
  • Health, fitness and wellbeing
    • Crossfit Workouts
    • Gymnastics
    • Strength and fitness masterclass
    • Nutrition Course
    • Workout Stats
    • Physical Activity Stats
    • 6 week pre season soccer fitness plan
  • About

Category: Data Analysis

Automation

Installing gcloud on ubuntu – 2026

4th June 20263rd June 2026 M SmithLeave a comment

A short guide on installing and authorising gcloud on ubuntu 26.04 LTS. gcloud is a command line tool that allows you to interact with Google Cloud Platform services such as …

Read More ›
Data Analysis

Basic Cypher operations

1st June 20263rd June 2026 M SmithLeave a comment

A small collection of basic cypher commands so I don’t have to Google them everytime. Count how many nodes of a type there are Create a named index See the …

Read More ›
Data Analysis

An introduction to data testing – a presentation

1st April 20261st April 2026 M SmithLeave a comment

Recently I attended Measurecamp Melbourne and gave an introductory talk about data testing. In it I covered the different types of “bad data” and some tools and approaches to test …

Read More ›
Data Analysis

Load an excel file into DuckDB database using Python

26th March 202626th March 2026 M SmithLeave a comment

A short example of how to load data that is in a sheet in an excel file into a DuckDB database saved on disk using python.

Read More ›
Automation

A command line tool to compare 2 different csv files – sweetviz edition

4th November 20254th November 2025 M SmithLeave a comment

In a previous post we used sweetviz to make a summary of a csv file, and ydata profiling to compare 2 different csv files. Well Sweetviz can also be used …

Read More ›
Data Analysis

Bots arrive on marquinsmith.com

3rd November 20252nd November 2025 M SmithLeave a comment

On this blog I self host Umami analytics to help me keep an eye on how many visitors come and which pages and types of content are most popular. I …

Read More ›
Automation

A `.sqlfluff` settings file I like

26th October 202526th October 2025 M SmithLeave a comment

When using SQLfluff to format sql files it is possible to set certain formatting rules that should be followed when formatting the the files. These rules can be specified in …

Read More ›
Automation

Setting up sqlfluff as a pre commit hook using UV

23rd October 202523rd October 2025 M SmithLeave a comment

SQL is a very flexible language when it comes to formatting. As long as the commands and syntax are in the right order then the SQL query will work. Whitespace …

Read More ›
Automation

A command line tool to compare 2 different csv files

20th September 202520th September 2025 M SmithLeave a comment

Below is a python script that uses the ydata-profiling library to create a report that compares two csv files. This is proving useful in verifying the similarity between original and …

Read More ›
Data Analysis

Calculate a reasonable daily usage quota for BigQuery

28th July 202528th July 2025 M SmithLeave a comment

A previous post showed how to set a ‘daily query usage‘ quota in BigQuery, and what that quota should be to stay within the free tier of BigQuery. Well, what …

Read More ›

Posts pagination

1 2 3 4 … 7

Recent Posts

  • Installing gcloud on ubuntu – 2026
  • Physical Health Tracker – January 2026
  • Basic Cypher operations
  • Getting started with Gemini CLI
  • Physical Health Tracker – 2025 in review

Categories

  • Automation
  • Business
  • cybersecurity
  • Data Analysis
  • Finance
  • Personal Development
  • Python
  • Uncategorised
  • Web Analytics

Archives

Copyright © 2026 MarquinSmith.
Theme Easthill by ThemeMunk. Powered by WordPress.