MarquinSmith

Analytical – Honest – Passionate

  • Home
  • Services
  • Mini Projects
    • Data Analytics Helper Tools
    • Bitcoin Price Dashboard
    • Journey to data engineering
    • Tarantino Film Dashboard
    • Woolworths stock price tracker
    • How to reason and argue
  • Reference
    • The BookShelf
    • SQL Cheat Sheet
    • Dockerfile Cheat Sheet
  • Health, fitness and wellbeing
    • Crossfit Workouts
    • Gymnastics
    • Strength and fitness masterclass
    • Nutrition Course
    • Workout Stats
    • Physical Activity Stats
    • 6 week pre season soccer fitness plan
  • About

Tag: python

Data Analysis

An introduction to data testing – a presentation

1st April 20261st April 2026 M SmithLeave a comment

Recently I attended Measurecamp Melbourne and gave an introductory talk about data testing. In it I covered the different types of “bad data” and some tools and approaches to test …

Read More ›
Data Analysis

Load an excel file into DuckDB database using Python

26th March 202626th March 2026 M SmithLeave a comment

A short example of how to load data that is in a sheet in an excel file into a DuckDB database saved on disk using python.

Read More ›
Automation

A command line tool to compare 2 different csv files – sweetviz edition

4th November 20254th November 2025 M SmithLeave a comment

In a previous post we used sweetviz to make a summary of a csv file, and ydata profiling to compare 2 different csv files. Well Sweetviz can also be used …

Read More ›
Automation

Setting up sqlfluff as a pre commit hook using UV

23rd October 202523rd October 2025 M SmithLeave a comment

SQL is a very flexible language when it comes to formatting. As long as the commands and syntax are in the right order then the SQL query will work. Whitespace …

Read More ›
Automation

A command line tool to compare 2 different csv files

20th September 202520th September 2025 M SmithLeave a comment

Below is a python script that uses the ydata-profiling library to create a report that compares two csv files. This is proving useful in verifying the similarity between original and …

Read More ›
Automation

Python script to send an email with Sendgrid API

5th July 2025 M SmithLeave a comment

A short python script that uses the Sendgrid API to send a basic email.

Read More ›
Automation

A python command line tool to create dates in a range

26th June 202525th June 2025 M SmithLeave a comment

A simple tool to create dates in a date range at the terminal. Each date in the date range is printed to the screen allowing for redirection or incorporation into …

Read More ›
Data Analysis

SQLite – Getting started masterclass

24th June 2025 M SmithLeave a comment

A collection of video resources to on using and understanding SQLite. SQLite case study SQLite with Python SQLite for beginners …although some of the topics don’t seem like beginner topics …

Read More ›
Automation

Loading data into SQLite

23rd June 202523rd June 2025 M SmithLeave a comment

Different approaches to load data into a SQLite database using the following Python libraries: Pandas Using the pandas library (alongside click, and uv) to load a csv file into a …

Read More ›
Automation

A command line tool to get csv summaries using Sweetviz

22nd June 2025 M SmithLeave a comment

Working with data involves handling a lot of data files, most commonly CSV files. It can be very helpful to do some exploratory data analysis on the CSV file to …

Read More ›

Posts pagination

1 2 3

Recent Posts

  • An introduction to data testing – a presentation
  • Physical Health Tracker – December 2025
  • Load an excel file into DuckDB database using Python
  • String replacement across multiple files in current directory using `sed`
  • Physical Health Tracker – November 2025

Categories

  • Automation
  • Business
  • cybersecurity
  • Data Analysis
  • Finance
  • Personal Development
  • Python
  • Uncategorised
  • Web Analytics

Archives

Copyright © 2026 MarquinSmith.
Theme Easthill by ThemeMunk. Powered by WordPress.