An introduction to data testing – a presentation
Recently I attended Measurecamp Melbourne and gave an introductory talk about data testing. In it I covered the different types of “bad data” and some tools and approaches to test …
Analytical – Honest – Passionate
Recently I attended Measurecamp Melbourne and gave an introductory talk about data testing. In it I covered the different types of “bad data” and some tools and approaches to test …
A short example of how to load data that is in a sheet in an excel file into a DuckDB database saved on disk using python.
In a previous post we used sweetviz to make a summary of a csv file, and ydata profiling to compare 2 different csv files. Well Sweetviz can also be used …
SQL is a very flexible language when it comes to formatting. As long as the commands and syntax are in the right order then the SQL query will work. Whitespace …
Below is a python script that uses the ydata-profiling library to create a report that compares two csv files. This is proving useful in verifying the similarity between original and …
A short python script that uses the Sendgrid API to send a basic email.
A simple tool to create dates in a date range at the terminal. Each date in the date range is printed to the screen allowing for redirection or incorporation into …
A collection of video resources to on using and understanding SQLite. SQLite case study SQLite with Python SQLite for beginners …although some of the topics don’t seem like beginner topics …
Different approaches to load data into a SQLite database using the following Python libraries: Pandas Using the pandas library (alongside click, and uv) to load a csv file into a …
Working with data involves handling a lot of data files, most commonly CSV files. It can be very helpful to do some exploratory data analysis on the CSV file to …