Tag Archives: data

Python Fake Data With Faker

Fake data can be invaluable for testing or demonstrating behaviour without using live, production data. This lets you protect your production data or help you get started when you don’t yet have a production system set up. This post gives an overview of the Python fake data package faker, which is invaluable for generating this… Read More »

Open Archaeology Data

Archaeology can be an interesting source of data for projects or exploring analytical techniques. This post contains a few sources of open archeology data which you may wish to refer to. Journal of Open Archaeology Data This isn’t a repository of data itself, but the articles published in the journal of open archaeology data typically… Read More »

Wikipedia Data Stream

Streaming data is an important part of modern data processing. If you are just starting out, and perhaps don’t yet work somewhere with access to a big data streaming infrastructure, it can be hard to know where to start. This post talks you through a simple wikipedia data stream example from the wikimedia documentation. Wikipedia… Read More »