Open Archaeology Data

Archaeology can be an interesting source of data for projects or exploring analytical techniques. This post contains a few sources of open archeology data which you may wish to refer to. Journal of Open Archaeology Data This isn’t a repository of data itself, but the articles published in the journal of open archaeology data typically… Read More »

Wikipedia Data Stream

Streaming data is an important part of modern data processing. If you are just starting out, and perhaps don’t yet work somewhere with access to a big data streaming infrastructure, it can be hard to know where to start. This post talks you through a simple wikipedia data stream example from the wikimedia documentation. Wikipedia… Read More »

A Machine Learning Workflow

Machine learning is an essential part of data science – a field which covers a range of activities from data acquisition and cleaning, through to analytics and data visualisation. It can be helpful to think in terms of a machine learning workflow that puts some structure around some of these processes. This post looks at… Read More »