These days I’ve been experimenting with non-intrusive cheap smart meters that can be easily installed and programmed with minimal effort. Of course, I am referring here to setting up the
Category: Data science
Data correlation and clustering in PySparkData correlation and clustering in PySpark
Most of the computations today are performed on cloud infrastructures. Many of these rely on Hadoop and Apache Spark. In this tutorial, I will show how you can set up
Making simple forecasts (with examples)Making simple forecasts (with examples)
Wait… is it prediction or forecast? We often use the two terms interchangeably, I know I have, but they in fact refer to completely different things. Simply put, forecasting is
Simple analytics (with examples)Simple analytics (with examples)
In a previous post, I explained how smart grid time series data can be cleaned in preparation for data analysis. Not all analyses involve predictions. In fact, the preliminary information
Cleaning up smart grid data (with examples)Cleaning up smart grid data (with examples)
You probably noticed in the featured image of this post that there are some funny negative values in the energy data. You might have seen some gaps as well. All
Smart grid data and timeSmart grid data and time
When talking about smart grid data it is hard to avoid the variable associated with time. A blackout occurred at a specific time; high consumption will be expected between 1