End-to-end data science project from data generation to machine learning using applied DIY projects that you can build using MicroPython, C++ or even Python to read sensors and take decisions. This is combined with a support from Docker, Kubernetes, GCP and AWS.
Fig. 1: Data diagrams on Google Data Studio
This is the third tutorial from the series "implementing real-time data pipelines - from generation to models" and the other tutorials are the following:
Two weeks ago, I published a tutorial that explains how to connect an M5Stack running MicroPython to the Google Cloud Platform using the IoT Core, and I did mention that upcoming tutorials will examine the following topics:
This tutorial is about managing a Docker Engine remotely using Portainer connected to the protected Docker daemon socket (
TCP port 2376). By default, you can manage Docker locally through a non-networked UNIX socket (option
-v /var/run/docker.sock:/var/run/docker.sockwhile running Portainer). But, if you want the Docker Engine to be reachable through the network in a safe manner, you need to enable TLS by specifying the
--tlsverifyflag and pointing Docker’s
--tlscacertflag to a CA certificate. Then, the daemon only accepts connections from clients that are authenticated by a certificate si...
Analytics | 12 min | 103957
PEP-8 (sometimes PEP 8 or PEP8) is a coding standard and style guide for readability and long-term maintainability of code in Python. It was written in 2001 by Guido van Rossum, Barry Warsaw, and Nick Coghlan and provides guidelines and best practices on how to program Python code. PEPs stand for Python Enhancement Proposals, and they describe and document the way Python language evolves, providing a reference point (and in some way a standardization) for the pythonic way to write code.
This tutorial presents some of the most important key points of PEP-8. If you want to, you can read the ful...
Attention Docker Hub users - Docker Hub has been hacked!
An email containing the following highlight was sent to the users whose account data may have been exposed.
During a brief period of unauthorized access to a Docker Hub database, sensitive data from approximately 190,000 accounts may have been exposed (less than 5% of Hub users). Data includes usernames and hashed passwords for a small percentage of these users, as well as Github and Bitbucket tokens for Docker autobuilds (full email).
If you got this email you should (and if you didn't receive that email, do it too ;)):
This post is about data inside Docker containers. As I mentioned in the last post #Analytics: Docker for Data Science Environment, data in Docker can either be temporary or persistent. In this tutorial, I will focus on Docker volumes, but I will include some info about temporary data and bind mounts too.
Fig 1: Data in Docker container (source)
Inside a Docker container, there are two ways in which data can be kept temporarily. By default, files created inside a container are stored in the writable layer of the container. You do not have to do anything, but every...
Analytics | 3 min | 3681
Portainer is a management UI which allows you to easily manage your different Docker environments.
This is what I will try to accomplish in this tutorial. You will be able to:
- Run Portainer for Docker management on Windows, Linux and on a Cloud platform
- Start container with a predefined admin password, in case you are on a public network
If you are running Docker on Ubuntu or Mac OSX, you can start using Portainer as a Docker Container typing the following:
docker volume create portainer_data docker run -d -p 9000:9000 --name portainer --restart always -v /var/run/docker.sock:...
Analytics | 8 min | 21041
Today, I am opening a new section on my blog, and this time it is about analytics. As you may know, I've been working in research on IIoT and analytics for the last years, but up to now my blog has only shown my hobbyist projects. I want to change the focus of my website a little bit and add something about data analytics, machine learning, Docker technology etc. Everything that I will be publishing in this section is not new. There are many tutorials and great YouTube videos that explain these topics too, but I am going to focus on building an end-to-end data science project using some of the...
Analytics | 9 min | 18106
Docker is a technology that emerged for about 5 years and since then it has simplified the packaging, distribution, installation and execution of (complex) applications. Usually applications consist of many components that need to be installed and configured. Installing all needed dependencies and configuring them correctly is usually time consuming and frustrating for users, developers and administrators. Here is where Docker comes to simplify this process allowing developers and users to package these applications into containers.
A container image is lightweight, stand-alone, executable pa...
Analytics | 2 min | 34359
Docker is a technology that emerges for about 5 years and simplifies the packaging, distribution, installation and execution of (complex) applications. Portainer is a management UI which allows you to easily manage your different Docker environments. If you are here because of the post title, I do not have to explain anymore about the Docker technology and the management tool Portainer. But, if you need to know more about these two topics, I leave you two links:
Let's start with the installation of Portainer for Docker management on Windows 10 (running on a Linux Container)...