Blog
Network Insight.

For Prometheus monitoring, we want as many metrics as possible. These need to be stored so we can follow trends, understand what has been happening in a historical view, and better predict any issues. So, there are several parts to a Prometheus monitoring solution; we must collect the metrics, known …

Prometheus Monitoring

In this post, I would like to discuss Prometheus monitoring and its pull-based approach to metric collection. Firstly, let us roll back in time, say ten years, and look at monitoring. For monitoring, traditionally, you can use something like Ganglia. Ganglia was often used to monitor CDN networks involving several …

service level objectives

To support the new variations, your infrastructure is amid a paradigm shift. As systems become more distributed and complex, methods for building and operating them are evolving, making visibility into your services and infrastructure more important than ever. All of which leads you to adopt new practices, such as Observability …

OpenShift | Networking

We have several challenges with traditional data center networks that prove the inability to support today’s types of applications, such as microservices and containers. Therefore we need a new set of networking technologies built into OpenShift that can more adequately deal with today’s landscape changes. Firstly, one of the main …

Observability and Controllability

What Is a Metric: Good for Known. So when it comes to observability and controllability, one needs to understand the downfall of the metric. In reality, a metric is a single number, with tags optionally appended for grouping and searching those numbers. They are disposable and cheap and have a …

Traditionally, we have built our networks based on a hierarchical design. This is often referred to as the traditional data center with a three-tier design, where we had an access layer, an aggregation layer, and a core layer. Historically, this design enabled a substantial amount of predictability because aggregation switch …

Service Level Objectives (SLOs)

Site Reliability Engineering (SRE) teams have tools such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budget that can guide them on the road to building a reliable system with the customer viewpoint as the metric. These new tools or technologies form the basis for a reliable …

observability platform

To understand the difference between observability vs monitoring, we need to first discuss the role of monitoring. Monitoring is the evaluation to help identify the most valuable and efficient use of resources. So the big question I put to you is what to monitor? This is the first step to …

openshift security

Securing containerized environments is considerably different from securing the traditional monolithic application because of the inherent nature of the microservices architecture. We went from one to many and there is a clear difference in attack surface and entry points to consider. So there is a lot to consider for OpenShift …

System Observability

We have had a considerable drive with innovation that has spawned several megatrends that have affected how we manage and view our network infrastructure and the need for observability. In reality, we have seen the decomposition of everything, from one to many. Many services and dependencies in multiple locations need …

system reliability

There have been considerable shifts in our environmental landscape that have caused us to examine how we operate and run our systems and networks. We have had a mega shift with the introduction of various cloud platforms and their services and containers along with the complexity of managing distributed systems that …

chaos engineering

In the past, applications were running in single private data centers, potentially two data centers for high availability. There may have been some satellite PoPs but generally, everything was housed in a few locations. These types of data centers were on-premises and all components were housed internally. As a result, …