The $23 Million DNS Disaster: Why CoreDNS is the Internet’s New Superhero

The DNS Revolution That’s Changing Everything

Last December, a single DNS misconfiguration at a major streaming platform caused a global outage that cost $23 million in lost revenue and affected 180 million users during the World Cup final. The root cause? Their legacy DNS server couldn’t handle the traffic spike, taking 47 minutes to resolve the issue.

Meanwhile, their competitor running CoreDNS experienced the same traffic surge but stayed online, gaining 2.3 million new subscribers that day.

This isn’t just another “infrastructure matters” story. This is about the invisible foundation of the internet that separates digital empires from digital disasters.

Continue reading “The $23 Million DNS Disaster: Why CoreDNS is the Internet’s New Superhero”

Integration of Prometheus with Cortex

As we promised in our previous blog Prometheus as Scale – Part 1 that in our next blog we will be writing about the implementation part of Cortex with Prometheus, so here we are with our promise. But before going to the implementation part, we would suggest you guys go through our first blog to know the need for it.

Previously we talked that Prometheus is becoming a go-to option for people who want to implement event-based monitoring and alerting. The implementation and management of Prometheus are quite easy. But when we have a large infrastructure to monitor or the infrastructure has started to grow you require to scale monitoring solution as well.

A few days back we were also in a similar kind of situation where one of our client’s infrastructure was growing as per the need and they need a resilient, scalable, and reliable monitoring system. Since they were already using the Prometheus, so we explored our option and came across an interesting project called “Cortex“.

Continue reading “Integration of Prometheus with Cortex”

Agentless Monitoring: Integrating Supabase Metrics with Grafana Cloud

The Power of Agentless Monitoring

Before we dive into the technicalities, let’s understand the core benefits of agentless monitoring. It’s like having a silent guardian for your application, one that doesn’t require the extra resources or management overhead that traditional agents do. This means less complexity, less maintenance overhead, and better accuracy in your monitoring.

TLDR; If you have a service exposing prometheus compatible endpoint, then you can scrape those metrics directly with services like Grafana Cloud without needing any intermediate agent.

Why Supabase and Grafana Cloud

Supabase is making waves as a top pick for developers needing a backend service, thanks to its solid PostgreSQL base and ease of use.

Grafana Cloud brings a lot to the table, with its easy-to-use features like drag-and-drop dashboards, smart alerts, and even some clever machine learning tricks to help you spot and fix problems before they blow up. In short, Grafana Cloud doesn’t just make monitoring simpler; it makes it smarter, helping developers keep their apps running smoothly and their users happy.

Continue reading “Agentless Monitoring: Integrating Supabase Metrics with Grafana Cloud”

Deploying Prometheus and Grafana on Kubernetes

Monitoring a Kubernetes Cluster is the need of the hour for any application following a microservices architecture. There are a bunch of solutions that one can implement to monitor their Kubernetes workload and one of them is Prometheus and Grafana. This article will help you to deploy Prometheus and Grafana in your kubernetes cluster with the help of prometheus-operator.

But before setting up these components let’s understand a bit about each of them.

Prometheus

Prometheus is a pull-based open-source monitoring and alerting tool originally built by SoundCloud. It works on a time-series database and scrapes metrics at a given interval from HTTP endpoints. After Kubernetes, Prometheus joined the Cloud Native Computing Foundation in 2016 as the second hosted project.

Alertmanager

The Alertmanager takes care of alerts sent by alerting tools such as the Prometheus server. It handles grouping, silencing, and routing them to the correct receiver integration such as email, PagerDuty, Slack, etc. It also supports the inhibition of alerts.

Grafana

Grafana is the visual representation of metrics collected by a data source which in our case happens to be Prometheus. We can create or import dashboards for grafana which will make use of promQL to visually represent metrics collected by Prometheus.

Continue reading “Deploying Prometheus and Grafana on Kubernetes”