Understanding COW and MOR in Apache Hudi: Choosing the Right Storage Strategy 

Apache Hudi (Hadoop Upserts Deletes and Incrementals) is a powerful framework designed for managing large datasets on cloud storage systems, enabling efficient data ingestion, storage, and retrieval. One of the key features of Hudi is its support for two distinct storage types: Copy-On-Write (COW) and Merge-On-Read (MOR). Each of these storage strategies has unique characteristics and serves different use cases. In this blog, we will explore COW and MOR.  Continue reading “Understanding COW and MOR in Apache Hudi: Choosing the Right Storage Strategy “

Setup Cross Cluster Replication for Data migration in Elasticsearch

 

Elasticsearch Cross Cluster Replication (CCR) is a powerful feature in Elasticsearch that facilitates seamless data migration and disaster recovery by allowing real-time replication of data from a remote source cluster to a target cluster. In this guide, we’ll cover setting up CCR to migrate data from a Remote Cluster in Singapore to a Local Cluster in Mumbai. Continue reading “Setup Cross Cluster Replication for Data migration in Elasticsearch”

Addressing the Rise of Cloud Security Threats: Best Practices for 2024-25

Cloud technologies have become essential for businesses seeking scalability and flexibility. However, as cloud adoption grows, so do the risks associated with securing these environments.

Cyberattacks, data breaches, and misconfigurations are increasingly targeting cloud infrastructures, making robust security measures a necessity.

To protect sensitive data and ensure business continuity, organizations must adopt proactive strategies to address these evolving threats.

In this article, we’ll outline best practices to strengthen cloud security and reduce vulnerabilities.

Continue reading “Addressing the Rise of Cloud Security Threats: Best Practices for 2024-25”

A Fun and Easy Guide to Monitoring and Observability With Prometheus

Hi Guys !! I am back with another interesting blog where we learn the concepts but in a funny and easy way.

What is Prometheus ?

Boring Version 💤💤

Prometheus is an open-source monitoring and alerting toolkit originally developed by SoundCloud in 2012. It was designed to monitor systems, track metrics, and trigger alerts based on those metrics. Prometheus uses a powerful query language called PromQL to collect and analyze time-series data from various services and applications. It stores data in a time-series database, making it easy to track trends over time. Prometheus is now a part of the Cloud Native Computing Foundation (CNCF) and is widely used in cloud-native environments for monitoring microservices, containers, and more.

Funny Version 😂😂

Imagine throwing a party where you need to keep track of everything — from who’s dancing to how loud the music is. Prometheus is like your super-organized friend who monitors it all in real-time, sending you alerts if the punch bowl is low or if a conga line breaks out. Born at SoundCloud in 2012, Prometheus quickly became the ultimate party planner for techies, ensuring everything runs smoothly in the cloud-native world.

Continue reading “A Fun and Easy Guide to Monitoring and Observability With Prometheus”