Talk to an Expert Start for Free

Last9 engineering

Posts about code, practices and experience of building last9

Extracting Account-Level CDN Metrics from Akamai Logs with Last9: A Practical Guide

Extracting Account-Level CDN Metrics from Akamai Logs with Last9

Prathamesh Sonpatki, Aditya Godbole

Last9’s Single Pane for High Cardinality Observability

Think Data Warehouse, NOT Database.

What needs to change in software monitoring?

Back to the Future: The R-C-A of alerting

Back to the Future: The R-C-A of alerting

Software Monitoring — Stuck in the 00s

Software Monitoring — Stuck in the 00s

Why your monitoring costs are high and how you can reduce them with Levitate

Why your monitoring costs are high

Deliver all your orders this December 31st 😉

The unresolved cost of High Cardinality

Prathamesh Sonpatki

A Time Series Data Warehouse vs A Time Series Database

Why you need a Time Series Data Warehouse

Real-Time Canary Deployment Tracking with Argo CD & Levitate

Real-Time Canary Deployment Tracking with Argo CD & Last9

Monitor Google Cloud Functions using Pushgateway and Levitate

Golang Concurrency Masterclass by Swati Modi at Gophercon 2023

Golang Concurrency Masterclass by Swati Modi at Gophercon 2023

Do more with your metrics by Piyush Verma at GopherConIndia 2022

Do more with your metrics by Piyush Verma

Unwiring High Cardinality - SRE Day 2023

Unwiring High Cardinality - SRE Day 2023

How to restart Kubernetes Pods with kubectl

How to restart Kubernetes Pods with kubectl

Levitate: Last9’s Managed TSDB Now on AWS Marketplace

Prathamesh Sonpatki

Standardize PromQL with Macros

PromQL Macros in Levitate

Prathamesh Sonpatki

GCP Managed Service For Prometheus vs. Levitate

Prathamesh Sonpatki

A case for Observability outside engineering teams

A case for Observability outside engineering teams

Understanding the Rasmussen model for failures

Understanding the Rasmussen model for failures

How we tame High Cardinality by Sharding a stream

How we tame High Cardinality by Sharding a stream

1979, a nuclear accident and SRE

1979, a nuclear accident and SRE

How we tame high cardinality in time series databases: Part 1

How we tame high cardinality in time series databases

Piyush Verma, Swati Modi

What Site Reliability Engineering needs — A swarm of rogue bees

What Site Reliability Engineering Needs: A Swarm of Bees

Take back control of your Monitoring with Levitate

Take back control of your Monitoring

Observability is a practice, not a job

Observability is a practice, not a job

Using a Golang package in Python using Gopy

Using a Golang package in Python using Gopy

Who should define Reliability — Engineering, or Product

Who should define Reliability — Engineering, or Product?

OSS vs Paid vs Managed OSS — Picking what works for your Observability journey

Observability—OSS vs Paid vs Managed OSS

Satyajeet Jadhav

Learnings integrating jmxtrans with Levitate

Learnings integrating jmxtrans

The neglected tech arctic winter — Internal SaaS expenses

What does "Cricket scale" mean for a Site Reliability Engineer?

Understanding “Cricket Scale”

What is MTBI?

Do your alerting tools improve outcomes for Business?

Rethinking Anomaly Detection: Focus on business outcomes

A good chunk of SRE woes can be traced back to the stronghold tribal knowledge across teams 😵‍💫

Observability is dead, long live observability

Self-managed Prometheus vs Managed Prometheus

The importance of structured communication in the world of SRE

The importance of structured communication in the world of SRE

The difference between DevOps, SRE, and Platform Engineering

The difference between DevOps, SRE, and Platform Engineering

Prathamesh Sonpatki

Golang's Stringer tool

How to improve Prometheus remote write performance at scale

India vs Pakistan: SRE and the Shannon Limit

Satyajeet Jadhav

Battling Alert Fatigue

Battling Alert Fatigue

Guide to Service Level Indicators and Setting Service Level Objectives

SLOs, SLIs, and SLAs: Understanding Key Service Metrics

Kubernetes Monitoring with Prometheus and Grafana

Kubernetes Monitoring with Prometheus and Grafana

Why We Auto-Delete Slack Messages at Last9

Why We Auto-Delete Slack Messages at Last9

Static Threshold vs. Dynamic Threshold Alerting

Static Threshold vs. Dynamic Threshold Alerting

How we won Dukaan over

How to calculate HTTP content-length metrics on cli

Choosing Effective SLIs

Running a Database on EC2 is Slowing It Down

Running a Database on EC2 is Slowing It Down

Jayesh Bapu Ahire, Akshay Chugh

Doing SRE the Right Way!

Doing SRE the Right Way!

Microservices - Tracking Dependencies

Microservices - Tracking Dependencies

Akshay Chugh, Jayesh Bapu Ahire

SLOs eased

SLOs eased

Piyush Verma, Saurabh Hirani

Rescuing a SPAghetti React project

Prathamesh Sonpatki

One year at Last9

One year at Last9

Prathamesh Sonpatki