Steal our best ideas — seriously.
We've spent years untangling logs. Now, we're sharing the tools, templates, and frameworks that helped us get there.
Observability Maturity Assessment
A comprehensive 20-point checklist designed to evaluate where your team stands today versus where you want to be. We cover alerting, distributed tracing, documentation, and on-call culture.
Use this to baseline your current state, identify gaps, and create a roadmap for improvement. It's the same framework we use for our enterprise clients.
Get the PDF (Free)The Assessment
PDF • 12 Pages • 2023 Edition
Everything you need to build better pipelines
Alert Template Library
Pre-built templates for Datadog, Grafana, and PagerDuty. Stop writing the same alert logic over and over.
Log Schema Cheat Sheet
A quick-reference guide for structuring JSON logs. Ensure your parsers don't break when new fields are added.
Incident Severity Matrix
Define what "Critical" actually means. A standardised severity scale to reduce alert fatigue across teams.
On-Call Schedule Builder
A spreadsheet template for rotating on-call duties. Includes rotation logic, handoff checklists, and escalation paths.
Past Webinar Recordings
-
OCT 24
Building Resilient Pipelines
How to design logging architectures that survive Kubernetes chaos.
-
SEP 12
The Art of the Post-Mortem
Moving from blame culture to systemic improvement.
-
AUG 05
Reducing Alert Fatigue
Strategies for trimming noise without losing signal.
Ask us directly
New resources every month
Join 2,000+ engineers getting smarter. We share templates, case studies, and industry insights directly to your inbox.