Postgres Emergency Room
Nikolay and Michael discuss PostgreSQL emergencies — both the psychological side of incident management, and some technical aspects too.
Here are some links to things they mentioned:
Here are some links to things they mentioned:
- Site Reliability Engineering resources from Google https://sre.google
- GitLab Handbook SRE https://handbook.gitlab.com/job-families/engineering/infrastructure/site-reliability-engineer
- Keeping Customers Streaming — The Centralized Site Reliability Practice at Netflix https://netflixtechblog.com/keeping-customers-streaming-the-centralized-site-reliability-practice-at-netflix-205cc37aa9fb
- Our monitoring checklist episode https://postgres.fm/episodes/monitoring-checklist
- Hannu Krosing talk on Postgres TV — Do you vacuum everyday? https://www.youtube.com/watch?v=JcRi8Z7rkPg
- Our episode on corruption https://postgres.fm/episodes/corruption
- Nikolay’s episode on stopping and starting Postgres faster https://postgres.fm/episodes/stop-and-start-postgres-faster
- Our episode on out of disk https://postgres.fm/episodes/out-of-disk
- The USE method (Brendan Gregg) https://www.brendangregg.com/usemethod.html
- Thundering herd problem https://en.wikipedia.org/wiki/Thundering_herd_problem
- pgwatch2 Postgres AI edition https://gitlab.com/postgres-ai/pgwatch2
~~~
What did you like or not like? What should we discuss next time? Let us know via a YouTube comment, on social media, or by commenting on our Google doc!
What did you like or not like? What should we discuss next time? Let us know via a YouTube comment, on social media, or by commenting on our Google doc!
~~~
Postgres FM is produced by:
- Michael Christofides, founder of pgMustard
- Nikolay Samokhvalov, founder of Postgres.ai
With special thanks to:
- Jessie Draws for the elephant artwork