SlideShare a Scribd company logo
2
Most read
3
Most read
4
Most read
Следете актуалните обяви за DevOps
Партньори:
Monitoring & Logging
Marian Marinov
mm@yuhu.biz
Следете актуалните обяви за DevOps
Партньори:
Who am I?
● Director of Engineering at Web Hosting Canada
● Former partner and Head of DevOps at SiteGround
● A SysAdmin and System Architect
Следете актуалните обяви за DevOps
Партньори:
What I have to monitor?
● 13 physical linux machines
○ Storage capacity (df/df -i)
○ S.M.A.R.T. of the drives
○ RAID (HW or Soft)
○ Network (routes, traffic and usage)
○ Performance (CPU, Mem, I/O, Processes)
○ Kernel logs
○ Service logs
Следете актуалните обяви за DevOps
Партньори:
What I have to monitor?
● 1 UPS
● 2 APC PUDs
● 2 Switches (SNMP statistics)
● 2 Thermostat (traffic, temp, humidity)
● 40+ LXC containers
○ Performance (CPU, Mem, I/O, Processes)
○ Storage capacity (df/df -i)
○ Service logs
● 2-3 Wifi access points
○ number of attached devices
○ traffic per-device
Следете актуалните обяви за DevOps
Партньори:
What I have to monitor?
● A few things for which I want traffic and power on time
○ 3 TVs
○ 3 Amplifiers
○ 4 Cameras
○ 1 Washing machine
○ 1 Dryer
Следете актуалните обяви за DevOps
Партньори:
What I wanted
● Single solution for log and metrics collection
● Single central interface
Следете актуалните обяви за DevOps
Партньори:
What I ended up having
● multiple grafana dashboards
● monitor events, instead of reading logs
● a bunch of different log collectors
Следете актуалните обяви за DevOps
Партньори:
What tested
● syslog-ng
● rsyslog
● Filebeat
● Prometheus node_exporter
● Loki
● Fluentd
● Clolectd
● StatsD
● Graylog
● PostgreSQL+timescale
● Grafana
Следете актуалните обяви за DevOps
Партньори:
Conclusions
● there is no one solution to rule them all
● SNMP is still the king for networking
● too many logging formats and DSLs
Следете актуалните обяви за DevOps
Партньори:
Conclusions
● there is no one solution to rule them all
● SNMP is still the king for networking
● too many logging formats and DSLs
● collectd was the easiest
○ with the most metrics out-of-the-box
Следете актуалните обяви за DevOps
Партньори:
Conclusions
● there is no one solution to rule them all
● SNMP is still the king for networking
● too many logging formats and DSLs
● collectd was the easiest
○ with the most metrics out-of-the-box
● ElasticSearch + Kibana require too much resources
○ Not usable for smaller setups
● Graylog uses a lot of CPU for the work it does
○ alerts can be based on number of events instead of parsing logs
Следете актуалните обяви за DevOps
Партньори:
Installation / Setup
● basic apt-get:
○ rsyslogd, syslog-ng, fluentd, collectd, filebeat, loki, node_exporter
○ statsd wanted full npm
Следете актуалните обяви за DevOps
Партньори:
Pros and Cons
● Syslog pros
○ can easily ingest netconsole kernel logging
○ very good performance
○ well documented and standardized interface
● Syslog cons
○ fire and forget
○ the syslog protocol
○ not enough parsing flexibility
○ syslog-ng was heavier then rsyslogd
Следете актуалните обяви за DevOps
Партньори:
Pros and Cons
● Loki/Node_exporter/filebeat/fluentd
○ very good parsing capabilities
○ filebeat was the easiest for me
○ reliable log delivery
○ different integrations
○ ready made grafana dashboards
● Loki/Node_exporter/filebeat/fluentd
○ very heavy on CPU
○ Loki did not have sysv init script :)
Следете актуалните обяви за DevOps
Партньори:
Interesting
● OAIEvals Collector - by Nikolay Stankov
Следете актуалните обяви за DevOps
Партньори:
DB integrations
1. Prometheus node-exporter
2. Fluentd
3. filebeat
4. syslog
Следете актуалните обяви за DevOps
Партньори:
Not out of the box
● Custom local collectors still have to go directly to your metrics DB
● Having a producer/subscriber greatly reduces the performance hit
● Fluent and fliebeat were the only one supporting kafka out of the box
○ https://github.com/hikhvar/mqtt2prometheus
○ https://github.com/toyokazu/fluent-plugin-mqtt-io
Thank you!
СЛЕДВАЩО СЪБИТИЕ
Лектор Дата Език
Следете актуалните обяви за DevOps
Партньори:
Monitoring & Logging
Marian Marinov 19.Mar.2024 Български
Contacts:
Marian Marinov
Github profile
Facebook profile
Следете актуалните обяви за DevOps
Партньори:
What do I have on the containers?
● NextCloud
● Home Assistant
● Mirrors
● VPNs
● NetBox
● Monitoring (Grafana, StatPing)
● Games (Minecraft, CS, PVPGN)
● IRC (server, bouncers, bots)
● Matrix, Mattermost
● Backups
● Streaming (FOSDEM streamer setup)
● DBs (PostgreSQL, MySQL, Redis, DragonFly, Timescale, InfluxDB, Mongo)
● Vitess, ProxySQL
● MPI (Gearman, MQTT, Kafka, RabbitMQ)
● Web stuff - Wiki, HAproxy, Nginx, Varnish
● OpenShift, OpenStack, K8s on VMs and physical
● A lot of other experiments
Следете актуалните обяви за DevOps
Партньори:
What storage do I use?
● Local + LVM
● DRBD+OCFS2
● iSCSI
● cLVM + iSCSI
● GlusterFS
● OrangeFS
● I had in the past:
○ Ceph
○ NFS
○ cLVM + ATAoE
○ cLVM + NBD

More Related Content

PDF
WebRTC on Mobile
PDF
ORDS - Oracle REST Data Services
PDF
Tuning the Kernel for Varnish Cache
PPTX
WebRTC Seminar Report
PDF
Pestel powerpoint
PPTX
REST Easy with Django-Rest-Framework
PPTX
2-day-dba-oracle.pptx
PDF
How to start and then move forward in IT
WebRTC on Mobile
ORDS - Oracle REST Data Services
Tuning the Kernel for Varnish Cache
WebRTC Seminar Report
Pestel powerpoint
REST Easy with Django-Rest-Framework
2-day-dba-oracle.pptx
How to start and then move forward in IT

More from Marian Marinov (20)

PDF
Thinking about highly-available systems and their setup
PDF
Understanding your memory usage under Linux
PDF
How to implement PassKeys in your application
PDF
Basic presentation of cryptography mechanisms
PDF
Microservices: Benefits, drawbacks and are they for me?
PDF
Introduction and replication to DragonflyDB
PDF
Message Queuing - Gearman, Mosquitto, Kafka and RabbitMQ
PDF
How to successfully migrate to DevOps .pdf
PDF
How to survive in the work from home era
PDF
Managing sysadmins
PDF
Improve your storage with bcachefs
PDF
Control your service resources with systemd
PDF
Comparison of-foss-distributed-storage
PDF
Защо и как да обогатяваме знанията си?
PDF
Securing your MySQL server
PDF
Sysadmin vs. dev ops
PDF
DoS and DDoS mitigations with eBPF, XDP and DPDK
PDF
Challenges with high density networks
PDF
SiteGround building automation
PDF
Preventing cpu side channel attacks with kernel tracking
Thinking about highly-available systems and their setup
Understanding your memory usage under Linux
How to implement PassKeys in your application
Basic presentation of cryptography mechanisms
Microservices: Benefits, drawbacks and are they for me?
Introduction and replication to DragonflyDB
Message Queuing - Gearman, Mosquitto, Kafka and RabbitMQ
How to successfully migrate to DevOps .pdf
How to survive in the work from home era
Managing sysadmins
Improve your storage with bcachefs
Control your service resources with systemd
Comparison of-foss-distributed-storage
Защо и как да обогатяваме знанията си?
Securing your MySQL server
Sysadmin vs. dev ops
DoS and DDoS mitigations with eBPF, XDP and DPDK
Challenges with high density networks
SiteGround building automation
Preventing cpu side channel attacks with kernel tracking
Ad

Recently uploaded (20)

PPTX
web development for engineering and engineering
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPT
Mechanical Engineering MATERIALS Selection
PDF
Digital Logic Computer Design lecture notes
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
PPT on Performance Review to get promotions
PPTX
Lecture Notes Electrical Wiring System Components
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Welding lecture in detail for understanding
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
web development for engineering and engineering
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Mechanical Engineering MATERIALS Selection
Digital Logic Computer Design lecture notes
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Operating System & Kernel Study Guide-1 - converted.pdf
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPT on Performance Review to get promotions
Lecture Notes Electrical Wiring System Components
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Welding lecture in detail for understanding
CH1 Production IntroductoryConcepts.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Ad

Dev.bg DevOps March 2024 Monitoring & Logging

  • 1. Следете актуалните обяви за DevOps Партньори: Monitoring & Logging Marian Marinov mm@yuhu.biz
  • 2. Следете актуалните обяви за DevOps Партньори: Who am I? ● Director of Engineering at Web Hosting Canada ● Former partner and Head of DevOps at SiteGround ● A SysAdmin and System Architect
  • 3. Следете актуалните обяви за DevOps Партньори: What I have to monitor? ● 13 physical linux machines ○ Storage capacity (df/df -i) ○ S.M.A.R.T. of the drives ○ RAID (HW or Soft) ○ Network (routes, traffic and usage) ○ Performance (CPU, Mem, I/O, Processes) ○ Kernel logs ○ Service logs
  • 4. Следете актуалните обяви за DevOps Партньори: What I have to monitor? ● 1 UPS ● 2 APC PUDs ● 2 Switches (SNMP statistics) ● 2 Thermostat (traffic, temp, humidity) ● 40+ LXC containers ○ Performance (CPU, Mem, I/O, Processes) ○ Storage capacity (df/df -i) ○ Service logs ● 2-3 Wifi access points ○ number of attached devices ○ traffic per-device
  • 5. Следете актуалните обяви за DevOps Партньори: What I have to monitor? ● A few things for which I want traffic and power on time ○ 3 TVs ○ 3 Amplifiers ○ 4 Cameras ○ 1 Washing machine ○ 1 Dryer
  • 6. Следете актуалните обяви за DevOps Партньори: What I wanted ● Single solution for log and metrics collection ● Single central interface
  • 7. Следете актуалните обяви за DevOps Партньори: What I ended up having ● multiple grafana dashboards ● monitor events, instead of reading logs ● a bunch of different log collectors
  • 8. Следете актуалните обяви за DevOps Партньори: What tested ● syslog-ng ● rsyslog ● Filebeat ● Prometheus node_exporter ● Loki ● Fluentd ● Clolectd ● StatsD ● Graylog ● PostgreSQL+timescale ● Grafana
  • 9. Следете актуалните обяви за DevOps Партньори: Conclusions ● there is no one solution to rule them all ● SNMP is still the king for networking ● too many logging formats and DSLs
  • 10. Следете актуалните обяви за DevOps Партньори: Conclusions ● there is no one solution to rule them all ● SNMP is still the king for networking ● too many logging formats and DSLs ● collectd was the easiest ○ with the most metrics out-of-the-box
  • 11. Следете актуалните обяви за DevOps Партньори: Conclusions ● there is no one solution to rule them all ● SNMP is still the king for networking ● too many logging formats and DSLs ● collectd was the easiest ○ with the most metrics out-of-the-box ● ElasticSearch + Kibana require too much resources ○ Not usable for smaller setups ● Graylog uses a lot of CPU for the work it does ○ alerts can be based on number of events instead of parsing logs
  • 12. Следете актуалните обяви за DevOps Партньори: Installation / Setup ● basic apt-get: ○ rsyslogd, syslog-ng, fluentd, collectd, filebeat, loki, node_exporter ○ statsd wanted full npm
  • 13. Следете актуалните обяви за DevOps Партньори: Pros and Cons ● Syslog pros ○ can easily ingest netconsole kernel logging ○ very good performance ○ well documented and standardized interface ● Syslog cons ○ fire and forget ○ the syslog protocol ○ not enough parsing flexibility ○ syslog-ng was heavier then rsyslogd
  • 14. Следете актуалните обяви за DevOps Партньори: Pros and Cons ● Loki/Node_exporter/filebeat/fluentd ○ very good parsing capabilities ○ filebeat was the easiest for me ○ reliable log delivery ○ different integrations ○ ready made grafana dashboards ● Loki/Node_exporter/filebeat/fluentd ○ very heavy on CPU ○ Loki did not have sysv init script :)
  • 15. Следете актуалните обяви за DevOps Партньори: Interesting ● OAIEvals Collector - by Nikolay Stankov
  • 16. Следете актуалните обяви за DevOps Партньори: DB integrations 1. Prometheus node-exporter 2. Fluentd 3. filebeat 4. syslog
  • 17. Следете актуалните обяви за DevOps Партньори: Not out of the box ● Custom local collectors still have to go directly to your metrics DB ● Having a producer/subscriber greatly reduces the performance hit ● Fluent and fliebeat were the only one supporting kafka out of the box ○ https://github.com/hikhvar/mqtt2prometheus ○ https://github.com/toyokazu/fluent-plugin-mqtt-io
  • 18. Thank you! СЛЕДВАЩО СЪБИТИЕ Лектор Дата Език Следете актуалните обяви за DevOps Партньори: Monitoring & Logging Marian Marinov 19.Mar.2024 Български Contacts: Marian Marinov Github profile Facebook profile
  • 19. Следете актуалните обяви за DevOps Партньори: What do I have on the containers? ● NextCloud ● Home Assistant ● Mirrors ● VPNs ● NetBox ● Monitoring (Grafana, StatPing) ● Games (Minecraft, CS, PVPGN) ● IRC (server, bouncers, bots) ● Matrix, Mattermost ● Backups ● Streaming (FOSDEM streamer setup) ● DBs (PostgreSQL, MySQL, Redis, DragonFly, Timescale, InfluxDB, Mongo) ● Vitess, ProxySQL ● MPI (Gearman, MQTT, Kafka, RabbitMQ) ● Web stuff - Wiki, HAproxy, Nginx, Varnish ● OpenShift, OpenStack, K8s on VMs and physical ● A lot of other experiments
  • 20. Следете актуалните обяви за DevOps Партньори: What storage do I use? ● Local + LVM ● DRBD+OCFS2 ● iSCSI ● cLVM + iSCSI ● GlusterFS ● OrangeFS ● I had in the past: ○ Ceph ○ NFS ○ cLVM + ATAoE ○ cLVM + NBD