SlideShare a Scribd company logo
Reconnoiter:
                 Large-scale trending and fault-detection



                             / another product built from pain



Sunday, August 1, 2010
Goals




                    •    make checks cheap: 10000+ checks on cheap hardware

                    •    centralized configuration management.

                    •    decentralized configuration manipulation.

                    •    decouple data collections from

                         •   visualization/trending

                         •   fault-detection

                    •    make life suck just a little less.




Sunday, August 1, 2010
Architectural Design




Sunday, August 1, 2010
System Components

                    •    noitd

                         •   C, hybrid thread/event model, async I/O, small, fast, efficient,

                         •   extensible in lua.

                    •    stratcond

                         •   C, brokers and aggregates data, feeds PostgreSQL,

                         •   feeds Esper complex event processing system for fault-detection,

                         •   has a comet-style webserver built in for feeding web clients.

                    •    postgres

                    •    webconsole

                         •   PHP, almost entirely AJAX based,

                         •   uses canvas to draw.




Sunday, August 1, 2010
Some basics on the architecture



                    •    Everything important happens over SSL:

                         •   Services exposed over SSL with certificates,

                         •   Client connects using client certificates as well.

                    •    Designed so that data collection is journalled and replayed

                         •   prevents data loss due to transient networking issues
                             between NOC an data center.

                    •    There are a lot of moving parts...

                         •   designed to work when the parts don’t get along.




Sunday, August 1, 2010
Installing the code




                    •    svn co https://labs.omniti.com/reconnoiter/tags/wangle

                    •    cd trunk

                    •    autoconf

                    •    ./configure

                    •    make

                    •    make install




Sunday, August 1, 2010
Installing the database


                    •    createdb reconnoiter

                    •    createlang -d reconnoiter plpgsql

                    •    createuser reconnoiter

                    •    createuser stratcon

                    •    createuser prism

                    •    psql -U postgres reconnoiter

                         •   BEGIN;

                         •   i sql/reconnoiter_ddl_dump.sql

                         •   COMMIT;

                    •    install the crontab in sql/crontab




Sunday, August 1, 2010
Installing the web console




                    •    Web web UI lives in trunk/web/ui

                    •    Setup Apache (left as an exercise for the reader)




Sunday, August 1, 2010
SSL is underneath everything



                    •    Setup SSL certs:

                         •   Need a CA (even if a dummy CA)

                         •   need a cert (signed) for each noitd server

                         •   need a client cert (signed) for the stratcond server

                         •   need a web cert (signed) for the stratcond server (future**)

                    •    More details by running make test and looking into the

                         •   test-noit.conf

                         •   test-stratcon.conf




Sunday, August 1, 2010
Configure noitd




                         •   noit.conf

                         •   pretty good as a starting point, just clear out all the checks

                         •   module loading is at boot time, so make sure all the modules you
                             want are loaded.

                         •   checks can be added at run time via the text console.

                         •   noitd must run as root, it drops privileges after module
                             initialization.

                         •   /usr/local/sbin/noitd -c /usr/local/etc/noit.conf




Sunday, August 1, 2010
Configure stratcond




                         •   stratcon.conf

                         •   pretty good as a starting point, just clear out all the noit addresses

                         •   each noit in the field requires a line in the stratcon.conf file:

                             •   <noit address="10.225.209.25" port="43191" />

                         •   database passwords must be configured

                         •   hostname and document_domain must be configured

                         •   /usr/local/sbin/stratcond -c /usr/local/etc/stratcon.conf




Sunday, August 1, 2010
Thank You




                    •    Documentation at https://labs.omniti.com/trac/reconnoiter

                    •    Commercial support available from OmniTI.

                    •    Please join in.

                    •    OmniTI is hiring: http://omniti.com/is/hiring

                    •    Thanks!




Sunday, August 1, 2010

More Related Content

PDF
Configuration Management - Finding the tool to fit your needs
PPTX
Scaling application with RabbitMQ
ODP
Shootout at the AWS Corral
PDF
Spot Trading - A case study in continuous delivery for mission critical finan...
PDF
Basics of Logical Replication,Streaming replication vs Logical Replication ,U...
PDF
Ansible for Configuration Management for Lohika DevOps training 2018 @ Lohika...
PDF
Salt Stack - Subhankar Sengupta
PPTX
SaltConf 2014: Safety with powertools
Configuration Management - Finding the tool to fit your needs
Scaling application with RabbitMQ
Shootout at the AWS Corral
Spot Trading - A case study in continuous delivery for mission critical finan...
Basics of Logical Replication,Streaming replication vs Logical Replication ,U...
Ansible for Configuration Management for Lohika DevOps training 2018 @ Lohika...
Salt Stack - Subhankar Sengupta
SaltConf 2014: Safety with powertools

What's hot (20)

ODP
Salt Stack pt. 2 : Configuration Management
PPT
SaltConf14 - Ben Cane - Using SaltStack in High Availability Environments
PDF
Benchmarking, Load Testing, and Preventing Terrible Disasters
PDF
Vmth project
PDF
Introduction to selenium_grid_workshop
PPTX
AVA - a futuristic test runner
PDF
The SaltStack Pub Crawl - Fosscomm 2016
PDF
Ruby projects of interest for DevOps
KEY
Puppet
ODP
Devops madrid: successful case in AWS
PPT
Local Dev on Virtual Machines - Vagrant, VirtualBox and Ansible
PDF
Selenium grid workshop london 2016
PDF
Foreman - Advanced use cases - Timo Goebel
ODP
DevOps and Chef improve your life
PDF
Steamlining your puppet development workflow
ODP
Shootout at the PAAS Corral
ODP
GUC Tutorial Package (9.0)
KEY
Tech4Africa - Tuning LAMP, and beyond LAMP
PDF
SaltConf14 - Ryan Lane, Wikimedia - Immediate consistency with Trebuchet Depl...
PDF
OpenNebula, the foreman and CentOS play nice, too
Salt Stack pt. 2 : Configuration Management
SaltConf14 - Ben Cane - Using SaltStack in High Availability Environments
Benchmarking, Load Testing, and Preventing Terrible Disasters
Vmth project
Introduction to selenium_grid_workshop
AVA - a futuristic test runner
The SaltStack Pub Crawl - Fosscomm 2016
Ruby projects of interest for DevOps
Puppet
Devops madrid: successful case in AWS
Local Dev on Virtual Machines - Vagrant, VirtualBox and Ansible
Selenium grid workshop london 2016
Foreman - Advanced use cases - Timo Goebel
DevOps and Chef improve your life
Steamlining your puppet development workflow
Shootout at the PAAS Corral
GUC Tutorial Package (9.0)
Tech4Africa - Tuning LAMP, and beyond LAMP
SaltConf14 - Ryan Lane, Wikimedia - Immediate consistency with Trebuchet Depl...
OpenNebula, the foreman and CentOS play nice, too
Ad

Viewers also liked (6)

PDF
Monitoring is easy, why are we so bad at it presentation
PDF
Velocity 2010: Scalable Internet Architectures
PDF
PostgreSQL on Solaris
PDF
OmniOS Motivation and Design ~ LISA 2012
PDF
Applying operations culture to everything
PDF
Project reality
Monitoring is easy, why are we so bad at it presentation
Velocity 2010: Scalable Internet Architectures
PostgreSQL on Solaris
OmniOS Motivation and Design ~ LISA 2012
Applying operations culture to everything
Project reality
Ad

Similar to Noit ocon-2010 (20)

PPTX
Security Onion
PPTX
Sanger, upcoming Openstack for Bio-informaticians
PPTX
Flexible compute
PPTX
London Hashicorp Meetup #22 - Congruent infrastructure @zopa by Ben Coughlan
PDF
DevOpsCon 2015 - DevOps in Mobile Games
KEY
Deployment Strategies (Mongo Austin)
PDF
StarlingX - Project Onboarding
PDF
Docking postgres
PDF
Automating hard things may 2015
PDF
DevOps Unleashed: Strategies that Speed Deployments
PPTX
Reactive summit 2020 microsoft orleans the easy way
PPT
PowerPoint Presentation
PDF
Fluentd and docker monitoring
PPTX
Optimizing VM images for OpenStack with KVM/QEMU
ODP
Deep Dive: OpenStack Summit (Red Hat Summit 2014)
PPTX
Storage as a service and OpenStack Cinder
PPT
Introduction to node.js aka NodeJS
PPTX
A Survey of Container Security in 2016: A Security Update on Container Platforms
PDF
Intro To Gentoo Embedded Cclug
PDF
Smart Platform Infrastructure with AWS
Security Onion
Sanger, upcoming Openstack for Bio-informaticians
Flexible compute
London Hashicorp Meetup #22 - Congruent infrastructure @zopa by Ben Coughlan
DevOpsCon 2015 - DevOps in Mobile Games
Deployment Strategies (Mongo Austin)
StarlingX - Project Onboarding
Docking postgres
Automating hard things may 2015
DevOps Unleashed: Strategies that Speed Deployments
Reactive summit 2020 microsoft orleans the easy way
PowerPoint Presentation
Fluentd and docker monitoring
Optimizing VM images for OpenStack with KVM/QEMU
Deep Dive: OpenStack Summit (Red Hat Summit 2014)
Storage as a service and OpenStack Cinder
Introduction to node.js aka NodeJS
A Survey of Container Security in 2016: A Security Update on Container Platforms
Intro To Gentoo Embedded Cclug
Smart Platform Infrastructure with AWS

More from Theo Schlossnagle (20)

PPTX
Adding Simplicity to Complexity
PPTX
Put Some SRE in Your Shipped Software
PPTX
Monitoring 101
PPTX
Distributed Systems - Like It Or Not
PDF
Applying SRE techniques to micro service design
PPTX
Craftsmanship
PDF
SRECon Coherent Performance
PDF
Commandments of scale
PDF
Adaptive availability
PDF
Monitoring the #DevOps way
PDF
Operational Software Design
PDF
A Coherent Discussion About Performance
PDF
The math behind big systems analysis.
PDF
Understanding Slowness
PDF
Monitoring and observability
PDF
Omnios and unix
PDF
Monitoring and observability
PDF
Xtreme Deployment
PDF
PDF
It's all about telemetry
Adding Simplicity to Complexity
Put Some SRE in Your Shipped Software
Monitoring 101
Distributed Systems - Like It Or Not
Applying SRE techniques to micro service design
Craftsmanship
SRECon Coherent Performance
Commandments of scale
Adaptive availability
Monitoring the #DevOps way
Operational Software Design
A Coherent Discussion About Performance
The math behind big systems analysis.
Understanding Slowness
Monitoring and observability
Omnios and unix
Monitoring and observability
Xtreme Deployment
It's all about telemetry

Noit ocon-2010

  • 1. Reconnoiter: Large-scale trending and fault-detection / another product built from pain Sunday, August 1, 2010
  • 2. Goals • make checks cheap: 10000+ checks on cheap hardware • centralized configuration management. • decentralized configuration manipulation. • decouple data collections from • visualization/trending • fault-detection • make life suck just a little less. Sunday, August 1, 2010
  • 4. System Components • noitd • C, hybrid thread/event model, async I/O, small, fast, efficient, • extensible in lua. • stratcond • C, brokers and aggregates data, feeds PostgreSQL, • feeds Esper complex event processing system for fault-detection, • has a comet-style webserver built in for feeding web clients. • postgres • webconsole • PHP, almost entirely AJAX based, • uses canvas to draw. Sunday, August 1, 2010
  • 5. Some basics on the architecture • Everything important happens over SSL: • Services exposed over SSL with certificates, • Client connects using client certificates as well. • Designed so that data collection is journalled and replayed • prevents data loss due to transient networking issues between NOC an data center. • There are a lot of moving parts... • designed to work when the parts don’t get along. Sunday, August 1, 2010
  • 6. Installing the code • svn co https://labs.omniti.com/reconnoiter/tags/wangle • cd trunk • autoconf • ./configure • make • make install Sunday, August 1, 2010
  • 7. Installing the database • createdb reconnoiter • createlang -d reconnoiter plpgsql • createuser reconnoiter • createuser stratcon • createuser prism • psql -U postgres reconnoiter • BEGIN; • i sql/reconnoiter_ddl_dump.sql • COMMIT; • install the crontab in sql/crontab Sunday, August 1, 2010
  • 8. Installing the web console • Web web UI lives in trunk/web/ui • Setup Apache (left as an exercise for the reader) Sunday, August 1, 2010
  • 9. SSL is underneath everything • Setup SSL certs: • Need a CA (even if a dummy CA) • need a cert (signed) for each noitd server • need a client cert (signed) for the stratcond server • need a web cert (signed) for the stratcond server (future**) • More details by running make test and looking into the • test-noit.conf • test-stratcon.conf Sunday, August 1, 2010
  • 10. Configure noitd • noit.conf • pretty good as a starting point, just clear out all the checks • module loading is at boot time, so make sure all the modules you want are loaded. • checks can be added at run time via the text console. • noitd must run as root, it drops privileges after module initialization. • /usr/local/sbin/noitd -c /usr/local/etc/noit.conf Sunday, August 1, 2010
  • 11. Configure stratcond • stratcon.conf • pretty good as a starting point, just clear out all the noit addresses • each noit in the field requires a line in the stratcon.conf file: • <noit address="10.225.209.25" port="43191" /> • database passwords must be configured • hostname and document_domain must be configured • /usr/local/sbin/stratcond -c /usr/local/etc/stratcon.conf Sunday, August 1, 2010
  • 12. Thank You • Documentation at https://labs.omniti.com/trac/reconnoiter • Commercial support available from OmniTI. • Please join in. • OmniTI is hiring: http://omniti.com/is/hiring • Thanks! Sunday, August 1, 2010