SlideShare a Scribd company logo
Greg Bruno
VP of Engineering
Workshop: Building a Cluster with Stacki
Datacenter Architecture
Frontend Services
Services to build backend nodes
◦ DHCP
◦ TFTP
◦ Named (optional)
Services to access backend nodes
◦ SSH key management
◦ Parallel execution shell
Host Configuration Spreadsheet
Backend Installation
Save your Host Configuration spreadsheet as a CSV
Import CSV on frontend
◦ “stack load hostfile file=hosts.csv”
Tell backend nodes to install on their next PXE boot
◦ “stack set host boot backend action=install”
PXE boot all backend nodes
Done!
BitTorrent-Inspired Package Installation
Stacki
Custom Partitioning
Storage Partition Spreadsheet
Custom Partitioning
We will substitute the “os” pallet with the full CentOS 6.7 distribution
◦ “stack list storage partition”
◦ “stack load storage partition file=partition.csv”
Prep the host for reinstall
◦ “stack list host partition”
◦ “stack remove host partition backend-0-0”
◦ “stack set host attr backend-0-0 attr=nukedisks value=true”
◦ “stack set host boot backend-0-0 action=install”
Adding a Pallet
Adding CentOS to Default Box
We will substitute the “os” pallet with the full CentOS 6.7 distribution
◦ “stack list pallet”
◦ “stack add pallet CentOS*iso”
◦ “stack enable pallet CentOS”
◦ “stack disable pallet os”
Create a New Box with CentOS
Making a new box
◦ “stack list box”
◦ “stack add box centos”
◦ “stack enable pallet CentOS box=centos”
◦ “stack enable pallet stacki box=centos”
Assign a host to a new box
◦ “stack list host”
◦ “stack set host box backend-0-0 box=centos”
Boxes
Stacki Pallet
CentOS
CentOS PalletOS Pallet
Stacki Pallet
Default
Boxes
OS Pallet
Stacki Pallet
Default
Stacki Pallet
CentOS
CentOS Pallet
Why is this hard and important?
The “Step 0” Problem
Check namenodes are
empty
Format/start HDFS
Create all directories
Create all metastores
Start services (Hbase, Hive,
Oozie, Sqoop, Impala, etc)
Deploy client configuration Configure database
Setup/assign monitors
(activity, services, and host)
Test database connections
Validate/resolve hostnamesConsistent host timezones
No bad kernel versions
running
(CDH) version consistency
Java version consistency
Daemons versions
consistency
Mgmt Agents versions
consistency
Host specification/SSH
ports
MUCH MORE …
DHCP Server/Client setup TFTP/PXE configuration
Server OS installation
Node OS Install
RAID configuration
Boot configuration
System/data disk
partitioning
Monitoring system setup
and config
Lights Out/IPMI setup
User accounts added and
synced
SSH keys on all hosts
Network node configuration
Config Mgmt install and
configuration
Route configurationOS upgrades/updates
Site specific software and
configuration
Host specification/SSH
ports
Security
Firewall setupCluster Mgmt utility Database install and config
Multiple network configPackage installation MUCH MORE …
Clusters are Different
Adding new servers does require coordination
Newly added servers must:
• Have same software stack as original
servers
• Have same configuration as original
servers
• Know about original servers
And, original servers must:
• Know about new servers
Result: The management complexity added to the
Operations staff is “exponential”
Exponential Complexity
Number of Servers
ManagementComplexity
General Data Center
Clusters
The Pain Curve
Number of Servers
ManagementComplexity
General Data Center
Clusters
PAIN
The Pain Threshold
The pain threshold differs for every
organization
Function of:
• cluster(s) size
• number of people in Operations
• Operations staff cluster expertise
Moore’s Law
Moore’s Law and Infrastructure Value
What it Means for You
Time is Money
The clock starts ticking when hosts land on your
loading dock
Without your applications online, you have an
paper weight that consumes power, cooling, and
management’s attention

More Related Content

PDF
StackiFest16: What's Next in Stacki - Mason Katz
PDF
StackiFest 16: Stacki Overview- Anoop Rajendra
PPTX
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
PDF
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
PDF
StackiFest16: Building a Cart
PDF
IT Automation with Ansible
PPTX
Vagrant, Ansible, and OpenStack on your laptop
PDF
Ansible, best practices
StackiFest16: What's Next in Stacki - Mason Katz
StackiFest 16: Stacki Overview- Anoop Rajendra
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
StackiFest16: Building a Cart
IT Automation with Ansible
Vagrant, Ansible, and OpenStack on your laptop
Ansible, best practices

What's hot (20)

PDF
docker build with Ansible
ODP
PXEless Discovery with Foreman
PPT
Tips for a Faster Website
PDF
Arnold Bechtoldt, Inovex GmbH Linux systems engineer - Configuration Manageme...
PPT
Apache Traffic Server
PDF
Cobbler - Fast and reliable multi-OS provisioning
ODP
ansible why ?
PDF
Chef Provisioning a Chef Server Cluster - ChefConf 2015
PPT
SaltConf14 - Oz Akan, Rackspace - Deploying OpenStack Marconi with SaltStack
PDF
SaltConf14 - Matthew Williams, Flowroute - Salt Virt for Linux contatiners an...
PDF
Red Hat Satellite 6 - Automation with Puppet
PDF
Spot Trading - A case study in continuous delivery for mission critical finan...
PDF
SaltConf14 - Eric johnson, Google - Orchestrating Google Compute Engine with ...
PDF
ReplacingSquidWithATS
PPTX
Journey to Stability: Petabyte Ceph Cluster in OpenStack Cloud
PPTX
Herd your chickens: Ansible for DB2 configuration management
PDF
Cobbler, Func and Puppet: Tools for Large Scale Environments
PDF
Ansible 101
PDF
Network Automation: Ansible 101
PDF
Deploying postgre sql on amazon ec2
docker build with Ansible
PXEless Discovery with Foreman
Tips for a Faster Website
Arnold Bechtoldt, Inovex GmbH Linux systems engineer - Configuration Manageme...
Apache Traffic Server
Cobbler - Fast and reliable multi-OS provisioning
ansible why ?
Chef Provisioning a Chef Server Cluster - ChefConf 2015
SaltConf14 - Oz Akan, Rackspace - Deploying OpenStack Marconi with SaltStack
SaltConf14 - Matthew Williams, Flowroute - Salt Virt for Linux contatiners an...
Red Hat Satellite 6 - Automation with Puppet
Spot Trading - A case study in continuous delivery for mission critical finan...
SaltConf14 - Eric johnson, Google - Orchestrating Google Compute Engine with ...
ReplacingSquidWithATS
Journey to Stability: Petabyte Ceph Cluster in OpenStack Cloud
Herd your chickens: Ansible for DB2 configuration management
Cobbler, Func and Puppet: Tools for Large Scale Environments
Ansible 101
Network Automation: Ansible 101
Deploying postgre sql on amazon ec2
Ad

Similar to StackiFest16: Building a Cluster with Stacki - Greg Bruno (20)

PDF
Introduction to Stacki at Atlanta Meetup February 2016
PDF
Introduction to Stacki - World's fastest Linux server provisioning Tool
PDF
Provisioning Servers Made Easy
PPTX
Stacki at the Seattle Scalability Meetup
PDF
How to Build a Compute Cluster
ODP
Blades for HPTC
PDF
Operate your hadoop cluster like a high eff goldmine
PDF
SF Bay Area OpenStack Meetup Stacki Presentation
PDF
create auto scale jboss cluster with openshift
DOCX
C mode class
PDF
OpenStack + Cloud Foundry for the OpenStack Boston Meetup
PPTX
The Reality of DIY Kubernetes vs. PKS
PDF
Taking Docker to Production: What You Need to Know and Decide
PDF
Taking Docker to Production: What You Need to Know and Decide
ODP
Automated Deployment using Open Source
PDF
OpenStack Operations Guide 1st Edition Tom Fifield
PDF
Pivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics Workbench
 
PPTX
BigDataTech 2015 Is Hadoop Enterprise ready?
PPTX
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
PPTX
EMC World 2016 - code.02 Introduction to Immutable Infrastructure
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki - World's fastest Linux server provisioning Tool
Provisioning Servers Made Easy
Stacki at the Seattle Scalability Meetup
How to Build a Compute Cluster
Blades for HPTC
Operate your hadoop cluster like a high eff goldmine
SF Bay Area OpenStack Meetup Stacki Presentation
create auto scale jboss cluster with openshift
C mode class
OpenStack + Cloud Foundry for the OpenStack Boston Meetup
The Reality of DIY Kubernetes vs. PKS
Taking Docker to Production: What You Need to Know and Decide
Taking Docker to Production: What You Need to Know and Decide
Automated Deployment using Open Source
OpenStack Operations Guide 1st Edition Tom Fifield
Pivotal: Operationalizing 1000 Node Hadoop Cluster - Analytics Workbench
 
BigDataTech 2015 Is Hadoop Enterprise ready?
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
EMC World 2016 - code.02 Introduction to Immutable Infrastructure
Ad

More from StackIQ (20)

PDF
Installing a Cluster of Raspberry Pis with Stacki Ace
PPTX
Building a Hadoop Cluster with Stacki
PPTX
How Teradata uses Stacki
PPTX
Installing a Cluster of Raspberry Pis with Stacki Ace
PPTX
StackiFest 2017 Technology Update
PPTX
Automation of your OpenStack Infrastructure with Stacki
PPTX
StackiFest 2017 Welcome & Opening Address
PPTX
Provisioning Heterogenous Bare Metal with Stacki
PDF
Provisioning with Stacki at NIST
PDF
Public vs. Private Cloud Performance by Flex
PDF
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
PDF
StackiFest16: CoreOS/Ubuntu on Stacki
PDF
Salesforce at Stacki Atlanta Meetup February 2016
PDF
Private Cloud vs. Public Cloud
PDF
Datacenter Word Search
PDF
Stacki: Remove Commands
PDF
The Big Picture on Hadoop
PDF
Stacki Crossword Puzzle
PDF
Open Source Adoption in the Enterprise
PDF
Stacki Saves Time
Installing a Cluster of Raspberry Pis with Stacki Ace
Building a Hadoop Cluster with Stacki
How Teradata uses Stacki
Installing a Cluster of Raspberry Pis with Stacki Ace
StackiFest 2017 Technology Update
Automation of your OpenStack Infrastructure with Stacki
StackiFest 2017 Welcome & Opening Address
Provisioning Heterogenous Bare Metal with Stacki
Provisioning with Stacki at NIST
Public vs. Private Cloud Performance by Flex
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
StackiFest16: CoreOS/Ubuntu on Stacki
Salesforce at Stacki Atlanta Meetup February 2016
Private Cloud vs. Public Cloud
Datacenter Word Search
Stacki: Remove Commands
The Big Picture on Hadoop
Stacki Crossword Puzzle
Open Source Adoption in the Enterprise
Stacki Saves Time

Recently uploaded (20)

PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPT
Introduction Database Management System for Course Database
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
top salesforce developer skills in 2025.pdf
PPTX
Transform Your Business with a Software ERP System
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
System and Network Administraation Chapter 3
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
Reimagine Home Health with the Power of Agentic AI​
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
L1 - Introduction to python Backend.pptx
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Digital Strategies for Manufacturing Companies
PPTX
Introduction to Artificial Intelligence
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Introduction Database Management System for Course Database
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PTS Company Brochure 2025 (1).pdf.......
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
top salesforce developer skills in 2025.pdf
Transform Your Business with a Software ERP System
VVF-Customer-Presentation2025-Ver1.9.pptx
System and Network Administraation Chapter 3
Understanding Forklifts - TECH EHS Solution
Reimagine Home Health with the Power of Agentic AI​
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Design an Analysis of Algorithms I-SECS-1021-03
Operating system designcfffgfgggggggvggggggggg
L1 - Introduction to python Backend.pptx
Computer Software and OS of computer science of grade 11.pptx
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Digital Strategies for Manufacturing Companies
Introduction to Artificial Intelligence

StackiFest16: Building a Cluster with Stacki - Greg Bruno

  • 1. Greg Bruno VP of Engineering
  • 2. Workshop: Building a Cluster with Stacki
  • 4. Frontend Services Services to build backend nodes ◦ DHCP ◦ TFTP ◦ Named (optional) Services to access backend nodes ◦ SSH key management ◦ Parallel execution shell
  • 6. Backend Installation Save your Host Configuration spreadsheet as a CSV Import CSV on frontend ◦ “stack load hostfile file=hosts.csv” Tell backend nodes to install on their next PXE boot ◦ “stack set host boot backend action=install” PXE boot all backend nodes Done!
  • 10. Custom Partitioning We will substitute the “os” pallet with the full CentOS 6.7 distribution ◦ “stack list storage partition” ◦ “stack load storage partition file=partition.csv” Prep the host for reinstall ◦ “stack list host partition” ◦ “stack remove host partition backend-0-0” ◦ “stack set host attr backend-0-0 attr=nukedisks value=true” ◦ “stack set host boot backend-0-0 action=install”
  • 12. Adding CentOS to Default Box We will substitute the “os” pallet with the full CentOS 6.7 distribution ◦ “stack list pallet” ◦ “stack add pallet CentOS*iso” ◦ “stack enable pallet CentOS” ◦ “stack disable pallet os”
  • 13. Create a New Box with CentOS Making a new box ◦ “stack list box” ◦ “stack add box centos” ◦ “stack enable pallet CentOS box=centos” ◦ “stack enable pallet stacki box=centos” Assign a host to a new box ◦ “stack list host” ◦ “stack set host box backend-0-0 box=centos”
  • 14. Boxes Stacki Pallet CentOS CentOS PalletOS Pallet Stacki Pallet Default
  • 15. Boxes OS Pallet Stacki Pallet Default Stacki Pallet CentOS CentOS Pallet
  • 16. Why is this hard and important?
  • 17. The “Step 0” Problem Check namenodes are empty Format/start HDFS Create all directories Create all metastores Start services (Hbase, Hive, Oozie, Sqoop, Impala, etc) Deploy client configuration Configure database Setup/assign monitors (activity, services, and host) Test database connections Validate/resolve hostnamesConsistent host timezones No bad kernel versions running (CDH) version consistency Java version consistency Daemons versions consistency Mgmt Agents versions consistency Host specification/SSH ports MUCH MORE … DHCP Server/Client setup TFTP/PXE configuration Server OS installation Node OS Install RAID configuration Boot configuration System/data disk partitioning Monitoring system setup and config Lights Out/IPMI setup User accounts added and synced SSH keys on all hosts Network node configuration Config Mgmt install and configuration Route configurationOS upgrades/updates Site specific software and configuration Host specification/SSH ports Security Firewall setupCluster Mgmt utility Database install and config Multiple network configPackage installation MUCH MORE …
  • 18. Clusters are Different Adding new servers does require coordination Newly added servers must: • Have same software stack as original servers • Have same configuration as original servers • Know about original servers And, original servers must: • Know about new servers Result: The management complexity added to the Operations staff is “exponential”
  • 19. Exponential Complexity Number of Servers ManagementComplexity General Data Center Clusters
  • 20. The Pain Curve Number of Servers ManagementComplexity General Data Center Clusters PAIN
  • 21. The Pain Threshold The pain threshold differs for every organization Function of: • cluster(s) size • number of people in Operations • Operations staff cluster expertise
  • 23. Moore’s Law and Infrastructure Value
  • 24. What it Means for You
  • 25. Time is Money The clock starts ticking when hosts land on your loading dock Without your applications online, you have an paper weight that consumes power, cooling, and management’s attention