SlideShare a Scribd company logo
OpenStack@IIIT-H
Dharmesh Kakadia (@dharmeshkakadia)
Shashank Sahni (@shredder12)
What we do
● Run an Indian Languages Search Engine
● Research
  ○   Information Extraction
  ○   Information Retrieval
  ○   Information Access
  ○   Virtualization and Cloud

● Users of
  ○ OpenStack
  ○ Hadoop
  ○ and lot of other FOSS
Before OpenStack...
Before OpenStack




    source: http://www.codeproject.com/KB/threads/hxgrid/image4.jpg
Problems
● Provisioning
  ○ Adhoc
  ○ Time consuming
  ○ Unmanaged

● User Management
  ○ No resource accounting
  ○ Access Control
  ○ Usage Restriction

● Storage
  ○ Data reliability
  ○ Duplication
More Problems...
● Cluster
  ○   Terrible Resource Utilization
  ○   New deployment => Too much time
  ○   Data Redundancy
  ○   Non-optimal deployments

● Academic
  ○ No cloud platform for experimentation
  ○ Large Scale sandboxed resource provisioning for
    students.
After OpenStack
OpenStack(KVM)
●   7 Compute nodes
    (8GB, quad-core)
●   1 nova-volume(2 TB,
    Raid-1)

Swift
● 3 storage nodes (2TB
  each)

OpenStack(LXC)
● 16 Compute nodes
  (6GB, dual core)
Provisioning
● Pre-configured images to quickly get started.

● VM of any capacity available at any time( 2
  a.m. Sunday morning)

● Snapshots
User Management
● Resource restrictions using Quota

● Project based collaboration and private
  resources

● Usage monitoring
Storage
This wasn't easy. We experimented with
● nova-volume
● Swift(diablo)
● GlusterFS
● Swift(Folsom)(current)
Storage
● Hadoop compatible distributed storage
● Glance image store
● Desktop backup utility using CloudFuse
● Data reliability
● No more Data Fragmentation
OpenStack in Academia
● Research
  ○ Inter cloud migration
  ○ Inter cloud scheduling
  ○ Performance Evaluation

● Resource provisioning for course
  assignments and projects.
  ○ 3 courses
  ○ 350+ students
  ○ 20+ projects
HadoopStack

● Big Data processing on Demand

● Entire ecosystem for Big Data - Hadoop
  Family, Spark, Mahout, R

● Multi-Cloud - OpenStack and AWS.
HadoopStack
Conclusion
● Using OpenStack

● Working with and around OpenStack

●   OpenStack is Awesome !!
Questions/Feedback ?

More Related Content

PDF
Introduction to Apache Tajo: Data Warehouse for Big Data
PDF
Apache Tajo on Swift: Bringing SQL to the OpenStack World
PDF
Using Ceph for Large Hadron Collider Data
PDF
SANSA ISWC 2017 Talk
PPTX
Apache Cassandra Lunch #54: Machine Learning with Spark + Cassandra Part 2
PDF
Introduction to Apache Tajo: Future of Data Warehouse
PDF
Openstack For Beginners
PDF
Mongo nyc nyt + mongodb
Introduction to Apache Tajo: Data Warehouse for Big Data
Apache Tajo on Swift: Bringing SQL to the OpenStack World
Using Ceph for Large Hadron Collider Data
SANSA ISWC 2017 Talk
Apache Cassandra Lunch #54: Machine Learning with Spark + Cassandra Part 2
Introduction to Apache Tajo: Future of Data Warehouse
Openstack For Beginners
Mongo nyc nyt + mongodb

What's hot (20)

PPTX
Time Series Data in a Time Series World
PPTX
Cassandra Lunch #59 Functions in Cassandra
PPTX
Need for Time series Database
PDF
Ceph Day Chicago: Using Ceph for Large Hadron Collider Data
PPTX
Apache Cassandra Lunch #67: Moving Data from Cassandra to Datastax Astra
PDF
Cncf meetup-rook
PDF
Cncf meetup-rook
PPTX
Apache cassandra
PDF
Improve Presto Architectural Decisions with Shadow Cache
PPTX
Comparing Orchestration
PDF
ManetoDB: Key/Value storage, BigData in Open Stack_Сергей Ковалев, Илья Свиридов
PPTX
Migration strategies for a mission critical cluster
PDF
Machine Learning & Data Science in the Age of the GPU: Smarter, Faster, Better
PDF
Geo exploration simplified with Elastic Maps
PPTX
Apache Cassandra Lunch #70: Basics of Apache Cassandra
PDF
Cassandra meetup slides - Oct 15 Santa Monica Coloft
PPTX
InfluxDb and Grafana fighting with data
PDF
Effectively deploying hadoop to the cloud
PDF
Tweaking perfomance on high-load projects_Думанский Дмитрий
PPTX
New Ceph capabilities and Reference Architectures
Time Series Data in a Time Series World
Cassandra Lunch #59 Functions in Cassandra
Need for Time series Database
Ceph Day Chicago: Using Ceph for Large Hadron Collider Data
Apache Cassandra Lunch #67: Moving Data from Cassandra to Datastax Astra
Cncf meetup-rook
Cncf meetup-rook
Apache cassandra
Improve Presto Architectural Decisions with Shadow Cache
Comparing Orchestration
ManetoDB: Key/Value storage, BigData in Open Stack_Сергей Ковалев, Илья Свиридов
Migration strategies for a mission critical cluster
Machine Learning & Data Science in the Age of the GPU: Smarter, Faster, Better
Geo exploration simplified with Elastic Maps
Apache Cassandra Lunch #70: Basics of Apache Cassandra
Cassandra meetup slides - Oct 15 Santa Monica Coloft
InfluxDb and Grafana fighting with data
Effectively deploying hadoop to the cloud
Tweaking perfomance on high-load projects_Думанский Дмитрий
New Ceph capabilities and Reference Architectures
Ad

Viewers also liked (8)

PDF
Mitakalab in Hongo
POTX
World War II
PDF
Snapeshare
PDF
Art Academy logo
PPTX
Waste expo business writing
PPTX
A journey to hawaii
PPTX
Resume Writing Workshop
PDF
Guts & OpenStack migration
Mitakalab in Hongo
World War II
Snapeshare
Art Academy logo
Waste expo business writing
A journey to hawaii
Resume Writing Workshop
Guts & OpenStack migration
Ad

Similar to Open stack @ iiit hyderabad (20)

PDF
Hadoop on OpenStack - Sahara @DevNation 2014
PDF
Rook: Storage for Containers in Containers – data://disrupted® 2020
PDF
Ceph Research at UCSC
PDF
Hadoop and OpenStack - Hadoop Summit San Jose 2014
PDF
Hadoop and OpenStack
PDF
Business Intelligent
PDF
Sparkler - Spark Crawler
PDF
Introduction into Ceph storage for OpenStack
PDF
Ippevent : openshift Introduction
ODP
Ceph Day NYC: Building Tomorrow's Ceph
ODP
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
ODP
Ceph Day Santa Clara: Ceph and Apache CloudStack
PPTX
AWS Big Data Demystified #1: Big data architecture lessons learned
PDF
Ceph Day New York: Ceph: one decade in
PDF
Ceph and Apache CloudStack
PDF
Ceph Day Seoul - Ceph: a decade in the making and still going strong
PPTX
Hadoop at LinkedIn
PDF
Terabyte-scale image similarity search: experience and best practice
PPTX
Introduction to pyspark for civil engineers
PPTX
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
Hadoop on OpenStack - Sahara @DevNation 2014
Rook: Storage for Containers in Containers – data://disrupted® 2020
Ceph Research at UCSC
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack
Business Intelligent
Sparkler - Spark Crawler
Introduction into Ceph storage for OpenStack
Ippevent : openshift Introduction
Ceph Day NYC: Building Tomorrow's Ceph
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Day Santa Clara: Ceph and Apache CloudStack
AWS Big Data Demystified #1: Big data architecture lessons learned
Ceph Day New York: Ceph: one decade in
Ceph and Apache CloudStack
Ceph Day Seoul - Ceph: a decade in the making and still going strong
Hadoop at LinkedIn
Terabyte-scale image similarity search: experience and best practice
Introduction to pyspark for civil engineers
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...

More from openstackindia (20)

PDF
Copr HD OpenStack Day India
PDF
OPNFV & OpenStack
PDF
Your first patch to OpenStack
PPTX
OpenStack Neutron Behind The Senes
PDF
OpenStack Storage Buddy Ceph
PDF
OpenStack Watcher
PPTX
State of Containers in OpenStack
PPTX
The OpenStack Contribution Workflow
PPTX
Introduction to Cinder
PDF
OpenStack NFV Edge computing for IOT microservices
PDF
OpenStack Tempest and REST API testing
PDF
Deploying openstack using ansible
PDF
Ceph openstack-jun-2015-meetup
PPTX
Role of sdn controllers in open stack
PDF
Outreachy with-openstack-zaqar
PPTX
Enhancing OpenStack FWaaS for real world application
PDF
Openstack devops challenges
PPTX
Demistifying open stack storage
PPTX
OpenStack Heat
PPTX
Why open stack database as a service offerings are doomed
Copr HD OpenStack Day India
OPNFV & OpenStack
Your first patch to OpenStack
OpenStack Neutron Behind The Senes
OpenStack Storage Buddy Ceph
OpenStack Watcher
State of Containers in OpenStack
The OpenStack Contribution Workflow
Introduction to Cinder
OpenStack NFV Edge computing for IOT microservices
OpenStack Tempest and REST API testing
Deploying openstack using ansible
Ceph openstack-jun-2015-meetup
Role of sdn controllers in open stack
Outreachy with-openstack-zaqar
Enhancing OpenStack FWaaS for real world application
Openstack devops challenges
Demistifying open stack storage
OpenStack Heat
Why open stack database as a service offerings are doomed

Recently uploaded (20)

PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Encapsulation theory and applications.pdf
PPTX
Tartificialntelligence_presentation.pptx
PPTX
1. Introduction to Computer Programming.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
A Presentation on Artificial Intelligence
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Digital-Transformation-Roadmap-for-Companies.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Network Security Unit 5.pdf for BCA BBA.
Programs and apps: productivity, graphics, security and other tools
Encapsulation theory and applications.pdf
Tartificialntelligence_presentation.pptx
1. Introduction to Computer Programming.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
A comparative analysis of optical character recognition models for extracting...
A Presentation on Artificial Intelligence
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Per capita expenditure prediction using model stacking based on satellite ima...

Open stack @ iiit hyderabad