SlideShare a Scribd company logo
Open Cloud Engine
Open Source Big Data Platform
Flamingo Project ์†Œ๊ฐœ ๋ฐ ํ™œ์šฉ
Open Cloud Engine
Flamingo Project Leader
๊น€๋ณ‘๊ณค
(ceo@cloudine.co.kr)
2014.04.02 v0.9
๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
๋น… ๋ฐ์ดํ„ฐ ์ฑ…์ž„์ž์—๊ฒŒ ๋“ฃ๋Š” ํ”ํ•œ ์งˆ๋ฌธ
โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ๊ฐ€ ๊ธฐ์กด์˜ DW๋ž‘ ์ฐจ์ด๊ฐ€ ๋ญ๊ฐ€ ์žˆ๋Š”์ง€ ๋ชจ๋ฅด๊ฒ ์Šต๋‹ˆ๋‹ค.
โ€ขโ€ฏ ๋‹จ์œ„ ๋ฐ์ดํ„ฐ๋งŒ ๋ด์„œ๋Š” ํฐ ๋ฐ์ดํ„ฐ๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค. ์‚ฌ์—…์˜ ํƒ€๋‹น์„ฑ์„ ๋งŒ
๋“ค์ˆ˜๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค. ์–ด๋–ป๊ฒŒ ํ•ด์•ผ ํ•˜๋‚˜์š”?
โ€ขโ€ฏ A๋ผ๋Š” ๋ฐ์ดํ„ฐ๊ฐ€ ์žˆ๋Š”๋ฐ ๊ทธ๊ฒƒ์œผ๋กœ ๋ญ˜ ํ•ด์•ผํ• ๊นŒ์š”?
โ€ขโ€ฏ ๋‹ค๋ฅธ ํšŒ์‚ฌ๋Š” ๋ญ ํ•œ๋‹ต๋‹ˆ๊นŒ? ํ˜น์‹œ ๋™์ข…์—…๊ณ„ ๋น„์Šทํ•œ ์‚ฌ๋ก€๊ฐ€ ์žˆ๋‚˜์š”?
โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ๋งŒ๋“ค๋ผ๋Š”๋ฐ ์ด๋†ˆ์ด ๋ญ๋ฅผ ํ•˜๋Š” ๋†ˆ์ธ์ง€ ๋ชจ๋ฅด๊ฒ 
์Šต๋‹ˆ๋‹ค.
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์˜ ์—ญํ• ์— ๋Œ€ํ•œ ๊ณ ๋ฏผ
โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์—์„œ ํ•˜๊ณ ์ž ํ•˜๋Š” ์ฃผ์š” ์—…๋ฌด๋Š” ๋ฌด์—‡์ธ๊ฐ€?
โ€ขโ€ฏ ๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹, ํ†ต๊ณ„, ๋กœ๊ทธ ๊ด€๋ฆฌ(์ˆ˜์ง‘, ์ „์ฒ˜๋ฆฌ, โ€ฆ)
โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์—์„œ ๋ˆ„๊ฐ€ ๋ฌด์Šจ ์ผ์„ ํ•˜๋Š”๊ฐ€?
โ€ขโ€ฏ ์‚ฌ์šฉ์ž์— ๋”ฐ๋ผ์„œ ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ์ด ์„œ๋กœ ๋‹ค๋ฅผ ์ˆ˜ ์žˆ๋‹ค.
โ€ขโ€ฏ ์šด์˜์ž๋Š” ๋Œ€๋ถ€๋ถ„ ๊ฐœ๋ฐœ์ž ์ถœ์‹ ์ด๊ธฐ ๋•Œ๋ฌธ์— ์‹œ์Šคํ…œ ๊ด€๋ฆฌ ๋ฐ ๋กœ๊ทธ ๊ด€๋ฆฌ์— ์ดˆ์ 
โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋ถ„์„๊ฐ€ ์ถœ์‹ ์ธ ๊ฒฝ์šฐ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ ํ™˜๊ฒฝ์˜ ์„ฑ์ˆ™๋„๊ฐ€ ์ดˆ์ 
โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ์‚ฌ์šฉํ•˜๋Š” ์‚ฌ์šฉ์ž์˜ ์ˆ˜๋Š”?
โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋งŽ๋‹ค๋ฉด ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ์„ฑ๊ณผ ์ธํ”„๋ผ์˜ ์ ‘๊ทผ์„ฑ์ด ์ค‘์š”
โ€ขโ€ฏ ํ”Œ๋žซํผ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค๋ฃจ๋Š” ํŠน์„ฑ ๋•Œ๋ฌธ์— ๋ณด์•ˆ์— ์ทจ์•ฝํ•  ์ˆ˜ ์žˆ๊ณ  Hadoop์€ ์‹ค
์ œ๋กœ ์ทจ์•ฝํ•จ
โ€ขโ€ฏ ๋‚˜๋Š” ์šด์˜์ž? ๊ธฐํš์ž? ๊ฐœ๋ฐœ์ž? ๋ถ„์„๊ฐ€?
โ€ขโ€ฏ ์ฑ…์ž„์ž์˜ ์—ญํ• ์— ๋”ฐ๋ผ์„œ ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ๋„ ๋‹ค๋ฅด๊ฒŒ ์ •์˜ํ•œ๋‹ค.
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
SOFTWARE STACK
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
INFRA MANAGEMENT
MONITORING
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
WORKFLOW
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
๋ถ„์„ ๋ฐ ์‹œ๊ฐํ™” ํ™˜๊ฒฝ
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
DASHBOARD
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
SECURITY
โ€ขโ€ฏ ACCESS
โ€ขโ€ฏ AUTHENTICATION
โ€ขโ€ฏ AUTHORIZATION
โ€ขโ€ฏ ENCRYPTION
โ€ขโ€ฏ AUDITING
โ€ขโ€ฏ POLICY
๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ
โ€ขโ€ฏ ๋ฐฐ์น˜ ์ž‘์—… ๊ด€๋ฆฌ์™€ ์ž‘์—… ๋ชจ๋‹ˆํ„ฐ๋ง
โ€ขโ€ฏ ๋ณ‘๋ ฌ ๋ถ„์„ ํ”„๋กœ๊ทธ๋žจ
โ€ขโ€ฏ ์‚ฌ์šฉ์ž์˜ ํ–‰์œ„์— ๋Œ€ํ•œ ๋ชจ๋‹ˆํ„ฐ๋ง
โ€ขโ€ฏ ๋ฆฌ์†Œ์Šค์— ๋Œ€ํ•œ ๊ฐ์ข… ์ ‘๊ทผ ํ†ต์ œ ์ •์ฑ… ๋ฐ ์‹œ์Šคํ…œ
โ€ขโ€ฏ ์ธํ”„๋ผ์˜ ์ ‘๊ทผ์„ฑ ํ–ฅ์ƒ์„ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ๊ธฐ๋Šฅ๋“คโ€ฆ
Flamingo Project In Open Cloud Engine
โ€ขโ€ฏ ์›น ๊ธฐ์ˆ ์„ ํ™œ์šฉํ•˜์—ฌ ๋น… ๋ฐ์ดํ„ฐ ์ธํ”„๋ผ ๋ฐ ๋ฐ์ดํ„ฐ๋ฅผ ํŽธ๋ฆฌํ•˜๊ฒŒ ์‚ฌ์šฉ
ํ•˜๋„๋ก ํ•œ๋‹ค.
โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์ž˜ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค.
โ€ขโ€ฏ ํ•˜๋‚˜์˜ ํ™”๋ฉด์—์„œ ์ž์œ ๋กญ๊ฒŒ ๋‹ค์–‘ํ•œ ์ž‘์—…์„ ํ•  ์ˆ˜ ์žˆ๋Š” ์ž‘์—… ๊ณต๊ฐ„์„
์ œ๊ณตํ•œ๋‹ค.
โ€ขโ€ฏ ๋‹ค์–‘ํ•œ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ MapReduce๋ฅผ ์‰ฝ๊ฒŒ ์žฌํ™œ์šฉ ํ•  ์ˆ˜ ์žˆ๋„๋ก
ํ•œ๋‹ค.
โ€ขโ€ฏ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜์œผ๋กœ ๋ชจ๋“  ์‹œ์Šคํ…œ์„ ์ œ๋Œ€๋กœ ๊ฐ–์ถ”๊ณ  ์ง„ํ–‰ํ•œ๋‹ค.
โ€ขโ€ฏ ๋‚จ์˜ ๊ฒƒ์— ์˜์กดํ•˜์ง€ ์•Š๊ณ  ์ง์ ‘ ๋‹ค ๋งŒ๋“ ๋‹ค.
โ€ขโ€ฏ ํ˜„์žฅ์˜ ์—…๋ฌด๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์„ค๊ณ„ํ•œ๋‹ค.
โ€ขโ€ฏ ๋‹ค๊ตญ์–ด ์ง€์›์„ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ์‚ฌ๋žŒ๋“ค์ด ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค.
โ€ขโ€ฏ Hadoop EcoSystem์„ ์ž˜ ์ง€์›ํ•œ๋‹ค.
Browser	
 ย 
๋””์ž์ด๋„ˆ	
 ย  Search	
 ย 
ํ˜•ํƒœ์†Œ
ย 
๋ถ„์„
ย 
๊ทธ๋ž˜ํ”„
ย 
๋ถ„์„
ย 
์‚ฌ์šฉ์ž๋ณ„
ย ํ‰
๊ฐ€
ย 
๋ฆฌ๋”
ย ์„ 
์ถœ
ย 
๋กœ๊ทธ
ย ๋ฐ์ดํ„ฐ
ย 
๋ฐ์ดํ„ฐ
ย ๋ถ„์„๊ฐ€
ย 
์„œ๋น„์Šค
ย ๊ธฐํš์ž
ย 
๋ฐ์ดํ„ฐ
ย ๋ถ„์„๊ฐ€
ย 
Browser	
 ย 
์ธํฌ๋ฉ”์ด์…˜ ์นดํƒˆ๋กœ๊ทธ	
 ย  Search	
 ย 
์ธํฌ๋ฉ”์ด์…˜ ์œ ํ˜•	
 ย  ๋ณด์•ˆ๋“ฑ๊ธ‰	
 ย  ์ƒ์„ฑ์ฃผ๊ธฐ	
 ย  ํ˜•์‹	
 ย 
์‚ฌ์šฉ์ž ์นœ๋ฐ€๋„	
 ย  1	
 ย  ๋งค์ผ ์ƒˆ๋ฒฝ2์‹œ	
 ย  XML	
 ย 
์•„์ดํ…œ ์ถ”์ฒœ	
 ย  2	
 ย  ๋งค์ผ ์ƒˆ๋ฒฝ 1์‹œ	
 ย  JSON	
 ย 
๊ตฌ๋งค ์„ฑํ–ฅ	
 ย  3	
 ย  ๋งค์ผ ์ €๋… 8์‹œ	
 ย  XML/JSON	
 ย 
์˜คํ”ผ๋‹ˆ์–ธ ๋ฆฌ๋” ์ ์ˆ˜	
 ย  2	
 ย  ๋งค์ผ ์˜ค์ „ 10
์‹œ	
 ย 
XML/JSON	
 ย 
๋ฐ์ดํ„ฐ
ย ์ด์šฉ์ž
ย 
์‹œ์Šคํ…œ
ย 
์˜คํ”ผ๋‹ˆ์–ธ
ย ๋ฆฌ๋”
ย ์ ์ˆ˜
ย 
Open
ย 
API
ย 
๋ฐ์ดํ„ฐ
ย ์‹œ๊ฐํ™”
๋ฅผ
ย ์œ„ํ•œ
ย Chart
ย 
์›Œํฌํ”Œ๋กœ์šฐ
ย ๋””์ž์ธ
ย 
์ˆ˜์ง‘
ย 
ย 
๋ฐ์ดํ„ฐ
ย ์ด์šฉ์ž
ย 
์„œ๋น„์Šค
ย 
ย 
์š”์ฒญ

More Related Content

PDF
OpenSource Big Data Platform - Flamingo ์†Œ๊ฐœ์™€ ํ™œ์šฉ
PDF
(์ฃผ)ํด๋ผ์šฐ๋‹ค์ธ & Flamingo ์†Œ๊ฐœ์„œ
PDF
Flamingo 1.2 ๋ฆด๋ฆฌ์ฆˆ์˜ ์ง€์› ๊ธฐ๋Šฅ ์ •๋ฆฌ
PDF
OpenSource Big Data Platform : Flamingo Project
PDF
Flamingo (FEA) Spark Designer
PPTX
2010 CASCON - Towards a integrated network of data and services for the life ...
PDF
IVI Kirov 27 June 2007 Presentation For Distribution
KEY
GeekMeet Intro - Filip C.T.E.
OpenSource Big Data Platform - Flamingo ์†Œ๊ฐœ์™€ ํ™œ์šฉ
(์ฃผ)ํด๋ผ์šฐ๋‹ค์ธ & Flamingo ์†Œ๊ฐœ์„œ
Flamingo 1.2 ๋ฆด๋ฆฌ์ฆˆ์˜ ์ง€์› ๊ธฐ๋Šฅ ์ •๋ฆฌ
OpenSource Big Data Platform : Flamingo Project
Flamingo (FEA) Spark Designer
2010 CASCON - Towards a integrated network of data and services for the life ...
IVI Kirov 27 June 2007 Presentation For Distribution
GeekMeet Intro - Filip C.T.E.

Viewers also liked (20)

PPT
Introduccion Al Movimiento Del Software Libre
PPT
Ds Consumer Samples
PPT
Deep Oceans
PPT
DS Furniture Samples
PPT
Gladneyfinal
PPS
ๆˆ‘ๅฐฑๆ˜ฏๅ–œๆญก้€™ๆจฃ็š„ไฝ 
ย 
PPT
Hopf anemia09
PPT
MNCs Presentation
ย 
PDF
EuRoSaIn presentazione
PDF
New Libertarian Manifesto
PPT
Part 4: New HIV Treatment Pipeline
ย 
PDF
Steria Recruitment Presentation
PPT
Multicultural health standards around the world
PDF
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
PPT
Social media analysis for toronto 2010 mayoral election
PDF
iPhone and Appstore
ย 
PDF
Ishii presentation
PPT
Health-e-cITi NJ
PDF
Cda esm waste oil disposal application part 2
PDF
How to write tech posts & talks
Introduccion Al Movimiento Del Software Libre
Ds Consumer Samples
Deep Oceans
DS Furniture Samples
Gladneyfinal
ๆˆ‘ๅฐฑๆ˜ฏๅ–œๆญก้€™ๆจฃ็š„ไฝ 
ย 
Hopf anemia09
MNCs Presentation
ย 
EuRoSaIn presentazione
New Libertarian Manifesto
Part 4: New HIV Treatment Pipeline
ย 
Steria Recruitment Presentation
Multicultural health standards around the world
Line Upgrade Deferral Scenarios for Distributed Renewable Energy Resources
Social media analysis for toronto 2010 mayoral election
iPhone and Appstore
ย 
Ishii presentation
Health-e-cITi NJ
Cda esm waste oil disposal application part 2
How to write tech posts & talks
Ad

Similar to OpenSource Big Data Platform - Flamingo v7 (20)

PDF
์ œ14ํšŒ JCO Presentation - Build Your Big Data Platform
PDF
201210 ๊ทธ๋ฃจํ„ฐ ๋น…๋ฐ์ดํ„ฐ_ํ”Œ๋žซํผ_์•„ํ‚คํ…์ณ_๋ฐ_์†”๋ฃจ์…˜_์†Œ๊ฐœ
ย 
PPTX
Open standard open cloud engine (3)
PPTX
OCE - Cno 2014 private sector oriented open paas oce
PDF
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
PPTX
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
ย 
PDF
ํƒœ๋ธ”๋กœ ์†Œํ”„ํŠธ์›จ์–ด(Tableau Software) ์†Œ๊ฐœ
ย 
PDF
HTML5/JSON ์„ ์ด์šฉํ•ด ๋ฒ”์šฉ 2D ๋งต์—๋””ํ„ฐ ์ œ์ž‘ํ•˜๊ธฐ
PDF
Sencha ExtJS๋ฅผ ํ™œ์šฉํ•œ Big Data Platform ๊ฐœ๋ฐœ ์‚ฌ๋ก€
PDF
GRUTER๊ฐ€ ๋“ค๋ ค์ฃผ๋Š” Big Data Platform ๊ตฌ์ถ• ์ „๋žต๊ณผ ์ ์šฉ ์‚ฌ๋ก€: GRUTER์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ๋ฐ ์ „๋žต ์†Œ๊ฐœ
ย 
PPTX
[Ankus Open Source Conference 2013] Introduction to ankus integration tool (f...
PPTX
Node.js์—์„œ ๊ณต๊ณตAPI๋ฅผ ํ™œ์šฉํ•ด์„œ ๊ฐœ๋ฐœํ•˜๊ธฐ
PPTX
4. แ„ƒแ…ขแ„‹แ…ญแ†ผแ„…แ…ฃแ†ผ แ„‹แ…กแ„แ…ตแ„แ…ฆแ†จแ„Žแ…ง แ„‰แ…ฅแ†ฏแ„€แ…จ แ„‘แ…ขแ„แ…ฅแ†ซ
PDF
์„œ๋ฒ„ํ•™๊ฐœ๋ก (๋ฐฑ์—”๋“œ ์„œ๋ฒ„ ๊ฐœ๋ฐœ์ž๋ฅผ ์œ„ํ•œ)
PDF
์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ํ™˜๊ฒฝ์˜ ๋ฐ์ดํ„ฐ๋ชจ๋ธ ๊ด€๋ฆฌ ๋ฐฉ์•ˆ By ์— ๋ฐ”์นด๋ฐ๋กœ ๋ฐ๋ธŒ๊ธฐ์–ด 2015.12.03
ย 
PPTX
Big Data platform์„ ์œ„ํ•œ Sencha Ext JS ์‚ฌ๋ก€.
PDF
ํ™์„ฑ์šฐ, ๊ฒŒ์ž„ ์„œ๋ฒ„์˜ ๋ชฉ์ฐจ - ์‹œ์ž‘๋ถ€ํ„ฐ ์ถœ์‹œ๊นŒ์ง€, NDC2019
PDF
[141]์ง€๋‚œ 1๋…„๊ฐ„์˜ ์›จ์ผ ๋ธŒ๋ผ์šฐ์ €์™€ ๊ทธ ๋ฏธ๋ž˜ (๋ถ€์ œ: ์ œํ’ˆ ๋งค๋‹ˆ์ €๊ฐ€ ๋“ค๋ ค์ฃผ๋Š” ์ƒ์ƒํ•œ ๊ธฐ์ˆ /์ œํ’ˆ ์ด์•ผ๊ธฐ)
PDF
HTML5 ์ŠคํŽ™ ์†Œ๊ฐœ
PPTX
์‹ ๊ทœ ํ˜‘์—…๋„๊ตฌ ์‚ฌ์šฉ์ž ๊ต์œก(๊ณตํ†ต ๋น„๊ฐœ๋ฐœ์ž)
์ œ14ํšŒ JCO Presentation - Build Your Big Data Platform
201210 ๊ทธ๋ฃจํ„ฐ ๋น…๋ฐ์ดํ„ฐ_ํ”Œ๋žซํผ_์•„ํ‚คํ…์ณ_๋ฐ_์†”๋ฃจ์…˜_์†Œ๊ฐœ
ย 
Open standard open cloud engine (3)
OCE - Cno 2014 private sector oriented open paas oce
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
ย 
ํƒœ๋ธ”๋กœ ์†Œํ”„ํŠธ์›จ์–ด(Tableau Software) ์†Œ๊ฐœ
ย 
HTML5/JSON ์„ ์ด์šฉํ•ด ๋ฒ”์šฉ 2D ๋งต์—๋””ํ„ฐ ์ œ์ž‘ํ•˜๊ธฐ
Sencha ExtJS๋ฅผ ํ™œ์šฉํ•œ Big Data Platform ๊ฐœ๋ฐœ ์‚ฌ๋ก€
GRUTER๊ฐ€ ๋“ค๋ ค์ฃผ๋Š” Big Data Platform ๊ตฌ์ถ• ์ „๋žต๊ณผ ์ ์šฉ ์‚ฌ๋ก€: GRUTER์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ๋ฐ ์ „๋žต ์†Œ๊ฐœ
ย 
[Ankus Open Source Conference 2013] Introduction to ankus integration tool (f...
Node.js์—์„œ ๊ณต๊ณตAPI๋ฅผ ํ™œ์šฉํ•ด์„œ ๊ฐœ๋ฐœํ•˜๊ธฐ
4. แ„ƒแ…ขแ„‹แ…ญแ†ผแ„…แ…ฃแ†ผ แ„‹แ…กแ„แ…ตแ„แ…ฆแ†จแ„Žแ…ง แ„‰แ…ฅแ†ฏแ„€แ…จ แ„‘แ…ขแ„แ…ฅแ†ซ
์„œ๋ฒ„ํ•™๊ฐœ๋ก (๋ฐฑ์—”๋“œ ์„œ๋ฒ„ ๊ฐœ๋ฐœ์ž๋ฅผ ์œ„ํ•œ)
์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ํ™˜๊ฒฝ์˜ ๋ฐ์ดํ„ฐ๋ชจ๋ธ ๊ด€๋ฆฌ ๋ฐฉ์•ˆ By ์— ๋ฐ”์นด๋ฐ๋กœ ๋ฐ๋ธŒ๊ธฐ์–ด 2015.12.03
ย 
Big Data platform์„ ์œ„ํ•œ Sencha Ext JS ์‚ฌ๋ก€.
ํ™์„ฑ์šฐ, ๊ฒŒ์ž„ ์„œ๋ฒ„์˜ ๋ชฉ์ฐจ - ์‹œ์ž‘๋ถ€ํ„ฐ ์ถœ์‹œ๊นŒ์ง€, NDC2019
[141]์ง€๋‚œ 1๋…„๊ฐ„์˜ ์›จ์ผ ๋ธŒ๋ผ์šฐ์ €์™€ ๊ทธ ๋ฏธ๋ž˜ (๋ถ€์ œ: ์ œํ’ˆ ๋งค๋‹ˆ์ €๊ฐ€ ๋“ค๋ ค์ฃผ๋Š” ์ƒ์ƒํ•œ ๊ธฐ์ˆ /์ œํ’ˆ ์ด์•ผ๊ธฐ)
HTML5 ์ŠคํŽ™ ์†Œ๊ฐœ
์‹ ๊ทœ ํ˜‘์—…๋„๊ตฌ ์‚ฌ์šฉ์ž ๊ต์œก(๊ณตํ†ต ๋น„๊ฐœ๋ฐœ์ž)
Ad

OpenSource Big Data Platform - Flamingo v7

  • 1. Open Cloud Engine Open Source Big Data Platform Flamingo Project ์†Œ๊ฐœ ๋ฐ ํ™œ์šฉ Open Cloud Engine Flamingo Project Leader ๊น€๋ณ‘๊ณค (ceo@cloudine.co.kr) 2014.04.02 v0.9
  • 3. ๋น… ๋ฐ์ดํ„ฐ ์ฑ…์ž„์ž์—๊ฒŒ ๋“ฃ๋Š” ํ”ํ•œ ์งˆ๋ฌธ โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ๊ฐ€ ๊ธฐ์กด์˜ DW๋ž‘ ์ฐจ์ด๊ฐ€ ๋ญ๊ฐ€ ์žˆ๋Š”์ง€ ๋ชจ๋ฅด๊ฒ ์Šต๋‹ˆ๋‹ค. โ€ขโ€ฏ ๋‹จ์œ„ ๋ฐ์ดํ„ฐ๋งŒ ๋ด์„œ๋Š” ํฐ ๋ฐ์ดํ„ฐ๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค. ์‚ฌ์—…์˜ ํƒ€๋‹น์„ฑ์„ ๋งŒ ๋“ค์ˆ˜๊ฐ€ ์—†์Šต๋‹ˆ๋‹ค. ์–ด๋–ป๊ฒŒ ํ•ด์•ผ ํ•˜๋‚˜์š”? โ€ขโ€ฏ A๋ผ๋Š” ๋ฐ์ดํ„ฐ๊ฐ€ ์žˆ๋Š”๋ฐ ๊ทธ๊ฒƒ์œผ๋กœ ๋ญ˜ ํ•ด์•ผํ• ๊นŒ์š”? โ€ขโ€ฏ ๋‹ค๋ฅธ ํšŒ์‚ฌ๋Š” ๋ญ ํ•œ๋‹ต๋‹ˆ๊นŒ? ํ˜น์‹œ ๋™์ข…์—…๊ณ„ ๋น„์Šทํ•œ ์‚ฌ๋ก€๊ฐ€ ์žˆ๋‚˜์š”? โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ๋งŒ๋“ค๋ผ๋Š”๋ฐ ์ด๋†ˆ์ด ๋ญ๋ฅผ ํ•˜๋Š” ๋†ˆ์ธ์ง€ ๋ชจ๋ฅด๊ฒ  ์Šต๋‹ˆ๋‹ค.
  • 4. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์˜ ์—ญํ• ์— ๋Œ€ํ•œ ๊ณ ๋ฏผ โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์—์„œ ํ•˜๊ณ ์ž ํ•˜๋Š” ์ฃผ์š” ์—…๋ฌด๋Š” ๋ฌด์—‡์ธ๊ฐ€? โ€ขโ€ฏ ๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹, ํ†ต๊ณ„, ๋กœ๊ทธ ๊ด€๋ฆฌ(์ˆ˜์ง‘, ์ „์ฒ˜๋ฆฌ, โ€ฆ) โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์—์„œ ๋ˆ„๊ฐ€ ๋ฌด์Šจ ์ผ์„ ํ•˜๋Š”๊ฐ€? โ€ขโ€ฏ ์‚ฌ์šฉ์ž์— ๋”ฐ๋ผ์„œ ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ์ด ์„œ๋กœ ๋‹ค๋ฅผ ์ˆ˜ ์žˆ๋‹ค. โ€ขโ€ฏ ์šด์˜์ž๋Š” ๋Œ€๋ถ€๋ถ„ ๊ฐœ๋ฐœ์ž ์ถœ์‹ ์ด๊ธฐ ๋•Œ๋ฌธ์— ์‹œ์Šคํ…œ ๊ด€๋ฆฌ ๋ฐ ๋กœ๊ทธ ๊ด€๋ฆฌ์— ์ดˆ์  โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋ถ„์„๊ฐ€ ์ถœ์‹ ์ธ ๊ฒฝ์šฐ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ ํ™˜๊ฒฝ์˜ ์„ฑ์ˆ™๋„๊ฐ€ ์ดˆ์  โ€ขโ€ฏ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ์‚ฌ์šฉํ•˜๋Š” ์‚ฌ์šฉ์ž์˜ ์ˆ˜๋Š”? โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋งŽ๋‹ค๋ฉด ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ์„ฑ๊ณผ ์ธํ”„๋ผ์˜ ์ ‘๊ทผ์„ฑ์ด ์ค‘์š” โ€ขโ€ฏ ํ”Œ๋žซํผ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค๋ฃจ๋Š” ํŠน์„ฑ ๋•Œ๋ฌธ์— ๋ณด์•ˆ์— ์ทจ์•ฝํ•  ์ˆ˜ ์žˆ๊ณ  Hadoop์€ ์‹ค ์ œ๋กœ ์ทจ์•ฝํ•จ โ€ขโ€ฏ ๋‚˜๋Š” ์šด์˜์ž? ๊ธฐํš์ž? ๊ฐœ๋ฐœ์ž? ๋ถ„์„๊ฐ€? โ€ขโ€ฏ ์ฑ…์ž„์ž์˜ ์—ญํ• ์— ๋”ฐ๋ผ์„œ ํ”Œ๋žซํผ์˜ ๊ธฐ๋Šฅ๋„ ๋‹ค๋ฅด๊ฒŒ ์ •์˜ํ•œ๋‹ค.
  • 5. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ SOFTWARE STACK
  • 6. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ INFRA MANAGEMENT MONITORING
  • 7. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ WORKFLOW
  • 8. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ ๋ถ„์„ ๋ฐ ์‹œ๊ฐํ™” ํ™˜๊ฒฝ
  • 9. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ DASHBOARD
  • 10. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ SECURITY โ€ขโ€ฏ ACCESS โ€ขโ€ฏ AUTHENTICATION โ€ขโ€ฏ AUTHORIZATION โ€ขโ€ฏ ENCRYPTION โ€ขโ€ฏ AUDITING โ€ขโ€ฏ POLICY
  • 11. ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์ œ๊ณตํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ โ€ขโ€ฏ ๋ฐฐ์น˜ ์ž‘์—… ๊ด€๋ฆฌ์™€ ์ž‘์—… ๋ชจ๋‹ˆํ„ฐ๋ง โ€ขโ€ฏ ๋ณ‘๋ ฌ ๋ถ„์„ ํ”„๋กœ๊ทธ๋žจ โ€ขโ€ฏ ์‚ฌ์šฉ์ž์˜ ํ–‰์œ„์— ๋Œ€ํ•œ ๋ชจ๋‹ˆํ„ฐ๋ง โ€ขโ€ฏ ๋ฆฌ์†Œ์Šค์— ๋Œ€ํ•œ ๊ฐ์ข… ์ ‘๊ทผ ํ†ต์ œ ์ •์ฑ… ๋ฐ ์‹œ์Šคํ…œ โ€ขโ€ฏ ์ธํ”„๋ผ์˜ ์ ‘๊ทผ์„ฑ ํ–ฅ์ƒ์„ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ๊ธฐ๋Šฅ๋“คโ€ฆ
  • 12. Flamingo Project In Open Cloud Engine โ€ขโ€ฏ ์›น ๊ธฐ์ˆ ์„ ํ™œ์šฉํ•˜์—ฌ ๋น… ๋ฐ์ดํ„ฐ ์ธํ”„๋ผ ๋ฐ ๋ฐ์ดํ„ฐ๋ฅผ ํŽธ๋ฆฌํ•˜๊ฒŒ ์‚ฌ์šฉ ํ•˜๋„๋ก ํ•œ๋‹ค. โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์ž˜ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค. โ€ขโ€ฏ ํ•˜๋‚˜์˜ ํ™”๋ฉด์—์„œ ์ž์œ ๋กญ๊ฒŒ ๋‹ค์–‘ํ•œ ์ž‘์—…์„ ํ•  ์ˆ˜ ์žˆ๋Š” ์ž‘์—… ๊ณต๊ฐ„์„ ์ œ๊ณตํ•œ๋‹ค. โ€ขโ€ฏ ๋‹ค์–‘ํ•œ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ MapReduce๋ฅผ ์‰ฝ๊ฒŒ ์žฌํ™œ์šฉ ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค. โ€ขโ€ฏ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜์œผ๋กœ ๋ชจ๋“  ์‹œ์Šคํ…œ์„ ์ œ๋Œ€๋กœ ๊ฐ–์ถ”๊ณ  ์ง„ํ–‰ํ•œ๋‹ค. โ€ขโ€ฏ ๋‚จ์˜ ๊ฒƒ์— ์˜์กดํ•˜์ง€ ์•Š๊ณ  ์ง์ ‘ ๋‹ค ๋งŒ๋“ ๋‹ค. โ€ขโ€ฏ ํ˜„์žฅ์˜ ์—…๋ฌด๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์„ค๊ณ„ํ•œ๋‹ค. โ€ขโ€ฏ ๋‹ค๊ตญ์–ด ์ง€์›์„ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ์‚ฌ๋žŒ๋“ค์ด ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค. โ€ขโ€ฏ Hadoop EcoSystem์„ ์ž˜ ์ง€์›ํ•œ๋‹ค.
  • 13. Browser ย  ๋””์ž์ด๋„ˆ ย  Search ย  ํ˜•ํƒœ์†Œ
  • 29. ย  Browser ย  ์ธํฌ๋ฉ”์ด์…˜ ์นดํƒˆ๋กœ๊ทธ ย  Search ย  ์ธํฌ๋ฉ”์ด์…˜ ์œ ํ˜• ย  ๋ณด์•ˆ๋“ฑ๊ธ‰ ย  ์ƒ์„ฑ์ฃผ๊ธฐ ย  ํ˜•์‹ ย  ์‚ฌ์šฉ์ž ์นœ๋ฐ€๋„ ย  1 ย  ๋งค์ผ ์ƒˆ๋ฒฝ2์‹œ ย  XML ย  ์•„์ดํ…œ ์ถ”์ฒœ ย  2 ย  ๋งค์ผ ์ƒˆ๋ฒฝ 1์‹œ ย  JSON ย  ๊ตฌ๋งค ์„ฑํ–ฅ ย  3 ย  ๋งค์ผ ์ €๋… 8์‹œ ย  XML/JSON ย  ์˜คํ”ผ๋‹ˆ์–ธ ๋ฆฌ๋” ์ ์ˆ˜ ย  2 ย  ๋งค์ผ ์˜ค์ „ 10 ์‹œ ย  XML/JSON ย  ๋ฐ์ดํ„ฐ
  • 44. ย 
  • 48. ย 
  • 84. ย  Future of Big Data Platform
  • 85. Flamingo Project โ€ขโ€ฏ ํ˜„์žฅ์—์„œ ์˜ค๋žซ๋™์•ˆ ๊ฒฝํ—˜ํ•œ ๊ฒฐ๊ณผ Hadoop ๊ธฐ๋ฐ˜ Big Data ํ™˜๊ฒฝ์€ ๊ธฐ๋Šฅ์„ฑ์ด ๋งค์šฐ ์ค‘์š” โ€ขโ€ฏ ๋งŽ์€ ์˜คํ”ˆ์†Œ์Šค๋“ค์ด ํ†ตํ•ฉ๋˜๋ฉด์„œ ๊ด€๋ฆฌ์˜ ์–ด๋ ค์›€์ด ๋ฐœ์ƒํ•˜๊ณ  ์žˆ๊ณ  ํ†ตํ•ฉํ™˜๊ฒฝ์„ ์ œ๊ณตํ•˜๋Š” UI๋„ ์ ˆ๋Œ€์ ์œผ๋กœ ๋ถ€์กฑ
  • 86. Flamingo์˜ ํ†ตํ•ฉ ํ™˜๊ฒฝ(Workbench) โ€ขโ€ฏ ์‚ฌ์šฉ์ž๋Š” ์ž‘์—… ๊ณต๊ฐ„ ๋‚ด์—์„œ ์ž์œ ๋กญ๊ฒŒ ์ด๋™ํ•˜๋ฉด์„œ ์ž‘์—…์„ ํ•  ์ˆ˜ ์žˆ ๋„๋ก ๊ตฌ์„ฑ โ€ขโ€ฏ ๊ฐ ํ™”๋ฉด์€ ์ตœ๋Œ€ํ•œ ๋…๋ฆฝ ๊ฐœ๋ฐœ์ด ๊ฐ€๋Šฅํ•˜๋„๋ก ๋ถ„๋ฆฌํ•˜์—ฌ ๊ตฌ์„ฑ โ€ขโ€ฏ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ๊ฒƒ์€ ์ปดํฌ๋„ŒํŠธํ™”์—ฌ ์ฝ”๋“œ ์ž‘์„ฑ์„ ์ตœ์†Œํ™” โ€ขโ€ฏ ๋ˆ„๊ตฌ๋‚˜ ์ถ”๊ฐ€ํ•  ์ˆ˜ ์žˆ๋„๋ก ์ตœ๋Œ€ํ•œ ๊ตฌ์กฐ๋ฅผ ๋‹จ์ˆœํ™”ํ•˜๊ณ  ๋Œ€์ค‘์ ์ธ ํ”„ ๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉ โ€ขโ€ฏ ๊ฐœ๋ฐœ ๋ฐฉ๋ฒ•๋„ ๋ชจ๋‘ ํ‘œ์ค€ํ™” (๋„๊ตฌ, ์ ˆ์ฐจ, ๋งค๋‰ด์–ผ, ํ™˜๊ฒฝ ๋“ฑ๋“ฑ)
  • 88. File System Browser โ€ขโ€ฏ Hadoop์ด ํŒŒ์ผ์„ ๋‹ค๋ฃจ๋ฏ€๋กœ ํŒŒ์ผ ์‹œ์Šคํ…œ ๋ธŒ๋ผ์šฐ์ €์˜ ๊ธฐ๋Šฅ์€ ์ƒ๋‹นํžˆ ์ค‘์š”ํ•œ ๋ฉ”์ธ ๊ธฐ๋Šฅ โ€ขโ€ฏ ์‚ฌ์šฉ์ž๊ฐ€ Windows Explorer ์Šคํƒ€์ผ์˜ ์นœ์ˆ™ํ•œ UX๋ฅผ ๋”ฐ๋ผ๊ฐ€๋„๋ก ์„ค๊ณ„
  • 89. File System Browser ๋””๋ ‰ํ† ๋ฆฌ๋ฅผ Hive DB์™€ Table๋กœ ์ „ํ™˜ ๋ธŒ๋ผ์šฐ์ €์—์„œ๋Š” Hive DB์™€ Table ๊ฒฝ๋กœ๋ฅผ ๋‹ค๋ฅธ ์•„์ด์ฝ˜์œผ๋กœ ํ‘œ์‹œํ•˜์—ฌ ํ™•์ธ FLAMINGO์—์„œ๋Š” ์‚ฌ์šฉ์ž ๊ฐ€ ์ฃผ๋กœ ํ•˜๋Š” ํ–‰์œ„์— ์ตœ์  ํ™”ํ•˜์—ฌ ๊ธฐ๋Šฅ์„ ์ œ๊ณต
  • 90. File System Browser ๊ธฐ๋Šฅ ๊ณ ๋„ํ™” โ€ขโ€ฏ ํŒŒ์ผ ๋‚ด์šฉ ๋ฐ Block Location ๋ณด๊ธฐ ๊ธฐ๋Šฅ โ€ขโ€ฏ ์‚ฌ์šฉ์ž์˜ ๋“ฑ๊ธ‰๋ณ„ ๋””๋ ‰ํ† ๋ฆฌ ๋ฐ ํŒŒ์ผ ํ‘œ์‹œ ๋ฐ ๊ธฐ๋Šฅ ์ œํ•œ (Hadoop ์ž์ฒด ๊ธฐ๋Šฅ์€ ์—†์Œ) โ€ขโ€ฏ ์˜ˆ) ์ผ๋ฐ˜ ์‚ฌ์šฉ์ž์˜ ๊ฒฝ์šฐ /tmp ๋””๋ ‰ํ† ๋ฆฌ๋Š” ํ‘œ์‹œํ•˜์ง€ ์•Š์Œ โ€ขโ€ฏ ๋””๋ ‰ํ† ๋ฆฌ ๋ฐ ํŒŒ์ผ์˜ permission ์„ค์ • ๊ธฐ๋Šฅ โ€ขโ€ฏ ์‚ฌ์šฉ์ž์˜ ํ™ˆ ๋””๋ ‰ํ† ๋ฆฌ ๊ธฐ๋Šฅ (Hadoop ์ž์ฒด ๊ธฐ๋Šฅ์€ ์—†์Œ) โ€ขโ€ฏ ๋””๋ ‰ํ† ๋ฆฌ Quota ์„ค์ • ๊ธฐ๋Šฅ โ€ขโ€ฏ ํŒŒ์ผ ์‹œ์Šคํ…œ์˜ ํฌ๊ธฐ ์ •๋ณด๋ฅผ ์ฃผ๊ธฐ์ ์œผ๋กœ ๋คํ”„๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๊ธฐ๋Šฅ ์ถ”๊ฐ€ (๋ชจ๋‹ˆํ„ฐ๋ง)
  • 91. Audit Log โ€ขโ€ฏ HDFS ๋“ฑ๊ณผ ๊ฐ™์€ File System ์ƒ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋กœ๊ทธ์˜ ๊ธฐ๋ก์„ ๋ชจ๋‘ ๋‚จ๊ธฐ๊ณ  ์กฐํšŒ
  • 92. Workflow Designer โ€ขโ€ฏ ๋‹ค์–‘ํ•œ ๋ถ„์„ ๋ชจ๋“ˆ์„ ํƒ‘์žฌํ•  ์ˆ˜ ์žˆ๋„๋ก ์„ค๊ณ„ (์˜ˆ; Mahout) โ€ขโ€ฏ UI๋ฅผ ํ†ตํ•ด ๋ฏธ๋ฆฌ ์ œ๊ณตํ•˜๋Š” ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ ๋ชจ๋“ˆ์„ ๋“œ๋ž˜๊ทธ ์•ค ๋“œ๋กญ์œผ๋กœ ์ฒ˜๋ฆฌ โ€ขโ€ฏ ํ˜„์žฌ ๋ถ„์„ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๋ฐ ๊ธฐ์ดˆ ํ†ต๊ณ„ ๋ชจ๋“ˆ์€ ํ†ตํ•ฉ ์™„๋ฃŒ, Mahout, Giraph ํ†ตํ•ฉ ์ง„ํ–‰์ค‘. ์ถ”ํ›„ MR ETL ํ†ตํ•ฉ ์˜ˆ์ •.
  • 93. Big Workflow Case ํ˜„์žฅ์—์„œ ํ•„์š”ํ•˜๋‹ค๋ฉด ๋‹ค์ˆ˜์˜ ๋…ธ๋“œ๋กœ ๊ตฌ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ์‹ค์ œ ๋กœ ๊ตฌํ˜„ํ•˜์—ฌ ์‚ฌ์šฉํ•จ.
  • 95. Apache Access Log To CSV ์ž‘์„ฑํ•œ MapReduce์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์˜ต์…˜ โ€ขโ€ฏ CSV ํŒŒ์ผ ๋ณ€ํ™˜์‹œ ํ•„์š”ํ•œ ์ปฌ๋Ÿผ ๊ตฌ๋ถ„์ž โ€ขโ€ฏ ํŒจํ„ด๊ณผ ๋‹ค๋ฅธ ๋กœ๊ทธ์˜ ๊ฒฝ์šฐ ํ‘œ์ค€ ์ถœ๋ ฅ์œผ๋กœ ๊ธฐ๋ก ํ• ์ง€ ์—ฌ๋ถ€(๋””๋ฒ„๊น…์šฉ) Apache Access Log์˜ ์œ„์น˜์™€ CSV ํŒŒ ์ผ์˜ ์œ„์น˜๋ฅผ ์ง€์ • MapReduce JAR ํŒŒ์ผ๊ณผ Driver ํด๋ž˜์Šค
  • 96. Workflow Designer โ€ขโ€ฏ ์ตœ์ข… ๊ฒฐ๊ณผ๋ฌผ์„ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋ณต์žกํ•œ ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ๊ตฌ์„ฑํ•˜๊ฒŒ ๋จ โ€ขโ€ฏ MapReduce์˜ ํŠน์„ฑ์ƒ ํŒŒ์ผ์„ ๊ฐ€๊ณตํ•˜๋Š”๋ฐ ํ•œ๋ฒˆ์˜ ์ž‘์—…์ด ์•„๋‹Œ ๋‹ค์ˆ˜์˜ ์ž‘์—…์œผ ๋กœ ํ•ด์•ผํ•˜๋Š” ๊ฒฝ์šฐ ๋นˆ๋ฒˆํ•˜์—ฌ ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ๋ณต์žกํ•˜๊ฒŒ ๋งŒ๋“ฌ โ€ขโ€ฏ ๊ตญ๋‚ด ์—”์ง€๋‹ˆ์–ด๋“ค์€ ์ ˆ๋Œ€์ ์œผ๋กœ Apache Hive์˜ SQL like Query Languag e๋ฅผ ์„ ํ˜ธํ•˜์—ฌ MapReduce๋ฅผ ๋งŽ์ด ์‚ฌ์šฉํ•˜์ง€ ์•Š์œผ๋ฏ€๋กœ ์›Œํฌํ”Œ๋กœ์šฐ ๋””์ž์ด๋„ˆ ์˜ ์ค‘์š”์„ฑ์ด ๋งŽ์ด ๋ถ€๊ฐ๋˜์ง€ ์•Š์Œ โ€ขโ€ฏ ํ˜„์—…์—์„œ ๋‹ค์–‘ํ•œ ๋กœ๊ทธ ํŒŒ์ผ์„ ๋‹ค๋ฃจ๋Š” ๊ฒฝ์šฐ ์›Œํฌํ”Œ๋กœ์šฐ ๋””์ž์ด๋„ˆ์™€ MapRedu ce๋Š” ๋งค์šฐ ์ค‘์š”ํ•จ
  • 97. Workflow Monitoring โ€ขโ€ฏ ์›Œํฌํ”Œ๋กœ์šฐ ๋””์ž์ด๋„ˆ์—์„œ ๋””์ž์ธํ•œ ์›Œํฌํ”Œ๋กœ์˜ ์‹คํ–‰์„ ๋ชจ๋‹ˆํ„ฐ๋ง. ์‹คํ–‰ ๋กœ๊ทธ๋ฅผ ์ •ํ™•ํ•˜๊ฒŒ ํ™•์ธํ•  ์ˆ˜ ์žˆ์Œ.
  • 98. Workflow Monitoring root@n02:~/flamingo_data/tmp/2014/03/31/90/JOB_20140331_172000_90_157566920/26385942 $ ls -lsa ํ•ฉ๊ณ„ 40 4 drwxr-xr-x 2 root root 4096 2014-03-31 17:23 . 4 drwxr-xr-x 20 root root 4096 2014-03-31 17:23 .. 16 -rw-r--r-- 1 root root 12731 2014-03-31 17:23 action.log ร ๏ƒ  ์‹คํ–‰ ๋กœ๊ทธ 4 -rwxrwxrwx 1 root root 1259 2014-03-31 17:23 core-site.xml 0 -rw-r--r-- 1 root root 0 2014-03-31 17:23 hadoop.job_201403300831_0471 ร ๏ƒ  MapReduce Job ID 4 -rwxrwxrwx 1 root root 852 2014-03-31 17:23 script.sh ร ๏ƒ  ์ปค๋งจ๋“œ ๋ผ์ธ root@n02:~/flamingo_data/tmp/2014/03/31/90/JOB_20140331_172000_90_157566920/26385942 $ ์›Œํฌํ”Œ๋กœ์šฐ์˜ ๋…ธ๋“œ๋Š” ๋‹ค ์ˆ˜์˜ MAPREDUCE JOB ์œผ๋กœ ๋™์ž‘ํ•  ์ˆ˜ ์žˆ์œผ๋ฏ€ ๋กœ ์ถ”์ ์ด ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ ์‚ฌ์šฉ์ž ๊ด€์ ์˜ MapReduce ์‹คํ–‰ ์ด๋ ฅ
  • 99. Hadoop Job Monitoring Hadoop Job ๋ชจ๋‹ˆํ„ฐ๋ง์—์„œ๋„ ๋ฐ˜๋Œ€๋กœ ์ถ”์ ์ด ๋ชจ๋‘ ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ.
  • 100. Expression Language (EL) โ€ขโ€ฏ ๋™์ ์ธ ๊ฐ’๋“ค์„ ์–ป๊ณ ์žํ•  ๋•Œ Workflow Designer์—์„œ ํ™œ์šฉ โ€ขโ€ฏ ์˜ˆ) ์˜ค๋Š˜ ๋‚ ์งœ : dateFormat(โ€˜yyyyMMddโ€™) dateFormat(โ€˜yyyy-MM-ddโ€™) โ€ขโ€ฏ ์›Œํฌํ”Œ๋กœ์šฐ๊ฐ€ ์‹คํ–‰ํ•  ๋•Œ ํŠน์ •ํ•œ ๊ฐ’๋“ค์€ ํ•ด๋‹น ์‹œ๊ฐ„์œผ๋กœ ๋Œ€์ฒด๋˜์–ด์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋ฐœ์ƒ โ€ขโ€ฏ ์˜ˆ) ์˜ค๋Š˜ ์‹คํ–‰ํ•˜๋Š” ์›Œํฌํ”Œ๋กœ์šฐ๋Š” ์–ด์ œ ๋‚ ์งœ์˜ ๋””๋ ‰ํ† ๋ฆฌ์— ๊ธฐ๋ก (์ผ๋ฐฐ์น˜) โ€ขโ€ฏ ์ œ๊ณตํ•˜๋Š” Expression Language โ€ขโ€ฏ dateFormat(โ€˜DATE FORMATโ€™) ร ๏ƒ  dateFormat(โ€˜yyyyMMddHHmmssโ€™) โ€ขโ€ฏ hostname, escapeString, โ€ขโ€ฏ yesterday, tommorow โ€ขโ€ฏ month, day, hour, minute, โ€ฆ ร ๏ƒ  day(โ€˜yyyyMMddโ€™, -1) :: ์–ด์ œ ๋‚ ์งœ (20131111) โ€ขโ€ฏ trim, concat, urlEncode, firstNotNull
  • 101. Expression Language (EL) ์ž…๋ ฅ ํ•„๋“œ์— ${EL} ํ˜•์‹์œผ๋กœ ์ž…๋ ฅํ•˜๋Š” ๊ฒฝ์šฐ ๋™์ ์œผ๋กœ ํ•ด์„ํ•˜์—ฌ ๊ฐ’์ด ๋ณ€๊ฒฝ๋จ.
  • 102. Hadoop Job Tracker Monitoring โ€ขโ€ฏ Hadoop์˜ Job Tracker ์ƒ์„ธ ์ •๋ณด๋ฅผ ๊ทธ๋ž˜ํ”„๋กœ ๋ณด์—ฌ์ฃผ๋Š” ๋ชจ๋‹ˆํ„ฐ๋ง ๊ธฐ๋Šฅ
  • 103. Hadoop Job Tracker Monitoring โ€ขโ€ฏ Hadoop Job์˜ ์ƒ์„ธ ์ •๋ณด๋ฅผ ์›๊ฒฉ์—์„œ ๋ชจ๋‘ ๋ชจ๋‹ˆํ„ฐ๋งํ•˜๊ณ  ์ถ”์  ๊ฐ€๋Šฅ
  • 104. Hive Editor Hive Metastore Browser โ€ขโ€ฏ ํŒŒ์ผ ์‹œ์Šคํ…œ์˜ ํŒŒ์ผ์„ SQL๋กœ ์กฐํšŒ, ๋ธŒ๋ผ์šฐ์ง•, ๋‹ค์šด๋กœ๋“œ โ€ขโ€ฏ Hive Metastore ๊ด€๋ฆฌ ๊ธฐ๋Šฅ์„ ์ œ๊ณตํ•˜์—ฌ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์™€ ํ…Œ์ด๋ธ”์„ ํ†ตํ•ฉ ๊ด€๋ฆฌํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ธฐ๋Šฅ์„ ์ œ๊ณต
  • 105. Hive ํŽธ์ง‘๊ธฐ ์ ์šฉ ์‚ฌ๋ก€ โ€ขโ€ฏ ์‹œ์Šคํ…œ์˜ ์‚ฌ์šฉ์ž ์ ‘๊ทผ ์ด๋ ฅ ๋กœ๊ทธ๋ฅผ Hive๋กœ ์กฐํšŒํ•˜๋Š” ์‚ฌ๋ก€ โ€“โ€ฏ ๋Œ€์ƒ ๋กœ๊ทธ์˜ ํ˜•์‹์ด ๋ฐ˜์ •ํ˜•์ด๋‚˜ ๋น„์ •ํ˜•์ธ ๊ฒฝ์šฐ ๋ฌธ์ œ ๋ฐœ์ƒ โ€“โ€ฏ ์นผ๋Ÿผ ์•ˆ์— Array, Map ๋“ฑ์˜ ์ด์ƒํ•œ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง„ ๋กœ๊ทธ์˜ ๊ฒฝ์šฐ ๋ฌธ์ œ ๋ฐœ์ƒ โ€ขโ€ฏ ๋Œ€์ƒ ๋กœ๊ทธ๋Š” CSV ํ˜•์‹๊ณผ ๊ฐ™์€ ์ž˜ ์ •๋ฆฌ๋œ ํ˜•์‹์ด ์•„๋‹Œ ๋ฐ˜์ •ํ˜• ๋กœ๊ทธ ํ˜•์‹ TYPE=IPINSIDE TIME=2014-03-20 17:40:37 ID=guest0899349 MAC=AA-BB-01-18-68-68 NAT_IP=10.24 .104.104 NAT_IP_NATION=USA PROXY_USE=Y VPN_USE=Y REMOTE_USE=Y PROXY_IP=192.24.104.104 P ROXY_IP_NATION=USA VPN_IP=192.24.104.104 VPN_IP_NATION=USA SVC_CODE=SVC_CODE_0899349 HDD_D ISK=HDD_DISK_0899349 CPU_INFO=CPU_INFO_0899349 USE_OS_NATION=USA MESG=mesg..... time[139528 4830] rnd[875899349] unq[5000000] TYPE=IPINSIDE TIME=2014-03-20 17:40:37 ID=guest0899349 MAC=AA-BB-01-18-68-68 NAT_IP=10.24 .104.104 NAT_IP_NATION=USA PROXY_USE=Y VPN_USE=Y REMOTE_USE=Y PROXY_IP=192.24.104.104 P ROXY_IP_NATION=USA VPN_IP=192.24.104.104 VPN_IP_NATION=USA SVC_CODE=SVC_CODE_0899349 HDD_D ISK=HDD_DISK_0899349 CPU_INFO=CPU_INFO_0899349 USE_OS_NATION=USA MESG=mesg..... time[139528 4830] rnd[875899349] unq[5000000]
  • 106. Hive ํŽธ์ง‘๊ธฐ ์ ์šฉ ์‚ฌ๋ก€ TYPE=IPINSIDE TIME=2014-03-20 17:40:37 ID=guest0899349 MAC=AA-BB-01-18-68-68 NAT_IP=10.24.104.104 NAT_IP_NATION=USA PROXY_USE=Y VPN_USE=Y REMOTE_USE=Y PROXY_IP=192.24.104.104 PROXY_IP_NATION=USA VPN_IP=192.24.104.104 VPN_IP_NATION=USA SVC_CODE=SVC_CODE_0899349 HDD_DISK=HDD_DISK_0899349 CPU_INFO=CPU_INFO_0899349 USE_OS_NATION=USA MESG=mesg..... time[1395284 830] rnd[875899349] unq[5000 000]โ€
  • 107. Hive ํŽธ์ง‘๊ธฐ ์ ์šฉ ์‚ฌ๋ก€ CREATE DATABASE TEST LOCATION '/RAW'; CREATE EXTERNAL TABLE TEST.MAS ( type string, time string, id string, mac string, nat_ip string, nat_ip_nation string, proxy_use string, vpn_use string, remote_use string, proxy_ip string, proxy_ip_nation string, vpn_ip string, vpn_ip_nation string, svc_code string, hdd_disk string, cpu_info string, use_os_nation string, mesg string) PARTITIONED BY ( yyyy string, mm string, dd string) ROW FORMAT SERDE 'kr.cloudine.poc.MasSerde' LOCATION '/RAW/MAS'; ALTER TABLE MAS ADD PARTITION (YYYY='2014', MM='03', DD=โ€™25');
  • 109. Hive ํŽธ์ง‘๊ธฐ ์ ์šฉ ์‚ฌ๋ก€ public class MasSerde implements SerDe { private StructTypeInfo rowTypeInfo; private ObjectInspector rowOI; private ListString colNames; private ListObject row = new ArrayListObject(); Pattern p = Pattern.compile((.*?)); // ๋กœ๊ทธ ํŒŒ์ผ์˜ ์ •๊ทœ ํ‘œํ˜„์‹ @Override public Object deserialize(Writable blob) throws SerDeException { row.clear(); Matcher m = p.matcher(blob.toString()); // ๋กœ๊ทธ ํŒŒ์ผ์„ ์ •๊ทœ์‹์œผ๋กœ ํŒจํ„ด ๋งค์นญ List list = new ArrayList(); while (m.find()) { list.add(m.group(1)); // ํŒจํ„ด ๋งค์นญ์„ ํ†ตํ•ด ์ถ”์ถœํ•œ ์นผ๋Ÿผ ์ •๋ณด๋ฅผ ์ €์žฅ } String[] split = (String[]) list.toArray(new String[list.size()]); int i = 0; for (String fieldName : rowTypeInfo.getAllStructFieldNames()) { TypeInfo fieldTypeInfo = rowTypeInfo.getStructFieldTypeInfo(fieldName); row.add(parseField(split[i], fieldTypeInfo)); i++; } return row; } ... ์ƒ๋žต } HIVE QUERY ์‹คํ–‰์‹œ ๋กœ๊ทธ ํŒŒ์ผ์„ ๋กœ๋”ฉํ•  ๋•Œ DESERIALIZEํ•œ๋‹ค.
  • 111. Pig Script Editor โ€ขโ€ฏ Pig Latin Script๋ฅผ ํŽธ์ง‘ํ•˜๊ณ  ์ €์žฅ โ€ขโ€ฏ Pig Latin Script๋ฅผ ์‹คํ–‰ํ•˜๊ณ  ๊ด€๋ จ ์ด๋ ฅ์„ ๊ด€๋ฆฌํ•˜์—ฌ ๋น ๋ฅด๊ฒŒ ๋ฐ์ดํ„ฐ๋ฅผ ํ”„๋กœ์„ธ์‹ฑ
  • 112. Dashboard โ€ขโ€ฏ ๋ฐฐ์น˜ ์ž‘์—…์˜ ๋™์ž‘ ํ˜„ํ™ฉ์„ ๋ณด์—ฌ์ฃผ๋Š” UI
  • 113. Job Management โ€ขโ€ฏ ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ์ฃผ๊ธฐ์ ์œผ๋กœ ์‹คํ–‰ํ•˜๋„๋ก ๋ฐฐ์น˜ ์ž‘์—…์„ ๋“ฑ๋กํ•˜๊ณ  ๋ชจ๋‹ˆํ„ฐ๋ง
  • 114. Job Management โ€ขโ€ฏ Cron Expression Fully Support
  • 115. ํ”„๋กœ์ ํŠธ ์ •๋ณด โ€ขโ€ฏ Source Forge (๋‹ค์šด๋กœ๋“œ) โ€“โ€ฏ http://www.sourceforge.net/projects/hadoop-manager โ€ขโ€ฏ ์œ„ํ‚ค (์„ค๋ช…์„œ ๋ฐ ๊ฐ์ข… ๊ธฐ์ˆ ์ž๋ฃŒ) โ€“โ€ฏ http://wiki.opencloudengine.org/pages/viewpage.action?pageId=8 19205 โ€ขโ€ฏ ์ด์Šˆ ๊ด€๋ฆฌ (๋ฒ„๊ทธ ๋ฐ ์‹ ๊ทœ ๊ธฐ๋Šฅ) โ€“โ€ฏ http://jira.opencloudengine.org โ€ขโ€ฏ ๋นŒ๋“œ ์„œ๋ฒ„ โ€“โ€ฏ http://build.opencloudengine.org โ€ขโ€ฏ ๊ตฌ๊ธ€ ๊ทธ๋ฃน์Šค : flamingo-project-kr@googlegroups.com โ€ขโ€ฏ facebook : https://www.facebook.com/groups/flamingo.workflow โ€ขโ€ฏ ์„œ๋ธŒ์Šคํฌ๋ฆฝ์…˜ (๊ธฐ์—… ๊ธฐ์ˆ ์ง€์›) : sales@cloudine.co.kr
  • 116. Flamingo Project์˜ ๋ฏธ๋ž˜ โ€ขโ€ฏ Big Data on Cloud โ€ขโ€ฏ Netra (OpenStack based Hadoop Provisioning) + Flamingo (Hadoop based Workspace) โ€ขโ€ฏ Open Source based Big Data Platform โ€ขโ€ฏ Apache Hadoop EcoSystem โ€ขโ€ฏ Big Data Management Using Flamingo โ€ขโ€ฏ Apache Hadoop PaaS (Platform as a Service) โ€ขโ€ฏ Big Data All In One Package
  • 117. Workflow Designer โ€ขโ€ฏ MapReduce ๊ฐœ๋ฐœ์ž ๋งˆ๋‹ค ๋ชจ๋‘ ํŒŒ๋ผ๋ฏธํ„ฐ ์ฒ˜๋ฆฌ๊ฐ€ ํ‹€๋ฆฌ๊ณ  ํ‘œ์ค€ํ™” ๋˜์–ด ์žˆ์ง€ ์•Š์Œ โ€ขโ€ฏ ์ด๋Ÿฌํ•œ ๋‹ค์–‘ํ•œ MapReduce๋ฅผ ๋น ๋ฅด๊ฒŒ ์–ด๋–ป๊ฒŒ ํ†ตํ•ฉํ•  ๊ฒƒ์ธ๊ฐ€?
  • 118. Workflow Designer โ€ขโ€ฏ ๋Œ€๋ถ€๋ถ„์˜ UI ์ปดํฌ๋„ŒํŠธ๋Š” ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•˜๋„๋ก ์„ค๊ณ„ํ•˜์—ฌ ์ปดํฌ๋„ŒํŠธ ํ˜•ํƒœ๋กœ ์ œ๊ณต โ€ขโ€ฏ MapReduce Module๊ณผ UI ํ†ตํ•ฉ ๋ฐฉ์‹์ด ํ‘œ์ค€ํ™” ๋˜์–ด ์žˆ์œผ๋ฉฐ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ์ œ๊ณต๋˜์–ด ๋น ๋ฅด๊ฒŒ ๊ฐœ๋ฐœ ๋ฐ ํ†ตํ•ฉ ๊ฐ€๋Šฅ ์žฌ์‚ฌ์šฉ ์ปดํฌ๋„ŒํŠธ UI ๊ตฌ์„ฑ
  • 119. Workflow Designer โ€ขโ€ฏ ๋ชจ๋“ˆ์˜ ์•„์ด์ฝ˜๋„ ๋ฉ”ํƒ€ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด์„œ ์ •์˜ํ•˜์—ฌ ๋ณ„๋„ ์ฝ”๋“œ ์ž‘์„ฑ์„ ์ตœ์†Œํ™”ํ•˜๊ณ  โ€ขโ€ฏ ๊ด€๋ จ ๊ธฐ๋Šฅ์„ ํ†ตํ•ฉ ํ”„๋ ˆ์ž„์›Œํฌ๋กœ ์œ„์ž„ํ•˜๊ณ  ์‚ฌ์šฉ์ž๋Š” ๋ฉ”ํƒ€ ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ํ•ธ๋“ค๋ง