18. ๋ฐ์ดํฐ ๋ถ์ ํ๋ฆ
Load in memory
hash(url)
IP-City
Data
URL, Count(1)
Group by URL
Log Parsing
WorkGroup #1
(LogType=URL)
time batch 60 sec.
TOP 100
Order by count
Desc
URL, Count(1)
Group by URL
log
data
Log Parsing
Log Parsing
Count
(Distinct User)
HBase Table
hash(user_id)
Count
(Distinct User)
WorkGroup #2
(LogType=User)
time batch 20 sec.
31. ์์คํ ๊ตฌ์ฑ
Uploader
Application Server
ZooKeeper
Master Server
Server Cluster Membership
Genome Browser
Uploader
Data Server Failover
JDBC
Master Election
Client
Indexer
Genome Allocation
Cluster Configuration
Meta Management
Meta Infomation
Data Server #1
โฆ
Genome Unit #1
Disk
Index
Memory
Index
Data
File
Index
File
Index
File
Index
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Hadoop DataNode
Hadoop DataNode
โฆ
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Index
File
Data
File
Hadoop DataNode