SlideShare a Scribd company logo
HareQL:快速HBase查詢工具的發展過程
Development of HBase Client and HareQL
Mon-Fong Mike Jiang
江孟峰
Kuan-Yu Hubert Fan-Chiang
范姜冠宇
Tienyu Rebecca Lin
林恬伃
 Providing IT solutions
• System development for big data solutions
• Smart manufacturing related services
• Financial data systems
• Telecommunication data systems
• We are the Cloudera certificated professional
services team
2
About Us
 Big data Product since 2011
•Hare Data platform
•2013 HSP Innovative Product Award
•2014 Golden Award of the TOP 100 Innovative Products
•Cloudera Certified Technology (Only one in Taiwan)
What is Hare
It’s a NoSQL Database which is based on HBase
Support SQL to HBase directly
Provide DBMS-like Web UI
Provide JDBC/ODBC and Restful Service
Why Hare ?
Easy
• Click and start to use it
• Friendly user interface
• To involve your big data rapidly
Comfortable
• SQL language supported
• Data type management
• Multi-Cluster in one client
Faster
• Quick access to the data in HBase
• Powerful query engine for better performance
Compatible
• Based on the Hadoop/HBase System
• Highly compatible in ecosystem
Features
 Easy use (Web UI)
 Easy install
 Friendly UI
 One Client ; Many Clusters (Connection Manager)
 Bulkload UI
 Meta Manager (Schema Manager)
 Relation between HBase Table and Hive Table
 HareQL (High Speed SQL Query in HBase)
 JDBC Driver
 ODBC Driver (not support sentry)
 Restful Services
Software Stack
Hadoop
HBaseHive Spark
Solr Cloud
Security
KerberosSentry
Indexing
Hare Core
HBase
Client
Restful
Service
JDBC
ODBC
(No
Sentry) Spark Monitor
HDFS Client
System Architecture
HareQL
Hive: MapReduce
We replace MapReduces in Hive to HBase coprocessors.
We call the language “HareQL”.
HareQL has some advantages as below.
 Low- latency
 Query HBase table directly
 High performance
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
HareQL Architecture
Hive Parser make us support HiveQL
Final Reducer
Hive
Flow
Parser
Semantic
Analyzer
Logical
Plan Gen.
Logical
Optimizer
Physical
Plan Gen.
Physical
Optimizer
AST
QB
OP
Tree
OP
Tree
Task
Tree
Task
Tree
Advance
Parser
HareSemantic
Analyzer
Hare
Coprocessor
Execution
QB
Execution
Plan
Task
Plan
Map
Result
Hare
Flow
Result
From Hive to Hare
Metadata
 As we know, anything that can be converted to an array of
bytes can be stored in HBase. However, we have to convert the
data back correctly, or we can’t recognize the data.
 We integrated meta-store of Hive to HBase Client. We call the
data type of HBase column “Meta data”.
When to get Metadata ?
Advance Parser
HareSemanticAnalyzer
Hare Coprocessor
QB
Task Plan
Hive Meta
Hare Meta
Hare Restful Service
Table manipulation
Row manipulation
Bulkload data
Sending SQL
Scanning
Metadata manipulation
Structured Data
Semiconductor application
15
MES
System
FDC
System
SCM
System
SPC
System
EDA
System
SQM
System
ERP
System
PLM
System
Un-Structured Data
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
Schema Design in HBase
• Designed the row key according to the
access request
18
Application – Yieldata
- Easy Selection
- Smart Filter
- Clear View of the
Dataset
Yieldata – Root Cause Ranking
Thank you
• is-land Systems Inc.
• Company:www.is-land.com.tw
• Big Data:www.HareDB.com
• Email:service@haredb.com
• Addr : 新竹科學園區展業二路4號3樓

More Related Content

PDF
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
PDF
HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC time
PPTX
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
PPTX
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
PDF
HBaseCon 2013:High-Throughput, Transactional Stream Processing on Apache HBase
PDF
hbaseconasia2017: Apache HBase at Netease
PDF
HBaseConAsia2018 Keynote 2: Recent Development of HBase in Alibaba and Cloud
PPTX
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
HBaseConAsia2018 Track1-1: Use CCSMap to improve HBase YGC time
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track3-7: The application of HBase in New Energy Vehicle Mon...
HBaseCon 2013:High-Throughput, Transactional Stream Processing on Apache HBase
hbaseconasia2017: Apache HBase at Netease
HBaseConAsia2018 Keynote 2: Recent Development of HBase in Alibaba and Cloud
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget

What's hot (20)

PDF
HBaseConAsia2018 Keynote1: Apache HBase Project Status
PPTX
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
PPTX
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
PDF
HBaseConAsia2018 Track1-3: HBase at Xiaomi
PDF
25 snowflake
PPTX
HBaseConAsia2018 Track2-2: Apache Kylin on HBase: Extreme OLAP for big data
PPTX
HBaseCon 2015 General Session: Zen - A Graph Data Model on HBase
PPTX
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
PPTX
HBaseCon 2015: State of HBase Docs and How to Contribute
PDF
Kafka to the Maxka - (Kafka Performance Tuning)
PDF
Rails on HBase
PPTX
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
PPTX
HBaseCon 2013: Streaming Data into Apache HBase using Apache Flume: Experienc...
PDF
Thug feb 23 2015 Chen Zhang
PPTX
Transform your DBMS to drive engagement innovation with Big Data
PDF
HBaseCon 2015- HBase @ Flipboard
PPTX
Chicago Data Summit: Geo-based Content Processing Using HBase
PPTX
HBase at Bloomberg: High Availability Needs for the Financial Industry
PPTX
Keynote: The Future of Apache HBase
PDF
Cassandra in e-commerce
HBaseConAsia2018 Keynote1: Apache HBase Project Status
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
HBaseConAsia2018 Track1-3: HBase at Xiaomi
25 snowflake
HBaseConAsia2018 Track2-2: Apache Kylin on HBase: Extreme OLAP for big data
HBaseCon 2015 General Session: Zen - A Graph Data Model on HBase
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
HBaseCon 2015: State of HBase Docs and How to Contribute
Kafka to the Maxka - (Kafka Performance Tuning)
Rails on HBase
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2013: Streaming Data into Apache HBase using Apache Flume: Experienc...
Thug feb 23 2015 Chen Zhang
Transform your DBMS to drive engagement innovation with Big Data
HBaseCon 2015- HBase @ Flipboard
Chicago Data Summit: Geo-based Content Processing Using HBase
HBase at Bloomberg: High Availability Needs for the Financial Industry
Keynote: The Future of Apache HBase
Cassandra in e-commerce
Ad

Similar to hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程 (20)

PPTX
Unit II Hadoop Ecosystem_Updated.pptx
PPTX
Horizon for Big Data
PDF
OPTIMIZATION OF MULTIPLE CORRELATED QUERIES BY DETECTING SIMILAR DATA SOURCE ...
PPTX
BDA: Introduction to HIVE, PIG and HBASE
PDF
Big Data Conference April 2015
PPT
HBase and Hive at StumbleUpon Presentation.ppt
PDF
Intro to HBase - Lars George
PDF
Big data and mstr bridge the elephant
PDF
HBaseCon 2013: Integration of Apache Hive and HBase
PPTX
Hspark index conf
PPTX
Overview of big data & hadoop v1
PPTX
Overview of big data & hadoop version 1 - Tony Nguyen
PPTX
Overview of Big data, Hadoop and Microsoft BI - version1
PPTX
Big data processing engines, Atlanta Meetup 4/30
PPTX
Hive and Hbase inegration
PPTX
Hive vs Hbase, a Friendly Competition
PDF
Spark HBase Connector: Feature Rich and Efficient Access to HBase Through Spa...
PPTX
Big Data, Bigger Brains
PPTX
12 SQL On-Hadoop Tools
PDF
HBaseCon2017 Apache HBase at Didi
Unit II Hadoop Ecosystem_Updated.pptx
Horizon for Big Data
OPTIMIZATION OF MULTIPLE CORRELATED QUERIES BY DETECTING SIMILAR DATA SOURCE ...
BDA: Introduction to HIVE, PIG and HBASE
Big Data Conference April 2015
HBase and Hive at StumbleUpon Presentation.ppt
Intro to HBase - Lars George
Big data and mstr bridge the elephant
HBaseCon 2013: Integration of Apache Hive and HBase
Hspark index conf
Overview of big data & hadoop v1
Overview of big data & hadoop version 1 - Tony Nguyen
Overview of Big data, Hadoop and Microsoft BI - version1
Big data processing engines, Atlanta Meetup 4/30
Hive and Hbase inegration
Hive vs Hbase, a Friendly Competition
Spark HBase Connector: Feature Rich and Efficient Access to HBase Through Spa...
Big Data, Bigger Brains
12 SQL On-Hadoop Tools
HBaseCon2017 Apache HBase at Didi
Ad

More from HBaseCon (20)

PDF
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
PDF
hbaseconasia2017: HBase on Beam
PDF
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
PDF
hbaseconasia2017: HBase在Hulu的使用和实践
PDF
hbaseconasia2017: 基于HBase的企业级大数据平台
PDF
hbaseconasia2017: HBase at JD.com
PDF
hbaseconasia2017: Large scale data near-line loading method and architecture
PDF
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
PDF
hbaseconasia2017: HBase Practice At XiaoMi
PDF
hbaseconasia2017: hbase-2.0.0
PDF
HBaseCon2017 Democratizing HBase
PDF
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
PDF
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
PDF
HBaseCon2017 Transactions in HBase
PDF
HBaseCon2017 Highly-Available HBase
PDF
HBaseCon2017 gohbase: Pure Go HBase Client
PDF
HBaseCon2017 Improving HBase availability in a multi tenant environment
PDF
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
PDF
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
PDF
HBaseCon2017 HBase at Xiaomi
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: HBase on Beam
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: HBase at JD.com
hbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: hbase-2.0.0
HBaseCon2017 Democratizing HBase
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Transactions in HBase
HBaseCon2017 Highly-Available HBase
HBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
HBaseCon2017 HBase at Xiaomi

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Modernizing your data center with Dell and AMD
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
Teaching material agriculture food technology
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Electronic commerce courselecture one. Pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
KodekX | Application Modernization Development
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Modernizing your data center with Dell and AMD
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
NewMind AI Monthly Chronicles - July 2025
Teaching material agriculture food technology
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Weekly Chronicles - August'25 Week I
Electronic commerce courselecture one. Pdf
Empathic Computing: Creating Shared Understanding
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
20250228 LYD VKU AI Blended-Learning.pptx
cuic standard and advanced reporting.pdf
KodekX | Application Modernization Development
Digital-Transformation-Roadmap-for-Companies.pptx
Review of recent advances in non-invasive hemoglobin estimation
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
The AUB Centre for AI in Media Proposal.docx
“AI and Expert System Decision Support & Business Intelligence Systems”

hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程

  • 1. HareQL:快速HBase查詢工具的發展過程 Development of HBase Client and HareQL Mon-Fong Mike Jiang 江孟峰 Kuan-Yu Hubert Fan-Chiang 范姜冠宇 Tienyu Rebecca Lin 林恬伃
  • 2.  Providing IT solutions • System development for big data solutions • Smart manufacturing related services • Financial data systems • Telecommunication data systems • We are the Cloudera certificated professional services team 2 About Us  Big data Product since 2011 •Hare Data platform •2013 HSP Innovative Product Award •2014 Golden Award of the TOP 100 Innovative Products •Cloudera Certified Technology (Only one in Taiwan)
  • 3. What is Hare It’s a NoSQL Database which is based on HBase Support SQL to HBase directly Provide DBMS-like Web UI Provide JDBC/ODBC and Restful Service
  • 4. Why Hare ? Easy • Click and start to use it • Friendly user interface • To involve your big data rapidly Comfortable • SQL language supported • Data type management • Multi-Cluster in one client Faster • Quick access to the data in HBase • Powerful query engine for better performance Compatible • Based on the Hadoop/HBase System • Highly compatible in ecosystem
  • 5. Features  Easy use (Web UI)  Easy install  Friendly UI  One Client ; Many Clusters (Connection Manager)  Bulkload UI  Meta Manager (Schema Manager)  Relation between HBase Table and Hive Table  HareQL (High Speed SQL Query in HBase)  JDBC Driver  ODBC Driver (not support sentry)  Restful Services
  • 6. Software Stack Hadoop HBaseHive Spark Solr Cloud Security KerberosSentry Indexing Hare Core HBase Client Restful Service JDBC ODBC (No Sentry) Spark Monitor HDFS Client
  • 8. HareQL Hive: MapReduce We replace MapReduces in Hive to HBase coprocessors. We call the language “HareQL”. HareQL has some advantages as below.  Low- latency  Query HBase table directly  High performance
  • 10. HareQL Architecture Hive Parser make us support HiveQL
  • 11. Final Reducer Hive Flow Parser Semantic Analyzer Logical Plan Gen. Logical Optimizer Physical Plan Gen. Physical Optimizer AST QB OP Tree OP Tree Task Tree Task Tree Advance Parser HareSemantic Analyzer Hare Coprocessor Execution QB Execution Plan Task Plan Map Result Hare Flow Result From Hive to Hare
  • 12. Metadata  As we know, anything that can be converted to an array of bytes can be stored in HBase. However, we have to convert the data back correctly, or we can’t recognize the data.  We integrated meta-store of Hive to HBase Client. We call the data type of HBase column “Meta data”.
  • 13. When to get Metadata ? Advance Parser HareSemanticAnalyzer Hare Coprocessor QB Task Plan Hive Meta Hare Meta
  • 14. Hare Restful Service Table manipulation Row manipulation Bulkload data Sending SQL Scanning Metadata manipulation
  • 18. Schema Design in HBase • Designed the row key according to the access request 18
  • 19. Application – Yieldata - Easy Selection - Smart Filter - Clear View of the Dataset
  • 20. Yieldata – Root Cause Ranking
  • 21. Thank you • is-land Systems Inc. • Company:www.is-land.com.tw • Big Data:www.HareDB.com • Email:service@haredb.com • Addr : 新竹科學園區展業二路4號3樓