SlideShare a Scribd company logo
Hadoop
• id:sasata299
•
• Ruby Perl
•
Hadoop
816
30         3   1
(   )
Hadoopを業務で使ってみました
•
• GROUP BY        (
        (   Д`)

•                     7000   (
    )
Hadoopを業務で使ってみました
Hadoop
Hadoop

• Hadoop Streaming
•           Ruby

• Amazon EC2 Hadoop
•            50
Hadoop Streaming
•
         (
    )

•       Mapper Reducer
Hadoopを業務で使ってみました
HDFS
Mapper, Reducer
Java ( or JRuby )
     Java API
Hadoop Streaming


       …orz
Hadoopを業務で使ってみました
Hadoop                          cat



`hadoop dfs -cat s3://xxxx/user/root/in/hoge`
           HDFS
7000   (   )→
7000   (   )→

30
Hadoop   !!
• Hadoop Streaming     HDFS
                     (Hadoop
    cat          )

•                       7000     30
                        Hadoop
Hadoopを業務で使ってみました

More Related Content

KEY
800万人の"食べたい"をHadoopで分散処理
KEY
マーケティングのためのHadoop利用
KEY
961万人の食卓を支えるデータ解析
PPTX
Big Data in the Microsoft Platform
PPTX
Cloud Friendly Hadoop and Hive
PPTX
Intro to cassandra + hadoop
PDF
Yahoo! - Arun Murthy - Hadoop World 2010
800万人の"食べたい"をHadoopで分散処理
マーケティングのためのHadoop利用
961万人の食卓を支えるデータ解析
Big Data in the Microsoft Platform
Cloud Friendly Hadoop and Hive
Intro to cassandra + hadoop
Yahoo! - Arun Murthy - Hadoop World 2010

What's hot (19)

PPTX
Qubole Overview at the Fifth Elephant Conference
PPTX
Cassandra/Hadoop Integration
PPTX
Cloud Optimized Big Data
PDF
PySpark Cassandra - Amsterdam Spark Meetup
PPTX
Hadoop big data online training
PDF
Hadoop 101 - Big Data Technology
PDF
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
PDF
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
PDF
Introduction to the Hadoop Ecosystem (codemotion Edition)
PDF
Introduction to Apache Hivemall v0.5.2 and v0.6
PPTX
Productive data engineer
KEY
Hive vs Pig for HadoopSourceCodeReading
PPTX
Hadoop introduction
PPTX
Drill at the Chug 9-19-12
PPT
Hadoop basics
PDF
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
PPTX
Hadoop training
PDF
Introduction to pig & pig latin
PDF
Map reduce and hadoop at mylife
Qubole Overview at the Fifth Elephant Conference
Cassandra/Hadoop Integration
Cloud Optimized Big Data
PySpark Cassandra - Amsterdam Spark Meetup
Hadoop big data online training
Hadoop 101 - Big Data Technology
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to Apache Hivemall v0.5.2 and v0.6
Productive data engineer
Hive vs Pig for HadoopSourceCodeReading
Hadoop introduction
Drill at the Chug 9-19-12
Hadoop basics
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Hadoop training
Introduction to pig & pig latin
Map reduce and hadoop at mylife
Ad

More from Tatsuya Sasaki (10)

KEY
からあげエンジニアについて
KEY
クックパッドでのemr利用事例
KEY
からあげとビーチと私
KEY
メタプログラミングでDSLを書こう
PDF
NoSQLデータベースが登場した背景と特徴
KEY
Hadoopをemr経由で利用する方法
KEY
COOKPADでのHadoop利用
KEY
Hadoop導入事例 in クックパッド
KEY
Hadoopを業務で使ってみた
からあげエンジニアについて
クックパッドでのemr利用事例
からあげとビーチと私
メタプログラミングでDSLを書こう
NoSQLデータベースが登場した背景と特徴
Hadoopをemr経由で利用する方法
COOKPADでのHadoop利用
Hadoop導入事例 in クックパッド
Hadoopを業務で使ってみた
Ad

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Modernizing your data center with Dell and AMD
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Advanced IT Governance
PDF
Machine learning based COVID-19 study performance prediction
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Modernizing your data center with Dell and AMD
Dropbox Q2 2025 Financial Results & Investor Presentation
Chapter 3 Spatial Domain Image Processing.pdf
Big Data Technologies - Introduction.pptx
NewMind AI Weekly Chronicles - August'25 Week I
NewMind AI Monthly Chronicles - July 2025
GamePlan Trading System Review: Professional Trader's Honest Take
Advanced Soft Computing BINUS July 2025.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Spectral efficient network and resource selection model in 5G networks
Advanced IT Governance
Machine learning based COVID-19 study performance prediction
20250228 LYD VKU AI Blended-Learning.pptx

Hadoopを業務で使ってみました