SlideShare a Scribd company logo
MapR NFS
  7/6/2012

© 2012 MapR Technologies   Storage Service Architecture 1
MapR NFS
   Agenda
   • Before NFS
   • Exercise: Using hadoop fs
   • MapR NFS Overview
   • NFS Mounting Models
   • Exercise: Mount the Cluster to Itself and Copy a File




© 2012 MapR Technologies   Storage Service Architecture 2
MapR NFS
   Objectives
   At the end of this module you will be able to:
   • Describe the tools that are used with HDFS and their limitations
   • List the features and benefits of MapR Direct Access NFS
   • Identify several use cases for NFS on Hadoop
   • Describe the different models for mounting NFS volumes with MapR




© 2012 MapR Technologies   Storage Service Architecture 3
Before NFS




© 2012 MapR Technologies   Storage Service Architecture 4
Before NFS
     Getting work done without MapR NFS
      –   Example: mining web logs for meaningful data

          •   First: get the logs into the cluster
              – Do you stop writing to the logs?
              – How long does it take?
              – What do you do in the meantime?




                            hadoop fs -put, copyfromlocal

                            distcp

                            Flume




© 2012 MapR Technologies             Storage Service Architecture 5
Before NFS
     Getting work done without MapR NFS
      –   Example: mining web logs for meaningful data

          •   Next: getting answers from the data
              – What tools do you use to access data in your cluster?
              – Who is using these tools?




                         hadoop fs

                         copytolocal




© 2012 MapR Technologies             Storage Service Architecture 6
Exercise:
                              hadoop fs



© 2012 MapR Technologies   Storage Service Architecture 7
MapR NFS Overview




© 2012 MapR Technologies   Storage Service Architecture 8
Introducing Direct Access NFS
                                           Drag and drop
                                           Open files and apps
                                           Watch live data come in
                                           Fully read/write
                                           Multiple concurrent reads/writes
                                           Easy to get data in/out
                                           Same namespace paths as Hadoop
                                         Mount with
                                         -o tcp,ver=3,nolock




© 2012 MapR Technologies   Storage Service Architecture 9
Getting Data into the Cluster with NFS
     Simply mount your cluster filesystem locally
      –   /mapr/my.cluster.com/myvolume/…
     Redirect (or copy) logs to cluster filesystem




                             log data




© 2012 MapR Technologies   Storage Service Architecture 10
Using NFS with Clients
     Mount your cluster filesystem locally
      –   /mapr/my.cluster.com/myvolume/…
     Access all of the data in your cluster with standard applications
      –   Double-click a file (spreadsheet, report, etc.)
          •   edit, save, etc.
      –   Run a perl script
      –   grep, sed, awk, shell scripts, etc.
     Use Hadoop apps (Hive, Pig, Hbase, etc.)




© 2012 MapR Technologies         Storage Service Architecture 11
Using NFS with Clients
     Data is constantly changing
     How do I view the data at a point in time?
      –   Easy: take a snapshot
      –   cd to the snapshot volume
      –   View the data of interest
      –   Note: overhead for taking a snapshot: none to minimal
          •   The only write overhead is the data that changed
              –   Recall snapshot architecture




© 2012 MapR Technologies              Storage Service Architecture 12
Using NFS with Legacy Applications
     What if I have legacy applications that work with my data?
      –   Log analysis, reporting tools, etc.
     Mount the cluster where your tools run
      –   Take a snapshot if necessary
      –   Then use your app just as before




© 2012 MapR Technologies        Storage Service Architecture 13
NFS Mounting Models




© 2012 MapR Technologies   Storage Service Architecture 14
NFS Mounting Models

     Export to the world
      –   NFS gateway runs on selected gateway hosts
     Local server
      –   NFS server runs on local host
      –   Enables local compression and check summing
     Export to self
      –   NFS gateway runs on all data nodes, mounted from localhost




© 2012 MapR Technologies      Storage Service Architecture 15
Export to the World



                                        NFS
                                          NFS
                                       Server
                                           NFS
                                        Server
                                             NFS
                                          Server
                 NFS                       Server
                Client




© 2012 MapR Technologies    Storage Service Architecture 16
Local Server


                             Application

                                     NFS
                                    Server
                           Client




                                                               Cluster
                                                               Nodes



© 2012 MapR Technologies     Storage Service Architecture 17
Universal Export to Self


                                                               Cluster Nodes




                             Application

                                    NFS
                           Cluster Server
                           Node




© 2012 MapR Technologies         Storage Service Architecture 18
Nodes are Identical

                  Application
                                                                     Application
                       NFS
                                                                         NFS
              Cluster Server
              Node                                              Cluster Server
                                                                Node



                             Application

                                    NFS
                           Cluster Server
                           Node



© 2012 MapR Technologies           Storage Service Architecture 19
Exercise:
                       Mount the Cluster to
                       Itself and Copy a File


© 2012 MapR Technologies   Storage Service Architecture 20
Questions




© 2012 MapR Technologies   Storage Service Architecture 21

More Related Content

PPTX
58a migration
PPTX
80a disaster recovery
PPTX
55a remote cluster
PPTX
50a volumes
PPTX
70a monitoring & troubleshooting
PPTX
30a accessing your cluster
PDF
Hands on MapR -- Viadea
PPTX
13c planning
58a migration
80a disaster recovery
55a remote cluster
50a volumes
70a monitoring & troubleshooting
30a accessing your cluster
Hands on MapR -- Viadea
13c planning

What's hot (20)

PPTX
20a installation
PDF
MapR Tutorial Series
PDF
Hadoop Internals
PPTX
Introduction to Yarn
PPTX
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
PDF
Autonomous control in Big Data platforms: and experience with Cassandra
PDF
What's New and Upcoming in HDFS - the Hadoop Distributed File System
PDF
How to Increase Performance of Your Hadoop Cluster
PPTX
PPTX
Anatomy of Hadoop YARN
PPTX
MapReduce Container ReUse
PPT
Advanced Hadoop Tuning and Optimization
PDF
Hadoop Cluster With High Availability
PPTX
Ambari Meetup: NameNode HA
PPTX
Nn ha hadoop world.final
PDF
MapReduce and Hadoop
PPTX
Hadoop fault-tolerance
PPTX
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
PPTX
Spark tunning in Apache Kylin
20a installation
MapR Tutorial Series
Hadoop Internals
Introduction to Yarn
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
Autonomous control in Big Data platforms: and experience with Cassandra
What's New and Upcoming in HDFS - the Hadoop Distributed File System
How to Increase Performance of Your Hadoop Cluster
Anatomy of Hadoop YARN
MapReduce Container ReUse
Advanced Hadoop Tuning and Optimization
Hadoop Cluster With High Availability
Ambari Meetup: NameNode HA
Nn ha hadoop world.final
MapReduce and Hadoop
Hadoop fault-tolerance
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
Spark tunning in Apache Kylin
Ad

Similar to 52 nfs (20)

PPTX
Ted Dunning - Whither Hadoop
PPTX
July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...
PPTX
NFS and ODBC
PDF
Design, Scale and Performance of MapR's Distribution for Hadoop
PDF
Google Compute and MapR
PPTX
MapR, Implications for Integration
PPTX
Upper layer protocol
PDF
HUG slides on NFS and ODBC
PPTX
12a architecture
PPTX
HBase with MapR
PDF
SAM SIG: Hadoop architecture, MapReduce patterns, and best practices with Cas...
PDF
NetApp Unified Scale-Out/Clustered Storage
PPT
PPTX
Oct 2012 HUG: Hadoop .Next (0.23) - Customer Impact and Deployment
PDF
Building Scale Free Applications with Hadoop and Cascading
PDF
Panasas pNFS Status (September 2010)
PDF
Intro to GlusterFS Webinar - August 2011
PPTX
Seattle Scalability Meetup - Ted Dunning - MapR
PPTX
Apache Hadoop
PPTX
Introduction to Data Storage and Cloud Computing
Ted Dunning - Whither Hadoop
July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...
NFS and ODBC
Design, Scale and Performance of MapR's Distribution for Hadoop
Google Compute and MapR
MapR, Implications for Integration
Upper layer protocol
HUG slides on NFS and ODBC
12a architecture
HBase with MapR
SAM SIG: Hadoop architecture, MapReduce patterns, and best practices with Cas...
NetApp Unified Scale-Out/Clustered Storage
Oct 2012 HUG: Hadoop .Next (0.23) - Customer Impact and Deployment
Building Scale Free Applications with Hadoop and Cascading
Panasas pNFS Status (September 2010)
Intro to GlusterFS Webinar - August 2011
Seattle Scalability Meetup - Ted Dunning - MapR
Apache Hadoop
Introduction to Data Storage and Cloud Computing
Ad

More from mapr-academy (9)

DOCX
53 lab-nfs
DOCX
51 lab-volumes
PPTX
48a tuning
DOCX
42 lab-managing services
PPTX
41a managing services
PPTX
22 configuration
DOCX
14 lab-planing
PPTX
10c introduction
DOCX
3 map r installation & setup administration course description
53 lab-nfs
51 lab-volumes
48a tuning
42 lab-managing services
41a managing services
22 configuration
14 lab-planing
10c introduction
3 map r installation & setup administration course description

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
cuic standard and advanced reporting.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Approach and Philosophy of On baking technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
Teaching material agriculture food technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Diabetes mellitus diagnosis method based random forest with bat algorithm
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding
NewMind AI Monthly Chronicles - July 2025
CIFDAQ's Market Insight: SEC Turns Pro Crypto
cuic standard and advanced reporting.pdf
Encapsulation theory and applications.pdf
Approach and Philosophy of On baking technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MYSQL Presentation for SQL database connectivity
Reach Out and Touch Someone: Haptics and Empathic Computing
Teaching material agriculture food technology

52 nfs

  • 1. MapR NFS 7/6/2012 © 2012 MapR Technologies Storage Service Architecture 1
  • 2. MapR NFS Agenda • Before NFS • Exercise: Using hadoop fs • MapR NFS Overview • NFS Mounting Models • Exercise: Mount the Cluster to Itself and Copy a File © 2012 MapR Technologies Storage Service Architecture 2
  • 3. MapR NFS Objectives At the end of this module you will be able to: • Describe the tools that are used with HDFS and their limitations • List the features and benefits of MapR Direct Access NFS • Identify several use cases for NFS on Hadoop • Describe the different models for mounting NFS volumes with MapR © 2012 MapR Technologies Storage Service Architecture 3
  • 4. Before NFS © 2012 MapR Technologies Storage Service Architecture 4
  • 5. Before NFS  Getting work done without MapR NFS – Example: mining web logs for meaningful data • First: get the logs into the cluster – Do you stop writing to the logs? – How long does it take? – What do you do in the meantime? hadoop fs -put, copyfromlocal distcp Flume © 2012 MapR Technologies Storage Service Architecture 5
  • 6. Before NFS  Getting work done without MapR NFS – Example: mining web logs for meaningful data • Next: getting answers from the data – What tools do you use to access data in your cluster? – Who is using these tools? hadoop fs copytolocal © 2012 MapR Technologies Storage Service Architecture 6
  • 7. Exercise: hadoop fs © 2012 MapR Technologies Storage Service Architecture 7
  • 8. MapR NFS Overview © 2012 MapR Technologies Storage Service Architecture 8
  • 9. Introducing Direct Access NFS  Drag and drop  Open files and apps  Watch live data come in  Fully read/write  Multiple concurrent reads/writes  Easy to get data in/out  Same namespace paths as Hadoop  Mount with -o tcp,ver=3,nolock © 2012 MapR Technologies Storage Service Architecture 9
  • 10. Getting Data into the Cluster with NFS  Simply mount your cluster filesystem locally – /mapr/my.cluster.com/myvolume/…  Redirect (or copy) logs to cluster filesystem log data © 2012 MapR Technologies Storage Service Architecture 10
  • 11. Using NFS with Clients  Mount your cluster filesystem locally – /mapr/my.cluster.com/myvolume/…  Access all of the data in your cluster with standard applications – Double-click a file (spreadsheet, report, etc.) • edit, save, etc. – Run a perl script – grep, sed, awk, shell scripts, etc.  Use Hadoop apps (Hive, Pig, Hbase, etc.) © 2012 MapR Technologies Storage Service Architecture 11
  • 12. Using NFS with Clients  Data is constantly changing  How do I view the data at a point in time? – Easy: take a snapshot – cd to the snapshot volume – View the data of interest – Note: overhead for taking a snapshot: none to minimal • The only write overhead is the data that changed – Recall snapshot architecture © 2012 MapR Technologies Storage Service Architecture 12
  • 13. Using NFS with Legacy Applications  What if I have legacy applications that work with my data? – Log analysis, reporting tools, etc.  Mount the cluster where your tools run – Take a snapshot if necessary – Then use your app just as before © 2012 MapR Technologies Storage Service Architecture 13
  • 14. NFS Mounting Models © 2012 MapR Technologies Storage Service Architecture 14
  • 15. NFS Mounting Models  Export to the world – NFS gateway runs on selected gateway hosts  Local server – NFS server runs on local host – Enables local compression and check summing  Export to self – NFS gateway runs on all data nodes, mounted from localhost © 2012 MapR Technologies Storage Service Architecture 15
  • 16. Export to the World NFS NFS Server NFS Server NFS Server NFS Server Client © 2012 MapR Technologies Storage Service Architecture 16
  • 17. Local Server Application NFS Server Client Cluster Nodes © 2012 MapR Technologies Storage Service Architecture 17
  • 18. Universal Export to Self Cluster Nodes Application NFS Cluster Server Node © 2012 MapR Technologies Storage Service Architecture 18
  • 19. Nodes are Identical Application Application NFS NFS Cluster Server Node Cluster Server Node Application NFS Cluster Server Node © 2012 MapR Technologies Storage Service Architecture 19
  • 20. Exercise: Mount the Cluster to Itself and Copy a File © 2012 MapR Technologies Storage Service Architecture 20
  • 21. Questions © 2012 MapR Technologies Storage Service Architecture 21