This document discusses a multi-tenant Hadoop-as-a-Service platform called HopsWorks that is available for free use at the SICS ICE research facility in Luleå, Sweden. Key points:
- SICS ICE is the world's first open data center dedicated to big data research, with resources like Hadoop/Spark/Flink available as a service.
- HopsWorks provides true multi-tenancy for Hadoop through project-specific user IDs and group IDs to isolate data and enforce access controls.
- Metadata is kept consistent through the use of a distributed database, with foreign keys ensuring integrity when projects or data sets are modified or deleted.
- A