The document discusses IBM's enterprise graph analytics platform based on distributed graph databases and distributed graph computing. It focuses on TitanDB, an open-source distributed graph database that can store and query billions of vertices and edges across a cluster. TitanDB uses HBase for storage, Solr for indexing, and integrates with Spark via TinkerPop's SparkGraphComputer for distributed graph analytics. The platform is used to build enterprise data quality analytics systems that leverage graph representations of metadata to enable features like lineage tracing, data quality checks, and audit recommendations.
Related topics: