Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark

100 bytes added, 08:37, 30 November 2020
m
Setup
Once you are registered create the data cluster of master and slave nodes
Go to the Menu -> Big Data -> DataProc -> Clusters
Go to '''Menu -> Big Data -> DataProc -> Clusters''' [[File:Googlecloud-setup-96b.jpg]]
We will create 5 worker nodes and 1 master node using the N1 series General-Purpose machine with 4vCPU and 15 GB memory and a disk size of 32-50 GB for all nodes.
You can see the cost of your machine configuration per hour. Using machines with more memory, computing power, etc will cost more per hourly use.
[[File:Googlecloud-setup-6b.jpg]]
[[File:Googlecloud-setup-9.jpg]]
To view the individual nodes in the cluster go to '''Menu -> Virtual Machines -> VM Instances'''
[[File:Googlecloud-setup-11b.jpg]]
76
edits