Running big data components on kubernetes
- kubernetes cluster v1.18+
- kubectl with cluster admin role on linux
-
Planning Nodes for Service Deployment, eg:
namespace: bigdatanode services node1 nn1 rm1 dn nm node2 nn2 rm2 dn nm node3 zk dn nm kafka node4 zk dn nm kafka node5 zk dn nm kafka -
Modify the configuration file config.sh to suit your plan
./installl.sh| service | ports |
|---|---|
| zookeeper | 2181 |
| journalnode | 8485 8480 |
| namenode | 8020 9870 |
| resourcemanager | 8088 |
| histroyserver | 10020 19888 |
| hive | 9083 10000 10002 |
| spark | 12222 10000 |
| trino | 8082 |
| clickhouse | 8123 |
| kafka | 9092 |
- namenode failover not implemented