Currently the core class for BMMs is called `JointDistribution`, which may make it confusing with TensorFlowProbability convention.