Tips for splitting the auto-encoder, using M1 based macs, and generating GIFs!

Kia ora!

Thanks for the series. I finally got things happening, feels fantastic. I managed to get the whole thing going and thought I'd leave you some tips in case you hit any of the struggles I did.

## M1 Macbooks/Mac Mini
Anyone running on a new MacBook (with the M1 chip) will find the published TensorFlow packages don't support M1, but it's coming very very very soon, (7daysish) but as a temporary workaround can use:
```sh
npm install https://storage.googleapis.com/tfjs-testing/tensorflow-tfjs-node-0.0.0.tgz
```
(see https://github.com/tensorflow/tfjs/issues/4514)

## Only one encoded variable
I managed to get a 28x28 encoder reduced down to a single value, expanding back out at an un-noticeable loss.
I couldn't do this at larger sizes such as 64x64, which worked very down to four variables

(one makes sense because there is only one variable changing: the single x+y dimension)

## Splitting the autoencoder 
So you can access the individual layers with `autoencoder.layers` so I made a little helper function to split the autoencoder by looking for an increasing number of units/nodes.
```js
export default function splitModel(autoencoder) {
    const index = autoencoder.layers.findIndex((layer, i) => {
        return layer.units > (autoencoder.layers[i - 1]?.units || Infinity);
    });
    return { encoderLayers: autoencoder.layers.slice(0, index), decoderLayers: autoencoder.layers.slice(index) };
}
```
BUT you can't just loop over these layers adding them to another sequence, they don't carry the weights and things, I solved this with another helper function that creates a new dense layer with the same setup then manually copying the weights.
(it is also important that you add the new layer to the model BEFORE setting the weights)
```js
function createFromLayers(layers){
    const newModel = tf.sequential()
    layers.forEach(layer => {
        const newLayer = tf.layers.dense({
            units: layer.units,
            activation: layer.activation,
            inputShape: [layer.kernel.shape[0]]
        })
        newModel.add(newLayer)
        newLayer.setWeights(layer.getWeights())
    })
    newModel.compile({
        optimizer: "adam",
        loss: "binaryCrossentropy",
        metrics: ["accuracy"]
    });
    return newModel
}
```
## Feeding random values
```js
const numberOfDecoderInputs = decoderLayers[0].kernel.shape[0]
const x_input = tf.randomUniform([1, numberOfDecoderInputs], 0, 1); //min:0, max:1
```
## Make sure the decoder is fed with values 0-1
I found that my encoding half was returning values outside of a 0-1 range using relu, so swapped to sigmoid on the final encoding layer
```js
// ...encoding layers
// these two are the final encoding layers

autoencoder.add(
    tf.layers.dense({
        units: 128,
        activation: "relu",
    })
);
autoencoder.add(
    tf.layers.dense({
        units: 1,
        activation: "sigmoid",

// this squeezes into range 0-1 for feeding the decoder after splitting.
// there is probably better solutions for this but worked for me!

    })
);

// ...decoding layers
```

## Generating gif
I found a really interesting behaviour by creating a gif of the possible single-dimension range (I didn't want to touch the browser so generated images and converted them to a gif)

The behaviour was that the squares didn't start small at an input of 0 and increase in size but instead the network learned a strange split where 0-0.5 was increasing as expected, and then jumped to the largest size and shrunk from 0.5-1

Generates images:
```js
new Array(1000).fill(0).forEach(async (x, i, y) => {
    const input = [[i / y.length]] // each image is fed a value evenly distributed from 0-1
    const decoderOutput = await decoder.predict(tf.tensor(input)).array();
    await generateImage(decoderOutput[0], `./test/decoder_${i}.png`, width, height);
})
```

Generates a gif from the open directory .png images (eg.run from inside /test folder, also notice scale value for resolution, it can upscale if desired)
```sh
ffmpeg -framerate 60 -pattern_type glob -i '*.png' -r 15 -vf scale=28:-1 out.gif
```

Finally the generateImage helper I use
```js
import { promises as fsp } from "fs";
import Canvas from "canvas";

export default async function generateImage(data, location, width, height) {
    const canvas = Canvas.createCanvas(width, height);
    const ctx = canvas.getContext('2d');
    const buffer = [];
    for (let n = 0; n < data.length; n++) {
        const val = Math.floor(data[n] * 255);
        buffer[n * 4 + 0] = val;
        buffer[n * 4 + 1] = val;
        buffer[n * 4 + 2] = val;
        buffer[n * 4 + 3] = 255;
    }
    const imageData = new Canvas.ImageData(new Uint8ClampedArray(buffer), width, height);
    ctx.putImageData(imageData, 0, 0);
    await fsp.writeFile(location, canvas.toBuffer());
}
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tips for splitting the auto-encoder, using M1 based macs, and generating GIFs! #6

M1 Macbooks/Mac Mini

Only one encoded variable

Splitting the autoencoder

Feeding random values

Make sure the decoder is fed with values 0-1

Generating gif

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tips for splitting the auto-encoder, using M1 based macs, and generating GIFs! #6

Description

M1 Macbooks/Mac Mini

Only one encoded variable

Splitting the autoencoder

Feeding random values

Make sure the decoder is fed with values 0-1

Generating gif

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions