densenet not training when using tf.contrib.layers.recompute_grad

I want to implement memory efficient densenet, following the code in
[https://github.com/joeyearsley/efficient_densenet_tensorflow/blob/master/models/densenet_creator.py](url), the traing process is stuck at first epoch
I have just changed the add_layer part
```python
        def add_layer(l):

            def _add_layer(l):
                shape = l.get_shape().as_list()
                in_channel = shape[3]
                with tf.variable_scope(name) as scope:
                    c = BatchNorm('bn1', l)
                    c = tf.nn.relu(c)
                    c = conv('conv1', c, self.growthRate, 1)
                    l = tf.concat([c, l], 3)
                return l
            
            if self.efficient:
                _add_layer = tf.contrib.layers.recompute_grad(_add_layer)
            
            return _add_layer(l)
```
also add the key word argument "efficient" to specify whether use the memory efficient version.
However the training process stucked.
Using tensorflow 1.9
tensorpack 0.9.1
Do I need to change other parts in the tensorpack?
Thanks in advance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

densenet not training when using tf.contrib.layers.recompute_grad #21

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

densenet not training when using tf.contrib.layers.recompute_grad #21

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions