Skip to content

Problem when training the flow generator #163

@Specom

Description

@Specom

Hey there,
I am currently trying to train the flow generator in my DeepEthogram project (windows, anaconda, installed with pip, cudatoolkit=11.2 cudnn=8.1.0 , and tensorflow-gpu==2.10.0 ):

"(deepethogramg) C:\Users\Sander>python -m deepethogram
[2024-05-22 11:50:20,333] INFO [deepethogram.gui.main.setup_gui_cfg:1268] CWD: C:\Users\Sander
[2024-05-22 11:50:20,336] INFO [deepethogram.gui.main.setup_gui_cfg:1269] Configuration used: split:
reload: true
file: null
train_val_test:

  • 0.8
  • 0.2
  • 0.0
    compute:
    fp16: false
    num_workers: 8
    batch_size: auto
    min_batch_size: 8
    max_batch_size: 512
    distributed: false
    gpu_id: 0
    dali: false
    metrics_workers: 0
    reload:
    overwrite_cfg: false
    latest: false
    notes: null
    log:
    level: info
    run:
    type: gui
    label_view_width: 31
    control_arrow_jump: 31
    vertical_arrow_jump: 3
    cmap: deepethogram
    unlabeled_alpha: 0.1
    prediction_opacity: 0.2

[2024-05-22 11:50:33,212] WARNING [deepethogram.projects.fix_config_paths:1264] Erroneous name of config file in the config file itself, changing...
[2024-05-22 11:50:33,249] INFO [deepethogram.gui.main.initialize_project:1017] cwd: C:\Users\Sander
[2024-05-22 11:50:33,250] INFO [deepethogram.projects.convert_config_paths_to_absolute:1135] cwd in absolute: C:\Users\Sander
[2024-05-22 11:50:33,251] INFO [deepethogram.projects.convert_config_paths_to_absolute:1178] after absolute: {'class_names': ['background', 'walking', 'ghead', 'gfront', 'gabdomen', 'gback', 'gwing', 'flight', 'pushing'], 'config_file': 'C:\DeepEthogram\test_deepethogram\project_config.yaml', 'data_path': 'C:\DeepEthogram\test_deepethogram\DATA', 'labeler': None, 'model_path': 'C:\DeepEthogram\test_deepethogram\models', 'name': 'test', 'path': 'C:\DeepEthogram\test_deepethogram', 'pretrained_path': 'C:\DeepEthogram\test_deepethogram\models\pretrained_models'}
[2024-05-22 11:50:33,251] INFO [deepethogram.gui.main.initialize_project:1019] cwd: C:\Users\Sander
[2024-05-22 11:50:33,266] INFO [deepethogram.gui.main.initialize_project:1021] loaded project configuration: split:
reload: true
file: null
train_val_test:

  • 0.8
  • 0.2
  • 0.0
    compute:
    fp16: false
    num_workers: 8
    batch_size: 32
    min_batch_size: 8
    max_batch_size: 512
    distributed: false
    gpu_id: 0
    dali: false
    metrics_workers: 0
    reload:
    overwrite_cfg: false
    latest: false
    notes: null
    log:
    level: info
    run:
    type: gui
    model: null
    dir: C:\DeepEthogram\test_deepethogram\gui_logs\240522_115033
    label_view_width: 31
    control_arrow_jump: 31
    vertical_arrow_jump: 3
    cmap: deepethogram
    unlabeled_alpha: 0.1
    prediction_opacity: 0.2
    postprocessor:
    type: min_bout_per_behavior
    min_bout_length: 1
    augs:
    LR: 0.5
    UD: 0.0
    brightness: 0.25
    contrast: 0.1
    crop_size: null
    degrees: 10
    grayscale: 0.5
    hue: 0.1
    normalization:
    'N': 324172800
    mean:
    • 0.26870773349589994
    • 0.26870773349589994
    • 0.26870773349589994
      std:
    • 0.23503533203484067
    • 0.23503533203484067
    • 0.23503533203484067
      pad: null
      random_resize: false
      resize:
  • 224
  • 224
    saturation: 0.1
    project:
    class_names:
  • background
  • walking
  • ghead
  • gfront
  • gabdomen
  • gback
  • gwing
  • flight
  • pushing
    config_file: C:\DeepEthogram\test_deepethogram\project_config.yaml
    data_path: C:\DeepEthogram\test_deepethogram\DATA
    labeler: null
    model_path: C:\DeepEthogram\test_deepethogram\models
    name: test
    path: C:\DeepEthogram\test_deepethogram
    pretrained_path: C:\DeepEthogram\test_deepethogram\models\pretrained_models
    sequence:
    filter_length: 15
    train:
    loss_weight_exp: 1.0

[2024-05-22 11:50:33,268] INFO [deepethogram.gui.main.initialize_project:1022] cwd: C:\DeepEthogram\test_deepethogram\gui_logs\240522_115033
[2024-05-22 11:50:33,281] INFO [deepethogram.gui.main.project_loaded_buttons:175] Number finalized labels: 0
[2024-05-22 11:50:33,588] INFO [deepethogram.gui.main.initialize_video:226] Record for loaded video: {'flow': None, 'label': None, 'output': None, 'rgb': 'C:\DeepEthogram\test_deepethogram\DATA\2024-05-14_A02_005_camera_01\2024-05-14_A02_005_camera_01.mp4', 'keypoint': None, 'key': '2024-05-14_A02_005_camera_01'}
[2024-05-22 11:50:48,936] INFO [deepethogram.gui.main.get_selected_models:1136] {'flow_generator': {'no pretrained weights': None, '200221_115158_TinyMotionNet': 'C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt'}, 'feature_extractor': {'no pretrained weights': None, '200415_125824_hidden_two_stream_kinetics_degf': 'C:\DeepEthogram\test_deepethogram\models\pretrained_models\200415_125824_hidden_two_stream_kinetics_degf\checkpoint.pt'}, 'sequence': {'': None}}
[2024-05-22 11:50:48,936] INFO [deepethogram.gui.main.flow_train:343] flow_train called with args: ['python', '-m', 'deepethogram.flow_generator.train', 'project.path=C:\DeepEthogram\test_deepethogram', 'reload.weights=C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt']
[2024-05-22 11:50:52,987] INFO [deepethogram.projects.convert_config_paths_to_absolute:1135] cwd in absolute: C:\DeepEthogram\test_deepethogram\gui_logs\240522_115033
[2024-05-22 11:50:52,988] INFO [deepethogram.projects.convert_config_paths_to_absolute:1178] after absolute: {'class_names': ['background', 'walking', 'ghead', 'gfront', 'gabdomen', 'gback', 'gwing', 'flight', 'pushing'], 'config_file': 'C:\DeepEthogram\test_deepethogram\project_config.yaml', 'data_path': 'C:\DeepEthogram\test_deepethogram\DATA', 'labeler': None, 'model_path': 'C:\DeepEthogram\test_deepethogram\models', 'name': 'test', 'path': 'C:\DeepEthogram\test_deepethogram', 'pretrained_path': 'C:\DeepEthogram\test_deepethogram\models\pretrained_models'}
[2024-05-22 11:50:53,000] INFO [main.flow_generator_train:54] args: C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\deepethogram\flow_generator\train.py project.path=C:\DeepEthogram\test_deepethogram reload.weights=C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt
[2024-05-22 11:50:53,000] INFO [main.flow_generator_train:62] configuration used ~~~~~
[2024-05-22 11:50:53,010] INFO [main.flow_generator_train:63] split:
reload: true
file: null
train_val_test:

  • 0.8
  • 0.2
  • 0.0
    compute:
    fp16: false
    num_workers: 8
    batch_size: 32
    min_batch_size: 8
    max_batch_size: 512
    distributed: false
    gpu_id: 0
    dali: false
    metrics_workers: 0
    reload:
    overwrite_cfg: false
    latest: false
    weights: C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt
    notes: null
    log:
    level: info
    augs:
    brightness: 0.25
    contrast: 0.1
    hue: 0.1
    saturation: 0.1
    color_p: 0.5
    grayscale: 0.5
    crop_size: null
    resize:
  • 224
  • 224
    dali: false
    random_resize: false
    pad: null
    LR: 0.5
    UD: 0.0
    degrees: 10
    normalization:
    'N': 324172800
    mean:
    • 0.26870773349589994
    • 0.26870773349589994
    • 0.26870773349589994
      std:
    • 0.23503533203484067
    • 0.23503533203484067
    • 0.23503533203484067
      train:
      lr: 0.0001
      scheduler: plateau
      num_epochs: 10
      steps_per_epoch:
      train: 1000
      val: 200
      test: 20
      min_lr: 5.0e-07
      stopping_type: learning_rate
      milestones:
  • 50
  • 100
  • 150
  • 200
  • 250
  • 300
    weight_loss: true
    patience: 3
    early_stopping_begins: 0
    viz_metrics: true
    viz_examples: 10
    reduction_factor: 0.1
    loss_weight_exp: 1.0
    loss_gamma: 1.0
    label_smoothing: 0.05
    oversampling_exp: 0.0
    regularization:
    style: l2_sp
    alpha: 1.0e-05
    beta: 0.001
    flow_generator:
    type: flow_generator
    flow_loss: MotionNet
    flow_max: 10
    input_images: 11
    flow_sparsity: false
    smooth_weight_multiplier: 1.0
    sparsity_weight: 0.0
    loss: MotionNet
    max: 5
    n_rgb: 11
    arch: TinyMotionNet
    weights: pretrained
    cmap: deepethogram
    control_arrow_jump: 31
    label_view_width: 31
    postprocessor:
    min_bout_length: 1
    type: min_bout_per_behavior
    prediction_opacity: 0.2
    project:
    class_names:
  • background
  • walking
  • ghead
  • gfront
  • gabdomen
  • gback
  • gwing
  • flight
  • pushing
    config_file: C:\DeepEthogram\test_deepethogram\project_config.yaml
    data_path: C:\DeepEthogram\test_deepethogram\DATA
    labeler: null
    model_path: C:\DeepEthogram\test_deepethogram\models
    name: test
    path: C:\DeepEthogram\test_deepethogram
    pretrained_path: C:\DeepEthogram\test_deepethogram\models\pretrained_models
    run:
    type: train
    model: flow_generator
    dir: C:\DeepEthogram\test_deepethogram\models\240522_115052_flow_generator_train
    sequence:
    filter_length: 15
    unlabeled_alpha: 0.1
    vertical_arrow_jump: 3

[2024-05-22 11:50:53,123] INFO [main.flow_generator_train:67] Total trainable params: 1,951,784
[2024-05-22 11:50:53,286] INFO [deepethogram.projects.get_weightfile_from_cfg:1068] loading pretrained weights: C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt
reloading weights...
[2024-05-22 11:50:53,287] INFO [deepethogram.utils.load_state:341] loading from checkpoint file C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt...
[2024-05-22 11:50:53,314] INFO [main.get_metrics:364] key metric is SSIM
[2024-05-22 11:50:53,323] INFO [deepethogram.data.augs.get_gpu_transforms:246] GPU transforms: {'train': Sequential(
(0): ToFloat()
(1): VideoSequential(
(RandomHorizontalFlip_0): RandomHorizontalFlip(p=0.5, p_batch=1.0, same_on_batch=False)
(RandomRotation_1): RandomRotation(degrees=10, p=0.5, p_batch=1.0, same_on_batch=False, resample=bilinear, align_corners=True)
(ColorJitter_2): ColorJitter(brightness=0.25, contrast=0.1, saturation=0.1, hue=0.1, p=0.5, p_batch=1.0, same_on_batch=False)
(RandomGrayscale_3): RandomGrayscale(p=0.5, p_batch=1.0, same_on_batch=False)
)
(2): NormalizeVideo()
(3): StackClipInChannels()
), 'val': Sequential(
(0): ToFloat()
(1): NormalizeVideo()
(2): StackClipInChannels()
), 'test': Sequential(
(0): ToFloat()
(1): NormalizeVideo()
(2): StackClipInChannels()
), 'denormalize': Sequential(
(0): UnstackClip()
(1): DenormalizeVideo()
)}
[2024-05-22 11:50:53,323] INFO [deepethogram.base.init:95] scheduler mode: min
[2024-05-22 11:50:53,393] INFO [deepethogram.losses.get_regularization_loss:205] Regularization: L2_SP. Pretrained file: C:\DeepEthogram\test_deepethogram\models\pretrained_models\200221_115158_TinyMotionNet\checkpoint.pt alpha: 1e-05 beta: 0.001
[2024-05-22 11:50:53,412] INFO [deepethogram.flow_generator.losses.init:179] Using MotionNet Loss with settings: smooth_weights: [0.01, 0.02, 0.04, 0.08, 0.16] flow_sparsity: False sparsity_weight: 0.0
Traceback (most recent call last):
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\deepethogram\base.py", line 398, in get_trainer_from_cfg
log_every_n_steps=1)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\pytorch_lightning\utilities\argparse.py", line 348, in insert_env_defaults
return fn(self, **kwargs)
TypeError: init() got an unexpected keyword argument 'reload_dataloaders_every_epoch'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\runpy.py", line 85, in _run_code
exec(code, run_globals)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\deepethogram\flow_generator\train.py", line 374, in
flow_generator_train(cfg)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\deepethogram\flow_generator\train.py", line 78, in flow_generator_train
trainer = get_trainer_from_cfg(cfg, lightning_module, stopper)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\deepethogram\base.py", line 413, in get_trainer_from_cfg
log_every_n_steps=1)
File "C:\Users\Sander\anaconda3\envs\deepethogramg\lib\site-packages\pytorch_lightning\utilities\argparse.py", line 348, in insert_env_defaults
return fn(self, **kwargs)
TypeError: init() got an unexpected keyword argument 'progress_bar_refresh_rate'
[2024-05-22 11:50:53,974] INFO [deepethogram.gui.main.flow_train:353] Training finished. If you see error messages above, training did not complete successfully.
[2024-05-22 11:50:53,974] INFO [deepethogram.gui.main.flow_train:359] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
[2024-05-22 11:50:53,982] INFO [deepethogram.gui.main.project_loaded_buttons:175] Number finalized labels: 0"

Could you please help me on this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions