Skip to content

Error with model creation from timm "vit_base_patch32_224" : 'TimmWrapperConfig' object has no attribute 'num_hidden_layers' #163

@andrewcaunes

Description

@andrewcaunes

Hi, thank you for your work !

Just tried to run some of the main demo in a clone of the 2.0 version of the lib and got an error, I guess the demo should be updated.
Fresh conda env with only "pip install vit_prisma"

Code ran :
`from vit_prisma.models.base_vit import HookedViT
from vit_prisma.configs.HookedViTConfig import HookedViTConfig
from vit_prisma.training import trainer
from vit_prisma.dataloaders.induction import InductionDataset

from vit_prisma.training.training_utils import PrismaCallback
from vit_prisma.models.base_vit import HookedViT

model_function = HookedViT.from_pretrained("vit_base_patch32_224",
center_writing_weights=True,
center_unembed=True,
fold_ln=True,
refactor_factored_attn_matrices=True,
)`

Error:
Traceback (most recent call last):
File "", line 198, in _run_module_as_main
File "", line 88, in _run_code
File "/projects/ViT-Prisma/project/test.py", line 26, in
model_function = HookedViT.from_pretrained("vit_base_patch32_224",
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "
/projects/ViT-Prisma/src/vit_prisma/models/base_vit.py", line 780, in from_pretrained
cfg = convert_pretrained_model_config(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/projects/ViT-Prisma/src/vit_prisma/prisma_tools/loading_from_pretrained.py", line 633, in convert_pretrained_model_config
'n_layers' : hf_config.num_hidden_layers,
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "
/miniconda3/envs/prisma_1/lib/python3.12/site-packages/transformers/configuration_utils.py", line 211, in getattribute
return super().getattribute(key)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'TimmWrapperConfig' object has no attribute 'num_hidden_layers'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions