Skip to content

self-distillation really works? #8

@LASTLINEK

Description

@LASTLINEK

Hi, I recently downloaded your code and did some experiments. I found that the performance of the self-distillation was about the same as just adding label loss at each stage. Have you ever encountered such a situation?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions