Is this the right way to compute KL divergence?

The code in `utils.py` related to compute KL divergence is as follows, but I think maybe this is not the KL divergence but cross entropy.

https://github.com/kevinyaobytedance/llm_unlearn/blob/647f309519f91c29d87e62cf63d9a43759810040/utils.py#L199-L203

Why not directly use PyTorch [`KLDivLoss`](https://pytorch.org/docs/stable/generated/torch.nn.KLDivLoss.html#torch.nn.KLDivLoss)?


	# P: pretrained model; Q: current model.
	prob_p = torch.nn.functional.softmax(pretrained_outputs.logits, -1)
	prob_q = torch.nn.functional.softmax(normal_outputs.logits, -1)

	loss = -(prob_p * torch.log(prob_q + 1e-12)).sum(-1).mean()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this the right way to compute KL divergence? #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Is this the right way to compute KL divergence? #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions