Logging Per-Sample Evaluation Loss During Inference in Swift PT for Qwen3-VL

### Checklist / 检查清单

- [x] I have searched existing issues, and this is a new question or discussion topic. / 我已经搜索过现有的 issues，确认这是一个新的问题与讨论。

### Question Description / 问题描述

I am currently using the Swift PT pre-training framework to pre-train a Qwen3-VL-2B model.

During inference/evaluation, I would like to leverage Swift's internal loss computation to obtain the loss value for each individual sample in the evaluation dataset. My goal is to analyze model behavior at the sample level and identify examples with particularly high or low loss.

At the moment, Swift reports only the average loss across the entire evaluation dataset. However, I need to generate a log containing the loss value for every sample.

Is there an existing configuration, callback, or recommended approach within Swift to record and export per-sample evaluation losses? If not, what would be the best way to modify the evaluation pipeline to achieve this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging Per-Sample Evaluation Loss During Inference in Swift PT for Qwen3-VL #9518

Checklist / 检查清单

Question Description / 问题描述

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Logging Per-Sample Evaluation Loss During Inference in Swift PT for Qwen3-VL #9518

Description

Checklist / 检查清单

Question Description / 问题描述

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions