Skip to content

Guidance for OP with multiple data fields to be processed #411

Open
@yxdyc

Description

@yxdyc

Search before continuing 先搜索,再继续

  • I have searched the Data-Juicer issues and found no similar feature requests. 我已经搜索了 Data-Juicer 的 issue 列表但是没有发现类似的功能需求。

Description 描述

Currently, users may be confused about supporting multiple fields for a given OP. For example, developing a OP that processes both text_key="question" and text_key="answer".

Besides, we need to add some guidance about the type of text related keys, e.g., must be str, rather than a list or dict, for the sake of efficiency and coding convenience (implicit assumptions for all text-related OPs).

Use case 使用场景

related issue: #380

Additional 额外信息

No response

Are you willing to submit a PR for this feature? 您是否乐意为此功能提交一个 PR?

  • Yes I'd like to help by submitting a PR! 是的!我愿意提供帮助并提交一个PR!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions