「各項目」「各プロジェクト」の辞書複合語登録#527
Merged
Merged
Conversation
- SKK-JISYO.akaza に「各項目」「各プロジェクト」を複合語として登録 - 「かくこうもく」単体変換では rerank により「各項目」が1位になる - 長い文では k-best に乗らないため今後のスコア調整が必要 - AGENTS.md に k-best の k 値制約(k=5 が現実的上限)を追記 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
5bd110c to
3fcf2b7
Compare
Merged
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
各/書く/核の同音異義語問題に対して、辞書に複合語エントリを追加変更内容
辞書登録
かくこうもく /各項目/— 「かくこうもく」単体変換では rerank により「各項目」が1位になることを確認かくぷろじぇくと /各プロジェクト/AGENTS.md
既知の制約
長い文(例: 「かくこうもくのいみをしりたい」)では、
各項目パスの viterbi コストが高く k-best (k=10) に乗らないため、「書く/項目」が選ばれる。単体変換(「かくこうもく」のみ)では rerank で正しく「各項目」が1位になる。長い文での改善は今後のスコア調整課題。テスト結果
🤖 Generated with Claude Code