Releases · Qingfeng-233/KeyAtten

What's New in v0.2.0

QK LoRA Keyword Extraction

QK LoRA adapter trained on Qwen3-Embedding-0.6B (Q/K projection LoRA fine-tuning)
Training data: CSL 2000 + ShenCeCup 800 + Multi-Domain ~7995 = 10769 samples (extractive filtered)
ShenCeCup (1000 docs): F1@5=0.4653, F1@10=0.3292, R@10=0.7325
Outperforms Gemini flash-lite by +14% F1@10, +23% R@10, ~500x faster (0.02s vs 11s per doc)

Performance Optimization

�uild_model_bundle now accepts dtype parameter ('bfloat16' / 'float16' / 'float32' / 'auto')
KeyAttenExtractor passes dtype through to model loading
All forward passes upgraded from orch.no_grad() to orch.inference_mode()
Dynamic max_length from model.config.max_position_embeddings (no more hardcoded 512)
bf16 + SDPA: ~50% memory reduction for long document inference

New Features (since v0.1.0)

Decoder-only causal attention adaptation (auto layer recommendation)
Token-span candidate scoring (candidate_scoring='token_span')
Gravity candidates for unseen keyphrases (�nable_gravity=True)
Optional nested dedup for top-5 results
External token input and domain dictionary support
Length bias parameter for academic scenarios

Benchmark Highlights

Method	Dataset	F1@5	F1@10	R@10
QK LoRA (sigmoid)	ShenCeCup 1000	0.4653	0.3292	0.7325
Gemini flash-lite	ShenCeCup 1000	0.4006	0.2894	0.5973

Acknowledgments

Thanks to the LinuxDo community for their support.

KeyAtten v0.1.0

基于 Transformer Attention 机制的关键词提取框架

包含内容

4 种纯 Attention 方法（cls_attn、received_attn、samrank、fusion_attn）

4 种 Attention-IDF 混合方法

中英双语支持

词级语义权重输出

7 个公开数据集、14 种方法的完整评测报告

评测代码

附件 keyatten-benchmark-code.tar.gz 为完整评测代码存档。

SHA-256: 1760e974241209a85f74fd94ff2aecd1f4f9c7704bcc5045eed8e162cf1aef6e

许可证

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's New in v0.2.0

QK LoRA Keyword Extraction

Performance Optimization

New Features (since v0.1.0)

Benchmark Highlights

Acknowledgments

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

KeyAtten v0.1.0

包含内容

评测代码

许可证

Uh oh!

Releases: Qingfeng-233/KeyAtten

v0.2.0 — QK LoRA & Performance Optimization

What's New in v0.2.0

QK LoRA Keyword Extraction

Performance Optimization

New Features (since v0.1.0)

Benchmark Highlights

Acknowledgments

Uh oh!

v0.1.0 - Initial Public Release

KeyAtten v0.1.0

包含内容

评测代码

许可证

Uh oh!