Skip to content

TigerResearch/MedCT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

MedCT

MedCT is the world's first clinical terminology for non-English community, specifically Chinese. It achieves state-of-the-art performance in medical NER and NEL tasks, for both English and Chinese.

Our first release contains:

  1. The MedCT bilingual (en and zh) clinical terminology dictionary: {concept_id, [synonyms]}..., MedCT;
  2. The MedCT named entity recognition (NER) models: MedLink-en, MedLink-zh;
  3. The MedCT foundation model: MedBERT;
  4. Our MedCT-clinical-notes dataset MedCT-clinical-notes;, including:
    • For the NER and NEL tasks, 7.4K real-world clinical notes in Chinese (medct_ner_notes.csv), and 61K entity mention annotations per MedCT graph (medct_ner_annotations.csv).
    • For the search task, 20 clinical queries (medct_search_queries.csv), and 2K discharge notes with relevance annotations (medct_search_notes.csv).
    • For the clinical notes summarization task, 91 raw discharge notes with summary by human, LLM and MedCT-augmented generations (medct_summary_notes.csv), with preference Likert-scale annotated by human physicians (medct_summary_scores.csv).
  5. The MedCT and the search application demos.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published