st-poc-glossary-poc

Glossary cleaner

This part handles existing glossaries making sure there are no duplicates with existing terms (lexically or semantically). This is attained by grouping by similarity and offering the glossary manager tools to resolve posssible conflicts.

1. Data Uploading and processing

Create a UI that allow users to load the existing glossary by uploading a csv file, or by connecting to the glossary db or via API call
Load the data into a df
Embed the list of terms and append the embedding columns to the df

2. Flag possibe conflicts and collusions

Build a mechanism to flag possible duplicates.
Use GPT to suggest : merging terms, discriminate terms

3. Conflict resolution

Create UX to resolve ownership conflict resolution using communication tools e.g.SLack /Slack API, suggested conversations etc

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
app		app
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
plan.txt		plan.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

st-poc-glossary-poc

Glossary cleaner

1. Data Uploading and processing

2. Flag possibe conflicts and collusions

3. Conflict resolution

Adding a new term

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

slal2003/st-poc-glossary

Folders and files

Latest commit

History

Repository files navigation

st-poc-glossary-poc

Glossary cleaner

1. Data Uploading and processing

2. Flag possibe conflicts and collusions

3. Conflict resolution

Adding a new term

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages