| Method | Dense | Sparse | GPU | CPU | Multiprocessing | HPC | Description | Example |
|---|---|---|---|---|---|---|---|---|
| NMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | NMF with Automatic Model Determination | Link |
| Custom NMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Use Custom NMF Functions with NMFk | Link |
| TriNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | NMF with Automatic Model Determination for Clusters and Patterns | Link | |
| RESCALk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | RESCAL with Automatic Model Determination | Link |
| RNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Recommender NMFk | Link |
| SymNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | NMFk with Symmetric Clustering | Link | |
| WNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | NMFk with weighting - used for recommendation system | Link | |
| HNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Hierarchical NMFk | Link |
| BNMFk | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Boolean NMFk | Link | |
| LMF | ✔️ | ✔️ | ✔️ | Logistic Matrix Factorization | Link | |||
| SPLIT | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Joint NMFk factorization of multiple data via SPLIT | Link | |
| SPLITTransfer | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | Supervised transfer learning method via SPLIT and NMFk | Link |
| Method | Multiprocessing | HPC | Description | Example |
|---|---|---|---|---|
| Vulture | ✔️ | ✔️ | Advanced text processing tool for cleaning and NLP | Link |
| Beaver | ✔️ | ✔️ | Fast matrix and tensor building tool for text mining | Link |
| iPenguin | ✔️ | Online information retrieval tool for Scopus, SemanticScholar, and OSTI | Link | |
| Orca | ✔️ | Duplicate author detector for text mining and information retrieval | Link | |
| Squirrel | Dataset pruning tool for documents | Link |
| Method | Description | Example |
|---|---|---|
| Wolf | Graph centrality and ranking tool | Link |
| Peacock | Data visualization and generation of actionable statistics | Link |
| SeaLion | Generic report generation tool | Link |
| Fox | Report generation tool for text data from NMFk using OpenAI | Link |
| ArcticFox | Report generation tool for text data from HNMFk using local LLMs | Link |
| Method | Description | Example |
|---|---|---|
| Cheetah | Fast search by keywords and phrases | Link |
| Bunny | Dataset generation tool for documents and their citations/references | Link |
| Penguin | Text storage tool | Link |
| Lynx | Streamlit UI | Link |
| Termite | Knowladge graph building tool | Link |
| Example | Description | Link |
|---|---|---|
| NM Law Data | Domain specific data for AI and RAG system written in our paper about New Mexico Law that uses the TELF pipeline | Link |
| Full TELF Pipeline | An end-to-end pipeline demonstration, from collection to analysis | Link |