LM Toolkit Refactor #381

dcgaines · 2025-03-07T17:46:28Z

Merging toolkit refactor into Banff LM branch for sim testing.

Overview

Replaced all custom models in the language module with language model adapters. Adapters rely on aactextpredict, our new LM toolkit, for the heavy lifting and only need to handle BciPy-specific things like special space and backspace characters and response type properties.

Ticket

Link a pivotal ticket here

Contributions

Deprecated LanguageModel classes in favor of LanguageModelAdapter classes.
Consolidated predict methods into the super class, only override when needed (Oracle).
Renamed KenLM model to NGram to match the aactextpredict package.
Updated all references to KenLM and LanguageModel classes to match new names/classes

Test

Ran all pytest cases

Documentation

Language module README updated. Added links to textpredict repo and AAC adapting arXiv paper.

Changelog

Is the CHANGELOG.md updated with your detailed changes?
Not yet.

…ates

TODO: finish processing script, integrate LLM

…linting

…ed in that package now

… textpredict package

tab-cmd

I'll wait for @lawhead to review since he has more experience here. I would keep the base class as LanguageModel or BciPyLanguageModel, but the others could be annotated as Adapters extending from that. We may want our own Uniform here without an adapter. I understand why you need it in the toolkit, but it's simple enough to keep here, and it could be a good example of how to build an LM in BciPy.

The toolkit doesn't seem to work for 3.10.6. >=3.7,<3.11?

Also, some linting errors!

codacy-production · 2025-03-24T18:02:32Z

Coverage summary from Codacy

See diff coverage on Codacy

Coverage variation	Diff coverage
Report missing for `aa216ca`¹	✅ 67.91% (target: 10.00%)

Coverage variation details

	Coverable lines	Covered lines	Coverage
Common ancestor commit (`aa216ca`)	Report Missing	Report Missing	Report Missing
Head commit (`6779982`)	9078	6139	67.63%

Coverage variation is the difference between the coverage for the head and common ancestor commits of the pull request branch: <coverage of head commit> - <coverage of common ancestor commit>

Diff coverage details

	Coverable lines	Covered lines	Diff coverage
Pull request (#381)	134	91	67.91%

Diff coverage is the percentage of lines that are covered by tests out of the coverable lines that the pull request added or modified: <covered lines added or modified>/<coverable lines added or modified> * 100%

See your quality gate settings Change summary preferences

Codacy didn't receive coverage data for the commit, or there was an error processing the received data. Check your integration for errors and validate that your coverage setup is correct. ↩

lawhead

Thanks for all of your effort on this PR. I like that it moves the details important for developing and evaluating language models into a separate space that can evolve independently from BciPy. However, there are a few things I would like to see implemented differently. Some of this feedback is detailed so let me know if you want to setup a meeting to discuss.

Language models have always been an important component of BciPy and we need to retain that priority while more formally specifying an API. While the textpredict library provides most of the language models used in BciPy, it does not provide all of them (ex. Oracle LM). We also need to leave open the potential for users to bring their own models. So we still need to retain a LanguageModel class to specify what is required of a language model for use in BciPy.

Now that we are using Python 3.8+, we have a few more options than when this code was originally written. Rather than tightly coupling this with textpredict and using the LanguageModel ABC class in that library, I propose creating a LanguageModel Protocol (https://peps.python.org/pep-0544) in BciPy. The textpredict library can still maintain its own LanguageModel base class, and all models would implicitly implement this protocol.

from typing import Literal, Protocol

class LanguageModel(Protocol):

  def predict(self, evidence: List[str]) -> List[Tuple[str, float]]:
    ...
    
  def configure(self, params: Dict[str, Any]) -> None:
    """Configure the language model. Assumes a no-arg constructor.
       See below regarding parameters."""

  # Tasks don't use word prediction yet, so maybe it's still optional and not included in the protocol.
  def set_response_type(self, response_type: Literal['symbol', 'word']):
    ...

Many of the adapters have similar code for handling spaces and backspaces. Is it possible to have a single adapter for all textpredict models?
Regarding language model parameters, I would prefer to establish a different mechanism for passing parameters to language models than using lm_params, which seems specific to what's currently in textpredict and may easily get out of sync. Maybe we have another value in parameters.json for this that is a serialized json string.

  "lm_params": {
    "value": "{}",
    "section": "lang_model_config",
    "name": "Language Model Parameters",
    "helpTip": "Parameters passed to the selected language model.",
    "recommended": [
    ],
    "editable": false,
    "type": "str"
  }

The language_model helper currently depends on importing LanguageModel subclasses to know what's available. This mechanism should be re-worked to be a registry allowing other models to be included. This is a lower priority and can be pushed to a subsequent ticket.
I agree with Tab that BciPy should have its own Uniform LM.

…col. Adjusted subclasses and references

dcgaines · 2025-03-28T17:48:31Z

I've addressed 1. and 5. with the previous two commits.

Regarding 2., I think that it is theoretically possible, but I worry that it would get far too messy with each model type requiring different parameters to initialize. I think having them all separate is likely cleaner. As it stands, the majority of the adapters inherit the same predict method. I considered moving some of the symbol set modifications from the model init methods into the super class init method, but now that it is a protocol, it might not be necessary/wanted for models in BciPy that aren't actually adapters.

I think that this would get very messy very quickly. Some of the models have several parameters that they require, which would make for a very long serialized json that wouldn't be very readable. Also, changing to a single parameter like this would remove the current ability to have default values for each type of model.

I agree that 4. should be a separate ticket. I took a quick stab at doing this and it seems that there's an extra layer of complication because BciPyLanguageModel is a Protocol as well.

lawhead · 2025-04-01T16:45:25Z

bcipy/language/main.py

@@ -18,26 +18,28 @@ def __str__(self):
        return self.value


-class LanguageModel(ABC):
-    """Parent class for Language Models."""
+class BciPyLanguageModel(Protocol):


This should just be LanguageModel. It's already name-spaced in the bcipy package. As far as I can tell we never import the textpredict LanguageModel base class, but if we did we can use an import alias (https://docs.python.org/3/reference/simple_stmts.html#import) to disambiguate.

lawhead · 2025-04-07T18:26:04Z

bcipy/language/main.py

-class LanguageModel(ABC):
-    """Parent class for Language Models."""
+class LanguageModel(Protocol):
+    """Protocol for BciPy Language Models."""


The LanguageModel Protocol is used for defining an interface (or contract) that any classes being used by our code must implement. It is intended for typing code that uses language models. It is not intended for code re-use. Implementing classes don't need to subclass the Protocol.

This interface should be minimal and only the methods that are used by the calling code. Also, protocol methods shouldn't have a body (just use ...). See my earlier comment regarding what the implementation could look like.

If you need code reuse for the adapters you could have a common LanguageModelAdapter parent class or use a mixin approach.

With Protocols we need to change the language_model registry process. It currently depends on LanguageModel.__subclasses__, which is a fairly brittle pattern and won't work with structural subtyping. The simplest change for this PR would be hard code the supported models in the language_models_by_name function. Then in the init_language_model function models should be instantiated using an empty constructor and configured using the protocol methods. I'm happy to work on a followup PR for registration.

If you want to work through any of this over a call let me know.

…ng requirement every time

dcgaines · 2025-04-19T20:53:17Z

I think that addresses all the changes we talked about. Let me know if there's anything I missed.

lawhead · 2025-04-22T01:10:13Z

I think that addresses all the changes we talked about. Let me know if there's anything I missed.

When I pulled the branch to test this I realized we had an issue with our protocols. I made a few changes in a branch that I think resolves it. I'll go ahead and setup a PR tomorrow for you to review.

…sort order.

LM Toolkit Refinements

lawhead

I think this is good to merge. Thanks for all your work on this.

tab-cmd and others added 30 commits January 8, 2025 12:43

add script for simulating lm change with different phrases, small upd…

737a8e2

…ates

add processing script, add NullDAQ

a18a35c

Update to custom typing parameters

8d9fb88

TODO: finish processing script, integrate LLM

Add progress bar and WIP update to average phrases across LM

c68803c

update the process script to the new output format

c284227

Add final phrases

0bdcfc7

add plotting and stats to the processing script

ab9244a

add missing params command

1b12409

add more logging and custom metrics

205d929

Integration of causal model, add phrases, update processing scripts

ff6e9e2

matrix processing

a58b8cf

update figure

0dc02fe

move script to a group demo

2ac395e

Merge remote-tracking branch 'origin/2.0.0' into BANFF_lm

a262d40

reset devices default

9b29a51

remove retry logic from language model init

32b82f6

reset static defaults

8cf25e1

update parameters

a43ba68

remove bad test

0fb843a

lint

7e5994b

remove integration tests (for now) and add some info to sim README + …

3ee8b9b

…linting

drop support for 3.8

a9ebcd4

Added textpredict dependency, removed LM dependencies that are includ…

7b7e560

…ed in that package now

Refactored main language model classes into adapters that use the new…

4f827d9

… textpredict package

Renamed ngram model

b408af6

Updated imports

3461308

More ngram renaming

5f3016f

More ngram renaming, adjusted mixture default params

956ca98

Updated textpredict version

7d96e9a

Converted mixture model to adapter

0c4a167

tab-cmd force-pushed the BANFF_lm branch from 71bbe11 to a9ebcd4 Compare March 17, 2025 17:51

tab-cmd requested changes Mar 17, 2025

View reviewed changes

tab-cmd added 2 commits March 20, 2025 01:39

Merge branch '2.0.0' into lm-toolkit

fceca14

lint

f809adb

tab-cmd deleted the branch 2.0.0 March 20, 2025 08:44

tab-cmd closed this Mar 20, 2025

tab-cmd reopened this Mar 20, 2025

tab-cmd changed the base branch from BANFF_lm to 2.0.0 March 20, 2025 08:53

Update textpredict dependency to fix 3.10

de49406

lawhead requested changes Mar 27, 2025

View reviewed changes

dcgaines added 2 commits March 28, 2025 12:16

Converted LanguageModelAdapter base class to BciPyLanguageModel proto…

616d57f

…col. Adjusted subclasses and references

Restored BciPy's own uniform model

348decd

dcgaines requested review from tab-cmd and lawhead March 28, 2025 17:48

lawhead reviewed Apr 1, 2025

View reviewed changes

Renamed base LM class back to LanguageModel

a52209f

lawhead reviewed Apr 7, 2025

View reviewed changes

dcgaines added 4 commits April 15, 2025 10:46

Removed language module exclusion from mypy

9831833

Deprecated response type, added separate protocols for char and word LMs

7e3cd92

Updated dependency to the new toolkit name

a22f375

Loosened textslinger dependency to allow minor changes without updati…

3fe2cf9

…ng requirement every time

dcgaines requested a review from lawhead April 19, 2025 20:52

lawhead and others added 2 commits April 22, 2025 10:56

Refactored Language Model protocols; fixed linting issues and import …

777ba10

…sort order.

Merge pull request #390 from CAMBI-tech/lm-toolkit-refinements

6779982

LM Toolkit Refinements

lawhead approved these changes Apr 23, 2025

View reviewed changes

tab-cmd merged commit 3bab3e3 into 2.0.0 Apr 29, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LM Toolkit Refactor #381

LM Toolkit Refactor #381

Uh oh!

dcgaines commented Mar 7, 2025

Uh oh!

tab-cmd left a comment •

edited

Loading

Uh oh!

codacy-production bot commented Mar 24, 2025 •

edited

Loading

Uh oh!

lawhead left a comment

Uh oh!

dcgaines commented Mar 28, 2025

Uh oh!

lawhead Apr 1, 2025

Uh oh!

lawhead Apr 7, 2025

Uh oh!

dcgaines commented Apr 19, 2025

Uh oh!

lawhead commented Apr 22, 2025

Uh oh!

lawhead left a comment

Uh oh!

Uh oh!

Uh oh!

LM Toolkit Refactor #381

LM Toolkit Refactor #381

Uh oh!

Conversation

dcgaines commented Mar 7, 2025

Overview

Ticket

Contributions

Test

Documentation

Changelog

Uh oh!

tab-cmd left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codacy-production bot commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage summary from Codacy

See diff coverage on Codacy

See your quality gate settings Change summary preferences

Footnotes

Uh oh!

lawhead left a comment

Choose a reason for hiding this comment

Uh oh!

dcgaines commented Mar 28, 2025

Uh oh!

lawhead Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

lawhead Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

dcgaines commented Apr 19, 2025

Uh oh!

lawhead commented Apr 22, 2025

Uh oh!

lawhead left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tab-cmd left a comment •

edited

Loading

codacy-production bot commented Mar 24, 2025 •

edited

Loading