-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Description
Describe the bug
When trying to run ops.text.clean.filter_pos("NOUN", keep_matching_tokens=True), getting: module 'jange.ops.text.clean' has no attribute 'filter_pos'.
When changed to ops.text.clean.pos_filter("NOUN", keep_matching_tokens=True), getting: OSError: [E050] Can't find model 'en_core_web_sm'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.
To Reproduce
From examples:
clusters_ds = ds.apply(
ops.text.clean.pos_filter("NOUN", keep_matching_tokens=True),
ops.text.encode.tfidf(max_features=5000, name="tfidf"),
ops.cluster.minibatch_kmeans(n_clusters=5),
result_collector=result_collector,
)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels