Skip to content

Commit

Permalink
remove comment
Browse files Browse the repository at this point in the history
  • Loading branch information
maxjakob committed Jan 25, 2024
1 parent 0ca0e68 commit 3158306
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion notebooks/search/tokenization.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -201,7 +201,6 @@
"source": [
"We can observe:\n",
"- There are special tokens `[CLS]` and `[SEP]` to model the the beginning and end of the text. These two extra tokens will become relevant below.\n",
"- All tokens are lower-cased.\n",
"- Punctuations are they own tokens.\n",
"- Compounds words are split into two tokens, for example `hitmen` becomes `hit` and `##men`.\n",
"\n",
Expand Down

0 comments on commit 3158306

Please sign in to comment.