Commit 5302f2a
* fix duplicate `bos` token when `context==""`
* add docs
* check tokenizer.add_bos_token for bos control
* fix params
* skip duplicate bos
* fix bos token handling
* fix bos token handling
* fix box_token handling
* fixup! default add_special_tokens as unset
* `self.tokenizer.bos_token` can be None
* fix type
* Update lm_eval/models/huggingface.py
Co-authored-by: Cyrus Leung <[email protected]>
* refactor bos token handling logic
* add tests for bos
* fix tests
---------
Co-authored-by: Cyrus Leung <[email protected]>
1 parent 90950a8 commit 5302f2a
File tree
5 files changed
+772
-72
lines changed- lm_eval
- api
- models
- tests/models
5 files changed
+772
-72
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
324 | 324 | | |
325 | 325 | | |
326 | 326 | | |
| 327 | + | |
327 | 328 | | |
328 | 329 | | |
329 | 330 | | |
330 | | - | |
| 331 | + | |
331 | 332 | | |
332 | 333 | | |
333 | 334 | | |
| |||
336 | 337 | | |
337 | 338 | | |
338 | 339 | | |
339 | | - | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
340 | 343 | | |
341 | 344 | | |
| 345 | + | |
| 346 | + | |
342 | 347 | | |
343 | 348 | | |
344 | 349 | | |
| |||
351 | 356 | | |
352 | 357 | | |
353 | 358 | | |
354 | | - | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
355 | 386 | | |
356 | 387 | | |
357 | 388 | | |
358 | 389 | | |
359 | 390 | | |
360 | 391 | | |
361 | | - | |
362 | | - | |
363 | | - | |
364 | | - | |
365 | | - | |
366 | | - | |
| 392 | + | |
367 | 393 | | |
368 | 394 | | |
369 | 395 | | |
370 | 396 | | |
371 | 397 | | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
372 | 402 | | |
373 | 403 | | |
374 | 404 | | |
375 | 405 | | |
376 | 406 | | |
377 | 407 | | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
378 | 433 | | |
379 | 434 | | |
380 | 435 | | |
381 | | - | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
382 | 440 | | |
383 | | - | |
384 | | - | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
385 | 444 | | |
| 445 | + | |
386 | 446 | | |
387 | 447 | | |
388 | 448 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
| 35 | + | |
35 | 36 | | |
36 | 37 | | |
37 | 38 | | |
38 | 39 | | |
| 40 | + | |
39 | 41 | | |
40 | 42 | | |
41 | 43 | | |
| |||
84 | 86 | | |
85 | 87 | | |
86 | 88 | | |
87 | | - | |
| 89 | + | |
88 | 90 | | |
89 | 91 | | |
90 | 92 | | |
| |||
258 | 260 | | |
259 | 261 | | |
260 | 262 | | |
261 | | - | |
262 | | - | |
263 | | - | |
264 | | - | |
265 | | - | |
266 | 263 | | |
267 | 264 | | |
268 | 265 | | |
| |||
744 | 741 | | |
745 | 742 | | |
746 | 743 | | |
747 | | - | |
| 744 | + | |
748 | 745 | | |
749 | 746 | | |
750 | 747 | | |
| |||
763 | 760 | | |
764 | 761 | | |
765 | 762 | | |
766 | | - | |
767 | | - | |
| 763 | + | |
| 764 | + | |
768 | 765 | | |
769 | 766 | | |
770 | 767 | | |
| |||
858 | 855 | | |
859 | 856 | | |
860 | 857 | | |
861 | | - | |
862 | 858 | | |
| 859 | + | |
| 860 | + | |
863 | 861 | | |
864 | | - | |
865 | 862 | | |
866 | 863 | | |
867 | | - | |
868 | | - | |
869 | | - | |
870 | | - | |
871 | | - | |
872 | | - | |
873 | | - | |
874 | | - | |
875 | | - | |
876 | | - | |
877 | | - | |
878 | | - | |
| 864 | + | |
| 865 | + | |
| 866 | + | |
| 867 | + | |
| 868 | + | |
| 869 | + | |
| 870 | + | |
| 871 | + | |
879 | 872 | | |
880 | 873 | | |
881 | 874 | | |
| |||
897 | 890 | | |
898 | 891 | | |
899 | 892 | | |
900 | | - | |
| 893 | + | |
| 894 | + | |
| 895 | + | |
| 896 | + | |
| 897 | + | |
| 898 | + | |
901 | 899 | | |
902 | 900 | | |
903 | 901 | | |
| |||
971 | 969 | | |
972 | 970 | | |
973 | 971 | | |
974 | | - | |
| 972 | + | |
975 | 973 | | |
976 | 974 | | |
977 | 975 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
150 | 150 | | |
151 | 151 | | |
152 | 152 | | |
153 | | - | |
| 153 | + | |
154 | 154 | | |
155 | 155 | | |
156 | 156 | | |
| |||
881 | 881 | | |
882 | 882 | | |
883 | 883 | | |
| 884 | + | |
| 885 | + | |
| 886 | + | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
| 891 | + | |
| 892 | + | |
| 893 | + | |
| 894 | + | |
| 895 | + | |
| 896 | + | |
| 897 | + | |
| 898 | + | |
| 899 | + | |
| 900 | + | |
0 commit comments