Releases · IBM/unitxt

30 Jan 14:14

elronbandel

1.6.0

23d9a84

1.6.0

What's Changed

BugFixes:

FIx errors in datasets by @elronbandel in #495
Fix manifest to include all files by @elronbandel in #496
Fix templates for qa datasets by @elronbandel in #501
Fix attach summarization cards to the summarization task by @arielge in #506
Fix catalog main entry by @matanor in #520
Fix ibm cos bucket reader cache returning wrong result when loader limit is set by @yoavkatz in #527

New Assets:

Add multidoc2dial dataset in its abstractive and extractive forms by @arielge in #504
Rag metrics by @assaftibm in #508
Add regression task and fix stsb to use it by @elronbandel in #505
Add new templates from fm-eval to public unitxt by @OfirArviv in #481
Add few public datasets cards from IBM internal fmeval project by @OfirArviv in #502

Enhancments

Enhanced UI by @Roni-Friedman in #511
Prepare load_dataset and evaluate api to fit the paper by @elronbandel in #510
Update catalog navigation Index on documentation side menu for easier catalog browsing by @matanor in #518
Add catalog summary printing functionality by @elronbandel in #519
add docstrings to some operators and templates, so that they show in the respective catalog cards by @dafnapension in #522
Added UNITXT_TEST_METRIC_DISABLE to disable test_metric by @yoavkatz in #526
Update docs introduction and component sections by @matanor in #525

Breaking Changes

processors are now operators and should be redefined accordingly
The functionality of unitxt dataset creation moved from dataset to dataset_utils

Full Changelog: 1.5.0...1.6.0

Contributors

OfirArviv, elronbandel, and 6 other contributors

Assets 2

22 Jan 19:09

elronbandel

1.5.3

61cb213

Unitxt 1.5.3

What's Changed

Update version to 1.5.3 by @elronbandel in #513

Full Changelog: 1.5.2...1.5.3

Contributors

elronbandel

Assets 2

22 Jan 18:05

elronbandel

1.5.2

184588d

Unitxt 1.5.2

What's Changed

Update version to 1.5.1 by @elronbandel in #497
Fix templates for qa datasets by @elronbandel in #501
Add multidoc2dial dataset in its abstractive and extractive forms by @arielge in #504
attach summarization cards to the summarization task by @arielge in #506
Prepare load_dataset and evaluate api to fit the paper by @elronbandel in #510
enhanced UI by @Roni-Friedman in #511
Update version to 1.5.2 by @elronbandel in #512

Full Changelog: 1.5.1...1.5.2

Contributors

elronbandel, arielge, and Roni-Friedman

Assets 2

18 Jan 14:59

elronbandel

1.5.1

4f6bcf0

1.5.1

What's Changed

1.5.0 by @elronbandel in #494
FIx all datasets by @elronbandel in #495
fix manifest by @elronbandel in #496

Full Changelog: 1.5.0...1.5.1

Contributors

elronbandel

Assets 2

18 Jan 13:57

elronbandel

1.5.0

1e540d3

Unitxt 1.5.0

What's Changed

Add Perplexity by @assaftibm in #442
Balance demos in fair_tos datasets by @yoavkatz in #473
postprocessors become extensions of FieldOperator by @dafnapension in #468
Initial UI by @Roni-Friedman in #475
introducing text at the top of loaders.py by @dafnapension in #474
Improve Unitxt UI and add support for multiple catalogs by @Roni-Friedman in #476
Improve ExtractMostCommonFieldValues performance by avoiding the creation of the stream-length long list of values instead building values counter on the fly by @dafnapension in #471
Add a new Perturbate operator that is handy for faking prediction as a perturbated version of target or for any other perturbation use cases by @dafnapension in #456
add a card for HF xsum, a summary dataset by @dafnapension in #479
Add QA, NER, Targeted sentiment and Generation tasks, Llama and Alpaca formats and instructions and whitespace augmentor by @matanor in #483
Cfpb product by @ilyashnil in #485
Improve catalog UI on documentation website by separating catalog objects to files and adding information per catalog item by @matanor in #461
Update QA templates by @matanor in #486
Length balancer docstring by @matanor in #487
Add process_instance function to every mutli stream operator for easier testing and usage at instance level by @elronbandel in #488
Add LRU caching for catalog artifact loading to minimize IO overhead and enhance performance by @elronbandel in #489
Make postprocessors a general operator that operates on 'prediction' and on 'references' which enables the use of every possible operator as postprocessor by @dafnapension in #484
Improve UI code presentation and organization by @Roni-Friedman in #491
Simplify ui launching with the console command unitxt-explore by @elronbandel in #492
Xmmlu template multilingual by @gitMichal in #493

New Contributors

@Roni-Friedman made their first contribution in #475

Full Changelog: 1.4.6...1.5.0

Contributors

gitMichal, elronbandel, and 6 other contributors

Assets 2

11 Jan 16:01

elronbandel

1.4.6

12cac48

Unitxt 1.4.6

What's Changed

Fix automatic dataset and metric uploads to Hugginface hub by @elronbandel in #466

Full Changelog: 1.4.5...1.4.6

Contributors

elronbandel

Assets 2

11 Jan 15:40

elronbandel

1.4.4

2a1314d

Unitxt 1.4.4

What's Changed

Fix HuggingFace integration by changing logging.py to logging_utils.py to comply with hf dataset conventions by @elronbandel in #457
Add FilterByQuery and ExecuteQuery as simple operators receiving textual python query by @dafnapension in #449
New multi label template by @yoavkatz in #462
Delete old formats and make SystemFormat the only format exist by @elronbandel in #458

Full Changelog: 1.4.3...1.4.4

Contributors

elronbandel, dafnapension, and yoavkatz

Assets 2

09 Jan 18:36

elronbandel

1.4.3

837b5c6

Unitxt 1.4.3

What's Changed

Change formats to be backward compatible by @elronbandel in #453

Full Changelog: 1.4.2...1.4.3

Contributors

elronbandel

Assets 2

08 Jan 15:09

elronbandel

1.4.2

40d2952

Unitxt 1.4.2

Enhancements

Added ability to calculate confidence interval for additional scores beyond the main_score by @assaftibm in #431
Improved error messaging in test cards and added option to disable via environment variable by @yoavkatz in #440
Added support for Japanese language in sacrebleu by @yoavkatz in #433
Introduced binary recall, precision metrics, and advanced filtering operators by @lilacheden in #424
Added text completion task and modified lm harness template by @perlitz in #429

Internal code changes

Converted instructions to operators by @elronbandel in #450

Bug fixes

Fixed split definition in debater datasets by @ilyashnil in #443
Added new requirements for sacrebleu by @gitMichal in #448

Documentation

Documented release notes process by @matanor in #444
Added guidelines for updating Unitxt documentation by @matanor in #446

Non backward compatible changes

Implemented SystemFormat instance operator and updated BasicRecipe to use all operators, removing renderers and ICLFormat by @dafnapension in #423
(this change can break any code using the deprecated ICLFormat)
Additional verification introduced, aiming for uniquely determined results by @dafnapension in #435

New Contributors

@lilacheden made their first contribution in #424

Full Changelog: 1.4.1...1.4.2

Contributors

perlitz, gitMichal, and 7 other contributors

Assets 2

31 Dec 10:45

yoavkatz

1.4.1

a27c171

Unitxt 1.4.1

(Same as 1.4.0 - rereleasing due to release process error)

Enhancements

New random generation mechanism to remove dependency between different random generators @matanor in #414
New MultipleChoiceTemplate which changes all QA datasets by @elronbandel in #405
New MAP, MRR, and Retrieval@K metrics by @assaftibm in #422
New LoadFromKaggle loader which allows direct loading of datasets from Kaggle by @ilyashnil in #413
New StringContainment metric that if one of the references is contained in the prediction by @ellarabi in #394
New ConvertToBoolean post processor that changes the predict to either TRUE or FALSE by @ellarabi in #394
15 new open source classification datasets by @ilyashnil in #410, #418
Documentation is now automatically generate in each release (#384)

Bug fixes

Fixed and improved error checking in multi label F1 by @yoavkatz in #390
Changed translations to use normalized_sacrebleu by @gitMichal in #426

Non backward compatible changes

MultipleChoice QA datasets need to move to new MultipleChoiceTemplates
Translation blue metric is now between 0-1 and not 0-100
New randomization mechanism may change selects selection of demos , randomized text augmentation, or any other random choice.

What's Changed

metric and processor related to robustness evaluation by @ellarabi in #394
Removed variable set in debugging by @yoavkatz in #409
Adding simple datasets by @ilyashnil in #410
Add multiple choice template and fix mmlu by @elronbandel in #405
Fixed error check in multi label F1 by @yoavkatz in #390
FIX multiple choice template by @perlitz in #416
example of kaggle loader by @ilyashnil in #413
Medical abstract by @ilyashnil in #418
Multilabel and jsons by @ilyashnil in #419
Improve dataset addition tutorial + Fix auto uploads of the catalog to the web by @elronbandel in #384
Improve docs by @elronbandel in #420
Fix docs and docs compilation tests by @elronbandel in #421
Hard code classes names by @elronbandel in #425
Retrieval metrics by @assaftibm in #422
modify to different bleu impl. by @gitMichal in #426
Improve docs by @elronbandel in #427
Independent random generators by @matanor in #414

New Contributors

@ellarabi made their first contribution in #394

Full Changelog: 1.3.1...1.4.0

Contributors

perlitz, gitMichal, and 6 other contributors

Assets 2

Releases: IBM/unitxt

1.6.0

What's Changed

BugFixes:

New Assets:

Enhancments

Breaking Changes

Contributors

Uh oh!

Unitxt 1.5.3

What's Changed

Contributors

Uh oh!

Unitxt 1.5.2

What's Changed

Contributors

Uh oh!

1.5.1

What's Changed

Contributors

Uh oh!

Unitxt 1.5.0

What's Changed

New Contributors

Contributors

Uh oh!

Unitxt 1.4.6

What's Changed

Contributors

Uh oh!

Unitxt 1.4.4

What's Changed

Contributors

Uh oh!

Unitxt 1.4.3

What's Changed

Contributors

Uh oh!

Unitxt 1.4.2

Enhancements

Internal code changes

Bug fixes

Documentation

Non backward compatible changes

New Contributors

Contributors

Uh oh!

Unitxt 1.4.1

Enhancements

Bug fixes

Non backward compatible changes

What's Changed

New Contributors

Contributors

Uh oh!