MMT4NL

A repository for the paper "Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning." This repository contains scripts, datasets, prompts, and results for running and evaluating prompt-based experiments, including question answering (QnA) and sentiment analysis, with and without context.

Folder Structure

Experiment/
│
├── readme.md
├── Datasets/
├── Prompts/
├── Results/
└── Scripts/

1. `Datasets/`

Contains raw and processed datasets used for experiments.

chat_dataset.csv, clean_qna_dataset.csv: CSV files with QnA data.
strategyqa_train.json: JSON dataset for QnA tasks.
Datasets.md: Documentation about datasets.

2. `Prompts/`

Contains prompt templates for different tasks and settings.

qna_with_context/: Prompts for QnA tasks with context (e.g., coreference, fairness, negation, robustness).
qna_without_context/: Prompts for QnA tasks without context.
sentiment/: Prompts for sentiment analysis.

3. `Results/`

Stores outputs and evaluation results.

qa_with_context/: Results for QnA with context.
qa_without_context/: Results for QnA without context.
sentiment_result/: Results for sentiment analysis.

4. `Scripts/`

Contains all code and notebooks for running experiments.

01_sentiment_notebook.ipynb: Sentiment analysis experiments.
02_qna_no_context_notebook.ipynb: QnA without context experiments.
03_qna_with_context_notebook.ipynb: QnA with context experiments.
PromptOps/: Python package with utility modules (e.g., template formatters, perturbation, test suite).

Getting Started

Datasets: Place or update datasets in the Datasets/ folder.
Prompts: Edit or add prompt templates in the Prompts/ subfolders.
Scripts: Run the notebooks in Scripts/ to generate prompts, run models, and evaluate results.
Results: Find generated outputs and evaluation metrics in the Results/ folders.

Notes

Update API keys and file paths in scripts as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Datasets		Datasets
Prompts		Prompts
Results		Results
Scripts		Scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MMT4NL

Folder Structure

1. `Datasets/`

2. `Prompts/`

3. `Results/`

4. `Scripts/`

Getting Started

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

MUICT-SERU/MMT4NL

Folders and files

Latest commit

History

Repository files navigation

MMT4NL

Folder Structure

1. Datasets/

2. Prompts/

3. Results/

4. Scripts/

Getting Started

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

1. `Datasets/`

2. `Prompts/`

3. `Results/`

4. `Scripts/`

Packages