@@ -14,49 +14,8 @@ cd llm-threat-model
1414
1515The first step is to execute a selected attack method on a specified model.
1616
17- <details ><summary > Supported Attacks: </summary ><p >
18-
19- - ** PRS** : A black-box adaptive attack that combines in-context attack with random search.
20- - ** BEAST** : A black-box adaptive attack that iteratively refines test cases based on feedback.
21- - ** GCG** : A gradient-based attack that directly leverages model gradients for generating adversarial examples.
22- - ** AutoDan** : A dynamic attack that adapts based on the model's response patterns.
23- - ** PAIR** : A similarity-based attack that tries to fool the model by finding similar but adversarial cases.
24-
25- These attacks can be found in the ` baselines ` folder and configured with YAML files in the ` configs/method_configs/ ` folder.
26-
27- </p ></details >
28-
29- <details ><summary > Supported Models: </summary ><p >
30-
31- You can run attacks on a variety of pre-trained language models. Below are some of the supported models:
32- - ** LLaMA** : Versions 2, 3, 3.1, and 3.2 with sizes ranging from 7B to 70B, safety-tuned.
33- - ** Vicuna** : Both 7B and 13B, version 1.5, optimized for chat-based applications.
34- - ** StableLM Zephyr** : A lightweight, robust model focused on resource efficiency.
35- - ** Starling** : Optimized models for both alpha and beta variants.
36- - ** Gemma** : Versions 1 and 2 with sizes ranging from 2B to 9B, safety-tuned.
37- - ** R2D2** : Model, proposed in [[ 1]] ( #-acknowledgements-and-citation- ) , adversarially safety-tuned from Zephyr-7b.
38-
3917These models can be found in the corresponding model configurations defined in the YAML files under ` configs/model_configs/ ` .
4018
41- </p ></details >
42-
43- <details ><summary > Recommended Models with Fast Tokenization </summary ><p >
44-
45- We recommend using models with fast tokenization. Here are some common choices:
46-
47- - ** vicuna_7b_v1_5_fast**
48- - ** starling_lm_7B_alpha_fast**
49- - ** llama2_7b_fast**
50- - ** llama2_13b_fast**
51- - ** llama3_8b_fast**
52- - ** llama3_1_8b_fast**
53- - ** gemma_7b_it_fast**
54- - ** gemma2_2b_it_fast**
55- - ** llama3_2_1b_fast**
56- - ** llama3_2_3b_fast**
57-
58- </p ></details >
59-
6019<details ><summary > Command Breakdown: </summary ><p >
6120To run an attack on a model, you need to specify the following:
6221
0 commit comments