Add an Eval harness

### Pre-checks

- [x] I searched existing issues and discussions

### What problem are you trying to solve?

```text
A killer feature would be to have an eval harness to run different models and see how they compare to each other across the most popular evals available.
```

### What would you like NexaSDK to do?

```markdown
- Add Eval harness that allows the user to select the most popular evals available, as well as custom evals via .json.
```

### Alternatives you've considered

```text

```

### Who does this help, and how much?

```text

```

### Additional context

```markdown

```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an Eval harness #1070

Pre-checks

What problem are you trying to solve?

What would you like NexaSDK to do?

Alternatives you've considered

Who does this help, and how much?

Additional context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add an Eval harness #1070

Description

Pre-checks

What problem are you trying to solve?

What would you like NexaSDK to do?

Alternatives you've considered

Who does this help, and how much?

Additional context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions