Scrape FIPS algorithm data

Initial description by @J08nY 

> Data from the FIPS algorithm dataset is not utilized and mined fully. We can follow the links to the algorithm page and get more data that will help us. This can help in cert id cleanup to get rid of the algo references.

## Details

Currently, the `FIPSAlgorithm` object is built from rows of a pandas DataFrame constructed merely from the [list of Algorithms](https://csrc.nist.gov/projects/Cryptographic-Algorithm-Validation-Program/validation-search?searchMode=implementation&page=), see below

https://github.com/crocs-muni/sec-certs/blob/f41d077185f7e40d1a524bbfc9c4a11dbd312f73/src/sec_certs/dataset/fips_algorithm.py#L98

This table does not include valuable attributes found on the individual pages of the algorithm. The proposed enhancement should:
- Track the URL for each of the algorithms, e.g., https://csrc.nist.gov/projects/Cryptographic-Algorithm-Validation-Program/details?product=3989
- Scrape the contents of the corresponding html and extract the values for the following attributes:
    - algorithm type
    - description
    - version
    - algorithm capabilities
- The `FIPSAlgorithm` object (see below) should be enriched with the attributes mentioned above.

https://github.com/crocs-muni/sec-certs/blob/f41d077185f7e40d1a524bbfc9c4a11dbd312f73/src/sec_certs/sample/fips_algorithm.py#L13

## Further guidance

One can isolate the pipeline stage that processes the algorithm dataset simply by

```python
from sec_certs.dataset.fips_algorithm import FIPSAlgorithmDataset

alg_dset = FIPSAlgorithmDataset.from_web()
alg_dset.to_json("/path/to/some/file.json")
```

The PR implementing this enhancement should modify the [parse_algorithms_from_html](https://github.com/crocs-muni/sec-certs/blob/f41d077185f7e40d1a524bbfc9c4a11dbd312f73/src/sec_certs/dataset/fips_algorithm.py#L97) method.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scrape FIPS algorithm data #276

Details

Further guidance

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Scrape FIPS algorithm data #276

Description

Details

Further guidance

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions