Skip to content

[BUG]: Requesting clear error or smoother automation around site-explorer for DPU BMC #557

@pmilano-nvidia

Description

@pmilano-nvidia

Describe the bug
During host ingestion process, after running em replace-all -f expected-machines.json and after running site-explorer, there is an error in carbide-api log for a DPU BMC on a host that is to be ingested:

msg="Failed to fetch machine setup status." error="Missing key HostPrivilegeLevel in JSON at Systems/{}/Bios" location="crates/api/src/site_explorer/redfish.rs:266"

Force-reboot via Redfish API allows ingestion to continue:

curl -u user:pass -H "Content-Type: application/json" -X POST https://<DPU BMC IP>/redfish/v1/Systems/Bluefield/Actions/ComputerSystem.Reset -d '{"ResetType": "ForceRestart"}

Steps/Code to reproduce bug

  1. Run em replace-all expected-machines.json to begin ingesting desired host.
  2. Run site-explorer against host
  3. See error in carbide-api log:
msg="Failed to fetch machine setup status." error="Missing key HostPrivilegeLevel in JSON at Systems/{}/Bios" location="crates/api/src/site_explorer/redfish.rs:266"

Environment details (please complete the following information):

  • Environment location: data hall dev environment.
  • Method of NVIDIA Bare Metal Manager install: Running latest version of docker image with sha: bb8676823f3d27238e50b5066eef73182e55865d6b965147387b75c614f0e86b

Additional context
Requesting that this manual intervention be automated away, and/or that the error message is clear about requiring a restart. Thank you!

Metadata

Metadata

Assignees

Labels

bugA defect in existing software (deprecated - use issue type, but it's needed for reporting now)

Type

Projects

Status

Backlog

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions