Skip to content

Latest commit

 

History

History
260 lines (200 loc) · 11.8 KB

File metadata and controls

260 lines (200 loc) · 11.8 KB

AI Infrastructure Architect - References and Standards

Enterprise Architecture Frameworks

TOGAF (The Open Group Architecture Framework)

Zachman Framework

  • Website: zachman.com
  • Focus: Enterprise Architecture classification schema
  • Documentation: Framework white papers and guides

ITIL (Information Technology Infrastructure Library)

Cloud Provider Documentation

Amazon Web Services (AWS)

Google Cloud Platform (GCP)

Microsoft Azure

ML and AI Frameworks

MLOps

ML Frameworks

Model Optimization

Kubernetes and Cloud-Native

Kubernetes

CNCF (Cloud Native Computing Foundation)

Service Mesh

Data and Streaming

Apache Projects

Data Lakehouse

Security and Compliance Standards

Security Frameworks

Compliance Regulations

Zero Trust

Cost Optimization and FinOps

FinOps Foundation

Cloud Cost Management

Industry Standards

Networking

Software Engineering

  • ISO/IEC 25010: Software Quality Model
  • IEEE 1471: Architecture Description Standards

Professional Organizations

Architecture

Cloud and DevOps

Security

  • ISC2 (Information Systems Security Certification Consortium): isc2.org
  • ISACA: isaca.org

Research Institutions and Labs

AI Research Labs

Academic Resources

  • arXiv.org: arxiv.org - Pre-print server for research papers
  • Papers with Code: paperswithcode.com - ML papers with code implementations

Company Tech Blogs

Major Tech Companies

Architecture Patterns and Best Practices

Pattern Collections

Books Online

Visualization and Diagramming Standards

Diagram Standards

Conferences and Events

Major Conferences

Learning Platforms

Online Learning

Hands-On Labs

Newsletter and Aggregators

Architecture and Engineering

Cloud and DevOps

Open Source Communities

GitHub Organizations

Glossary of Terms

Architecture Terms

  • ADR: Architecture Decision Record
  • NFR: Non-Functional Requirement
  • RTO: Recovery Time Objective
  • RPO: Recovery Point Objective
  • SLA: Service Level Agreement
  • SLI: Service Level Indicator
  • SLO: Service Level Objective
  • TCO: Total Cost of Ownership
  • ROI: Return on Investment

Cloud Terms

  • IaaS: Infrastructure as a Service
  • PaaS: Platform as a Service
  • SaaS: Software as a Service
  • HA: High Availability
  • DR: Disaster Recovery

ML Terms

  • MLOps: Machine Learning Operations
  • LLM: Large Language Model
  • RAG: Retrieval-Augmented Generation
  • GPU: Graphics Processing Unit
  • TPU: Tensor Processing Unit

Updates: This reference list is maintained regularly.

Suggestions: Have a resource to add? Submit a pull request!