Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents

Table of Contents

Architecture Tenets

Here we discuss the tenets of the architecture.  Which things are most important and why.  When we make architecture decisions below, they take these tenants into account.

The System Must be Manageable

We will have a distributed network using distributed ledger technology in the guise of Hyperledger Fabric.  The surrounding components such as APIs, UI Applications, Databases, Extraction Patterns and Data Transformation and Enrichment must all be managed.  That is, when we have the need to add some new functionality, we want all the participants to have the necessary components.  To make this work, the node must be manageable.  For this purpose we intend to use GitOps, where the configuration is managed by a configuration held in git which is used to update the node components automatically.

The System Must be Cloud Agnostic Whenever Possible

The openIDL components will most often be deployed to the cloud.  While we will use AWS, other participants may choose another cloud provider.  For this reason, we want the deployment to work in any one of the possible clouds without undo alteration or duplication.  There will be specific areas where the cloud provided services must differ.  In these cases, we will use a layer to normalize the interface if possible.  We must always be careful to balance cost and complexity with flexibility.

Infrastructure as a Service before Self Managed Infrastructure

Whenever possible when there is a choice between implementing infrastructure that we manage and that which is managed automatically by the cloud provider, lean toward cloud provider managed solutions.

The System Must be Transparent

The use of Infrastructure as Code will make they the system’s configuration self documenting.  Anywhere this does not fully explain, extra documentation must be provided.  This is generally found in the README.md files in git.

Architecture Decisions

SECURITY

SUBJECT

Secret Management

STATUSOpen

DECISION

The secrets are held in: GitHub Secrets and Vault?

The secrets are managed by:

The secrets are accessed from Iac …?

DISCUSSION

The management of secrets is complicated.  Below are some requirements for the solution.  If we can tick off all these, we’ll have a winner.

Must be able to manage:

-       carrier secrets

-       api keys

-       aais secrets

-       common secrets

-       cloud provider secrets

-       database secrets

-       hlf network secrets like certs

-       application secrets

-       distributed secrets

Must:

-       rotate passwords

-       be encrypted

-       permissioned so only visible to specific individuals or ci/cd

-       manageable - update / delete / create / view

-       auditable - know what changed and that no breaches have occurred

-       be accessible from IaC - terraform

-       be accessible from IaC - helm

-       be accessible during CI/CD

-       be cloud agnostic for use

-       be multi-cloud

-       have a health check of the system - at startup and intervals

-       provide logging and notifications of updates

-       exhibit CIA - confidentiality, integrity, access

-       have a user interface for managing the secrets

Options:

-       tools

o   vault

o   aws secrets manager

-        

...

SUBJECTMongoDB HDS Implementation
STATUSOpen

DECISION

Mongo DB is a cloud managed servicedeployed inside Kubernetes

DISCUSSION

The default implementation for MongoDB is a cloud managed service. 

In AWS this will be DocumentDB, as long as, it implements the API we need.

In other cloud providers it will be specific to that cloud.

For local deployments it will be inside the kubernetes cluster.

The Terraform to setup the mongo db will be pluggable so it can be replaced with the appropriate platform specific configuration.


Options discussed were:

-       self-managed cloud implementation

-       cloud-managed Infrastructure as a Service

-       mongo db atlas - a mongo provided infrastructure as a service (requires separate account, so it was dismissed)


DocumentDB does NOT support mapreduce which is how we run the extraction patterns.

So we have two options:

  • we use true mongodb compatible db that supports mapreduce
  • we refactor the extraction pattern functionality to not use mapreduce

So, for the migration project we switch to a mongodb self managed db

For the longer term we'll reimagine the extraction pattern processing.

APPLICATION DEVELOPMENT

SUBJECTCommon UI Code Management
STATUSOpen

DECISION

Single Application Angular UI Variations will utilize angular libraries

DISCUSSION

The library will be a different kind of angular app located in the same super library as all apps that use it. This is the approach for the data-call-ui. (openidl-ui and openidl-carrier-ui)

For common / shared libraries we will use an npm registry.

DEV / OPS

SUBJECTLocal Kubernetes Development
STATUSOpen

DECISION

Use Kind or Minikube for Local Kubernetes Runtime

DISCUSSION

We will use MinikubeThere are multiple options for local kubernetes deployment.  We chose Minikube over Kind because of it's simplicity.


SUBJECTInfrastructure as Code
STATUSOpen

DECISION

Use GitOps  a combination of solutions depending on application.

  • Cloud
    • Terraform for Infrastructure Provisioning in the Cloud
    • GitHub actions for CI/CD and execution of Terraform
    • Ansible for Deploying ??
    • Helm for managing Kubernetes
    • Flux for provisioning and managing distributed nodes
  • Local Reference
    • Bash scripts for provisioning local

DISCUSSION

Provide options for selection upon setup.

  • Terraform Cloud
  • Terraform Enterprise
  • GitHub Actions
  • Manual

All provisioning artifacts are managed in git

The customer will have a github / gitlab account that is private to them.

We may or may not have access to that repository.

To accept updates, the customer will accept a merge/pull request into their repository with our changes.

That update in git will automatically trigger the workflow.

The workflow may allow automatic provisioning or require an acceptance from the customer.

-       milestones

  1. start with github actions, forked repos and manual execution
  2. move to terraform cloud for aais node


target will have two options for node owners

  • use terraform cloud
  • use terraform on-prem (terraform enterprise)

...

DISCUSSION
SUBJECTInfrastructure as Code
STATUSOpen

DECISION

Use Helm Charts for Application and Network provisioning in Kubernetes

DISCUSSION

Helm is a very popular way to provision Kubernetes clusters

SUBJECTInfrastructure as Code
STATUSOpen

DECISION

Use Kustomize for managing variables inside and outside of Kubernetes

Kustomize can be used to manage generic scripts to work in different dev envts and for different customers




SUBJECTDevOps
STATUSOpen

DECISION

Publish Common Libs as images to NPM Registry in GitHub

DISCUSSION

Any common components should be packaged as images and published to the GitHub packages.

...

SUBJECTDevOps
STATUSOpen

DECISION

Images should be published in the GitHub packages npm container registry

DISCUSSION

Since we separate building of images from their deployment, we can build the images into the registry and then refer to that registry when deploying

...

DISCUSSION
SUBJECTSecrets Management
STATUSOpen

DECISION

secret management should be cloud agnostic

DISCUSSION

Notes
-       the secrets for the each cloud might be managed differently
-       hashicorp makes a popular opensource solution called vault
-       if cloud specific, we should have a layer that normalizes the access of secrets so the scripts / config files don’t need to change from cloud to cloud

SUBJECTDevOps
STATUSOpen

DECISION

Use an NPM registry to hold prebuilt and certified images

The NPM registry allows the images to be created and “certified” before they are deployed.  They are not built as part of the build.


SUBJECTSecret Management
STATUSOpen

DECISION

Secrets are applied during deployment, not in the image

DISCUSSION

The images used to create the pods in Kubernetes should not contain any private information.  This can all be applied during deployment by mounting the file from a secret held outside.

...

SUBJECTMongoDB
STATUSOpen

DECISION

The Harmonized Data Store will be deployed outsideside inside kubernetes

DISCUSSION

The best practice regarding databases and Kubernetes is to host them outside.  As long as the db is mongo and has a uri accessible to the insurance data manager and other apis, it is viable.

The terraform to set it up may need different flavors for the different clouds.

...

SUBJECTUI Deployment
STATUSOpen

DECISION

The UI will be deployed inside kubernetes

DISCUSSION

There are two main choices for deploying the ui.  Here is the discussion about the relative merits for the options.

Item

Cloud Specific

Cloud Agnostic

How

Using S3 and other AWS specific technologies

Deploy as pod inside Kubernetes

Performance

Very good performance

Less performant

Availability

The UI itself is more available, but the api isn’t any more available

The UI is subject to the same availability as the API

Cost

Very inexpensive

More cost, TBD

Scalability

Infinite scalability, subject to API

Not as scalable, but good

Complexity

More complex for multi-cloud

Less complex for multi-cloud

Managability

More difficult to manage in remote nodes

Less complex for remote nodes


Because manageability is a very high priority item for the ui components, this outweighs the differences in other aspects.


SUBJECTChannel Policy
STATUSOpen

DECISION

The Channel Policy will be set to ANY with a specific role required to allow new organizations to join the network

DISCUSSION

The channel policy controls how new organizations are joined to the network.  If set to Majority, many of the participants on the network must approve new organizations.  If set to Any, then just one is required.  We will create a role of Admin which will be required by the policy for any organization to approve new organizations.