Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

SUBJECTExtraction Processing
STATUSOpen

DECISION

TBD

DISCUSSION

In openIDL when a data call (or a stat reporting) is "consented to" by the carrier, the data must first be accessed from somewhere and then transformed into the result format and lastly converted into a report that the target party (usually a regulator) can access or be sent.

The transformation of the data from its "harmonized" state to the result state is called the "extraction", "extraction pattern" or "extraction process".

Since accessing the data can take multiple forms (see other architecture decision "Harmonized Data", there is some variability in this decision while that decision is undecided.  

We can assume that the data being accessed for the extraction is "harmonized", meaning for every execution of the extraction on a single node or multiple nodes for a single carrier or multiple carrier, the schema and semantics of the data are known and consistent.


SUBJECTHarmonized Data AccessFormat
STATUSOpenProposed

DECISION

Proposal:

data format / schema will be standardized across nodes, for a given use case

DISCUSSION

The data available for extraction must be normalized for multiple extractions across multiple use cases across multiple members.

Is this one single model?

Is the data at rest in the same model as the data in motion?


SUBJECTHarmonized Data Loading / Normalization
STATUSProposed

DECISION

Proposal:

If the HDS is at rest, the loading of that data is the responsiblity of the member owner of the node.

If the HDS is an API, the maintenance of that API and the mapping to other data sources is the responsiblity of the member owner of the node.

DISCUSSION

  1. Loading data will be via an API, IDM or ...? Will a direct SQL load be allowed?  This will use the ingestion model.


SUBJECTHarmonized Data Format Governance
STATUSProposed

DECISION

Proposal:

data schema, enumeration and the data dictionary will be standardized, and endorsed by the RRSC (and other groups per use case)

DISCUSSION



SUBJECTHarmonized Data Store
STATUSProposed

DECISION

Proposal:

  1. HDS can be persistent or transient. It's a member's decision. See next
  2. HDS can be persistent and can be used by member's as a "warehouse on the edge" for sharing data via openIDL

DISCUSSION

DECISION

??

DISCUSSION

The data available for extraction must be normalized for multiple extractions across multiple use cases across multiple carriersmembers.

Is this one single model?

Is this one database?

Is this data at rest and/or available through an API?


SUBJECTHarmonized Datastore DBMS Implementation
STATUSOpen

DECISION

Proposal:

HDS will be a relational database. It cannot be a noSQL, graph, document DB etc.

technical implementation of a HDS is non-prescriptive i.e. it can be MySQL, MS SQL, Oracle etc.  ??

DISCUSSION

If data is at rest in the harmonized datastore, what is the technology?

Does it need to be a single dbms?

Should it be noSQL?

Can it just be an interface?

...