2022-10-14 Meeting Agenda

Date

14 Oct 2022

A. Stat Plan

Recap
1. Mr. Antley discussed the Test Data Set following Dr. Harris's work on this set in Excel - calculation of EP, etc.
2. Mr. Antley evaluating where Dr. Harris's records and his work have diverged.
3. New business rule recently identified
Outstanding Records
1. Multiple records - same coverage and same Occurrence Identifier, and different values for the outstanding. Currently: picking largest outstanding value. (Latest is unavailable).
2. Data is heavily modified but not duplicated. Not recent data.
3. Question raised: should we have business rules that identify issues with records? (Group Discussion)
  1. DH: we shouldn't accept it if we can't trace specific loss to a particular entity
  2. JT: If claimaint field isn't completed and 5 records can't be distinguished, these should be tagged as errors
  3. PA noted that this is older data from a company no longer in business - we can develop a rule to tag specific pieces of data that may be tied to SDMA issues
4. Dr. Harris noted that he is creating expected values - litmus test seeing if PA can match/align with his values.
5. PA: will define a rule to use for testing, a claimant, and also may define a rule to test per SDMA. We want to be able to do checks that span multiple rows
Conclusions
1. We will continue to use the big one
2. We will develop rules that span multiple rows

B. Work recap - AWG

Spike POC - focused POCs to validate various modules to be used in openIDL
Looking into moving into relational database. Utilizing postgreSQL
KS: decision making brought to TSC. Agreed that AWG will draw arch. schema - decisions will be made within this group (e.g., HDS/relational database). Supporting documents will bely key decisions.
Large decisions will command a vote. All decisions will be documented.
PA discussed relational database - we will have 2 tables/line. 1 tbl premium records, and 1 loss records. Clean tabular design. Possible index added to tables as we are loading, w/a primary key for each record. This will provide easier tracking mechanism
PA. Premium Table
1. We're looking at adding Annual Statement Line (optional) - which raises key questions about how to handle #. (Raised as question for group)
2. KS: suggested taking # as a string (as is) not as a #. (PA agreed).
3. PA: everything (including Zip code) is VARCHAR except for 'Date'
4. PA: for numbers, we're using numeric.
5. JM: sought to discuss nature of identifier in field, as this is a querying type table not a loading type one. This table may be what we query from. KS: This is the first level of basic structure that will be in HDS.
6. JM: the challenge w/numeric: if you make it a sequence, you get into parallelism problems, gaps in it, etc. Especially if carriers are providing the sequence. (JM: this is only looking ahead, not a problem necess. to deal with right this moment).

C. Review of postscript progress

Time	Item	Who	Notes