Methodology

Federated Transfer Learning

Automated Feature Selection

ARCH (Aggregated naRrative Codified Health records analysis) generates a large-scale knowledge graph for a comprehensive set of EHR codified and narrative features. It allows:

  • feature representation
  • information extraction
  • uncertainty quantification

The ARCH algorithm first derives embedding vectors from a co-occurrence matrix of all EHR concepts and then generates cosine similarities along with associated p-values to measure the strength of relatedness between clinical features with statistical certainty quantification. In the final step, ARCH performs a sparse embedding regression to remove indirect linkage between entity pairs.