Methodology
Federated Transfer Learning
Automated Feature Selection
ARCH (Aggregated naRrative Codified Health records analysis) generates a large-scale knowledge graph for a comprehensive set of EHR codified and narrative features. It allows:
- feature representation
- information extraction
- uncertainty quantification
The ARCH algorithm first derives embedding vectors from a co-occurrence matrix of all EHR concepts and then generates cosine similarities along with associated p-values to measure the strength of relatedness between clinical features with statistical certainty quantification. In the final step, ARCH performs a sparse embedding regression to remove indirect linkage between entity pairs.