Bio-IT World March 9, 2021
The problems are common in any large, data-centric organization. We don’t know what we have. We don’t know where it is. We need to be able to clean, combine, and search our data assets. A favorite solution is a data commons, an architecture for holding all of an organization’s data in common with well-defined connections. The idea is to make the data within an organization FAIR: findable, accessible, interoperable, and reusable.
The foundation of a data commons is the data dictionary—the map or model populated with all of the data within an organization and the relationships between them. The hardest part of FAIR is the “I”—interoperability, says Bill Van Etten, senior scientific consultant at BioTeam. The data dictionary is what...