Measuring Genealogical Similarity using the Jaccard Index
For some of the posts on this blog I’ll be using one way to measure the similarity of two sample sets of data. The statistic is called the Jaccard Index, or the Jaccard Similarity Coefficient. This post is a technical explanation of the calculation itself. The sets of data are the unique ancestral surnames of … Read more