Reference Number: 101.003
Graphs for Genealogists (GFG) is an open-source software package with an application front-end, a Neo4j database and a plugin designed and optimized for genealogy data management and analytics. It loads family tree data in GEDCOM format, a set of consumer DNA test results, and genealogist curated files providing links between graphs. The primary purpose of the analytics is to discover new insights and provide actionable recommendations for further genealogy research. GFG traversals collect concatenated strings to create Ahnentafel numbers and enable filtering on X-linked inheritance and other patterns. Traversals from the family tree through DNA matches to chromosome segment data discover triangulation groups and monophyletic segments aligned with specific family tree branches. Graph algorithms from the Neo4j Graph Data Science plugin discover communities (clusters) aligned with family tree branches. Hierarchical trees include patrilineal and matrilineal trees, DNA haplotrees, ORDPATH enhanced renderings, and hybrids linking these together. Chromosome painting and 3D renderings help users interpret the results. Recommendations include manageable sets of persons from a pool of over 250,000 DNA matches. There are many opportunities for further development of graph analytics including a paradigm shift to using stable elements aligned with a specific family tree branch.
Due to the length of this article, the full text is not reproduced here. Please see the PDF link above for the article content.
Files attached to the original article Appendices and Supplements can be downloaded from the Appendices and Supplements tab here.
Appendix 1 | GFG PlugIn jar file | Link | 10 |
Appendix 2 | Unzip in c:/genealogy/neo4j/ | Link | 20 |
Appendix 3 | List of UDF in GFG-PI – See Supplement 1 Tab Functions | 22 | |
Appendix 4 | GEDCOM-Kit curation file template | Link | 30 |
Appendix 5 | Triangulation group template | Link | 40 |
Appendix 6 | Data Dictionary (Neo4j Schema) | Link | 60 |
Appendix 7 | Database indices | Link | 70 |
Appendix 8 | Grass files: unzip and place in a convenient folder | Link | 80 |
Appendix 9 | Database properties | Link | 90 |
Supplement 1 | Know your data | Link | 10 |
Supplement 2 | X-chr descendancy tree | Link | 20 |
Supplement 3 | DNA Painter MSS segments | Link | 39 |
Supplement 4 | Y-haplotree | Link | 50 |