Exploring family relations from online obituaries using text mining and data visualisation tools
Using a copy of all the obituaries published online by the Ahram newspaper from January 2002 till April 2008 it is possible to use Linux command line tools (gawk, sed, bash) to find family relations between individuals in certain professions. An example given here explores the family links between a sample of 456 Egyptian state security officers.
This is a very brief description of the method.
The first step is to convert the HTML files …