Congruence Engine investigations

saltaire2_visualisation

Data, code and documentation

Saltaire Phase 2 - Household Genealogies and Education

Short summary

The visualisation design follows principles of ‘subject in transit’ (Butterworth 2022) and, as such, is conceived in relation to a suite of exploratory data visualisation formats representing contexts between which subject focalisation is carried, allowing fluid movement through complex semantic datascapes.

Research questions

People

Alex Butterworth

Visualisation, Conceptualisation, Formal Analysis, Investigation, Methodology, Writing – original draft

Colin Coates

Resources, Data Curation, Formal Analysis, Investigation, Writing – review & editing

Nayomi Kasturchi-Arachchi

Conceptualisation, Formal Analysis, Software, Validation, Investigation, Methodology, Writing – original draft

Jo Kent

Conceptualisation, Formal Analysis, Investigation, Writing – original draft

Denice Penrose

Data Curation

Felix Needham-Simpson

Data Curation

Andrew Richardson

Visualisation, Conceptualisation, Investigation, Methodology, Software, Writing – original draft

Data sources (used or developed)

Investigation methods/ tools/ code/ software (used or developed)

• Preprocessing routines for source data, developed in Python and contained in 2 Jupyter notebooks. • Neo4j graph construction routine developed in Python and contained in a third Jupyter notebook. • Neo4j graph of the census data plus additional information on occupations.

• Initial data analysis to identify occupations not represented in the Booth Armstrong class and industry mapping • Normalisation of occupations as listed and mapping to classes and industry sectors • Processing of data using Jupyter notebooks to create household class mappings over time

• Community and identify analysis of graphed data for individuals and households.

• Note: whilst the Jupyter notebooks are shared to expose methodology, the actual graph cannot be made publicly accessible as any downloading of it would expose the source data, the majority of which we do not own.

Outputs

Licence

This work is licensed under a Creative Commons Attribution 4.0 License - CC BY 4.0.