Networks (or graph) are visualisations of the connections/interactions between elements of a dataset. It’s often possible to reframe your data as a network and extract useful information from analysis of the graph. Here are some examples of where graph may be interesting:
It’s extremely important to understand that you must not interpet features of a graph based on how close nodes are to one another. In general, the absolute positions of nodes is unimportant as most layout algorithms are non-deterministic; i.e. final node positions are computed by running a computer simulation.
Nodes are the unique elements in your data (the individuals). They might well have a number of different properties which are important when visualising the data:
Edges communicate how two nodes are connected to one another, they have the following properties:
The networks below are undirected and directed, respectively. Select a node to see it’s first degree neighbours, note that in the second graph the direction of the edges affects which nodes are highlighted.
As mentioned above, in general the absolute positions of nodes is unimportant. Most layout algorithms are non-deterministic, i.e. final node positions are computed by running a computer simulation. Deciding on which layout algorithm (or graph embedding) to use for your visualisations is somewhat of an art as opposed to a science. The three graph below display the exact same dataset using three very different layout algorithms from the excellent igraph library. In the future this website will include a page dedicated to choosing layout algorithms.