Neutrality Centrality and structural bias in networks

social network analysis
Author

Oskar Kosch

Published

September 5, 2018

Below is R code to compute the neutrality centrality.

library(igraph)
getNeutralityCentrality <- function(net,
                                    directed=FALSE,
                                    standardized=FALSE) {
  l <- length(V(net))
  m <- if(directed) 'in' else 'all'
  dist <- distances(net, mode=m)
  dist[dist==0] <- NA
  res <- apply(dist, 1, sd, TRUE)
  if(standardized==TRUE) {
    dia <- diameter(net, directed = directed)
    res <- res/sd(1:(dia))
  }
  return(res)
}

Cite the page as: Kosch O. Neutrality Centrality and structural bias in networks. Oskar Kosch blog. http://oskarkosch.com/blog/neutrality-centrality-and-structural-bias-in-networks/. Published September 5, 2018.

Wayback permalink: https://web.archive.org/web/20180905235722/http://oskarkosch.com/neutrality-centrality-and-bias-in-networks.html


Introduction - purpose of this measure

This is concept paper (understood as novel ideas presentation1), and as such it is not deepened neither with multiple empirical cases, nor with sophisticated theoretical background. Instead, it presents an approach that first helped me with analysis of scientific publications co-citation network (to know which shouldn’t be biased - and in economics this is great deal!), and then evolved to idea of broader usage, with its possible applications in consensus decision making, unbiased location selection or every other situation that would require lowest structural bias of solution (maybe some social sciences problems?).

Neutrality Centrality formula

Given connected graph, equation for neutrality centrality (\(C_N\)) is as follows: \[C_N(v_i)=\sqrt{\frac{1}{\mid V \setminus \left\{ {v_i}\right\} \mid}\sum_{j=1, j \neq i}^{\mid V \mid}[d(v_i,v_j)-\overline{d}(v_i)]^2}\] where \(V\) is set of all vertices in network, \(d(v_i,v_j)\) is distance between vertex \(v_i\) and vertex \(v_j\) and \(\overline{d}(v_i)\) is mean distance of vertex \(i\) to other vertices.

Take an example of star typology network: The perfect non-biased vertex in that example is node no 1, and this is quite obvious; being equally connected to other nodes, there is no suspicion of bias. This is self explanatory. Other nodes are single-edged, connected with central node - and there is not a lot of bias, since they wouldn’t even be clustered separately (see Figure 1).

Figure 1: Simple star network example
Source: author

Neutrality centrality of nodes is presented in Table 1.

Table 1: Simple star network example vertices neutrality

measure \ node

1

2

3

4

5

6

7

neutrality

0

0.408

0.408

0.408

0.408

0.408

0.408

standardized.neutrality

0

0.577

0.577

0.577

0.577

0.577

0.577

Source: author

Next example is connected to the problem of normalization / standarization. Normalization in style that \(C_N \in <0;1>\), was not found. What was observed, is the fact that standarization based on connected graph diameter is possible and effective in terms of comparing graphs’ structure. \(D\) stands for diameter, and the formula is: \[C_{N_S}(v_i)=\frac{C_N(v_i)}{\sqrt{\frac{1}{D}\sum_{z=1}^{D}}(z-\overline{z})^2}\] Of course, both standardized and non-standardized version may be calculated in case of weighted graph, if we adjust that while calculating distances. This is inspired by graph with highest diameter possible, the one that is somewhat one-dimensional (see Figure 2):

Figure 2: Single dimension graph
Source: author

And, as can be seen (in Table 2), standard deviation from opposite vertices is equal to values achieved with same diameter, so standardized value is equal to 1.

Table 2: Single dimension example vertices neutrality

measure \ node

1

2

3

4

5

6

7

neutrality

1.87

1.633

1.169

0.894

1.169

1.633

1.87

standardized.neutrality

1.00

0.873

0.625

0.478

0.625

0.873

1.00

Source: author

Idea is that this is a graph that could be presented in single dimension, and therefore easily show two strictly opposite positions, but in linear mode. Based on graph like this, to increase \(C_{N_S}\) over 1, the only solution is to expand to two dimensions with adding two nodes connected either to first or last vertex (see Figure 3):

Figure 3: Graph with standardized vertex bias over 1
Source: author

With nodes 8 and 9 connected to first one, node no 7 has \(C_{N_S}\) over 1 (see Table 3) - and this formula is applying some bias to vertices that are distanced from “center”, more than to the ones in “crowd” (and for that reason adding nodes in middle wouldn’t result in standardized bias over 1). Presence of standardized values over 1 indicates, that graph has high internal structural bias diversity, and is likely to contain some visible differences - eg. two main parties, cities (by population and location) in country, etc.

Table 3: Two dimensions example vertices neutrality

measure \ node

1

2

3

4

5

6

7

8

9

neutrality

1.959

1.414

1.061

1.195

1.64

2.070

2.26

2.121

2.121

standardized.neutrality

0.907

0.655

0.491

0.553

0.76

0.958

1.05

0.982

0.982

Source: author

Adding more “clusters” to the ends lowers standardized neutrality, because relative bias within network is smaller. Usually bipolar structure will deliver highest relative bias. But in terms of what is most important - minimalizing bias, achieving neutrality - algorithm is not changing great and lowest bias still means highest structural neutrality.

Weighted version

This algorithm can have weighted version, only the distance should be adjusted to that. Only requirement is that a graph is connected one.

Clusters

Clustering can be performed in kind of voting style, where every vertex votes for the most biased one it is connected to (or create new cluster, if no other vertex is more biased than himself). Such kind of algorithm could use a starter list of cluster-making nodes and due to its almost linear process, quite good performance could be accomplished. It is based on single assumption, that neutral vertices are “convinced” by more biased ones - at least in terms of structure. Cut-off also can be added - many social entities may not bend to too biased nodes, constructing some sort of “mainstream” with “common sense” approach.

Uses

While there are plenty uses for this formula, its standardized version and clusters based on it, the most useful ones are tied to bias minimalization (\(min(C_N)\)). Some of them are:

  1. Unbiased location of things (items, services, etc. equal distribution)

  2. Checking for most neutral structural attitude (like in citations) and possible mediation role

  3. Consensus alghorithm - if there was a mixed network of ideas/decisions and social entities (people, groups, parties) each subgraph of idea and entites could be taken and studied, and the lowest bias one would be the consensus. This would probably be sub-optimal, but acceptable by players.

  4. Content delivery applications, where first is served the content that would be most neutral when added to one’s ego network.

Uses are both observational and normative. As observational we could look for mediation entity, eg. least biased publication or person, that should express most neutral opinions and could be engaged as middleman. Normative would be connected to governing social systems, like countries or cities.

As I wrote - reason why I used this approach was because out of plenty books and articles I wanted to get the most neutral one, at least judging from its cited references list. I achieved good results doing so and enhanced this idea a little, to be able to share it. You are most welcome to comment it and help to make it better! :)

Footnotes

  1. https://www.journals.elsevier.com/journal-of-chromatography-a/announcements/the-concept-paper↩︎