Categories
Uncategorized

How Graph Theory Could be used to Analyze the News

Using the knowledge of graph theory, one could generate a network of entity relationships that appear in daily news articles and analyze the connections between such entities. Such a network can be constructed using the following method as outlined by Marcell (2020):

  • Extract entities from news articles and define them as the nodes of the graph. Entities could either be a person or an organization.
  • Link pairs of entities that appear on a news article together or add to the weight if such a link already exists.

Using the above method, we would arrive at a network of people and organizations along with their degree of relationships denoted by the edges. An example of such a network would be the following

Graph generated using news from the UK during March 26, 2020. Source: https://towardsdatascience.com/building-a-social-network-from-the-news-using-graph-theory-by-marcell-ferencz-9155d314e77f

There could be several questions that can be answered through the construction of such a network. For example, one point of discussion could be along the words of “how is person X and person Y related”? Questions like such could be answered by analyzing the connections that are present in the graph. For example, if there exists a link between person X and person Y, we could derive that they have been directly involved in an event described in a news article. Also, if there does not exist a direct link, but there is a mutual entity that connects person X and Y, then we can conclude that it is very likely that person X and Y know each other in real life. Along with describing the relationships between two entities, the connected components that form during the construction of the graph could be used to categorize entities into related groups. Marcell (2020), using the network derived from news articles posted in the UK, was able to categorize the news topics into eight themes (US politics, Australian academic institutes, and so on). The categorization was derived using carefully visualizing the connected components that are present in the network.

Furthermore, we could analyze the popularity or influence that a person has on media using the network. Such an analysis is straight forward and deals with the connectivity of a given entity to other entities. Specifically, if a given entity has paths leading to several entities in the graph, then we can conclude that that entity is quite popular in the news media. This ties back to the idea of degrees of separation discussed in the lectures. The Bacon number is a related phenomenon that also deals with the degree of separation but specifically in Hollywood. It was found that only 12% of all the actors in Hollywood cannot be tied to Bacon using co-appearances. The same theory could be used to answer who has been the most influential person on the News by analyzing the links between the entities in the constructed graph.

In conclusion, graph theory could be employed in the news to analyze the relationship between entities that appear in the news media. Relationships between two particular entities, categorization of the entities into news topics, and the degree of influence of entities are only a few derivations that are possible using the news network. In theory, one could potentially extend this construction into several other industries to achieve insightful derivations that answer questions that are specific to that industry.

References:

  • Ferencz, Marcell (April 13, 2020). Building a Social Network from the News using Graph Theory. Retrieved October 17, 2020, from https://towardsdatascience.com/building-a-social-network-from-the-news-using-graph-theory-by-marcell-ferencz-9155d314e77f

Leave a Reply