The Higgs dataset has been built after monitoring the spreading processes on Twitter before, during and after the announcement of the discovery of a new particle with the features of the elusive Higgs boson on 4th July 2012. The messages posted in Twitter about this discovery between 1st and 7th July 2012 are considered.
source of dataset: https://snap.stanford.edu/data/higgs-twitter.html
This project involves a comprehensive analysis of Twitter mention networks whith NetworkX, Gephi and Cytoscape.
-
Network Analysis:
- Topology
- Degree Distribution
- Clustering Coefficient
- Density
- Connected Component
- Average Path Length
- Diameter
- Assortativity
-
Centrality Algorithms:
- k-shell
- HITS (Hyperlink-Induced Topic Search)
- Page Rank
-
Community Detection Algorithms:
- k-core
- Louvain
- Info-map
- Label Propagation
-
Tools and Libraries:
- NetworkX library in Python for network analysis.
- Gephi and Cytoscape for visualization.
-
Custom Implementation:
- Label Propagation algorithm implemented from scratch in Python.
- A detailed report in PDF format is available (in Persian).