DynamicGEM: Dynamic graph to vector embedding

Learning graph representations is a fundamental task aimed at capturing various properties of graphs in vector space. Most recent methods learn such representations for static networks. However, real world networks evolve over time and have varying dynamics. Capturing such evolution is key to predicting the properties of unseen networks. To understand how the network dynamics affect the prediction performance, various embedding approaches have been proposed. In this dynamicGEM package, we present some of the recently proposed algorithms. These algorithms include Incremental SVD, Rerun SVD, Optimal SVD, Dynamic TRIAD, Static AE, Dynamic AE, Dynamic RNN, Dynamic AERNN. We have formatted the algorithms so that they can be easily compared with each other. This library is published as DynamicGEM: A Library for Dynamic Graph Embedding Methods [0].

Implemented Methods

dynamicGEM implements the following graph embedding techniques:

Incremental SVD: This method utilizes a perturbation matrix capturing the dynamics of the graphs along with performing additive modification on the SVD. [1]
Rerun SVD: This method uses incremental SVD to create the dynamic graph embedding. In addition to that, it uses a tolerance threshold to restart the optimal SVD calculations and avoid deviation in incremental graph embedding. [2]
Optimal SVD: This method decomposes adjacency matrix of the graph at each timestep using Singular Value Decomposition (SVD) to represent each node using thedlargest singular values. [3]
Dynamic TRIAD: This method utilizes the triadic closure process to generate a graphembedding that preserves structural and evolution patterns of the graph. [4]
Static AE: This method uses deep autoencoder to learn the representation of each node in the graph. [5]
Dynamic AE: This method models the interconnection of nodes within and acrosstime using multiple fully connected layers. It extends Static AE for dynamic graphs. [6]
Dynamic RNN: This method uses sparsely connected Long Short Term Memory(LSTM) networks to learn the embedding. [6]
Dynamic AERNN: This method uses a fully connected encoder to initially ac-quire low dimensional hidden representation and feeds this representation into LSTMs to capture network dynamics. [6]

Graph Format

Due to variation in graph formats used by different embedding algorithms, we have written custom utils: dataprep_util which can convert various data types to the required dynamic graph format stored as list of Digraph (directed weighted graph) corresponding to the time-steps. Networkx package is used to handle these graph formats. The weight of the edges is stores as attibute "weight". The graphs are saved using nx.write_gpickle and loaded using nx.read_gpickle. For datasets that do not have these structure, we have methods (for example "get_graph_academic" for academic dataset) which can convert it into the desired graph format.

Repository Structure

DynamicGEM/embedding: It consists of the most recent dynamic graph embedding approaches, with each files representing a single embedding method. We also have some static graph embedding approaches as baselines.
DynamicGEM/evaluation: Currently, we have graph reconstruction and link prediction implemented for the evaluation.
DynamicGEM/utils: It consists of various utility functions for graph data preparation, embedding formatting, plotting utilities, etc.
DynamicGEM/graph_generation: It constis of functions to generate dynamic stochastic block model with diminishing community.
DynamicGEM/visualization: It consists of functions for plotting the static and dynamic embeddings of the dataset.
DynamicGEM/experiments: The functions for hyper-paramter tuning is present in this folder.
DynamicGEM/TIMERS: The matlab source code of the TIMERS along with added matlab modules for dataset preparation is present in this folder.
DynamicGEM/dynamicTriad: It consists of the dynamicTriad source code.

Dependencies

dynamicgem is tested to work on python 3.5. The module with working dependencies are listed as follows:

gaph_tool>=2.27
Cython==0.29
decorator==4.3.0
dill==0.2.8.2
h5py==2.8.0
joblib==0.12.5
Keras==2.2.4
Keras-Applications==1.0.6
Keras-Preprocessing==1.0.5
matlabruntimeforpython===R2017a
matplotlib==3.0.1
networkx==1.11
numpy==1.15.3
pandas==0.23.4
scikit-learn==0.20.0
scipy==1.1.0
seaborn==0.9.0
six==1.11.0
sklearn==0.0
tensorflow==1.11.0 or tensorflow-gpu==1.11.0 (whichever is compatible with python 3.5)
Theano==1.0.3

Install

Before setting up DynamicGEM, it is suggested that the dynamic triad and TIMERS are properly set up.

The TIMERS is originally written in matlab, in dynamicgem we have created python modules for Timers using Matlab Library Compiler. We have used Matlab R2017a to generate modules that work with python 3.5. To run the matlab runtime please configure the Matlab runtime by downloading it from "https://www.mathworks.com/products/compiler/matlab-runtime.html" and following steps mentioned in "https://www.mathworks.com/help/compiler/install-the-matlab-runtime.html". The source code of TIMERS along with the setup files are located in dynamicgem/TIMERS folder.
- Do not forget to export the matlabruntime library path if you haven't used sudo to install it (i.e. sudo ./install).
```
   export LD_LIBRARY_PATH="/usr/local/MATLAB/MATLAB_Runtime/v92/bin/glnxa64:/usr/local/MATLAB/MATLAB_Runtime/v92/runtime/glnxa64:/usr/local/MATLAB/MATLAB_Runtime/v92/sys/os/glnxa64:$LD_LIBRARY_PATH"
```
- Due to a bug in the MATLAB runtime R2017a, please perform the following steps to resolve the issue:
```
   cd /<full-path-to-MATLAB_Runtime>/v92/bin/glnxa64
   mv libexpat.so.1 libexpat.so.1.NOFIND
```
- To setup TIMERS perform the following steps:
```
   cd dynamicgem/TIMERS/TIMERS_ALL/for_redistribution_files_only
   python setup.py install --user  
```
- To install for all users in Unix/Linux:
```
   sudo python setup.py install
```
We have build the dynamicTriad using python 3.5. Please follow "https://github.com/luckiezhou/DynamicTriad" to install the necessary library for running the dynmicTriad. Moreover, you may build it of particular version of python as well.
- For graph_tool setup, if you are using virtual environment and not using sudo for setting up python modules, make sure to to perform following:"
```
    sudo find /usr/. -name graph_tool  #to find the <path-to-graph_tool> to graph)tool
    export PYTHONPATH="<path-to-graph_tool>:$PYTHONPATH"
```
- Also for compiled c mygraph.so module change the /dynamicGEM/dynamcigem/dynamictriad/core/gconv.py file by replacing with the absolute path of the dynamicGEM folder.
For setting of rest of the methods, the package uses setuptools, which is a common way of installing python modules.
- To install in your home directory, use:
```
  export PYTHONPATH="/<...>/python3.5/site-packages/:$PYTHONPATH"
  python setup.py install --user
```
- To install for all users on Unix/Linux:
```
   sudo python setup.py install
```