load_gpt2() improvements
load_gpt2()
in a fresh session is much faster and uses much less memory when loaded. (for the 117M model, the system will stay under <2 GB RAM which is the critical point for cloud services)start_tf_sess()
now accepts athreads
parameter, which is useful if you know exactly how many threads will be used.