import pickle
data = pickle.load(open('/content/openstack_train.pkl', 'rb'))
ids, labels, msgs, codes = data
If we look at codes, it does't contain actual added and removed code
[['added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code'],
['added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code',
'added _ code removed _ code'],
['added _ code removed _ code', 'added _ code removed _ code'],