Excuse me. Could you please post more details on data preprocessing in README? I was puzzled about How you process source data into .txt.gz formats in folder data. It would be great if you could share the specific meaning of each field and the detailed process of preprocessing the data. Thank you so much!