I am confused about the input data format, i.e., encodings.npy and offsets.npy is each element in encodings.npy a one-hot vector? Can you provide a detailed demo of them?