Skip to content

Train/Inference Random Model using this Library. #3

@xuann6

Description

@xuann6

Hi there,

I am trying to re-produce the result of your experiment, and the provided code seems only contains the function of error analysis. On the other hand, I found a function "create_trainer_engine", which seems used to train the model.

I am confused if the pack of code is only used for error analysis or it can be extended to generate the accuracy results of your paper. I understand this might need some efforts but I am trying to figure out if this can work. Thanks!

In addition, it seems that only Hopper arch from Nvidia (GH100) supports FP8 operation at the moment, is there any other options if I would like to train/inference the model using FP8?

Thanks again for reading the message.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions