batch inputs during inference

How can we batch multiple inputs when computing the embeddings using a trained model? The script `command/inference/get_embedding.py` demonstrates how to compute a single function's embedding, but it has slow throughput.

I tried to concatenate the tensors for each respective field in the dicts produced from `sample_emb = trex.process_token_dict(sample_tokens)`, but ran into a problem where the inputs needed to first be padded to the proper length, and I could not determine how to pad them or determine what the proper padding value is for each field.

Could you please give an example of computing embeddings for multiple functions at once?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch inputs during inference #25

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

batch inputs during inference #25

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions