Investigate if embeddings lookup with `tf.RaggedTensor` is slower than with `tf.Tensor` with latter versions of TF

In previous experiments from @vysarge (June 2022) it was found that `tf.RaggedTensor` representation is slower than using fixed-length dense `tf.Tensor` for embedding lookup, as shown in this [spreadsheet](https://docs.google.com/spreadsheets/d/1jlKDVeoMvpQfyCF9RFmR3VxckbPbrvBg2POwbF2p7RM/edit#gid=135622185).

This tasks is about benchmarking the difference of embeddings lookup for dense x ragged multi-hot columns, as MM does extensive usage of `tf.RaggedTensor` for multi-hot and for sequential / session-based recommendation.

### Notes
- Merlin dataloader will output ragged tensors (__values, __offsets format) if in column schema the value_count.max is None and will output tf.Tensor if value_count.max == value_count.min.
- You can find more information in this [related PR](https://docs.google.com/document/d/1KcIDzEFjz-Bp4Y-s80__teervpt56yMvihphSXyXWaw/edit#) on ragged tensors padding).





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate if embeddings lookup with `tf.RaggedTensor` is slower than with `tf.Tensor` with latter versions of TF #1038

Notes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Investigate if embeddings lookup with tf.RaggedTensor is slower than with tf.Tensor with latter versions of TF #1038

Description

Notes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Investigate if embeddings lookup with `tf.RaggedTensor` is slower than with `tf.Tensor` with latter versions of TF #1038