Skip to content

How Dense layers work? #219

@miladdona

Description

@miladdona

Hi,

For example we have a dense layer with shape (100, 100) and we will try with shape [[2, 2, 5, 5], [2, 5, 2, 5]] and max_tt_rank=4
based on this example we have these tt_cores
(1, 2, 2, 4)
(4, 2, 5, 4)
(4, 5, 2, 4)
(4, 5, 5, 1)

  1. Are the tt_core always 4-D?
  2. How does a dense layer work? For example in SVD decomposition of a dense layer we have two thinner dense layer :
    I mean for a dense layer with shape of (100, 100) and rank=20 we have two dense layers like (100, 20) and (20, 100).
    I want to know how does T3F library work in this manner?
  3. Is there a way to extract number of operation for each layer? For example for the previous example we have 100 * 100 = 10000 operations in normal way, but I can not extract number of operations in T3F library!

Thank you in advance.
Best regards,
Miladona

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions