Skip to content

sparsity definition invertedΒ #1

@pratyushtiw

Description

@pratyushtiw

When you say
"Each coordinate is independently chosen to be either uniform random in [0,1] (with probability 𝑠 ) or simple 0 (with probability 1βˆ’π‘  )"

I believe it should be
"Each coordinate is independently chosen to be either uniform random in [0,1] (with probability 1-𝑠 ) or simple 0 (with probability 𝑠 )"

so that s is the sparsity and is consistent with https://transformer-circuits.pub/2022/toy_model/index.html#demonstrating

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions