Skip to content

repo is humongous #46

@sneakers-the-rat

Description

@sneakers-the-rat

related to #36

#36 talks about adding test outputs to .gitignore, which looks like has already been done.

however the test outputs are still in the repo history, and so the repo is 700MB

If i list the objects in the git repo by size

git rev-list --objects --all | \
  git cat-file --batch-check='%(objectname) %(objecttype) %(objectsize) %(rest)' | \
  sort -nr -k 3 | \
  perl -ne 'm#^(\w+) blob (\d+) (.+)# or next; print "$1\t$2\t$3\n";' | \
  head -n 200 | \
  column -t -s $'\t'

then i get a big list like...

3eec84e14028bb78f907d45873181c159ab4fd77  53061904  tests/output/figs/func/mlspike_comparison/DS13-GCaMP6s-m-V1-neuropil-corrected.html
178473f76aaddc5a1935d777ae46525c417ca1b9  49326100  tests/output/figs/func/mlspike_comparison/DS10-GCaMP6f-m-V1-neuropil-corrected.html
e6cbce8b49589afe98fcadac4c03f2c695e05e8f  49310097  tests/output/figs/func/mlspike_comparison/X-DS09-GCaMP6f-m-V1.html
d7823d2a0be94c2331671b984a140c22cc557dde  37968583  tests/output/figs/func/mlspike_comparison/DS11-GCaMP6f-m-V1-neuropil-corrected.html
cad8d6c12ddd32d6b52756684018a36d74eab655  28770788  scripts/figs/dbg_ar/scal-tau_d=15, tau_r=5.html
08720fadcf814dda59dc7bd49c7116bc912994fb  28703801  scripts/figs/dbg_ar/err_scal-tau_d=15, tau_r=5.html
cda0488f9fdd6e73dc7fbce7641ae5061613e329  28531400  scripts/figs/dbg_ar/err-tau_d=15, tau_r=5.html
3a95c527f54a4ff5796b5ac79554331615c4b5c0  25915006  scripts/figs/dbg_ar/scal-tau_d=10, tau_r=3.html
97193a424d2634070938c372b22f3ea0b4834138  25460936  scripts/figs/dbg_ar/err_scal-tau_d=10, tau_r=3.html
b92027828a63bb57dda8459e2a689c3854a2f3aa  25277556  scripts/figs/dbg_ar/err-tau_d=10, tau_r=3.html
8f0fd37411b1ede2c1894179ea91acd558c04bad  24086644  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter1.html
d0d262681d82b196c711c7e41cb38d2941cefdf5  23908450  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter17.html
f14be7a9b1be188932b1421606ca69a3448156ac  23873877  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter16.html
7bbc52588fd6ff38beaf4ee7efc720880a0c1189  23862384  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter18.html
2bab34cc3ef88bb73bf0ebb92fa28a4763d1db2a  23825419  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter15.html
4d54936c4f953736719d4d4e3af9e290cc2c2faf  23752858  scripts/figs/benchmark_realds/trs-X-DS09-GCaMP6f-m-V1-iter0.html
...

and so on.

the tests/output directory is still full of data, even though it's ignored with .gitignore, so need to do two things:

  • remove currently tracked files from tests/output
  • purge git history with git filter-repo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions