See https://github.com/GNOME/gcab/tree/master/tests This will keep this code fresh enough. See also these few test files: https://github.com/nexB/scancode-toolkit/tree/develop/tests/extractcode/data/archive/cab