LibZip and Cloning (APFS and Btrfs) #519

kayembi · 2026-03-14T13:46:52Z

kayembi
Mar 14, 2026

Hello,

Could someone kindly explain how LibZip takes advantage of cloning on Apple APFS and Linux Btrfs? I know that on systems that support file cloning, when using zip_close(), LibZip doesn’t have to rewrite the whole file, and so is much faster. But the only explicit information about this that I can find is the following:

In this issue thread, @dillof says:

However, unless you are on a file system that supports cloning (BTFS on Linux, APFS under macOS), a zip_close() has to write the whole archive to disk each time.

In this discussion, @dillof says:

On file systems that support cloning, the file is only rewritten starting with the first changed entry.

However, in my tests, LibZip seems to be even faster than I expect on APFS given the above:

I have a zip file of a couple of KB and add a small text file to it.
I update the text file using zip_file_add() using the overwrite option and save using zip_close(). This takes a fraction of a second on my system (which uses APFS).
I add an 800MB movie file to the zip and close it. This takes 15 seconds to save on my machine - unsurprising, as it now has to write out a large file.
I add another small text file and close the zip. This takes a fraction of a second. Again, this is as expected: because this text file is the last entry in the zip and the volume supports cloning, only this part of the zip is rewritten; the 800MB+ of entries above this text file don’t need to be rewritten to disk.
I update the first text file using overwrite again and once more close the zip.

This is where I get confused. I would expect this to take around 15 seconds to save again. The updated text file appears above the 800MB movie file in the zip’s entries, and given that the zip is supposed to be “rewritten starting with the first changed entry”, I expect this text file, the movie file, and the text file after it all to be rewritten, thus making the save just as slow as adding the 800MB movie file in the first place.

However, this is not the case. In my tests, when I update the text file above the movie file, the save only takes around 1-2 seconds: slower than updating the entry after the large movie file, but much faster than rewriting the entire movie file again. This is great of course, but I’d love to understand how this is. Presumably cloning means that LibZip doesn’t necessarily have to rewrite everything below an updated entry after all?

Thanks!

Answered by dillof

Mar 15, 2026

Cloning does work the way I described. So if you change the entry before the video, the video is rewritten.

Your timing measurements are probably thrown off by the file system cache: The first time you add the video file, it has to be read in from disk. When you update the zip archive, it is already in RAM, so writing it back out is much faster.

Getting repeatable measurements of file access is hard. Making sure everything is in the cache first helps. You can usually get pretty close by repeating it multiple times and discarding the first measurements until it settles into similar run times.

View full answer

dillof · 2026-03-15T11:00:20Z

dillof
Mar 15, 2026
Maintainer

Cloning does work the way I described. So if you change the entry before the video, the video is rewritten.

Your timing measurements are probably thrown off by the file system cache: The first time you add the video file, it has to be read in from disk. When you update the zip archive, it is already in RAM, so writing it back out is much faster.

Getting repeatable measurements of file access is hard. Making sure everything is in the cache first helps. You can usually get pretty close by repeating it multiple times and discarding the first measurements until it settles into similar run times.

0 replies

kayembi · 2026-03-15T13:28:33Z

kayembi
Mar 15, 2026
Author

Great, thank you very much for the reply and for the information. That all makes sense. I'm using LibZip to read and write the zip-based file format of my app, so there's always a copy of the archive around in memory for reading, which must then be why I always see the faster times after the movie file is initially added.

The reason I wanted to clarify this by the way is that understanding this is really helpful for optimising save times in my app. It's a project-based app where projects are by default stored as zip files and users can import all sorts of files - PDF files, movie files and so on. They can also create text files which are editable (whereas research files such as PFDs and movie files cannot be edited).

Given how LibZip works on systems that support cloning, then, we can achieve much faster saves if we ensure text files are lower down in the zip's entries than research files. When a user makes changes to a text file and I update its entry in the zip file, I was previously using zip_file_add with the "overwrite" flag, but this can make saves slower if the text file is above some big research files in the zip. On APFS it's therefore faster to use zip_delete and then zip_file_add to recreate the text file as the last entry in the zip file. (That won't speed up the save after the first edit, but as the user continues to make changes to the text file, subsequent saves will be blazing fast because it's now the final entry in the zip.)

In fact I'm now thinking that whenever a user imports a research file such as a movie, I should probably move all text files below it in the zip at the same time. Given that there are no move or insert functions, I guess I will need to delete all of the text files and then re-add them below the imported research file.

Anyway, thanks again, this was really helpful!

0 replies

dillof · 2026-03-16T12:40:13Z

dillof
Mar 16, 2026
Maintainer

Keeping all the small, changeable file at the end is probably best, yes.

If you just want to move the files to the end, you can create a source with zip_source_zip() of the text file, delete the file, then use zip_file_add() with the source to add it again.

However, I' not sure using a zip archive to store your documents is the best option. libzip does not support cloning on Windows, which would make saving there inefficient. I would probably keep the files separate and use a directory to store the document. macOS has support for bundles, which basically treat directories as files in the UI.

0 replies

kayembi · 2026-03-16T14:09:07Z

kayembi
Mar 16, 2026
Author

If you just want to move the files to the end, you can create a source with zip_source_zip() of the text file, delete the file, then use zip_file_add() with the source to add it again.

This is what I'm going to do, thanks.

However, I' not sure using a zip archive to store your documents is the best option. libzip does not support cloning on Windows, which would make saving there inefficient. I would probably keep the files separate and use a directory to store the document. macOS has support for bundles, which basically treat directories as files in the UI.

Unfortunately bundles bring their own problems. On Windows they appear as regular folders, tempting users to move things out of them (we have an existing cross-platform app that uses a bundle file format where this is occasionally a problem). The bigger problem though is iOS. On iOS, Files app and UIDocument only support opening bundles from the local disk and iCloud Drive - bundle-based files don't work with document-based apps on iOS when stored in other cloud containers such as Dropbox, Box, Google Drive and so on. (I filed this as a bug with Apple long ago.)

Our solution is therefore to use a zip-based file format by default, since that causes less friction for users, and most projects are going to be quite small (importing an 800MB video file would be an unusual but not impossible use case). For this we use LibZip on macOS and iOS; on Windows we have to unzip into a temp folder which is clunkier and slower. But (similar to Pages app) we allow users to switch to a package-based format if they want, and recommend doing so if saving gets too slow or they need to work with projects of hundreds of megabytes, especially on Windows. And LibZip is working brilliantly for our macOS and iOS versions.

Anyway, thanks again, knowing the details is really helpful.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LibZip and Cloning (APFS and Btrfs) #519

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

LibZip and Cloning (APFS and Btrfs) #519

Uh oh!

kayembi Mar 14, 2026

Replies: 4 comments

Uh oh!

dillof Mar 15, 2026 Maintainer

Uh oh!

kayembi Mar 15, 2026 Author

Uh oh!

dillof Mar 16, 2026 Maintainer

Uh oh!

kayembi Mar 16, 2026 Author

kayembi
Mar 14, 2026

dillof
Mar 15, 2026
Maintainer

kayembi
Mar 15, 2026
Author

dillof
Mar 16, 2026
Maintainer

kayembi
Mar 16, 2026
Author