Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable all YAML text blocks of asdf to be zip compressed #279

Open
superwakk1 opened this issue Jan 19, 2021 · 1 comment
Open

Enable all YAML text blocks of asdf to be zip compressed #279

superwakk1 opened this issue Jan 19, 2021 · 1 comment

Comments

@superwakk1
Copy link

Enable all YAML text blocks of asdf to be zip compressed , such as comment block, tree block and index block.

This requirement comes from the fact that we want to use asdf file format as a type of compression format standard for some large data files (nomally 100GB400GB for a single file, but 10MB 20MB sometimes).

We hope that the asdf format file is as small as possible, and the part that can be compressed should be compressed as much as possible.

Can we add a field at the end of the header line to indicate whether it is zip compressed ?
like:

#ASDF 1.0.0 yaml:zip
In this way, the following comment, tree, index and other blocks are compressed with zip.

The header format of comment, tree and index blocks can be organized in the form of zipped_ block_ size + binary_ ziped_ Content mode.

@eslavich
Copy link
Contributor

Hi, can you tell us more about your use case? Are you expecting the comments + tree + index to be 100GB? Or does that number include binary blocks? The Python ASDF implementation handles large binary blocks gracefully, but a large YAML document would give it trouble. Parsing so much YAML would be unbearably slow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants