Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"meta-data as data" // document the needed extra columns for future AODN ocean in-situ observations parquet archives #37

Open
Thomas-Moore-Creative opened this issue May 8, 2024 · 2 comments
Assignees
Labels
CARSv2 branch for CARSv2 project

Comments

@Thomas-Moore-Creative
Copy link
Collaborator

see: #26 (comment)

In moving from NetCDF to more cloud optimised datasets like parquet we need to address the changes in how "meta-data" is addressed. The bottom line is that without the global attributes available in NetCDF we'll need to cary over, for each record in the dataset, some of this "meta-data" as "data" columns for each spatial and time point record.

The assumption is that while a duplication of bytes in the file and "wasteful" of storage that the real-world impact of the extra size in terms of resource costs or access time won't matter. (??)

@Thomas-Moore-Creative
Copy link
Collaborator Author

@BecCowley we should grab the headers from some of Chris's CODA headers here as a start?

@mhidas mhidas added the CARSv2 branch for CARSv2 project label May 8, 2024
@BecCowley
Copy link
Collaborator

BecCowley commented May 8, 2024

The list we want (if the information is available). We can add to this if anything obvious is missing.

lat
lon
date/time
probe_type
recorder
country
database origin
Project name
platform/instrument type
vehicle (eg vessel)
Institute

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CARSv2 branch for CARSv2 project
Projects
None yet
Development

No branches or pull requests

4 participants