We've been facing increasing requests to help researchers publish datasets.
There are many dimensions to this problem, but one of them is applying
appropriate metadata and mounting them so they can be explored with a
regular web browser or downloaded by expert users using specialized tools.
Datasets often are large. One that we used for a pilot project contained
well over 10,000 objects with a total size of about 1 TB. We've been asked
to help with much larger and more complex datasets.
The pilot was successful but our current process is neither scalable nor
sustainable. We have some ideas on how to proceed, but we're mostly making
things up. Are there methods/tools/etc you've found helpful? Also, where
should we look for ideas? Thanks,
kyle
|