-
Notifications
You must be signed in to change notification settings - Fork 19
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #7 from developmentseed/feat/rename-and-restructure
Feat/rename and restructure
- Loading branch information
Showing
21 changed files
with
469 additions
and
99 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,10 @@ | ||
# Cloud-Optimized Data Guide (In development) | ||
# Cloud-Optimized Geospatial Formats Guide | ||
|
||
See the site [https://developmentseed.org/cloud-optimized-data-guide/](https://developmentseed.org/cloud-optimized-data-guide/) | ||
See the site [https://developmentseed.org/cloud-optimized-geospatial-formats-guide/](https://developmentseed.org/cloud-optimized-geospatial-formats-guide/) | ||
|
||
This site is built using [Quarto](https://quarto.org/docs/get-started/) | ||
To preview the site locally, install quarto and run: | ||
|
||
```sh | ||
quarto render --to html | ||
quarto preview | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,91 @@ | ||
{ | ||
"cells": [ | ||
{ | ||
"cell_type": "markdown", | ||
"id": "e84fbc0b", | ||
"metadata": {}, | ||
"source": [ | ||
"# Cloud-Optimized GeoTIFFs\n", | ||
"\n", | ||
"## What is a Cloud-Optimized GeoTIFF?\n", | ||
"\n", | ||
"Cloud-Optimized GeoTIFF (the COG) is a variant of the TIFF image format that specifies a particular layout of internal data in the GeoTIFF specification to allow for optimized (subsetted or aggregated) access over a network for display or data reading. The key components are overviews, and internal tiling.\n", | ||
"\n", | ||
"For more details see https://www.cogeo.org/\n", | ||
"\n", | ||
"<img alt=\"COG Diagram\" src=\"./images/cog-diagram-1.png\" width=300/>\n", | ||
"\n", | ||
"### Dimensions and Internal Blocks\n", | ||
"\n", | ||
"This attribute is also sometimes called **chunks** or **internal tiles**.\n", | ||
"\n", | ||
"Dimensions are the number of bands, rows and columns stored in a GeoTIFF. There is a tradeoff between storing lots of data in one GeoTIFF and storing less data in many GeoTIFFs. The larger a single file, the larger the GeoTIFF header and the multiple requests may be required just to read the spatial index before data retrieval. The opposite problem occurs if you make too many small files, then it takes many reads to retrieve data, and when rendering a combined visualization can greatly impact load time.\n", | ||
"\n", | ||
"If you plan to pan and zoom a large amount of data through a tiling service in a web browser, there is a tradeoff between 1 large file, or many smaller files. The current recommendation is to meet somewhere in the middle, a moderate amount of medium files.\n", | ||
"\n", | ||
"### Internal Blocks\n", | ||
"\n", | ||
"Internal blocks are required if the dimensions of data are over 512x512. However you can control the size of the internal blocks. 256x256 or 512x512 are recommended. When displaying data at full resolution, or doing partial reading of data this size will impact the number of reads required. A size of 256 will take less time to read, and read less data outside the desired bounding box, however for reading large parts of a file, it may take more total read requests. Some clients will aggregate neighboring block reads to reduce the total number of requests.\n", | ||
"\n", | ||
"### Overviews\n", | ||
"\n", | ||
"Overviews are downsampled (aggregated) data intended for visualization.\n", | ||
"The best resampling algorithm depends on the range, type, and distribution of the data.\n", | ||
"\n", | ||
"The smallest size overview should match the tiling components’ fetch size, typically 256x256. Due to aspect ratio variation just aim to have at least one dimension at or slightly less than 256. The COG driver in GDAL, or rio cogeo tools should do this.\n", | ||
"\n", | ||
"There are many resampling algorithms for generating overviews. When creating overviews several options should be compared before deciding which resampling method to apply." | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"id": "eb04e257-322a-4d42-b6d6-e75f1c587a69", | ||
"metadata": {}, | ||
"source": [ | ||
"## How to create and validate COGs\n", | ||
"\n", | ||
"1. [Rio-cogeo: GitHub - cogeotiff/rio-cogeo: Cloud Optimized GeoTIFF creation and validation plugin for rasterio](https://github.com/cogeotiff/rio-cogeo)\n", | ||
"2. [Gdal: COG – Cloud Optimized GeoTIFF generator — GDAL documentation](https://gdal.org/drivers/raster/cog.html)" | ||
] | ||
}, | ||
{ | ||
"cell_type": "markdown", | ||
"id": "d193ab02-bb69-455e-9b72-5b89728f086e", | ||
"metadata": {}, | ||
"source": [ | ||
"## Additional Resources\n", | ||
"\n", | ||
"* [Planet Blog: An Introduction to Cloud Optimized GeoTIFFS (COGs) Part 1: Overview](https://developers.planet.com/docs/planetschool/an-introduction-to-cloud-optimized-geotiffs-cogs-part-1-overview/)\n", | ||
"* [COG Talk — Part 1: What’s new?](https://medium.com/devseed/cog-talk-part-1-whats-new-941facbcd3d1)\n", | ||
"* [Development Seed Blog: Do you really want people using your data?](https://medium.com/@_VincentS_/do-you-really-want-people-using-your-data-ec94cd94dc3f)\n", | ||
"\n", | ||
"## How to visualize COGs\n", | ||
"\n", | ||
"* GDAL vis* drivers (vsicurl, vsis3, vsiaz,)\n", | ||
"* Titiler https://github.com/developmentseed/titiler\n", | ||
"* Rio-viz https://github.com/developmentseed/rio-viz" | ||
] | ||
} | ||
], | ||
"metadata": { | ||
"kernelspec": { | ||
"display_name": "Python [conda env:geospatial]", | ||
"language": "python", | ||
"name": "conda-env-geospatial-py" | ||
}, | ||
"language_info": { | ||
"codemirror_mode": { | ||
"name": "ipython", | ||
"version": 3 | ||
}, | ||
"file_extension": ".py", | ||
"mimetype": "text/x-python", | ||
"name": "python", | ||
"nbconvert_exporter": "python", | ||
"pygments_lexer": "ipython3", | ||
"version": "3.10.6" | ||
} | ||
}, | ||
"nbformat": 4, | ||
"nbformat_minor": 5 | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
--- | ||
title: Contributing | ||
subtitle: Guidelines for contributing | ||
--- | ||
|
||
We welcome contributions to this guide. If you wish to contribute, please consider the following tenets: | ||
|
||
* All examples should use open data. If an example uses Earthdata, it must include an example of how to provide credentials. | ||
* Landing pages with no code should be use [quarto markdown (`.qmd`)](https://quarto.org/docs/authoring/markdown-basics.html). | ||
* Pages with executable code should be [iPython Notebooks (`.ipynb`)](https://ipython.org/notebook.html) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
/*-- scss:defaults --*/ | ||
|
||
// fonts | ||
$presentation-font-size-root: 27px !default; | ||
|
||
// colors | ||
$body-color: #000 !default; | ||
$selection-bg: #26351c !default; | ||
|
||
// headings | ||
$presentation-heading-color: #333 !default; | ||
|
||
/*-- scss:rules --*/ |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,12 +1,40 @@ | ||
--- | ||
title: "Cloud-Optimized Data Guide" | ||
subtitle: "Methods for Generating and Testing Cloud-Optimized Data" | ||
title: "Cloud-Optimized Geospatial Formats Guide" | ||
subtitle: "Methods for Generating and Testing Cloud-Optimized Geospatial Formats" | ||
--- | ||
|
||
The audience for this guide is any data provider wishing to make their data accessible without users needing to download entire files. | ||
# Who this guide is for | ||
|
||
If you wish to provide optimized access geospatial data, this guide is for you. Given the size of geospatial data, now and in the future, users can no longer rely on file download to achieve their science goals. | ||
|
||
## Built for the community, by the community. | ||
|
||
There is no one-size-fits-all approach to cloud-optimized data, but the community has developed many tools for creating and assessing cloud-optimized data formats that should be organized and shared. | ||
There is no one-size-fits-all approach to cloud-optimized data, but the community has developed many tools for creating and assessing geospatial formats that should be organized and shared. | ||
|
||
With this guide, we provide the landscape of cloud-optimized geospatial formats and provide the best-known answers to common questions. | ||
|
||
## The Opportunity | ||
|
||
Just putting data on the cloud does not solve the big geospatial data problem. Massive archives of data must be available via subsetting services in order for users to work with the data in-memory. Traditional geospatial formats are optimized for on-disk access via small internal chunks. The introduction of a network introduces latency and the number of requests must considered. The file format must support subsetted access via adressable chunks, internal tiling or both. These characteristics allow for parallelized and partial reading. | ||
|
||
## Table of Contents | ||
|
||
1. [Overview of Formats (slideshow)](./overview.qmd) | ||
2. Formats | ||
a. [Cloud-Optimized GeoTIFFs](./cloud-optimized-geotiffs.ipynb) | ||
b. Zarr and Kerchunk - COMING SOON | ||
c. Cloud-Optimized HDF5 - COMING SOON | ||
d. Geoparquet - COMING SOON | ||
e. Flatgeobuf - COMING SOON | ||
3. Cookbooks | ||
a. [Zarr Visualization Cookbook - IN DEVELOPMENT](https://nasa-impact.github.io/zarr-visualization-cookbook/) | ||
|
||
## Questions to ask when generating cloud-optimized geospatial data in any format | ||
|
||
1. What variable(s) should be included in the new data foramt | ||
2. Will you create copies to optimize for different needs? | ||
3. What is the intended use case or usage profile? Will this product be used for visualization, analysis or both? | ||
4. What is the expected access method? | ||
5. How much of your data is typically rendered or selected at once? All to very select subsets? | ||
|
||
|
||
With this guide, we provide the landscape of cloud-optimized data formats and provide the best-known answers to common questions. |
Oops, something went wrong.