Skip to content

Commit b2b38f0

Browse files
authored
Update README.md
SCL link was deprecated. Wrote out table for SCL values in the readme to preserve the info.
1 parent 5652250 commit b2b38f0

File tree

1 file changed

+19
-4
lines changed

1 file changed

+19
-4
lines changed

README.md

+19-4
Original file line numberDiff line numberDiff line change
@@ -55,17 +55,32 @@ where each .npz file is a compressed numpy file containing the 32-bit float Bott
5555

5656
Note: The original Sentinel-2 data is stored as unsigned 16-bit integers. Our dataset converts to 32-bit floats and applies the Sentinel-2 scaling factor (divison by 10,000) to retrieve surface reflectance values. Although this should result in values from 0 to 1, some values will exceed 1 due to small errors in the data. We decided to keep these values greater than 1 for training robustness.
5757

58-
The "gsd_10" array bands have the order blue, green, red, and then NIR. The "gsd_20" bands have 4 vegetation red edge bands, followed by two SWIR bands. The "gsd_60" array consists of the coastal aerosol and water vapour bands. The exact corresponding bands from the Sentinel-2 platform are listed in the below table. Find more information about these spectral bands [here](https://gisgeography.com/sentinel-2-bands-combinations/).
58+
The "gsd_10" array bands have the order blue, green, red, and then NIR. The "gsd_20" bands have 4 vegetation red edge bands, followed by two SWIR bands. The "gsd_60" array consists of the coastal aerosol and water vapour bands. The exact corresponding bands from the Sentinel-2 platform are listed in the table below. Find more information about these spectral bands [here](https://gisgeography.com/sentinel-2-bands-combinations/).
5959

6060
| Data Key | Sentinel-2 Bands |
6161
| -------- | ---------------- |
6262
| gsd_10 | B02, B03, B04, B08 |
6363
| gsd_20 | B05, B06, B07, B8A, B11, B12 |
6464
| gsd_60 | B01, B09 |
6565

66-
The image files also contain an "scl" band and a "bad_percent" value. The "scl" band contains the Scene Classification Layer values, which inform the quality of each pixel at 20 m. resolution. These values are described in Figure 3 [here](https://sentinels.copernicus.eu/web/sentinel/technical-guides/sentinel-2-msi/level-2a/algorithm).
67-
68-
The "bad_percent" value is a float value between 0 and 1 which describes the percentage of pixels within the "scl" band which we've determined to be bad data. Currently we include images with up to 25% bad data. You can use this key to filter the dataset using a lower threshold.
66+
The image files contain an "scl" band and a . The "scl" band contains the Scene Classification Layer values, which inform the quality of each pixel at 20 m. resolution. These valures are described in the table below.
67+
68+
| Label | Classification |
69+
| ----- | -------------- |
70+
| 0 | NO_DATA |
71+
| 1 | SATURATED_OR_DEFECTIVE |
72+
| 2 | CAST_SHADOWS |
73+
| 3 | CLOUD_SHADOWS |
74+
| 4 | VEGETATION |
75+
| 5 | NOT_VEGETATED |
76+
| 6 | WATER |
77+
| 7 | UNCLASSIFIED |
78+
| 8 | CLOUD_MEDIUM_PROBABILITY |
79+
| 9 | CLOUD_HIGH_PROBABILITY |
80+
| 10 | THIN_CIRRUS |
81+
| 11 | SNOW or ICE |
82+
83+
The image files also contain a "bad_percent" value, which is a float value between 0 and 1 describing the percentage of pixels within the "scl" band which we've determined to be bad data. Currently we filter images with more than 25% of their pixels having bad data. You can use this key to filter the dataset using a different threshold.
6984

7085
#### Annotations
7186

0 commit comments

Comments
 (0)