You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+19-4
Original file line number
Diff line number
Diff line change
@@ -55,17 +55,32 @@ where each .npz file is a compressed numpy file containing the 32-bit float Bott
55
55
56
56
Note: The original Sentinel-2 data is stored as unsigned 16-bit integers. Our dataset converts to 32-bit floats and applies the Sentinel-2 scaling factor (divison by 10,000) to retrieve surface reflectance values. Although this should result in values from 0 to 1, some values will exceed 1 due to small errors in the data. We decided to keep these values greater than 1 for training robustness.
57
57
58
-
The "gsd_10" array bands have the order blue, green, red, and then NIR. The "gsd_20" bands have 4 vegetation red edge bands, followed by two SWIR bands. The "gsd_60" array consists of the coastal aerosol and water vapour bands. The exact corresponding bands from the Sentinel-2 platform are listed in the below table. Find more information about these spectral bands [here](https://gisgeography.com/sentinel-2-bands-combinations/).
58
+
The "gsd_10" array bands have the order blue, green, red, and then NIR. The "gsd_20" bands have 4 vegetation red edge bands, followed by two SWIR bands. The "gsd_60" array consists of the coastal aerosol and water vapour bands. The exact corresponding bands from the Sentinel-2 platform are listed in the table below. Find more information about these spectral bands [here](https://gisgeography.com/sentinel-2-bands-combinations/).
59
59
60
60
| Data Key | Sentinel-2 Bands |
61
61
| -------- | ---------------- |
62
62
| gsd_10 | B02, B03, B04, B08 |
63
63
| gsd_20 | B05, B06, B07, B8A, B11, B12 |
64
64
| gsd_60 | B01, B09 |
65
65
66
-
The image files also contain an "scl" band and a "bad_percent" value. The "scl" band contains the Scene Classification Layer values, which inform the quality of each pixel at 20 m. resolution. These values are described in Figure 3 [here](https://sentinels.copernicus.eu/web/sentinel/technical-guides/sentinel-2-msi/level-2a/algorithm).
67
-
68
-
The "bad_percent" value is a float value between 0 and 1 which describes the percentage of pixels within the "scl" band which we've determined to be bad data. Currently we include images with up to 25% bad data. You can use this key to filter the dataset using a lower threshold.
66
+
The image files contain an "scl" band and a . The "scl" band contains the Scene Classification Layer values, which inform the quality of each pixel at 20 m. resolution. These valures are described in the table below.
67
+
68
+
| Label | Classification |
69
+
| ----- | -------------- |
70
+
| 0 | NO_DATA |
71
+
| 1 | SATURATED_OR_DEFECTIVE |
72
+
| 2 | CAST_SHADOWS |
73
+
| 3 | CLOUD_SHADOWS |
74
+
| 4 | VEGETATION |
75
+
| 5 | NOT_VEGETATED |
76
+
| 6 | WATER |
77
+
| 7 | UNCLASSIFIED |
78
+
| 8 | CLOUD_MEDIUM_PROBABILITY |
79
+
| 9 | CLOUD_HIGH_PROBABILITY |
80
+
| 10 | THIN_CIRRUS |
81
+
| 11 | SNOW or ICE |
82
+
83
+
The image files also contain a "bad_percent" value, which is a float value between 0 and 1 describing the percentage of pixels within the "scl" band which we've determined to be bad data. Currently we filter images with more than 25% of their pixels having bad data. You can use this key to filter the dataset using a different threshold.
0 commit comments