Crop Type Classification with Sentinel-1 SAR Data

This project classifies crop types from multi-temporal radar imagery using machine learning approaches.

Data

The raw data sources used were:

Sentinel-1 C-band VV/VH backscatter intensity rasters over an agricultural site spanning 3 months
Point vector data with 10 crop labels randomly distributed over the site's farmland

Packages

The key Python packages used in this project are:

Data Manipulation and Analysis:

pandas
numpy
xarray
rioxarray (GeoPandas)

Machine Learning:

scikit-learn
TensorFlow/Keras

Model Evaluation:

matplotlib
seaborn

Geospatial Data Handling:

planetary_computer
rasterio

Preprocessing:

MinMaxScaler
StandardScaler

Model Training:

LogisticRegression
RandomForestClassifier
RandomizedSearchCV (hyperparameter tuning)

Miscellaneous Utils:

warnings
time
itertools

The core modeling pipeline relies on Scikit-Learn and Pandas for manipulation and preprocessing of array data, along with geospatial packages like rioxarray to handle satellite raster time series. The deep learning components leverage TensorFlow/Keras.

Hyperparameter tuning was done using RandomizedSearch from Scikit-Learn to find optimal model configurations for the random forest. Evaluation was based on accuracy metrics.

Data Wrangling

Spatial Join: A spatial join between the crop labels and radar data extracted average VV/VH values for each farm's time series.
Preprocessing: Reshaped to samples x timesteps. Relative vegetation index (RVI) computed for each time series.
Train Test Split: 80-20 stratified split of farms to separate train and test sets.

Modeling

Varying ML approaches were tested:

1D CNN on RVI
1D CNN on radar backscatter
Random forest on backscatter
Random forest on RVI

Hyperparameter tuning and cross validation were utilized.

Results

Multiple models were evaluated based on accuracy on a held-out EY test set consisting of Sentinel-1 SAR data over unknown crop types.

The best performing model was a random forest classifier trained on engineered relative vegetation index (RVI) features, which achieved 85% accuracy at predicting crop types on the EY test data.

The next best model, a 1D CNN operating directly on the VV/VH time series, scored 10 percentage points lower at 75% accuracy.

The high performance of the RF+RVI approach can be attributed to:

The derived polarization ratios captured in RVI provide meaningful geospatial features
Ensemble modeling reduces overfitting compared to deep CNNs
Interpretability of RFs allows analysis of important variables

Given the significant jump in accuracy on external industry data, the RF+RVI approach shows good generalization ability even with limited training samples. This model has been saved for real-world crop type mapping applications.

Future iterations could incorporate recent hyperparameter optimization techniques such as Bayesian hyperband to further improve predictive power.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.DS_Store		.DS_Store
.gitattributes		.gitattributes
Crop_Location_Data.csv		Crop_Location_Data.csv
EY2023_submission.ipynb		EY2023_submission.ipynb
README.md		README.md
challenge_1_submission_template_correct_columns_fixed.csv		challenge_1_submission_template_correct_columns_fixed.csv
challenge_1_submission_template_correct_columns_fixed1.csv		challenge_1_submission_template_correct_columns_fixed1.csv
challengevvvh.csv		challengevvvh.csv
labelfinal.csv		labelfinal.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crop Type Classification with Sentinel-1 SAR Data

Data

Packages

Data Wrangling

Modeling

Results

Status

Contact

Acknowledgements

About

Releases

Packages

Languages

KNguyen37/EY_Rice_Mapping

Folders and files

Latest commit

History

Repository files navigation

Crop Type Classification with Sentinel-1 SAR Data

Data

Packages

Data Wrangling

Modeling

Results

Status

Contact

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages