If you find this work helpful, please consider citing the following paper:
Bibtex entry:
@article{fu2024well,
title={Well-Log-Based Reservoir Property Estimation With Machine Learning: A Contest Summary},
author={Fu, Lei and Yu, Yanxiang and Xu, Chicheng and Ashby, Michael and McDonald, Andrew and Pan, Wen and Deng, Tianqi and Szab{\'o}, Istv{\'a}n and Hanzelik, P{\'a}l P and Kalm{\'a}r, Csilla and others},
journal={Petrophysics},
volume={65},
number={01},
pages={108--127},
year={2024},
publisher={SPWLA}
}
Winner Team | Contact | |
---|---|---|
1st Place | UTFE | Wen Pan [email protected] |
Tianqi Deng [email protected] | ||
2nd Place | MoLPhy | István Szabó [email protected] |
Pál Hanzelik [email protected] | ||
Csilla Kalmár [email protected] | ||
3rd Place | Tomsk | |
4th Place | Atwah_Analytics | Saleh Alatwah [email protected] |
5th Place | Jaehyuk Lee | Jaehyuk Lee [email protected] |
Rank | Team Name | Best Score | Best Solution | Notebook |
---|---|---|---|---|
1 | UTFE | 0.0525 | Linear Regression | Notebook |
2 | MoLPhy | 0.0602 | Ensemble model | Notebook |
3 | Tomsk | 0.0634 | Neural network | Notebook |
4 | Atwah_Analytics | 0.0667 | ExtraTree and Catboost regression | Notebook |
5 | Jaehyuk Lee | 0.0775 | LightGBM | Notebook |
6 | curioso | 0.0833 | Neural network | Notebook |
7 | GGAIS | 0.0975 | XGBoost | Notebook |
8 | Iron486 | 0.0982 | Neural network | Notebook |
9 | VE4F | 0.1006 | Regression boosted trees | Notebook |
10 | dirtycats | 0.1021 | Gradient boosting | Notebook |
11 | DeepPlus | 0.1053 | LightGBM | Notebook |
12 | PDDA (Benchmark) | 0.11718 | Random Forest | SPWLA_2021_ML_Tutorial.ipynb |
13 | Geolatinas | 0.1250 | Artificial neural network | Notebook |
14 | Geo_ML | 0.13071 | CatBoost | Notebook |
15 | Team_Zotrex | 0.3773 | Multilayer perceptron (MLP) | |
BroodingPixel | 0.0751 | |||
GeoData | 0.0779 | |||
Emerson_BRASAR | 0.0782 | |||
white_hats | 0.0826 | |||
PetroVirago | 0.0830 | |||
ThePandas | 0.0839 | |||
CUP_AI_Hogwarts | 0.0856 | |||
BraveTeam | 0.0920 | |||
ML_developers_2022 | 0.0935 | |||
CUP_AI_Bamboo | 0.0940 | |||
PetroMachine | 0.0942 | |||
MachineLearningWhileDrilling | 0.0958 | |||
Moyu | 0.0961 | |||
Lowest_RMSE | 0.0977 | |||
UESTC_BG | 0.0988 | |||
LEARNERS | 0.0997 | |||
Rocky-AI | 0.1003 | |||
Cup_Melon_Eaters | 0.1007 | |||
EarthAnalyst | 0.1008 | |||
BHKU | 0.1020 | |||
Schooners | 0.1025 | |||
OxyPetroML | 0.1030 | |||
I.DesRochesBot | 0.1062 | |||
Bigdatalogging | 0.1089 | |||
ProbePetrophysics | 0.1131 | |||
Fourieous_Transformers | 0.1155 | |||
North | 0.1167 | |||
ton_osk118 | 0.1172 | |||
AI logging team | 0.1174 | |||
outlier_detectives | 0.1175 | |||
YYDS | 0.1326 | |||
Quantum Energy | 0.2134 | |||
Gen_Zheng_Miao_Hong_Team | 0.3149 |
Please use the team leader's email to register. The link was sent in the email.
Please note that:
- Only one user can register for the competition per team.
- The user name has to be exactly the same as the team name. If space is not allowed, please replace space with underscore '_'.
- The submission file must be a zip file with whatever name. However, the name of the csv file inside the zip has to be "submission.csv". (the csv file should be in the same format as “example_1.csv” file on the GitHub page in terms of number of columns and rows, as well as exactly the same column names)
- The submission status might need a couple minutes to be updated, don't refresh the page too often.
- The user needs to manually submit their best results to the leaderboard. Click "Participate", " Submit / View Results", click the "+" symbol in your submission. See the red circles in the attached figure.
- Please use version-control properly, as we need to validate your code and reproduce the results of the final submitted score in order to rank your team in the final scoreboard.
- Max submissions per day: 3
- Max submissions total: 100
Lei Fu, Yanxiang Yu, Chicheng Xu, Michael Ashby, McDonald Andy, Bin Dai
Well logs are interpreted/processed to estimate the in-situ reservoir properties (petrophysical, geomechanical, and geochemical), which is essential for reservoir modeling, reserve estimation, and production forecasting. The modeling is often based on multi-mineral physics or empirical formulae. When sufficient amount of training data is available, machine learning solution provides an alternative approach to estimate those reservoir properties based on well log data and is usually with less turn-around time and human involvements.
The goal of this contest is to develop data-driven models to estimate reservoir properties including shale volume, porosity, and fluid saturation, based on a common set of well logs including gamma ray, bulk density, neutron porosity, resistivity, and sonic.
You will be provided with log data from about 10 wells from the same field together with the corresponding reservoir properties estimated by petrophysicists. You need to build a data-driven model using the provided training dataset. Following that, you will deploy the newly developed data-driven models on the test dataset to predict the reservoir properties based on the well log data.
Submissions are evaluated according to root mean squared error(RMSE) calculated from the shale volume (VSH), porosity (PHIF), and water saturation (SW) values of the hidden dataset. The value of the hidden dataset is between 0 and 1.
- Here \hat{y_i} is the predicted values of the true values y_i. Both \hat{y_i} and y_i are vectors with 3 elements: shale volume (VSH), porosity (PHIF), and water saturation (SW).
- m is sample size.
- October 15, 2021 - Registration deadline. You must email Lei Fu ([email protected]) with team information (team name, member names, affiliations, and emails) before this date in order to compete.
- November 1, 2021 - Competition starts and data releases on github.
- February 1, 2022 - Submission deadline.
- March 1, 2022 - Announce winners.
- March 23/24, 2022 - Award ceremony and presentations in the special session of the SPWLA Spring Topical Conference - Petrophysical Machine Learning.
All deadlines are at 11:59 PM UTC on the corresponding day unless otherwise noted. The competition organizers reserve the right to update the contest timeline if they deem it necessary.
- Contestant can be an individual or a group with the maximum size of 4.
- The contest focuses on data-driven methods, the use of additional data or petrophysical equations is not allowed.
- Privately sharing code or data outside of teams is not permitted. However, it's okay to share code if made available to all participants on the competition Github repository via submitting issues or pull requests.
- A contestant will submit the estimated reservoir properties for testing wells.
- A contestant will submit the source code and a brief report documenting the accuracy achieved in a few plots.
- The judges will review the source code.
- The performance of the model will be quantified in terms of root mean square error (RMSE).
- A leaderboard will be updating the rank of submissions from each team.
- The contestant with the best quality source code and the best performance will be declared the winner for this competition.
- 1st Place - $500
- 2nd Place - $400
- 3rd Place - $300
- 4th Place - $200
- 5th Place - $100
Top 5 winning teams will be awarded with prizes(NOT in cash).
Note: The winners will additionally be required to provide a detailed description of their method in order to claim the prize (minimum of 2 pages double-column) by February 15, 2022, which is two weeks after the competition has concluded.
Novel and practical algorithms will be recommended for a submission to the a SPWLA special issue by PDDA or a journal paper.
The data comes from VOLVE dataset owned by Equinor.
DATA ACCESS AND USE: Creative Commons Attribution-NonCommercial-ShareAlike license.
ENTRY IN THIS COMPETITION CONSTITUTES YOUR ACCEPTANCE OF THESE OFFICIAL COMPETITION RULES.
The Competition named above is a skills-based competition to promote and further the field of data science. You must submit your registration to [email protected] to enter. Your competition submissions ("Submissions") must conform to the requirements stated on the Competition Website. Your Submissions will be scored based on the evaluation metric described on the Competition Website. Subject to compliance with the Competition Rules, Prizes, if any, will be awarded to participants with the best scores, based on the merits of the data science models submitted. Check the competition website for the complete Competition Rules.
Lei Fu, Yanxiang Yu, Chicheng Xu, Andy McDonald, Michael Ashby.