Estimating Total Organic Carbon of Potential Source Rocks in the Espírito Santo Basin, SE Brazil, Using XGBoost

Marine and petroleum geology(2024)

Cited 0|Views13
No score
Abstract
Identifying and constraining source rocks is critical for petroleum system modeling and risk assessment. Traditional methods for total organic carbon (TOC) estimation based on well logs have limitations, leading to the emergence of machine learning techniques like XGBoost. This study compiled a comprehensive data set of well log and geochemical data from the Espírito Santo Basin, SE Brazil, and XGBoost, which was integrated with pySpark, was used for running multiple machine learning models to predict TOC. Parameter tuning was performed by randomly combining model configurations over multiple replication data frames. XGBoost effectively predicted TOC, yielding a coefficient of determination R2 of 0.71, RMSE of 0.55 and MAE of 0.30, based on the average of all 10-fold cross-validation test sets. Heteroscedasticity was observed, possibly related to the presence of outliers in the target TOC variable, which may be linked to variable organic-matter deposition and preservation processes through the geological time, such as during Oceanic Anoxic Events (OAEs). The results indicate the potential of machine learning for TOC prediction in large, heterogeneous data sets, outperforming the traditional ΔlogR method and offering a promising tool for the usage of available public data sets in similar applications, such as the oil and gas (O&G) industry's exploration phase or field reassessment.
More
Translated text
Key words
Geological database,Machine learning,XGBoost regression,Total organic carbon (TOC),Parameter tuning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined