Biodentify\u2019s Patented Technology<\/strong><\/h3>\n\n\n\nBiodentify’s patented technology is based on the response elicited in the shallow soil (<30cm) microbial population by hydrocarbons which have migrated from depth to the surface through vertical microseepage. The trace hydrocarbons released are otherwise undetectable, but the bacterial population acts as an extremely sensitive indicator. Samples of 0.15 cm3<\/sup> are collected from the cuttings and analyzed for microbial DNA content leading to a characteristic fingerprint, from which final predictions can be made. Cutting samples originate from 10-3200 m in depth, in wells drilled between the 1960s-2010s. <\/p>\n\n\n\n <\/figure><\/div>\n\n\n\nProductivity predicted with cuttings samples using 69 DNA strings as bio-speicies markers. Total accuracy is 82% correct prediction, correct predictions denoted with white dots.<\/em><\/p>\n\n\n\n <\/figure><\/li> <\/figure><\/li><\/ul><\/figure>\n\n\n\nStored cuttings in core house at Zeist, the Netherlands.<\/em><\/p>\n\n\n\nPlanning and executing the project with our customers<\/strong><\/h3>\n\n\n\nThe project was set up in cooperation with three operators active in the Dutch North Sea. We concentrated on sampling locations with known production statistics, taking a total of 1080 samples. Three different sets of samples were taken: oil producing wells, gas producing wells, and dry wells. <\/p>\n\n\n\n
The selected wells were located across the entire Dutch offshore area with a 50\/50 distribution of wells that did and did not encounter any oil and or gas. Some wells have samples collected from various depths, which are collected at the Dutch core repository in Zeist. This is managed by TNO\/Geological Survey of the Netherlands.<\/p>\n\n\n\n
We analyzed the samples for microbes in our lab, separating the microbial DNA from the soil, ending up with a DNA fingerprint for each location. This data is very vast, hundreds of millions of numerical values, too vast for us humans to make sense of it alone. This is where supercomputing power comes in. The use of our unique and constantly updated DNA database, alongside big data algorithms, machine learning and our own metagenomics pipeline, ensure that the model is constantly enhanced and improved over time to enable us to accurately predict subsurface hydrocarbon presence.<\/p>\n\n\n\n
<\/figure><\/div>\n\n\n\nThree categories of DNA\/productivity data from the cutting samples: 1) training data used to build a model, 2) validation data to test if the model is accurate enough and 3) prediction data to test the prediction of 100 cuttings totally left out when model building.<\/em><\/p>\n\n\n\nFirstly 100 samples were set aside to use in prediction after training the model; the remaining samples were used to perform the training. For building a maximally predictive model, we use three loops, all making use of different data areas. <\/p>\n\n\n\n
Inner loop\/training:<\/strong> 70% of the available training samples are used to find a correlation which is shown in the innermost loop of the above figure.<\/li>Second loop\/cross validation: <\/strong>After constructing a predictive model, it is tested by cross-validation on the 30% left out of the training samples, shown in the second loop of the above figure. This is randomly repeated 100 times. Therefore, cross validation of the 100 predictive models can also be made 100 times. This leads to a total of 10,500 predictions from our samples, which means every sample will be left out on average 30 times. Statistics of these cross-validations are then made via scoring the number of times that a prediction is right vs wrong. <\/li>Third loop\/test: <\/strong>Within these 100 models, predictions can be made on the 100 test samples that were set aside, shown in the outer loop of the above figure. This is done 10 times with different random chosen test sets. Results of these predictions have been shown to be 82% accurate, on average, for the 10 test sets (see first figure above). In the case of an over fitted model, we would see good calibration but bad predictions.<\/li><\/ol>\n\n\n\nTesting causality of the models<\/strong><\/h3>\n\n\n\nThe high accuracy obtained means that there is a significant difference between the DNA fingerprints of oil and gas wells versus those of dry wells. This can be seen as statistical proof for microseepage. To link the model to microseepage by causality, we investigated what is known about the microbes selected by the machine learning model. These selected microbes have been well investigated in scientific literature. <\/p>\n\n\n\n
Scientific research performed worldwide sees relationships between the degradation of oil and gas and abundances of certain microbes. We can clearly see that the microbes found by the machine learning model which are found above oil and gas wells are described in literature as dominantly degrading hydrocarbons for their metabolism (hydrocarbon lovers). Vice versa that microbes which are more abundant above dry wells by the machine learning model find hydrocarbons toxic (hydrocarbon haters) concluded from independent scientific research. <\/p>\n\n\n\n <\/figure><\/li> <\/figure><\/li><\/ul><\/figure>\n\n\n\nSelected microbes in the model above oil and gas wells (left figure) vs selected microbes above dry wells (right figure) and their link to literature. The size of the contribution to the pie chart microbe indicates the weight the microbe gets in the model (measured for bio-species marker importance).<\/em><\/p>\n\n\n\nConclusions and future work<\/strong><\/h3>\n\n\n\nThe high accuracy of differences in DNA fingerprints from cuttings above oil and gas wells versus dry wells gives us great confidence that we can use these samples, stored in national or operator run core houses, to de-risk new well locations above prospects. Although this is an exciting and cost effective way of prediction, new areas do not have cuttings available. Because of this we need to be able to predict hydrocarbon presence based on seabed samples. <\/p>\n\n\n\n
Next step is to link the results from cuttings samples to seabed samples. This has to be systematically tested with a large seabed sample set. Currently this exercise is being discussed for the Dutch North Sea. Additionally, we are in proposal discussions on an extended seabed survey with a number of operators in the Norwegian Continental Shelf. This small consortium is still open for other operators to join \u2013 more on this will come out shortly but please reach out to us if you are keen to hear more on this exciting opportunity. <\/p>\n\n\n\n
About Biodentify<\/strong><\/h3>\n\n\n\nBiodentify was founded as a spin-out from Dutch R&D group TNO in December 2014. TNO is a 3000+ R&D organization and the largest microbiological research group in the Netherlands. Biodentify is owned and managed by three partners with extensive entrepreneurial experience bringing innovative new technologies to market in the Oil & Gas industry. The company has developed a novel and patented technology, predicting prospectivity before drilling, with > 70% accuracy, based on microbial DNA analysis of shallow soil or seabed samples through unique machine learning models. Biodentify was named the #58 most innovative company in 2019 by the KvK (Chamber of Commerce).<\/p>\n","protected":false},"excerpt":{"rendered":"
Biodentify predicts Oil & Gas productivity offshore with 82% accuracy. Another ‘proof of technology’ pilot, this time using cuttings from the Dutch North Sea.<\/p>\n","protected":false},"author":4,"featured_media":7782,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_seopress_robots_primary_cat":"none","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","content-type":"","footnotes":""},"categories":[17],"tags":[],"_links":{"self":[{"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/posts\/7745"}],"collection":[{"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/comments?post=7745"}],"version-history":[{"count":0,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/posts\/7745\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/media\/7782"}],"wp:attachment":[{"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/media?parent=7745"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/categories?post=7745"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/biodentify.ai\/wp-json\/wp\/v2\/tags?post=7745"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}