You are on page 1of 17

Leveraging Existing

Databases for Advanced


Analytics of the future
Chris Hanton
Perigon
James Miller
Anadarko Petroleum
Agenda
• Big Data – Here to Stay
• Leveraging Legacy data
• Autoloaders and Integration
• Case Study – Anadarko Corporate Core Repository
• Existing Challenges and goals of the solution
• Anadarko Customized Solution
• Conclusion

2
Big Data in Oil and Gas
• Growing consensus is that Oil and Gas is on the cusp of a new
era…. digital technologies looks set to reshape it – World Economic
Forum (2017)(1)
• 74% of companies investing in ‘Big Data/Advanced Analytics’ over
next 5 years – Accenture (2016)(2)
• $425 billion Potential value addition for the industry – World
Economic Forum (2017)(1)

3
Data Availability
• 650,000 Exploration and
Development Wells drilled
between 1970 and 2010 - EIA(3)
• 130,000+ wells since 2010 -
drillinginfo.com(4)

• Data coverage is not a problem


- But how to best utilize it?

Active wells in the USA 2017 – Washington Post(4)

4
Challenges of Legacy Data
• Paper copy vs Digital – and how digital is digital?
• Report format Consistency
• Lost/incorrect UWI’s
• Duplicate Data and conflicting vintages
• Accuracy Concerns

Clean data a necessity for building models

5
Leveraging Existing DB’s
• Majority of companies leveraging advanced analytics have pre-
existing legacy databases

3 Primary Challenges in utilizing these DB’s:


1. Ascertaining accuracy of stored data (for another presentation…)
2. Efficiently adding more data
3. Integration with Advanced Analytic tools

6
A commonly observed data workflow
A linear chain of data transfer results in a series of
bottlenecks and redundancies

Accessed by Applied to
Aggregation Load/QC Further QC
Geoscientists model

DM DM DM GEO GEO
Further QC
and
Data Data
Data loaded, manipulation
compiled provided to
naming and of data into Final load of
from multiple geoscientists
unit standard format data
sources prior upon request
applied required to
to initial load as flat files
load to
application

7
A commonly observed data workflow
Logs Geochem

Accessed by Applied to Accessed by Applied to


Aggregation Load/QC Further QC Aggregation Load/QC Further QC
Geoscientists model Geoscientists model

Core
Core Formation Tops

Accessed by Applied to Accessed by Applied to


Aggregation Load/QC Further QC Aggregation Load/QC Further QC
Geoscientists model Geoscientists model

PVT Other…

Accessed by Applied to Accessed by Applied to


Aggregation Load/QC Further QC Aggregation Load/QC Further QC
Geoscientists model Geoscientists model

8
A new approach to data transfer
• Conventional manual population workflows not scalable to ‘big data’

• We need efficient population of DB’s with clean standardized data

• Ability to feed directly into Advanced Analytics, allowing consumers


to choose type and variety of data

9
A Suite of Solutions
• Intelligent Autoloaders
• Drop box enabled loaders that allow implementation of business rules
on data as it’s loaded
• QC, Standardization, Renaming, Conversions
• GQL Connections
• Proprietary tool to connect with any JDBC compliant database
• Import, Compare, Visualize data from any connected source
• Database Views
• Exposing the database for querying by 3rd party apps for further
integration

10
Case Study – Anadarko Petroleum (APC)
• Requirement to utilize Anadarko’s extensive core records
• Digital Data, Documents, Images

• Data in no organized corporate repository


• In various hard-drives, applications, vendor sites, optical media

• Conflicting vintages and sources of data


• Acquisitions, re-runs of data, interpreted vs raw data

11
Data Management With APC

12
The Flaws of this System
Uncertainty in data No Naming Standards
quality used in modelling
Increased Cost Repeated steps of data
handling/formatting
Lack of Geotechs with specialized
Inconsistency between
knowledge of data trends
assets and techs
Time searching for data Lost Data
Inability to gather and Ever-growing alias tables
analyze proprietary data Maintaining redundant
databases systems No Process for data trades
No MDM Solutions
Time of manual data
transfers to applications
Lack of Competitive
advantage Repeated purchasing of data Lack of Efficiency
from vendor

13
Anadarko’s Aims
• True DB format for direct access to database for analytics
• Repository for interpreters
• Easy to search/view/retrieve data
• Tools for standardization
• Clear data structure
• Able to see all available data in a single view

14
Automated & Integrated

Autoloaders

iPointWeb
15
References
1. World Economic Forum - Digital Transformation Initiative Oil and Gas Industry
http://reports.weforum.org/digital-transformation/wp-content/blogs.dir/94/mp/files/pages/files/dti-oil-
and-gas-industry-white-paper.pdf
2. Accenture – The 2016 Upstream Oil and Gas Digital Trends Survey https://www.accenture.com/us-
en/insight-2016-upstream-oil-gas-digital-trends-survey
3. EIA – US Crude Oil and Exploratory and Developmental Wells Drilled
https://www.eia.gov/dnav/pet/hist/LeafHandler.ashx?n=pet&s=e_ertwo_xwc0_nus_c&f=a
4. Washington Post - The United States of oil and gas
https://www.washingtonpost.com/graphics/national/united-states-of-oil/

16
Thank You!
Chris Hanton

chanton@perigonsolutions.com

James Miller

james.miller@anadarko.com

17

You might also like