✓ Topic: Crude Oil Price Projection
Crude Oil Price Projection
✓ Reason why we selected this topic
For this project, we intend to develop a robust machine learning model that will deliver a one-year projection for the price of West Texas Intermediate (WTI) Crude Oil, one of the most well-known and widely produced blends of crude in the United States. To do so, we will be examining historical and current data from the Energy Information Agency (EIA), a federal agency that tracks production, sales, and spot & futures prices of WTI Crude. The price projection will be a function of oil production, refinery utilization and capacity, and sales.
✓ Questions we hope to answer with the data
Where will the data be sourced from?
How will the data be data be transformed and stored?
Which machine learning models and libraries should be used?
How will the results be visualized?
Which model produces the most accurate projections for WTI?
✓ Description of their source of data
Gathering data from U.S. Energy Information Administration (EIA)
Crude Oil Production dataset
The volume of crude oil produced from oil reservoirs during given periods of time. The amount of such production for a given period is measured as volumes delivered from lease storage tanks (i.e., the point of custody transfer) to pipelines, trucks, or other media for transport to refineries or terminals with adjustments for (1) net
differences between opening and closing lease inventories, and (2) basic sediment and water (BS&W).
Product Supplied dataset
Approximately represents consumption of petroleum products because it measures the disappearance of these products from primary sources, i.e., refineries, natural gas-processing plants, blending plants, pipelines, and bulk terminals. In general, Product Supplied of each product in any given period is computed as follows: Product supplied = field production + refinery production + imports + unaccounted for crude oil (+ net receipts when calculated on a PAD District basis) - stock change - crude oil losses - refinery inputs - exports
Refinery Utilization and Capacity dataset
Ratio of the total amount of crude oil, unfinished oils, and natural gas plant liquids run through crude oil distillation units to the operable capacity of these units.
NYMEX Futures Prices dataset
New York Mercantile Exchange, The price quoted for delivering a specified quantity of a commodity at a specified time and place in the future.
WTI Crude SPOT Prices Historical dataset
West Texas Intermediate, The price for a one-time open market transaction for immediate delivery of a specific quantity of product at a specific location where the commodity is purchased "on the spot" at current market rates.
✓ Technologies, languages, tools, and algorithms used throughout the project
Data Cleaning
Database Storage
Machine Learning
Dashboard
Slides
✓ Result of analysis
✓ Summary
✓ Recommendation for future analysis
✓ Anything the team would have done differently