Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 430 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 64.4 KiB |
| Average record size in memory | 153.3 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 11 |
Date has a high cardinality: 430 distinct values | High cardinality |
Year is highly correlated with Product_Supplied and 5 other fields | High correlation |
Product_Supplied is highly correlated with Year and 5 other fields | High correlation |
Refinery_Input is highly correlated with Year and 7 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Refinery_Input and 1 other fields | High correlation |
Percent_Util is highly correlated with Refinery_Input and 1 other fields | High correlation |
Future_Price is highly correlated with Year and 5 other fields | High correlation |
Spot_Price is highly correlated with Year and 5 other fields | High correlation |
Year is highly correlated with Product_Supplied and 5 other fields | High correlation |
Product_Supplied is highly correlated with Year and 3 other fields | High correlation |
Refinery_Input is highly correlated with Year and 6 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Percent_Util | High correlation |
Percent_Util is highly correlated with Refinery_Input and 1 other fields | High correlation |
Future_Price is highly correlated with Year and 4 other fields | High correlation |
Spot_Price is highly correlated with Year and 4 other fields | High correlation |
Year is highly correlated with Refinery_Input and 4 other fields | High correlation |
Product_Supplied is highly correlated with Refinery_Input | High correlation |
Refinery_Input is highly correlated with Year and 3 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 4 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 4 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Percent_Util | High correlation |
Percent_Util is highly correlated with Idle_Dist_Capacity | High correlation |
Future_Price is highly correlated with Year and 3 other fields | High correlation |
Spot_Price is highly correlated with Year and 3 other fields | High correlation |
Year is highly correlated with Total_Production and 7 other fields | High correlation |
Total_Production is highly correlated with Year and 7 other fields | High correlation |
Product_Supplied is highly correlated with Year and 6 other fields | High correlation |
Refinery_Input is highly correlated with Year and 7 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 8 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 7 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Operable_Dist_Capacity and 1 other fields | High correlation |
Percent_Util is highly correlated with Year and 5 other fields | High correlation |
Future_Price is highly correlated with Year and 6 other fields | High correlation |
Spot_Price is highly correlated with Year and 6 other fields | High correlation |
Date is uniformly distributed | Uniform |
Date has unique values | Unique |
Total_Production has unique values | Unique |
Reproduction
| Analysis started | 2022-02-01 00:06:59.529551 |
|---|---|
| Analysis finished | 2022-02-01 00:07:50.169279 |
| Duration | 50.64 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 430 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 27.4 KiB |
| Apr-1987 | 1 |
|---|---|
| Nov-2011 | 1 |
| Aug-2021 | 1 |
| May-2001 | 1 |
| Jan-1991 | 1 |
| Other values (425) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3440 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 430 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Jan-1986 |
|---|---|
| 2nd row | Feb-1986 |
| 3rd row | Mar-1986 |
| 4th row | Apr-1986 |
| 5th row | May-1986 |
Common Values
| Value | Count | Frequency (%) |
| Apr-1987 | 1 | 0.2% |
| Nov-2011 | 1 | 0.2% |
| Aug-2021 | 1 | 0.2% |
| May-2001 | 1 | 0.2% |
| Jan-1991 | 1 | 0.2% |
| Apr-2016 | 1 | 0.2% |
| Jan-1993 | 1 | 0.2% |
| Sep-2021 | 1 | 0.2% |
| Aug-2014 | 1 | 0.2% |
| May-2004 | 1 | 0.2% |
| Other values (420) | 420 |
Length
| Value | Count | Frequency (%) |
| may-1999 | 1 | 0.2% |
| oct-1994 | 1 | 0.2% |
| apr-1999 | 1 | 0.2% |
| may-2021 | 1 | 0.2% |
| oct-2018 | 1 | 0.2% |
| oct-2008 | 1 | 0.2% |
| nov-2013 | 1 | 0.2% |
| mar-2009 | 1 | 0.2% |
| jun-1987 | 1 | 0.2% |
| apr-2008 | 1 | 0.2% |
| Other values (420) | 420 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 430 | |
| 0 | 430 | |
| 9 | 336 | 9.8% |
| 1 | 334 | 9.7% |
| 2 | 320 | 9.3% |
| a | 108 | 3.1% |
| J | 108 | 3.1% |
| u | 108 | 3.1% |
| e | 107 | 3.1% |
| 8 | 96 | 2.8% |
| Other values (23) | 1063 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1720 | |
| Lowercase Letter | 860 | |
| Dash Punctuation | 430 | 12.5% |
| Uppercase Letter | 430 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 108 | |
| u | 108 | |
| e | 107 | |
| p | 72 | |
| r | 72 | |
| n | 72 | |
| c | 71 | |
| l | 36 | 4.2% |
| g | 36 | 4.2% |
| y | 36 | 4.2% |
| Other values (4) | 142 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 430 | |
| 9 | 336 | |
| 1 | 334 | |
| 2 | 320 | |
| 8 | 96 | 5.6% |
| 6 | 48 | 2.8% |
| 7 | 48 | 2.8% |
| 3 | 36 | 2.1% |
| 4 | 36 | 2.1% |
| 5 | 36 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 108 | |
| M | 72 | |
| A | 72 | |
| S | 36 | 8.4% |
| F | 36 | 8.4% |
| O | 36 | 8.4% |
| N | 35 | 8.1% |
| D | 35 | 8.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2150 | |
| Latin | 1290 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 108 | 8.4% |
| J | 108 | 8.4% |
| u | 108 | 8.4% |
| e | 107 | 8.3% |
| p | 72 | 5.6% |
| r | 72 | 5.6% |
| M | 72 | 5.6% |
| A | 72 | 5.6% |
| n | 72 | 5.6% |
| c | 71 | 5.5% |
| Other values (12) | 428 |
Common
| Value | Count | Frequency (%) |
| - | 430 | |
| 0 | 430 | |
| 9 | 336 | |
| 1 | 334 | |
| 2 | 320 | |
| 8 | 96 | 4.5% |
| 6 | 48 | 2.2% |
| 7 | 48 | 2.2% |
| 3 | 36 | 1.7% |
| 4 | 36 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 430 | |
| 0 | 430 | |
| 9 | 336 | 9.8% |
| 1 | 334 | 9.7% |
| 2 | 320 | 9.3% |
| a | 108 | 3.1% |
| J | 108 | 3.1% |
| u | 108 | 3.1% |
| e | 107 | 3.1% |
| 8 | 96 | 2.8% |
| Other values (23) | 1063 |
| Distinct | 36 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2003.418605 |
| Minimum | 1986 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 1986 |
|---|---|
| 5-th percentile | 1987 |
| Q1 | 1994.25 |
| median | 2003 |
| Q3 | 2012 |
| 95-th percentile | 2019.55 |
| Maximum | 2021 |
| Range | 35 |
| Interquartile range (IQR) | 17.75 |
Descriptive statistics
| Standard deviation | 10.35552747 |
|---|---|
| Coefficient of variation (CV) | 0.005168928472 |
| Kurtosis | -1.200417279 |
| Mean | 2003.418605 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.001086441007 |
| Sum | 861470 |
| Variance | 107.2369491 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2003 | 12 | 2.8% |
| 2020 | 12 | 2.8% |
| 2001 | 12 | 2.8% |
| 2000 | 12 | 2.8% |
| 1999 | 12 | 2.8% |
| 1998 | 12 | 2.8% |
| 1997 | 12 | 2.8% |
| 1996 | 12 | 2.8% |
| 1995 | 12 | 2.8% |
| 1994 | 12 | 2.8% |
| Other values (26) | 310 |
| Value | Count | Frequency (%) |
| 1986 | 12 | |
| 1987 | 12 | |
| 1988 | 12 | |
| 1989 | 12 | |
| 1990 | 12 | |
| 1991 | 12 | |
| 1992 | 12 | |
| 1993 | 12 | |
| 1994 | 12 | |
| 1995 | 12 |
| Value | Count | Frequency (%) |
| 2021 | 10 | |
| 2020 | 12 | |
| 2019 | 12 | |
| 2018 | 12 | |
| 2017 | 12 | |
| 2016 | 12 | |
| 2015 | 12 | |
| 2014 | 12 | |
| 2013 | 12 | |
| 2012 | 12 |
Month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.476744186 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3.25 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5.75 |
Descriptive statistics
| Standard deviation | 3.446990323 |
|---|---|
| Coefficient of variation (CV) | 0.5322103551 |
| Kurtosis | -1.211707003 |
| Mean | 6.476744186 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.005611039523 |
| Sum | 2785 |
| Variance | 11.88174229 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 36 | |
| 9 | 36 | |
| 8 | 36 | |
| 7 | 36 | |
| 6 | 36 | |
| 5 | 36 | |
| 4 | 36 | |
| 3 | 36 | |
| 2 | 36 | |
| 1 | 36 | |
| Other values (2) | 70 |
| Value | Count | Frequency (%) |
| 1 | 36 | |
| 2 | 36 | |
| 3 | 36 | |
| 4 | 36 | |
| 5 | 36 | |
| 6 | 36 | |
| 7 | 36 | |
| 8 | 36 | |
| 9 | 36 | |
| 10 | 36 |
| Value | Count | Frequency (%) |
| 12 | 35 | |
| 11 | 35 | |
| 10 | 36 | |
| 9 | 36 | |
| 8 | 36 | |
| 7 | 36 | |
| 6 | 36 | |
| 5 | 36 | |
| 4 | 36 | |
| 3 | 36 |
| Distinct | 430 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 220299.7558 |
| Minimum | 119208 |
|---|---|
| Maximum | 400219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 119208 |
|---|---|
| 5-th percentile | 154744.3 |
| Q1 | 173878.25 |
| median | 202056.5 |
| Q3 | 255772.25 |
| 95-th percentile | 349502.25 |
| Maximum | 400219 |
| Range | 281011 |
| Interquartile range (IQR) | 81894 |
Descriptive statistics
| Standard deviation | 59441.36299 |
|---|---|
| Coefficient of variation (CV) | 0.2698203762 |
| Kurtosis | 0.4264539035 |
| Mean | 220299.7558 |
| Median Absolute Deviation (MAD) | 33502.5 |
| Skewness | 1.031730681 |
| Sum | 94728895 |
| Variance | 3533275634 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200702 | 1 | 0.2% |
| 177488 | 1 | 0.2% |
| 355670 | 1 | 0.2% |
| 177372 | 1 | 0.2% |
| 249843 | 1 | 0.2% |
| 357344 | 1 | 0.2% |
| 342747 | 1 | 0.2% |
| 252374 | 1 | 0.2% |
| 180712 | 1 | 0.2% |
| 167272 | 1 | 0.2% |
| Other values (420) | 420 |
| Value | Count | Frequency (%) |
| 119208 | 1 | |
| 126417 | 1 | |
| 140894 | 1 | |
| 141200 | 1 | |
| 143301 | 1 | |
| 145708 | 1 | |
| 146698 | 1 | |
| 146868 | 1 | |
| 147061 | 1 | |
| 149297 | 1 |
| Value | Count | Frequency (%) |
| 400219 | 1 | |
| 397298 | 1 | |
| 396329 | 1 | |
| 395900 | 1 | |
| 388984 | 1 | |
| 386725 | 1 | |
| 377169 | 1 | |
| 376362 | 1 | |
| 371949 | 1 | |
| 369644 | 1 |
Product_Supplied
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 429 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 574218.2163 |
| Minimum | 436455 |
|---|---|
| Maximum | 671648 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 436455 |
|---|---|
| 5-th percentile | 499344.05 |
| Q1 | 537078.5 |
| median | 579533 |
| Q3 | 610163.5 |
| 95-th percentile | 641859.5 |
| Maximum | 671648 |
| Range | 235193 |
| Interquartile range (IQR) | 73085 |
Descriptive statistics
| Standard deviation | 45528.3279 |
|---|---|
| Coefficient of variation (CV) | 0.07928750187 |
| Kurtosis | -0.6464705951 |
| Mean | 574218.2163 |
| Median Absolute Deviation (MAD) | 36325.5 |
| Skewness | -0.2964651742 |
| Sum | 246913833 |
| Variance | 2072828641 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 597976 | 2 | 0.5% |
| 627838 | 1 | 0.2% |
| 634167 | 1 | 0.2% |
| 579867 | 1 | 0.2% |
| 617749 | 1 | 0.2% |
| 577827 | 1 | 0.2% |
| 622884 | 1 | 0.2% |
| 557349 | 1 | 0.2% |
| 529703 | 1 | 0.2% |
| 589098 | 1 | 0.2% |
| Other values (419) | 419 |
| Value | Count | Frequency (%) |
| 436455 | 1 | |
| 453209 | 1 | |
| 457486 | 1 | |
| 473425 | 1 | |
| 477269 | 1 | |
| 478339 | 1 | |
| 480896 | 1 | |
| 481482 | 1 | |
| 484163 | 1 | |
| 485354 | 1 |
| Value | Count | Frequency (%) |
| 671648 | 1 | |
| 666357 | 1 | |
| 664448 | 1 | |
| 662039 | 1 | |
| 658099 | 1 | |
| 655895 | 1 | |
| 651864 | 1 | |
| 651790 | 1 | |
| 651274 | 1 | |
| 649104 | 1 |
Refinery_Input
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 411 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15007.91395 |
| Minimum | 11759 |
|---|---|
| Maximum | 18041 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 11759 |
|---|---|
| 5-th percentile | 13074.9 |
| Q1 | 14049.25 |
| median | 15093.5 |
| Q3 | 15803.75 |
| 95-th percentile | 17042.1 |
| Maximum | 18041 |
| Range | 6282 |
| Interquartile range (IQR) | 1754.5 |
Descriptive statistics
| Standard deviation | 1238.984141 |
|---|---|
| Coefficient of variation (CV) | 0.08255538676 |
| Kurtosis | -0.5409443404 |
| Mean | 15007.91395 |
| Median Absolute Deviation (MAD) | 876 |
| Skewness | 0.01498363446 |
| Sum | 6453403 |
| Variance | 1535081.701 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14538 | 3 | 0.7% |
| 14783 | 2 | 0.5% |
| 13383 | 2 | 0.5% |
| 13097 | 2 | 0.5% |
| 13356 | 2 | 0.5% |
| 14637 | 2 | 0.5% |
| 15638 | 2 | 0.5% |
| 15000 | 2 | 0.5% |
| 14693 | 2 | 0.5% |
| 15768 | 2 | 0.5% |
| Other values (401) | 409 |
| Value | Count | Frequency (%) |
| 11759 | 1 | |
| 12068 | 1 | |
| 12237 | 1 | |
| 12417 | 1 | |
| 12583 | 1 | |
| 12603 | 1 | |
| 12637 | 1 | |
| 12725 | 1 | |
| 12742 | 1 | |
| 12753 | 1 |
| Value | Count | Frequency (%) |
| 18041 | 1 | |
| 17969 | 1 | |
| 17833 | 1 | |
| 17749 | 1 | |
| 17699 | 1 | |
| 17688 | 1 | |
| 17687 | 1 | |
| 17659 | 1 | |
| 17562 | 1 | |
| 17527 | 1 |
Operable_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 242 |
|---|---|
| Distinct (%) | 56.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16841.56279 |
| Minimum | 15028 |
|---|---|
| Maximum | 18976 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 15028 |
|---|---|
| 5-th percentile | 15186 |
| Q1 | 15686.25 |
| median | 16764 |
| Q3 | 17736 |
| 95-th percentile | 18619.2 |
| Maximum | 18976 |
| Range | 3948 |
| Interquartile range (IQR) | 2049.75 |
Descriptive statistics
| Standard deviation | 1161.536555 |
|---|---|
| Coefficient of variation (CV) | 0.06896845441 |
| Kurtosis | -1.331455769 |
| Mean | 16841.56279 |
| Median Absolute Deviation (MAD) | 1056 |
| Skewness | 0.06030327715 |
| Sum | 7241872 |
| Variance | 1349167.17 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18808 | 12 | 2.8% |
| 16747 | 11 | 2.6% |
| 17736 | 10 | 2.3% |
| 17594 | 8 | 1.9% |
| 18598 | 7 | 1.6% |
| 16978 | 6 | 1.4% |
| 17672 | 6 | 1.4% |
| 17820 | 5 | 1.2% |
| 15722 | 5 | 1.2% |
| 18976 | 4 | 0.9% |
| Other values (232) | 356 |
| Value | Count | Frequency (%) |
| 15028 | 2 | |
| 15058 | 1 | |
| 15105 | 1 | |
| 15121 | 1 | |
| 15129 | 1 | |
| 15133 | 1 | |
| 15137 | 1 | |
| 15139 | 1 | |
| 15140 | 1 | |
| 15142 | 1 |
| Value | Count | Frequency (%) |
| 18976 | 4 | 0.9% |
| 18808 | 12 | |
| 18641 | 1 | 0.2% |
| 18622 | 3 | 0.7% |
| 18621 | 2 | 0.5% |
| 18617 | 2 | 0.5% |
| 18603 | 3 | 0.7% |
| 18601 | 2 | 0.5% |
| 18598 | 7 | |
| 18571 | 1 | 0.2% |
Operating_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 364 |
|---|---|
| Distinct (%) | 84.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16419.06512 |
| Minimum | 14375 |
|---|---|
| Maximum | 18698 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 14375 |
|---|---|
| 5-th percentile | 14824.45 |
| Q1 | 15117 |
| median | 16564 |
| Q3 | 17227.5 |
| 95-th percentile | 18450.2 |
| Maximum | 18698 |
| Range | 4323 |
| Interquartile range (IQR) | 2110.5 |
Descriptive statistics
| Standard deviation | 1210.49159 |
|---|---|
| Coefficient of variation (CV) | 0.07372475723 |
| Kurtosis | -1.228651902 |
| Mean | 16419.06512 |
| Median Absolute Deviation (MAD) | 1110 |
| Skewness | 0.1045932304 |
| Sum | 7060198 |
| Variance | 1465289.889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16711 | 5 | 1.2% |
| 16921 | 4 | 0.9% |
| 16643 | 4 | 0.9% |
| 16904 | 4 | 0.9% |
| 16134 | 3 | 0.7% |
| 17150 | 3 | 0.7% |
| 15097 | 3 | 0.7% |
| 15081 | 3 | 0.7% |
| 17464 | 3 | 0.7% |
| 15117 | 3 | 0.7% |
| Other values (354) | 395 |
| Value | Count | Frequency (%) |
| 14375 | 1 | |
| 14411 | 1 | |
| 14517 | 1 | |
| 14538 | 1 | |
| 14550 | 1 | |
| 14607 | 1 | |
| 14639 | 1 | |
| 14649 | 1 | |
| 14662 | 1 | |
| 14691 | 1 |
| Value | Count | Frequency (%) |
| 18698 | 1 | |
| 18692 | 2 | |
| 18621 | 1 | |
| 18567 | 1 | |
| 18561 | 2 | |
| 18549 | 1 | |
| 18528 | 2 | |
| 18526 | 1 | |
| 18523 | 1 | |
| 18520 | 1 |
Idle_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 314 |
|---|---|
| Distinct (%) | 73.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 422.527907 |
| Minimum | 32 |
|---|---|
| Maximum | 2651 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 75 |
| Q1 | 158.25 |
| median | 321.5 |
| Q3 | 617.75 |
| 95-th percentile | 949.2 |
| Maximum | 2651 |
| Range | 2619 |
| Interquartile range (IQR) | 459.5 |
Descriptive statistics
| Standard deviation | 337.8876697 |
|---|---|
| Coefficient of variation (CV) | 0.7996813088 |
| Kurtosis | 9.433540024 |
| Mean | 422.527907 |
| Median Absolute Deviation (MAD) | 186.5 |
| Skewness | 2.156327909 |
| Sum | 181687 |
| Variance | 114168.0773 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 135 | 5 | 1.2% |
| 139 | 5 | 1.2% |
| 138 | 5 | 1.2% |
| 146 | 5 | 1.2% |
| 152 | 4 | 0.9% |
| 153 | 4 | 0.9% |
| 75 | 4 | 0.9% |
| 81 | 4 | 0.9% |
| 57 | 4 | 0.9% |
| 35 | 4 | 0.9% |
| Other values (304) | 386 |
| Value | Count | Frequency (%) |
| 32 | 2 | |
| 35 | 4 | |
| 36 | 2 | |
| 37 | 2 | |
| 45 | 1 | 0.2% |
| 49 | 1 | 0.2% |
| 50 | 1 | 0.2% |
| 57 | 4 | |
| 73 | 2 | |
| 74 | 2 |
| Value | Count | Frequency (%) |
| 2651 | 1 | |
| 2569 | 1 | |
| 2331 | 1 | |
| 1488 | 1 | |
| 1483 | 1 | |
| 1478 | 1 | |
| 1283 | 1 | |
| 1244 | 1 | |
| 1107 | 1 | |
| 1103 | 1 |
| Distinct | 163 |
|---|---|
| Distinct (%) | 37.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 89.15976744 |
| Minimum | 70.2 |
|---|---|
| Maximum | 99.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 70.2 |
|---|---|
| 5-th percentile | 81.3 |
| Q1 | 86 |
| median | 89.5 |
| Q3 | 92.7 |
| 95-th percentile | 96.31 |
| Maximum | 99.9 |
| Range | 29.7 |
| Interquartile range (IQR) | 6.7 |
Descriptive statistics
| Standard deviation | 4.918490392 |
|---|---|
| Coefficient of variation (CV) | 0.05516490826 |
| Kurtosis | 0.5971587334 |
| Mean | 89.15976744 |
| Median Absolute Deviation (MAD) | 3.3 |
| Skewness | -0.5674442978 |
| Sum | 38338.7 |
| Variance | 24.19154773 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90.6 | 10 | 2.3% |
| 86.5 | 8 | 1.9% |
| 92.5 | 7 | 1.6% |
| 92.6 | 7 | 1.6% |
| 88.9 | 6 | 1.4% |
| 95 | 6 | 1.4% |
| 90.2 | 6 | 1.4% |
| 85.8 | 6 | 1.4% |
| 87.1 | 6 | 1.4% |
| 89.1 | 6 | 1.4% |
| Other values (153) | 362 |
| Value | Count | Frequency (%) |
| 70.2 | 1 | |
| 70.8 | 1 | |
| 72 | 1 | |
| 74.6 | 1 | |
| 75.3 | 1 | |
| 75.9 | 1 | |
| 76.4 | 1 | |
| 76.9 | 1 | |
| 77.9 | 1 | |
| 78.6 | 1 |
| Value | Count | Frequency (%) |
| 99.9 | 1 | 0.2% |
| 99.6 | 1 | 0.2% |
| 99.2 | 1 | 0.2% |
| 99.1 | 1 | 0.2% |
| 98.9 | 1 | 0.2% |
| 98.4 | 1 | 0.2% |
| 97.8 | 1 | 0.2% |
| 97.5 | 3 | |
| 97.2 | 1 | 0.2% |
| 97.1 | 3 |
| Distinct | 306 |
|---|---|
| Distinct (%) | 71.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.57232558 |
| Minimum | 11.3 |
|---|---|
| Maximum | 134 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 11.3 |
|---|---|
| 5-th percentile | 14.945 |
| Q1 | 19.9 |
| median | 33.05 |
| Q3 | 63.5 |
| 95-th percentile | 100.555 |
| Maximum | 134 |
| Range | 122.7 |
| Interquartile range (IQR) | 43.6 |
Descriptive statistics
| Standard deviation | 28.69879024 |
|---|---|
| Coefficient of variation (CV) | 0.6438701562 |
| Kurtosis | -0.3805579035 |
| Mean | 44.57232558 |
| Median Absolute Deviation (MAD) | 15.65 |
| Skewness | 0.834881001 |
| Sum | 19166.1 |
| Variance | 823.620561 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.7 | 7 | 1.6% |
| 21.3 | 5 | 1.2% |
| 18.8 | 4 | 0.9% |
| 15.5 | 4 | 0.9% |
| 20 | 4 | 0.9% |
| 49.9 | 4 | 0.9% |
| 19.9 | 4 | 0.9% |
| 59.4 | 4 | 0.9% |
| 17.8 | 4 | 0.9% |
| 18 | 4 | 0.9% |
| Other values (296) | 386 |
| Value | Count | Frequency (%) |
| 11.3 | 1 | |
| 11.6 | 1 | |
| 12 | 1 | |
| 12.5 | 1 | |
| 12.6 | 1 | |
| 12.8 | 1 | |
| 13 | 1 | |
| 13.4 | 2 | |
| 13.7 | 1 | |
| 13.8 | 1 |
| Value | Count | Frequency (%) |
| 134 | 1 | |
| 133.5 | 1 | |
| 125.5 | 1 | |
| 116.7 | 1 | |
| 112.5 | 1 | |
| 110 | 1 | |
| 106.5 | 1 | |
| 106.2 | 2 | |
| 105.4 | 1 | |
| 105.1 | 1 |
| Distinct | 309 |
|---|---|
| Distinct (%) | 71.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.55837209 |
| Minimum | 11.3 |
|---|---|
| Maximum | 133.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.5 KiB |
Quantile statistics
| Minimum | 11.3 |
|---|---|
| 5-th percentile | 14.945 |
| Q1 | 19.9 |
| median | 33.3 |
| Q3 | 63.65 |
| 95-th percentile | 100.665 |
| Maximum | 133.9 |
| Range | 122.6 |
| Interquartile range (IQR) | 43.75 |
Descriptive statistics
| Standard deviation | 28.68330756 |
|---|---|
| Coefficient of variation (CV) | 0.6437243151 |
| Kurtosis | -0.3744934178 |
| Mean | 44.55837209 |
| Median Absolute Deviation (MAD) | 15.9 |
| Skewness | 0.8375564287 |
| Sum | 19160.1 |
| Variance | 822.7321325 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21.3 | 6 | 1.4% |
| 20.1 | 6 | 1.4% |
| 19.7 | 5 | 1.2% |
| 17.9 | 5 | 1.2% |
| 18 | 5 | 1.2% |
| 19.9 | 5 | 1.2% |
| 20.3 | 4 | 0.9% |
| 59 | 3 | 0.7% |
| 71 | 3 | 0.7% |
| 49.8 | 3 | 0.7% |
| Other values (299) | 385 |
| Value | Count | Frequency (%) |
| 11.3 | 1 | |
| 11.6 | 1 | |
| 12 | 1 | |
| 12.5 | 1 | |
| 12.6 | 1 | |
| 12.8 | 1 | |
| 13 | 1 | |
| 13.4 | 1 | |
| 13.5 | 1 | |
| 13.7 | 1 |
| Value | Count | Frequency (%) |
| 133.9 | 1 | |
| 133.4 | 1 | |
| 125.4 | 1 | |
| 116.7 | 1 | |
| 112.6 | 1 | |
| 109.5 | 1 | |
| 106.6 | 1 | |
| 106.3 | 1 | |
| 106.2 | 1 | |
| 105.8 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Date | Year | Month | Total_Production | Product_Supplied | Refinery_Input | Operable_Dist_Capacity | Operating_Dist_Capacity | Idle_Dist_Capacity | Percent_Util | Future_Price | Spot_Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Jan-1986 | 1986 | 1 | 283248 | 498728 | 12583 | 15459 | 14639 | 820 | 81.4 | 23.0 | 22.9 |
| 1 | Feb-1986 | 1986 | 2 | 256855 | 453209 | 12068 | 15485 | 14538 | 947 | 77.9 | 15.5 | 15.5 |
| 2 | Mar-1986 | 1986 | 3 | 279413 | 504565 | 11759 | 15485 | 14517 | 968 | 75.9 | 12.6 | 12.6 |
| 3 | Apr-1986 | 1986 | 4 | 265917 | 478339 | 12603 | 15473 | 14550 | 923 | 81.5 | 12.8 | 12.8 |
| 4 | May-1986 | 1986 | 5 | 273964 | 495789 | 13314 | 15484 | 14805 | 679 | 86.0 | 15.3 | 15.4 |
| 5 | Jun-1986 | 1986 | 6 | 258700 | 481482 | 13347 | 15465 | 14649 | 816 | 86.3 | 13.4 | 13.4 |
| 6 | Jul-1986 | 1986 | 7 | 268448 | 505514 | 13009 | 15475 | 14607 | 868 | 84.1 | 11.6 | 11.6 |
| 7 | Aug-1986 | 1986 | 8 | 259580 | 515167 | 13392 | 15430 | 14807 | 624 | 86.8 | 15.1 | 15.1 |
| 8 | Sep-1986 | 1986 | 9 | 249843 | 477269 | 13191 | 15435 | 14870 | 565 | 85.5 | 14.9 | 14.9 |
| 9 | Oct-1986 | 1986 | 10 | 260984 | 514674 | 12753 | 15435 | 14827 | 608 | 82.6 | 14.9 | 14.9 |
Last rows
| Date | Year | Month | Total_Production | Product_Supplied | Refinery_Input | Operable_Dist_Capacity | Operating_Dist_Capacity | Idle_Dist_Capacity | Percent_Util | Future_Price | Spot_Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 420 | Jan-2021 | 2021 | 1 | 342747 | 576457 | 14975 | 18143 | 17735 | 408 | 82.5 | 52.1 | 52.0 |
| 421 | Feb-2021 | 2021 | 2 | 273646 | 488438 | 12804 | 18090 | 17526 | 564 | 70.8 | 59.1 | 59.0 |
| 422 | Mar-2021 | 2021 | 3 | 345946 | 595319 | 14834 | 18090 | 17035 | 1055 | 82.0 | 62.4 | 62.3 |
| 423 | Apr-2021 | 2021 | 4 | 336905 | 583781 | 15633 | 18128 | 17553 | 574 | 86.2 | 61.7 | 61.7 |
| 424 | May-2021 | 2021 | 5 | 351346 | 622903 | 16130 | 18128 | 17843 | 285 | 89.0 | 65.2 | 65.2 |
| 425 | Jun-2021 | 2021 | 6 | 338645 | 616115 | 16743 | 18128 | 17910 | 218 | 92.4 | 71.4 | 71.4 |
| 426 | Jul-2021 | 2021 | 7 | 351228 | 616714 | 16482 | 18129 | 17943 | 187 | 90.9 | 72.4 | 72.5 |
| 427 | Aug-2021 | 2021 | 8 | 347393 | 635828 | 16377 | 18130 | 17914 | 216 | 90.3 | 67.7 | 67.7 |
| 428 | Sep-2021 | 2021 | 9 | 324654 | 606706 | 15797 | 18130 | 15800 | 2331 | 87.1 | 71.5 | 71.6 |
| 429 | Oct-2021 | 2021 | 10 | 355670 | 616639 | 15581 | 18132 | 17133 | 999 | 85.9 | 81.2 | 81.5 |