Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 430 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 64.4 KiB |
Average record size in memory | 153.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 11 |
Date has a high cardinality: 430 distinct values | High cardinality |
Year is highly correlated with Product_Supplied and 5 other fields | High correlation |
Product_Supplied is highly correlated with Year and 5 other fields | High correlation |
Refinery_Input is highly correlated with Year and 7 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Refinery_Input and 1 other fields | High correlation |
Percent_Util is highly correlated with Refinery_Input and 1 other fields | High correlation |
Future_Price is highly correlated with Year and 5 other fields | High correlation |
Spot_Price is highly correlated with Year and 5 other fields | High correlation |
Year is highly correlated with Product_Supplied and 5 other fields | High correlation |
Product_Supplied is highly correlated with Year and 3 other fields | High correlation |
Refinery_Input is highly correlated with Year and 6 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 5 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Percent_Util | High correlation |
Percent_Util is highly correlated with Refinery_Input and 1 other fields | High correlation |
Future_Price is highly correlated with Year and 4 other fields | High correlation |
Spot_Price is highly correlated with Year and 4 other fields | High correlation |
Year is highly correlated with Refinery_Input and 4 other fields | High correlation |
Product_Supplied is highly correlated with Refinery_Input | High correlation |
Refinery_Input is highly correlated with Year and 3 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 4 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 4 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Percent_Util | High correlation |
Percent_Util is highly correlated with Idle_Dist_Capacity | High correlation |
Future_Price is highly correlated with Year and 3 other fields | High correlation |
Spot_Price is highly correlated with Year and 3 other fields | High correlation |
Year is highly correlated with Total_Production and 7 other fields | High correlation |
Total_Production is highly correlated with Year and 7 other fields | High correlation |
Product_Supplied is highly correlated with Year and 6 other fields | High correlation |
Refinery_Input is highly correlated with Year and 7 other fields | High correlation |
Operable_Dist_Capacity is highly correlated with Year and 8 other fields | High correlation |
Operating_Dist_Capacity is highly correlated with Year and 7 other fields | High correlation |
Idle_Dist_Capacity is highly correlated with Operable_Dist_Capacity and 1 other fields | High correlation |
Percent_Util is highly correlated with Year and 5 other fields | High correlation |
Future_Price is highly correlated with Year and 6 other fields | High correlation |
Spot_Price is highly correlated with Year and 6 other fields | High correlation |
Date is uniformly distributed | Uniform |
Date has unique values | Unique |
Total_Production has unique values | Unique |
Reproduction
Analysis started | 2022-02-01 00:06:59.529551 |
---|---|
Analysis finished | 2022-02-01 00:07:50.169279 |
Duration | 50.64 seconds |
Software version | pandas-profiling v3.1.1 |
Download configuration | config.json |
Distinct | 430 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 27.4 KiB |
Apr-1987 | 1 |
---|---|
Nov-2011 | 1 |
Aug-2021 | 1 |
May-2001 | 1 |
Jan-1991 | 1 |
Other values (425) |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Characters and Unicode
Total characters | 3440 |
---|---|
Distinct characters | 33 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 430 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Jan-1986 |
---|---|
2nd row | Feb-1986 |
3rd row | Mar-1986 |
4th row | Apr-1986 |
5th row | May-1986 |
Common Values
Value | Count | Frequency (%) |
Apr-1987 | 1 | 0.2% |
Nov-2011 | 1 | 0.2% |
Aug-2021 | 1 | 0.2% |
May-2001 | 1 | 0.2% |
Jan-1991 | 1 | 0.2% |
Apr-2016 | 1 | 0.2% |
Jan-1993 | 1 | 0.2% |
Sep-2021 | 1 | 0.2% |
Aug-2014 | 1 | 0.2% |
May-2004 | 1 | 0.2% |
Other values (420) | 420 |
Length
Value | Count | Frequency (%) |
may-1999 | 1 | 0.2% |
oct-1994 | 1 | 0.2% |
apr-1999 | 1 | 0.2% |
may-2021 | 1 | 0.2% |
oct-2018 | 1 | 0.2% |
oct-2008 | 1 | 0.2% |
nov-2013 | 1 | 0.2% |
mar-2009 | 1 | 0.2% |
jun-1987 | 1 | 0.2% |
apr-2008 | 1 | 0.2% |
Other values (420) | 420 |
Most occurring characters
Value | Count | Frequency (%) |
- | 430 | |
0 | 430 | |
9 | 336 | 9.8% |
1 | 334 | 9.7% |
2 | 320 | 9.3% |
a | 108 | 3.1% |
J | 108 | 3.1% |
u | 108 | 3.1% |
e | 107 | 3.1% |
8 | 96 | 2.8% |
Other values (23) | 1063 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1720 | |
Lowercase Letter | 860 | |
Dash Punctuation | 430 | 12.5% |
Uppercase Letter | 430 | 12.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 108 | |
u | 108 | |
e | 107 | |
p | 72 | |
r | 72 | |
n | 72 | |
c | 71 | |
l | 36 | 4.2% |
g | 36 | 4.2% |
y | 36 | 4.2% |
Other values (4) | 142 |
Decimal Number
Value | Count | Frequency (%) |
0 | 430 | |
9 | 336 | |
1 | 334 | |
2 | 320 | |
8 | 96 | 5.6% |
6 | 48 | 2.8% |
7 | 48 | 2.8% |
3 | 36 | 2.1% |
4 | 36 | 2.1% |
5 | 36 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 108 | |
M | 72 | |
A | 72 | |
S | 36 | 8.4% |
F | 36 | 8.4% |
O | 36 | 8.4% |
N | 35 | 8.1% |
D | 35 | 8.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 430 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2150 | |
Latin | 1290 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 108 | 8.4% |
J | 108 | 8.4% |
u | 108 | 8.4% |
e | 107 | 8.3% |
p | 72 | 5.6% |
r | 72 | 5.6% |
M | 72 | 5.6% |
A | 72 | 5.6% |
n | 72 | 5.6% |
c | 71 | 5.5% |
Other values (12) | 428 |
Common
Value | Count | Frequency (%) |
- | 430 | |
0 | 430 | |
9 | 336 | |
1 | 334 | |
2 | 320 | |
8 | 96 | 4.5% |
6 | 48 | 2.2% |
7 | 48 | 2.2% |
3 | 36 | 1.7% |
4 | 36 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3440 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 430 | |
0 | 430 | |
9 | 336 | 9.8% |
1 | 334 | 9.7% |
2 | 320 | 9.3% |
a | 108 | 3.1% |
J | 108 | 3.1% |
u | 108 | 3.1% |
e | 107 | 3.1% |
8 | 96 | 2.8% |
Other values (23) | 1063 |
Distinct | 36 |
---|---|
Distinct (%) | 8.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2003.418605 |
Minimum | 1986 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 1986 |
---|---|
5-th percentile | 1987 |
Q1 | 1994.25 |
median | 2003 |
Q3 | 2012 |
95-th percentile | 2019.55 |
Maximum | 2021 |
Range | 35 |
Interquartile range (IQR) | 17.75 |
Descriptive statistics
Standard deviation | 10.35552747 |
---|---|
Coefficient of variation (CV) | 0.005168928472 |
Kurtosis | -1.200417279 |
Mean | 2003.418605 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 0.001086441007 |
Sum | 861470 |
Variance | 107.2369491 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2003 | 12 | 2.8% |
2020 | 12 | 2.8% |
2001 | 12 | 2.8% |
2000 | 12 | 2.8% |
1999 | 12 | 2.8% |
1998 | 12 | 2.8% |
1997 | 12 | 2.8% |
1996 | 12 | 2.8% |
1995 | 12 | 2.8% |
1994 | 12 | 2.8% |
Other values (26) | 310 |
Value | Count | Frequency (%) |
1986 | 12 | |
1987 | 12 | |
1988 | 12 | |
1989 | 12 | |
1990 | 12 | |
1991 | 12 | |
1992 | 12 | |
1993 | 12 | |
1994 | 12 | |
1995 | 12 |
Value | Count | Frequency (%) |
2021 | 10 | |
2020 | 12 | |
2019 | 12 | |
2018 | 12 | |
2017 | 12 | |
2016 | 12 | |
2015 | 12 | |
2014 | 12 | |
2013 | 12 | |
2012 | 12 |
Month
Real number (ℝ≥0)
Distinct | 12 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.476744186 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.25 |
median | 6 |
Q3 | 9 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.75 |
Descriptive statistics
Standard deviation | 3.446990323 |
---|---|
Coefficient of variation (CV) | 0.5322103551 |
Kurtosis | -1.211707003 |
Mean | 6.476744186 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.005611039523 |
Sum | 2785 |
Variance | 11.88174229 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 36 | |
9 | 36 | |
8 | 36 | |
7 | 36 | |
6 | 36 | |
5 | 36 | |
4 | 36 | |
3 | 36 | |
2 | 36 | |
1 | 36 | |
Other values (2) | 70 |
Value | Count | Frequency (%) |
1 | 36 | |
2 | 36 | |
3 | 36 | |
4 | 36 | |
5 | 36 | |
6 | 36 | |
7 | 36 | |
8 | 36 | |
9 | 36 | |
10 | 36 |
Value | Count | Frequency (%) |
12 | 35 | |
11 | 35 | |
10 | 36 | |
9 | 36 | |
8 | 36 | |
7 | 36 | |
6 | 36 | |
5 | 36 | |
4 | 36 | |
3 | 36 |
Distinct | 430 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 220299.7558 |
Minimum | 119208 |
---|---|
Maximum | 400219 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 119208 |
---|---|
5-th percentile | 154744.3 |
Q1 | 173878.25 |
median | 202056.5 |
Q3 | 255772.25 |
95-th percentile | 349502.25 |
Maximum | 400219 |
Range | 281011 |
Interquartile range (IQR) | 81894 |
Descriptive statistics
Standard deviation | 59441.36299 |
---|---|
Coefficient of variation (CV) | 0.2698203762 |
Kurtosis | 0.4264539035 |
Mean | 220299.7558 |
Median Absolute Deviation (MAD) | 33502.5 |
Skewness | 1.031730681 |
Sum | 94728895 |
Variance | 3533275634 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
200702 | 1 | 0.2% |
177488 | 1 | 0.2% |
355670 | 1 | 0.2% |
177372 | 1 | 0.2% |
249843 | 1 | 0.2% |
357344 | 1 | 0.2% |
342747 | 1 | 0.2% |
252374 | 1 | 0.2% |
180712 | 1 | 0.2% |
167272 | 1 | 0.2% |
Other values (420) | 420 |
Value | Count | Frequency (%) |
119208 | 1 | |
126417 | 1 | |
140894 | 1 | |
141200 | 1 | |
143301 | 1 | |
145708 | 1 | |
146698 | 1 | |
146868 | 1 | |
147061 | 1 | |
149297 | 1 |
Value | Count | Frequency (%) |
400219 | 1 | |
397298 | 1 | |
396329 | 1 | |
395900 | 1 | |
388984 | 1 | |
386725 | 1 | |
377169 | 1 | |
376362 | 1 | |
371949 | 1 | |
369644 | 1 |
Product_Supplied
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 429 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 574218.2163 |
Minimum | 436455 |
---|---|
Maximum | 671648 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 436455 |
---|---|
5-th percentile | 499344.05 |
Q1 | 537078.5 |
median | 579533 |
Q3 | 610163.5 |
95-th percentile | 641859.5 |
Maximum | 671648 |
Range | 235193 |
Interquartile range (IQR) | 73085 |
Descriptive statistics
Standard deviation | 45528.3279 |
---|---|
Coefficient of variation (CV) | 0.07928750187 |
Kurtosis | -0.6464705951 |
Mean | 574218.2163 |
Median Absolute Deviation (MAD) | 36325.5 |
Skewness | -0.2964651742 |
Sum | 246913833 |
Variance | 2072828641 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
597976 | 2 | 0.5% |
627838 | 1 | 0.2% |
634167 | 1 | 0.2% |
579867 | 1 | 0.2% |
617749 | 1 | 0.2% |
577827 | 1 | 0.2% |
622884 | 1 | 0.2% |
557349 | 1 | 0.2% |
529703 | 1 | 0.2% |
589098 | 1 | 0.2% |
Other values (419) | 419 |
Value | Count | Frequency (%) |
436455 | 1 | |
453209 | 1 | |
457486 | 1 | |
473425 | 1 | |
477269 | 1 | |
478339 | 1 | |
480896 | 1 | |
481482 | 1 | |
484163 | 1 | |
485354 | 1 |
Value | Count | Frequency (%) |
671648 | 1 | |
666357 | 1 | |
664448 | 1 | |
662039 | 1 | |
658099 | 1 | |
655895 | 1 | |
651864 | 1 | |
651790 | 1 | |
651274 | 1 | |
649104 | 1 |
Refinery_Input
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 411 |
---|---|
Distinct (%) | 95.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15007.91395 |
Minimum | 11759 |
---|---|
Maximum | 18041 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 11759 |
---|---|
5-th percentile | 13074.9 |
Q1 | 14049.25 |
median | 15093.5 |
Q3 | 15803.75 |
95-th percentile | 17042.1 |
Maximum | 18041 |
Range | 6282 |
Interquartile range (IQR) | 1754.5 |
Descriptive statistics
Standard deviation | 1238.984141 |
---|---|
Coefficient of variation (CV) | 0.08255538676 |
Kurtosis | -0.5409443404 |
Mean | 15007.91395 |
Median Absolute Deviation (MAD) | 876 |
Skewness | 0.01498363446 |
Sum | 6453403 |
Variance | 1535081.701 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14538 | 3 | 0.7% |
14783 | 2 | 0.5% |
13383 | 2 | 0.5% |
13097 | 2 | 0.5% |
13356 | 2 | 0.5% |
14637 | 2 | 0.5% |
15638 | 2 | 0.5% |
15000 | 2 | 0.5% |
14693 | 2 | 0.5% |
15768 | 2 | 0.5% |
Other values (401) | 409 |
Value | Count | Frequency (%) |
11759 | 1 | |
12068 | 1 | |
12237 | 1 | |
12417 | 1 | |
12583 | 1 | |
12603 | 1 | |
12637 | 1 | |
12725 | 1 | |
12742 | 1 | |
12753 | 1 |
Value | Count | Frequency (%) |
18041 | 1 | |
17969 | 1 | |
17833 | 1 | |
17749 | 1 | |
17699 | 1 | |
17688 | 1 | |
17687 | 1 | |
17659 | 1 | |
17562 | 1 | |
17527 | 1 |
Operable_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 242 |
---|---|
Distinct (%) | 56.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16841.56279 |
Minimum | 15028 |
---|---|
Maximum | 18976 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 15028 |
---|---|
5-th percentile | 15186 |
Q1 | 15686.25 |
median | 16764 |
Q3 | 17736 |
95-th percentile | 18619.2 |
Maximum | 18976 |
Range | 3948 |
Interquartile range (IQR) | 2049.75 |
Descriptive statistics
Standard deviation | 1161.536555 |
---|---|
Coefficient of variation (CV) | 0.06896845441 |
Kurtosis | -1.331455769 |
Mean | 16841.56279 |
Median Absolute Deviation (MAD) | 1056 |
Skewness | 0.06030327715 |
Sum | 7241872 |
Variance | 1349167.17 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18808 | 12 | 2.8% |
16747 | 11 | 2.6% |
17736 | 10 | 2.3% |
17594 | 8 | 1.9% |
18598 | 7 | 1.6% |
16978 | 6 | 1.4% |
17672 | 6 | 1.4% |
17820 | 5 | 1.2% |
15722 | 5 | 1.2% |
18976 | 4 | 0.9% |
Other values (232) | 356 |
Value | Count | Frequency (%) |
15028 | 2 | |
15058 | 1 | |
15105 | 1 | |
15121 | 1 | |
15129 | 1 | |
15133 | 1 | |
15137 | 1 | |
15139 | 1 | |
15140 | 1 | |
15142 | 1 |
Value | Count | Frequency (%) |
18976 | 4 | 0.9% |
18808 | 12 | |
18641 | 1 | 0.2% |
18622 | 3 | 0.7% |
18621 | 2 | 0.5% |
18617 | 2 | 0.5% |
18603 | 3 | 0.7% |
18601 | 2 | 0.5% |
18598 | 7 | |
18571 | 1 | 0.2% |
Operating_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 364 |
---|---|
Distinct (%) | 84.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16419.06512 |
Minimum | 14375 |
---|---|
Maximum | 18698 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 14375 |
---|---|
5-th percentile | 14824.45 |
Q1 | 15117 |
median | 16564 |
Q3 | 17227.5 |
95-th percentile | 18450.2 |
Maximum | 18698 |
Range | 4323 |
Interquartile range (IQR) | 2110.5 |
Descriptive statistics
Standard deviation | 1210.49159 |
---|---|
Coefficient of variation (CV) | 0.07372475723 |
Kurtosis | -1.228651902 |
Mean | 16419.06512 |
Median Absolute Deviation (MAD) | 1110 |
Skewness | 0.1045932304 |
Sum | 7060198 |
Variance | 1465289.889 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16711 | 5 | 1.2% |
16921 | 4 | 0.9% |
16643 | 4 | 0.9% |
16904 | 4 | 0.9% |
16134 | 3 | 0.7% |
17150 | 3 | 0.7% |
15097 | 3 | 0.7% |
15081 | 3 | 0.7% |
17464 | 3 | 0.7% |
15117 | 3 | 0.7% |
Other values (354) | 395 |
Value | Count | Frequency (%) |
14375 | 1 | |
14411 | 1 | |
14517 | 1 | |
14538 | 1 | |
14550 | 1 | |
14607 | 1 | |
14639 | 1 | |
14649 | 1 | |
14662 | 1 | |
14691 | 1 |
Value | Count | Frequency (%) |
18698 | 1 | |
18692 | 2 | |
18621 | 1 | |
18567 | 1 | |
18561 | 2 | |
18549 | 1 | |
18528 | 2 | |
18526 | 1 | |
18523 | 1 | |
18520 | 1 |
Idle_Dist_Capacity
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 314 |
---|---|
Distinct (%) | 73.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 422.527907 |
Minimum | 32 |
---|---|
Maximum | 2651 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 32 |
---|---|
5-th percentile | 75 |
Q1 | 158.25 |
median | 321.5 |
Q3 | 617.75 |
95-th percentile | 949.2 |
Maximum | 2651 |
Range | 2619 |
Interquartile range (IQR) | 459.5 |
Descriptive statistics
Standard deviation | 337.8876697 |
---|---|
Coefficient of variation (CV) | 0.7996813088 |
Kurtosis | 9.433540024 |
Mean | 422.527907 |
Median Absolute Deviation (MAD) | 186.5 |
Skewness | 2.156327909 |
Sum | 181687 |
Variance | 114168.0773 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
135 | 5 | 1.2% |
139 | 5 | 1.2% |
138 | 5 | 1.2% |
146 | 5 | 1.2% |
152 | 4 | 0.9% |
153 | 4 | 0.9% |
75 | 4 | 0.9% |
81 | 4 | 0.9% |
57 | 4 | 0.9% |
35 | 4 | 0.9% |
Other values (304) | 386 |
Value | Count | Frequency (%) |
32 | 2 | |
35 | 4 | |
36 | 2 | |
37 | 2 | |
45 | 1 | 0.2% |
49 | 1 | 0.2% |
50 | 1 | 0.2% |
57 | 4 | |
73 | 2 | |
74 | 2 |
Value | Count | Frequency (%) |
2651 | 1 | |
2569 | 1 | |
2331 | 1 | |
1488 | 1 | |
1483 | 1 | |
1478 | 1 | |
1283 | 1 | |
1244 | 1 | |
1107 | 1 | |
1103 | 1 |
Distinct | 163 |
---|---|
Distinct (%) | 37.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 89.15976744 |
Minimum | 70.2 |
---|---|
Maximum | 99.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 70.2 |
---|---|
5-th percentile | 81.3 |
Q1 | 86 |
median | 89.5 |
Q3 | 92.7 |
95-th percentile | 96.31 |
Maximum | 99.9 |
Range | 29.7 |
Interquartile range (IQR) | 6.7 |
Descriptive statistics
Standard deviation | 4.918490392 |
---|---|
Coefficient of variation (CV) | 0.05516490826 |
Kurtosis | 0.5971587334 |
Mean | 89.15976744 |
Median Absolute Deviation (MAD) | 3.3 |
Skewness | -0.5674442978 |
Sum | 38338.7 |
Variance | 24.19154773 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
90.6 | 10 | 2.3% |
86.5 | 8 | 1.9% |
92.5 | 7 | 1.6% |
92.6 | 7 | 1.6% |
88.9 | 6 | 1.4% |
95 | 6 | 1.4% |
90.2 | 6 | 1.4% |
85.8 | 6 | 1.4% |
87.1 | 6 | 1.4% |
89.1 | 6 | 1.4% |
Other values (153) | 362 |
Value | Count | Frequency (%) |
70.2 | 1 | |
70.8 | 1 | |
72 | 1 | |
74.6 | 1 | |
75.3 | 1 | |
75.9 | 1 | |
76.4 | 1 | |
76.9 | 1 | |
77.9 | 1 | |
78.6 | 1 |
Value | Count | Frequency (%) |
99.9 | 1 | 0.2% |
99.6 | 1 | 0.2% |
99.2 | 1 | 0.2% |
99.1 | 1 | 0.2% |
98.9 | 1 | 0.2% |
98.4 | 1 | 0.2% |
97.8 | 1 | 0.2% |
97.5 | 3 | |
97.2 | 1 | 0.2% |
97.1 | 3 |
Distinct | 306 |
---|---|
Distinct (%) | 71.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44.57232558 |
Minimum | 11.3 |
---|---|
Maximum | 134 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 11.3 |
---|---|
5-th percentile | 14.945 |
Q1 | 19.9 |
median | 33.05 |
Q3 | 63.5 |
95-th percentile | 100.555 |
Maximum | 134 |
Range | 122.7 |
Interquartile range (IQR) | 43.6 |
Descriptive statistics
Standard deviation | 28.69879024 |
---|---|
Coefficient of variation (CV) | 0.6438701562 |
Kurtosis | -0.3805579035 |
Mean | 44.57232558 |
Median Absolute Deviation (MAD) | 15.65 |
Skewness | 0.834881001 |
Sum | 19166.1 |
Variance | 823.620561 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19.7 | 7 | 1.6% |
21.3 | 5 | 1.2% |
18.8 | 4 | 0.9% |
15.5 | 4 | 0.9% |
20 | 4 | 0.9% |
49.9 | 4 | 0.9% |
19.9 | 4 | 0.9% |
59.4 | 4 | 0.9% |
17.8 | 4 | 0.9% |
18 | 4 | 0.9% |
Other values (296) | 386 |
Value | Count | Frequency (%) |
11.3 | 1 | |
11.6 | 1 | |
12 | 1 | |
12.5 | 1 | |
12.6 | 1 | |
12.8 | 1 | |
13 | 1 | |
13.4 | 2 | |
13.7 | 1 | |
13.8 | 1 |
Value | Count | Frequency (%) |
134 | 1 | |
133.5 | 1 | |
125.5 | 1 | |
116.7 | 1 | |
112.5 | 1 | |
110 | 1 | |
106.5 | 1 | |
106.2 | 2 | |
105.4 | 1 | |
105.1 | 1 |
Distinct | 309 |
---|---|
Distinct (%) | 71.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44.55837209 |
Minimum | 11.3 |
---|---|
Maximum | 133.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.5 KiB |
Quantile statistics
Minimum | 11.3 |
---|---|
5-th percentile | 14.945 |
Q1 | 19.9 |
median | 33.3 |
Q3 | 63.65 |
95-th percentile | 100.665 |
Maximum | 133.9 |
Range | 122.6 |
Interquartile range (IQR) | 43.75 |
Descriptive statistics
Standard deviation | 28.68330756 |
---|---|
Coefficient of variation (CV) | 0.6437243151 |
Kurtosis | -0.3744934178 |
Mean | 44.55837209 |
Median Absolute Deviation (MAD) | 15.9 |
Skewness | 0.8375564287 |
Sum | 19160.1 |
Variance | 822.7321325 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21.3 | 6 | 1.4% |
20.1 | 6 | 1.4% |
19.7 | 5 | 1.2% |
17.9 | 5 | 1.2% |
18 | 5 | 1.2% |
19.9 | 5 | 1.2% |
20.3 | 4 | 0.9% |
59 | 3 | 0.7% |
71 | 3 | 0.7% |
49.8 | 3 | 0.7% |
Other values (299) | 385 |
Value | Count | Frequency (%) |
11.3 | 1 | |
11.6 | 1 | |
12 | 1 | |
12.5 | 1 | |
12.6 | 1 | |
12.8 | 1 | |
13 | 1 | |
13.4 | 1 | |
13.5 | 1 | |
13.7 | 1 |
Value | Count | Frequency (%) |
133.9 | 1 | |
133.4 | 1 | |
125.4 | 1 | |
116.7 | 1 | |
112.6 | 1 | |
109.5 | 1 | |
106.6 | 1 | |
106.3 | 1 | |
106.2 | 1 | |
105.8 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
Date | Year | Month | Total_Production | Product_Supplied | Refinery_Input | Operable_Dist_Capacity | Operating_Dist_Capacity | Idle_Dist_Capacity | Percent_Util | Future_Price | Spot_Price | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | Jan-1986 | 1986 | 1 | 283248 | 498728 | 12583 | 15459 | 14639 | 820 | 81.4 | 23.0 | 22.9 |
1 | Feb-1986 | 1986 | 2 | 256855 | 453209 | 12068 | 15485 | 14538 | 947 | 77.9 | 15.5 | 15.5 |
2 | Mar-1986 | 1986 | 3 | 279413 | 504565 | 11759 | 15485 | 14517 | 968 | 75.9 | 12.6 | 12.6 |
3 | Apr-1986 | 1986 | 4 | 265917 | 478339 | 12603 | 15473 | 14550 | 923 | 81.5 | 12.8 | 12.8 |
4 | May-1986 | 1986 | 5 | 273964 | 495789 | 13314 | 15484 | 14805 | 679 | 86.0 | 15.3 | 15.4 |
5 | Jun-1986 | 1986 | 6 | 258700 | 481482 | 13347 | 15465 | 14649 | 816 | 86.3 | 13.4 | 13.4 |
6 | Jul-1986 | 1986 | 7 | 268448 | 505514 | 13009 | 15475 | 14607 | 868 | 84.1 | 11.6 | 11.6 |
7 | Aug-1986 | 1986 | 8 | 259580 | 515167 | 13392 | 15430 | 14807 | 624 | 86.8 | 15.1 | 15.1 |
8 | Sep-1986 | 1986 | 9 | 249843 | 477269 | 13191 | 15435 | 14870 | 565 | 85.5 | 14.9 | 14.9 |
9 | Oct-1986 | 1986 | 10 | 260984 | 514674 | 12753 | 15435 | 14827 | 608 | 82.6 | 14.9 | 14.9 |
Last rows
Date | Year | Month | Total_Production | Product_Supplied | Refinery_Input | Operable_Dist_Capacity | Operating_Dist_Capacity | Idle_Dist_Capacity | Percent_Util | Future_Price | Spot_Price | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
420 | Jan-2021 | 2021 | 1 | 342747 | 576457 | 14975 | 18143 | 17735 | 408 | 82.5 | 52.1 | 52.0 |
421 | Feb-2021 | 2021 | 2 | 273646 | 488438 | 12804 | 18090 | 17526 | 564 | 70.8 | 59.1 | 59.0 |
422 | Mar-2021 | 2021 | 3 | 345946 | 595319 | 14834 | 18090 | 17035 | 1055 | 82.0 | 62.4 | 62.3 |
423 | Apr-2021 | 2021 | 4 | 336905 | 583781 | 15633 | 18128 | 17553 | 574 | 86.2 | 61.7 | 61.7 |
424 | May-2021 | 2021 | 5 | 351346 | 622903 | 16130 | 18128 | 17843 | 285 | 89.0 | 65.2 | 65.2 |
425 | Jun-2021 | 2021 | 6 | 338645 | 616115 | 16743 | 18128 | 17910 | 218 | 92.4 | 71.4 | 71.4 |
426 | Jul-2021 | 2021 | 7 | 351228 | 616714 | 16482 | 18129 | 17943 | 187 | 90.9 | 72.4 | 72.5 |
427 | Aug-2021 | 2021 | 8 | 347393 | 635828 | 16377 | 18130 | 17914 | 216 | 90.3 | 67.7 | 67.7 |
428 | Sep-2021 | 2021 | 9 | 324654 | 606706 | 15797 | 18130 | 15800 | 2331 | 87.1 | 71.5 | 71.6 |
429 | Oct-2021 | 2021 | 10 | 355670 | 616639 | 15581 | 18132 | 17133 | 999 | 85.9 | 81.2 | 81.5 |