Predictions of Book Checkouts
MAT 259, 2016
Guanyu Chen
Concept
For Project 2, it was a extension of my project 1 that building time-series models to make predictions of numbers of checkouts in different dewey classes (total classes are 10). The primary model used in project is ARIMA which is a classical predictive model in statistics.
Due to consistent time factor, the predicting process is to use numbers of checkouts in different dewey classes from 2006 to 2010 as training sets and build ARIMA models based on the original training set. Then, I applied model to make a prediction for Jan.2011. After that, I add true number of checkout in Jan.2011 into the training set and make a prediction for Feb.2011. So on and so forth, it is possible for me to collect comparsions of predicted data and true data points. During this process, I found that there were two missing values and one outlier in 2018. To deal with missing problems, I used Kalman Smoothing to perform imputation.
Query
Please put the queries or codes to grab the data here.
Preliminary sketches
My first sketch of imaging plot is all circle lines of true data and prediction data with prediction boundaries. After discussing with Prof.Legrady, I decided to create the shape from the second plot. It is kind of like a sphere. We can map all data points in 3D space and connect them with lines.
Process
I did not save too many drafts when processing my graph. At the begining of drawing curves, I only used line to connect different points. However, it is too ugly to compare predicted checkouts and true checkouts. Thus, to use curveVextex() function jumped into my mind. The plot shown below is my first successful draw with smoothing curves.
Final result
Code