External Correlation
MAT 259, 2013
Giovanni Sgambato

Introduction
I would like to compare the trend between the number of checked items in the Seattle public library and the number of articles published in the New York Times containing the word “Europe”.

Process
I produced a double bar-charts to visualize the SPL and NYT data comparing the trends during time.

Query
SELECT year(cout), month(cout), count(*) FROM inraw WHERE title like '%europe%' and cout > '2006-01-01' and cout < '2012-01-01' GROUP BY year(cout), month(cout) ORDER BY year(cout), month(cout);

Results and Analysis
In this project I explored methods for compare and correlate data from different sources and show them in an effective way. I also spent time in the data selection, filtering and sorting, producing 2 queries to fit the data to the aim of the project. From the data I was able to get the number of items checked cotaining the word “Europe” and compare with the number of articles published o the New York Times.

Early Version

Final Version


Code
I used Processing and NYTimes - The Article Search API (http://developer.nytimes.com/docs/article_search_api/)


Run in Browser

Source Code


Control
Rolling with the mouse over the data is possible to select and underline a specific period and visualize the value of items checked and articles.