Good afternoon all! First let us find out what our working directory is.

#getwd()

Today we are going to query the table quiz2 by using Transact-SQL inside the R chunks. First, we are going to need to install the package sqldf.

#install.packages("sqldf")
library(sqldf)
data1<-read.csv("quiz2.csv",header = TRUE,sep = ",")
summary(data1)

Let us run a query that returns every single record from the table data1.

sqldf("SELECT * FROM data1")

Now I would like you to create a query that returns every single record (row) for the fields region and population.

Let us now create a query that returns the first 10 records of the data1 table.

sqldf('SELECT * FROM data1 LIMIT 10')
sqldf("SELECT * FROM data1 ORDER BY infections DESC  LIMIT 5")

Interpret the output of the query ran above and run a similar query (limit 7) including the attributes ipadddr , ufo2010 ,and infections.

sqldf('SELECT region,infections FROM data1 WHERE infections > 1000 order by 2 desc')

Now I would like you to inspect (create a query) income,region and population when the condition is income>=50,000. Make comments.

sqldf('SELECT * FROM data1 WHERE (income > 50000 AND infections > 2000) OR pop > 100000')

Interpret the query ran above and create a query that recreates a similar case scenario (one that is critical for the success of this mini-project).

Let us run a query that returns every region and population with a number of infections between 1000 and 7000.

sqldf('SELECT region,pop FROM data1 WHERE infections between  1000 and 7000')

Interpret the query below and list a hypothetical case scenario in which this query would be useful.

sqldf('SELECT * FROM data1 WHERE subregion LIKE "A%"')

Last,let us find the average number of infections in our table.

sqldf("SELECT AVG(infections) FROM data1")

Let us inspect the new table defined below data2.

data2<-read.csv("finalexam.csv",header = TRUE,sep = ",")

Let us apply summary to data2 and examine the output.

summary(data2)

Next, I would like you to complete the following steps:

1-Run a query that returns full details for every record from the west region.

2-Run a query that returns full details for every record from the east region where the number of infections is greater than 1500.

3-Run a few queries that can potentially help us to make the point that: a.Region affects the number of infections b.Population affects the number of infections c.Income affects the number of infections d.ufo2010 affects the number of infections e. The different interactions of region and income, and income and population affects the number of infections.

Explain every single query you include in this part of the handout.

LS0tDQp0aXRsZTogIlIgU1FMIg0Kb3V0cHV0OiBodG1sX25vdGVib29rDQotLS0NCg0KR29vZCBhZnRlcm5vb24gYWxsISBGaXJzdCBsZXQgdXMgZmluZCBvdXQgd2hhdCBvdXIgd29ya2luZyBkaXJlY3RvcnkgaXMuDQoNCmBgYHtyfQ0KI2dldHdkKCkNCmBgYA0KDQpUb2RheSB3ZSBhcmUgZ29pbmcgdG8gcXVlcnkgdGhlIHRhYmxlIHF1aXoyIGJ5IHVzaW5nIFRyYW5zYWN0LVNRTCBpbnNpZGUgdGhlIFIgY2h1bmtzLiBGaXJzdCwgd2UgYXJlIGdvaW5nIHRvIG5lZWQgdG8gaW5zdGFsbCB0aGUgcGFja2FnZSAqKnNxbGRmKiouDQoNCmBgYHtyfQ0KI2luc3RhbGwucGFja2FnZXMoInNxbGRmIikNCmBgYA0KDQoNCmBgYHtyfQ0KbGlicmFyeShzcWxkZikNCmBgYA0KDQoNCmBgYHtyfQ0KZGF0YTE8LXJlYWQuY3N2KCJxdWl6Mi5jc3YiLGhlYWRlciA9IFRSVUUsc2VwID0gIiwiKQ0KYGBgDQoNCg0KYGBge3J9DQpzdW1tYXJ5KGRhdGExKQ0KYGBgDQoNCkxldCB1cyBydW4gYSBxdWVyeSB0aGF0IHJldHVybnMgZXZlcnkgc2luZ2xlIHJlY29yZCBmcm9tIHRoZSB0YWJsZSBkYXRhMS4NCg0KYGBge3J9DQpzcWxkZigiU0VMRUNUICogRlJPTSBkYXRhMSIpDQpgYGANCg0KTm93IEkgd291bGQgbGlrZSB5b3UgdG8gY3JlYXRlIGEgcXVlcnkgdGhhdCByZXR1cm5zIGV2ZXJ5IHNpbmdsZSByZWNvcmQgKHJvdykgZm9yIHRoZSBmaWVsZHMgcmVnaW9uIGFuZCBwb3B1bGF0aW9uLg0KDQpMZXQgdXMgbm93IGNyZWF0ZSBhIHF1ZXJ5IHRoYXQgcmV0dXJucyB0aGUgZmlyc3QgMTAgcmVjb3JkcyBvZiB0aGUgZGF0YTEgdGFibGUuDQoNCmBgYHtyfQ0Kc3FsZGYoJ1NFTEVDVCAqIEZST00gZGF0YTEgTElNSVQgMTAnKQ0KYGBgDQoNCg0KYGBge3J9DQpzcWxkZigiU0VMRUNUICogRlJPTSBkYXRhMSBPUkRFUiBCWSBpbmZlY3Rpb25zIERFU0MgIExJTUlUIDUiKQ0KYGBgDQoNCkludGVycHJldCB0aGUgb3V0cHV0IG9mIHRoZSBxdWVyeSByYW4gYWJvdmUgYW5kIHJ1biBhIHNpbWlsYXIgcXVlcnkgKGxpbWl0IDcpIGluY2x1ZGluZyB0aGUgYXR0cmlidXRlcyAqKmlwYWRkZHIqKiAsICoqdWZvMjAxMCoqICxhbmQgKippbmZlY3Rpb25zKiouDQoNCmBgYHtyfQ0Kc3FsZGYoJ1NFTEVDVCByZWdpb24saW5mZWN0aW9ucyBGUk9NIGRhdGExIFdIRVJFIGluZmVjdGlvbnMgPiAxMDAwIG9yZGVyIGJ5IDIgZGVzYycpDQpgYGANCg0KTm93IEkgd291bGQgbGlrZSB5b3UgdG8gaW5zcGVjdCAoY3JlYXRlIGEgcXVlcnkpIGluY29tZSxyZWdpb24gYW5kIHBvcHVsYXRpb24gd2hlbiB0aGUgY29uZGl0aW9uIGlzIGluY29tZT49NTAsMDAwLiBNYWtlIGNvbW1lbnRzLg0KDQpgYGB7cn0NCnNxbGRmKCdTRUxFQ1QgKiBGUk9NIGRhdGExIFdIRVJFIChpbmNvbWUgPiA1MDAwMCBBTkQgaW5mZWN0aW9ucyA+IDIwMDApIE9SIHBvcCA+IDEwMDAwMCcpDQpgYGANCg0KSW50ZXJwcmV0IHRoZSBxdWVyeSByYW4gYWJvdmUgYW5kIGNyZWF0ZSBhIHF1ZXJ5IHRoYXQgcmVjcmVhdGVzIGEgc2ltaWxhciBjYXNlIHNjZW5hcmlvIChvbmUgdGhhdCBpcyBjcml0aWNhbCBmb3IgdGhlIHN1Y2Nlc3Mgb2YgdGhpcyBtaW5pLXByb2plY3QpLg0KDQpMZXQgdXMgcnVuIGEgcXVlcnkgdGhhdCByZXR1cm5zIGV2ZXJ5IHJlZ2lvbiBhbmQgcG9wdWxhdGlvbiB3aXRoIGEgbnVtYmVyIG9mIGluZmVjdGlvbnMgYmV0d2VlbiAxMDAwIGFuZCA3MDAwLiANCg0KYGBge3J9DQpzcWxkZignU0VMRUNUIHJlZ2lvbixwb3AgRlJPTSBkYXRhMSBXSEVSRSBpbmZlY3Rpb25zIGJldHdlZW4gIDEwMDAgYW5kIDcwMDAnKQ0KYGBgDQoNCkludGVycHJldCB0aGUgcXVlcnkgYmVsb3cgYW5kIGxpc3QgYSBoeXBvdGhldGljYWwgY2FzZSBzY2VuYXJpbyBpbiB3aGljaCB0aGlzIHF1ZXJ5IHdvdWxkIGJlIHVzZWZ1bC4NCg0KYGBge3J9DQpzcWxkZignU0VMRUNUICogRlJPTSBkYXRhMSBXSEVSRSBzdWJyZWdpb24gTElLRSAiQSUiJykNCmBgYA0KDQpMYXN0LGxldCB1cyBmaW5kIHRoZSBhdmVyYWdlIG51bWJlciBvZiBpbmZlY3Rpb25zIGluIG91ciB0YWJsZS4NCg0KYGBge3J9DQpzcWxkZigiU0VMRUNUIEFWRyhpbmZlY3Rpb25zKSBGUk9NIGRhdGExIikNCmBgYA0KDQpMZXQgdXMgaW5zcGVjdCB0aGUgbmV3IHRhYmxlIGRlZmluZWQgYmVsb3cgZGF0YTIuDQoNCmBgYHtyfQ0KZGF0YTI8LXJlYWQuY3N2KCJmaW5hbGV4YW0uY3N2IixoZWFkZXIgPSBUUlVFLHNlcCA9ICIsIikNCmBgYA0KDQpMZXQgdXMgYXBwbHkgc3VtbWFyeSB0byBkYXRhMiBhbmQgZXhhbWluZSB0aGUgb3V0cHV0Lg0KYGBge3J9DQpzdW1tYXJ5KGRhdGEyKQ0KYGBgDQoNCk5leHQsIEkgd291bGQgbGlrZSB5b3UgdG8gY29tcGxldGUgdGhlIGZvbGxvd2luZyBzdGVwczoNCg0KMS1SdW4gYSBxdWVyeSB0aGF0IHJldHVybnMgZnVsbCBkZXRhaWxzIGZvciBldmVyeSByZWNvcmQgZnJvbSB0aGUgd2VzdCByZWdpb24uDQoNCjItUnVuIGEgcXVlcnkgdGhhdCByZXR1cm5zIGZ1bGwgZGV0YWlscyBmb3IgZXZlcnkgcmVjb3JkIGZyb20gdGhlIGVhc3QgcmVnaW9uIHdoZXJlIHRoZSBudW1iZXIgb2YgaW5mZWN0aW9ucyBpcyBncmVhdGVyIHRoYW4gMTUwMC4NCg0KMy1SdW4gYSBmZXcgcXVlcmllcyB0aGF0IGNhbiBwb3RlbnRpYWxseSBoZWxwIHVzIHRvIG1ha2UgdGhlIHBvaW50IHRoYXQ6DQogICBhLlJlZ2lvbiBhZmZlY3RzIHRoZSBudW1iZXIgb2YgaW5mZWN0aW9ucw0KICAgYi5Qb3B1bGF0aW9uIGFmZmVjdHMgdGhlIG51bWJlciBvZiBpbmZlY3Rpb25zDQogICBjLkluY29tZSBhZmZlY3RzIHRoZSBudW1iZXIgb2YgaW5mZWN0aW9ucw0KICAgZC51Zm8yMDEwIGFmZmVjdHMgdGhlIG51bWJlciBvZiBpbmZlY3Rpb25zDQogICBlLiBUaGUgZGlmZmVyZW50IGludGVyYWN0aW9ucyBvZiByZWdpb24gYW5kIGluY29tZSwgYW5kIGluY29tZSBhbmQgcG9wdWxhdGlvbiBhZmZlY3RzIHRoZSBudW1iZXIgb2YgaW5mZWN0aW9ucy4NCiAgIA0KRXhwbGFpbiBldmVyeSBzaW5nbGUgcXVlcnkgeW91IGluY2x1ZGUgaW4gdGhpcyBwYXJ0IG9mIHRoZSBoYW5kb3V0Lg0KDQoNCg0K