Assignment – Working with XML and JSON in R

Overview

The goal of today’s lab is to load the same data from three different sources. From this we can learn the differences between each file type and find out if there are any differences in what the loaded data looks like.

library(tidyverse)
library(openintro)
library(xml2)

## Warning: package 'xml2' was built under R version 4.4.3

library(XML)
library(rvest)
library(rjson)
library(readr)
library(jsonlite)

## Warning: package 'jsonlite' was built under R version 4.4.3

library(plyr)
library(arsenal)

## Warning: package 'arsenal' was built under R version 4.4.3

library(reshape2)

Load files

The data extraction portion of this lab is important because it depends on the form of the data: html, xml, or json. Loading the html and using “html_table” on it creates an untidy, but usable data frame. Loading the xml using “read_xml”, “xmlParse”, and “xmlToList” turns it into a wide data frame. The “xmlToDataFrame” function did not work here because of the two author tags. The json file turned into a properly formed data frame at first glance, but it actually combined any books with multiple authors into rows where authors is a vector.

html_url <- 'https://raw.githubusercontent.com/Megabuster/Data607/refs/heads/main/data/assignment6/books.html'
xml_url <- 'https://raw.githubusercontent.com/Megabuster/Data607/refs/heads/main/data/assignment6/books.xml'
json_url <- 'https://raw.githubusercontent.com/Megabuster/Data607/refs/heads/main/data/assignment6/books.json'

raw_html_df <- read_html(html_url) %>% html_table() %>% as.data.frame()
raw_html_df

##                                                       title
## 1 American Woman's Home: Or, Principles of Domestic Science
## 2                                                          
## 3                          The handbook of soap manufacture
## 4                                                          
## 5                      Elevator Systems of the Eiffel Tower
##                     author language ebook_release_year         copyright_status
## 1 Catharine Esther Beecher  English               2004 Public domain in the USA
## 2    Harriet Beecher Stowe                          NA                         
## 3            W. H. Simmons  English               2007 Public domain in the USA
## 4           H. A. Appleton                          NA                         
## 5          Robert M. Vogel  English               2010 Public domain in the USA

raw_xml <- read_xml(xml_url)  %>% xmlParse(xml_url)
raw_xml_df <- xmlToList(raw_xml) %>% ldply(data.frame)
raw_xml_df

##   .id                                                     title
## 1 row American Woman's Home: Or, Principles of Domestic Science
## 2 row                          The handbook of soap manufacture
## 3 row                      Elevator Systems of the Eiffel Tower
##                     author              author.1 language ebook_release_year
## 1 Catharine Esther Beecher Harriet Beecher Stowe  English               2004
## 2            W. H. Simmons        H. A. Appleton  English               2007
## 3          Robert M. Vogel                  <NA>  English               2010
##           copyright_status
## 1 Public domain in the USA
## 2 Public domain in the USA
## 3 Public domain in the USA

raw_json_df <- fromJSON(json_url, flatten=TRUE)
raw_json_df

##                                                       title
## 1 American Woman's Home: Or, Principles of Domestic Science
## 2                          The handbook of soap manufacture
## 3                      Elevator Systems of the Eiffel Tower
##                                            author language ebook_release_year
## 1 Catharine Esther Beecher, Harriet Beecher Stowe  English               2004
## 2                   W. H. Simmons, H. A. Appleton  English               2007
## 3                                 Robert M. Vogel  English               2010
##           copyright_status
## 1 Public domain in the USA
## 2 Public domain in the USA
## 3 Public domain in the USA

Compare initial data frames

Let’s compare the loaded data. I’m choosing now to do a comparison because this is roughly the first instance where all three instances of the data can be compared since they are now data frames. There are three data frames, thus three comparisons to compare them all.

samp_summary <- summary(comparedf(raw_html_df, raw_xml_df))
head(samp_summary)

## $frame.summary.table
##   version         arg ncol nrow
## 1       x raw_html_df    5    5
## 2       y  raw_xml_df    7    3
## 
## $comparison.summary.table
##                                                      statistic value
## 1                                       Number of by-variables     0
## 2                         Number of non-by variables in common     5
## 3                                 Number of variables compared     4
## 4                           Number of variables in x but not y     0
## 5                           Number of variables in y but not x     2
## 6        Number of variables compared with some values unequal     4
## 7           Number of variables compared with all values equal     0
## 8                             Number of observations in common     3
## 9                        Number of observations in x but not y     2
## 10                       Number of observations in y but not x     0
## 11 Number of observations with some compared variables unequal     2
## 12    Number of observations with all compared variables equal     1
## 13                                    Number of values unequal     6
## 
## $vars.ns.table
##   version variable position     class
## 7       y      .id        1 character
## 8       y author.1        4 character
## 
## $vars.nc.table
##                var.x pos.x class.x              var.y pos.y   class.y
## 4 ebook_release_year     4 integer ebook_release_year     6 character
## 
## $obs.table
##   version ..row.names.. observation
## 4       x             4           4
## 5       x             5           5
## 
## $diffs.byvar.table
##              var.x            var.y n NAs
## 1            title            title 2   0
## 2           author           author 2   0
## 3         language         language 1   0
## 4 copyright_status copyright_status 1   0

comparedf(raw_html_df, raw_xml_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = raw_html_df, y = raw_xml_df)
## 
## Shared: 5 non-by variables and 3 observations.
## Not shared: 2 variables and 2 observations.
## 
## Differences found in 4/4 variables compared.
## 0 variables compared have non-identical attributes.

comparedf(raw_html_df, raw_json_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = raw_html_df, y = raw_json_df)
## 
## Shared: 5 non-by variables and 3 observations.
## Not shared: 0 variables and 2 observations.
## 
## Differences found in 4/4 variables compared.
## 0 variables compared have non-identical attributes.

comparedf(raw_xml_df, raw_json_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = raw_xml_df, y = raw_json_df)
## 
## Shared: 5 non-by variables and 3 observations.
## Not shared: 2 variables and 0 observations.
## 
## Differences found in 0/3 variables compared.
## 0 variables compared have non-identical attributes.

The results of the comparisons are heavily dependent on the loading processes chosen for each file format. In this case, it is not surprising that all three data frames are very different. However, there are some similarities. “Shared: 5 non-by variables and 3 observations.” The observations were consistently found. A snippet of a summary is shown as well. For larger data frames where it may be difficult to see differences, the “summary” function in tandem with “comparef” provides far more details that can be analyzed.

Fix data frames

In order to be complete with this example, let’s see the process of fixing the data frames so that they are tidy and actually comparable.

The html data frame needs values to be imputed to not have missing data if some books have multiple authors.

html_df <- raw_html_df %>% mutate_at(c('title', 'language', 'copyright_status'), ~na_if(., '')) %>% 
  fill(c(title, language, ebook_release_year, copyright_status), .direction = 'down')
html_df

##                                                       title
## 1 American Woman's Home: Or, Principles of Domestic Science
## 2 American Woman's Home: Or, Principles of Domestic Science
## 3                          The handbook of soap manufacture
## 4                          The handbook of soap manufacture
## 5                      Elevator Systems of the Eiffel Tower
##                     author language ebook_release_year         copyright_status
## 1 Catharine Esther Beecher  English               2004 Public domain in the USA
## 2    Harriet Beecher Stowe  English               2004 Public domain in the USA
## 3            W. H. Simmons  English               2007 Public domain in the USA
## 4           H. A. Appleton  English               2007 Public domain in the USA
## 5          Robert M. Vogel  English               2010 Public domain in the USA

Change the xml data frame to a longer form using “melt” and exclude rows that expect extra authors.

xml_df <- raw_xml_df %>%
  select(colnames(raw_xml_df)[2:7]) %>%
  melt(id.vars = c('title', 'language', 'ebook_release_year', 'copyright_status'), value.name = 'author', na.rm = TRUE) 
xml_df$variable = NULL 
xml_df

##                                                       title language
## 1 American Woman's Home: Or, Principles of Domestic Science  English
## 2                          The handbook of soap manufacture  English
## 3                      Elevator Systems of the Eiffel Tower  English
## 4 American Woman's Home: Or, Principles of Domestic Science  English
## 5                          The handbook of soap manufacture  English
##   ebook_release_year         copyright_status                   author
## 1               2004 Public domain in the USA Catharine Esther Beecher
## 2               2007 Public domain in the USA            W. H. Simmons
## 3               2010 Public domain in the USA          Robert M. Vogel
## 4               2004 Public domain in the USA    Harriet Beecher Stowe
## 5               2007 Public domain in the USA           H. A. Appleton

Simply unnest the author column for the json data frame to tidy it.

json_df <- raw_json_df %>% unnest(cols = c(author))
json_df

## # A tibble: 5 × 5
##   title                      author language ebook_release_year copyright_status
##   <chr>                      <chr>  <chr>                 <int> <chr>           
## 1 American Woman's Home: Or… Catha… English                2004 Public domain i…
## 2 American Woman's Home: Or… Harri… English                2004 Public domain i…
## 3 The handbook of soap manu… W. H.… English                2007 Public domain i…
## 4 The handbook of soap manu… H. A.… English                2007 Public domain i…
## 5 Elevator Systems of the E… Rober… English                2010 Public domain i…

Compare final data frames

The data frames are now effectively the same for most meaningful analyses. The html and json based data frames are equal. The xml data frame shows a slight difference from the other two because the column order is different.

samp_summary_final <- summary(comparedf(html_df, xml_df))
head(samp_summary_final)

## $frame.summary.table
##   version     arg ncol nrow
## 1       x html_df    5    5
## 2       y  xml_df    5    5
## 
## $comparison.summary.table
##                                                      statistic value
## 1                                       Number of by-variables     0
## 2                         Number of non-by variables in common     5
## 3                                 Number of variables compared     4
## 4                           Number of variables in x but not y     0
## 5                           Number of variables in y but not x     0
## 6        Number of variables compared with some values unequal     2
## 7           Number of variables compared with all values equal     2
## 8                             Number of observations in common     5
## 9                        Number of observations in x but not y     0
## 10                       Number of observations in y but not x     0
## 11 Number of observations with some compared variables unequal     4
## 12    Number of observations with all compared variables equal     1
## 13                                    Number of values unequal     8
## 
## $vars.ns.table
## [1] version  variable position class   
## <0 rows> (or 0-length row.names)
## 
## $vars.nc.table
##                var.x pos.x class.x              var.y pos.y   class.y
## 4 ebook_release_year     4 integer ebook_release_year     3 character
## 
## $obs.table
## [1] version       ..row.names.. observation  
## <0 rows> (or 0-length row.names)
## 
## $diffs.byvar.table
##              var.x            var.y n NAs
## 1            title            title 4   0
## 2           author           author 4   0
## 3         language         language 0   0
## 4 copyright_status copyright_status 0   0

comparedf(html_df, xml_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = html_df, y = xml_df)
## 
## Shared: 5 non-by variables and 5 observations.
## Not shared: 0 variables and 0 observations.
## 
## Differences found in 2/4 variables compared.
## 0 variables compared have non-identical attributes.

comparedf(html_df, json_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = html_df, y = json_df)
## 
## Shared: 5 non-by variables and 5 observations.
## Not shared: 0 variables and 0 observations.
## 
## Differences found in 0/5 variables compared.
## 0 variables compared have non-identical attributes.

comparedf(xml_df, json_df)

## Compare Object
## 
## Function Call: 
## comparedf(x = xml_df, y = json_df)
## 
## Shared: 5 non-by variables and 5 observations.
## Not shared: 0 variables and 0 observations.
## 
## Differences found in 2/4 variables compared.
## 0 variables compared have non-identical attributes.

Conclusions

There are many ways to load data stored in different file types into R. Some methods are able to create data frames immediately. The “xmlToDataFrame” had some issues dealing with there being two author tags. All options loaded the data, just in different forms. With some tidying, each data file could eventually become equivalent data frames.

There were also multiple forms the data could have within the original files. These were choices that I had to make myself. I could have nested some of the data deeper into the HTML or XML files. This would be akin to the work needed to derive such data from websites where I do not have control over the form of the data hosted. Further work could be to practice this process with various websites.

LS0tDQp0aXRsZTogIkFzc2lnbm1lbnQg4oCTIFdvcmtpbmcgd2l0aCBYTUwgYW5kIEpTT04gaW4gUiINCmF1dGhvcjogIkxhd3JlbmNlIFl1Ig0KZGF0ZTogImByIFN5cy5EYXRlKClgIg0Kb3V0cHV0OiBvcGVuaW50cm86OmxhYl9yZXBvcnQNCi0tLQ0KDQojIyMgT3ZlcnZpZXcNCg0KVGhlIGdvYWwgb2YgdG9kYXkncyBsYWIgaXMgdG8gbG9hZCB0aGUgc2FtZSBkYXRhIGZyb20gdGhyZWUgZGlmZmVyZW50IHNvdXJjZXMuIEZyb20gdGhpcyB3ZSBjYW4gbGVhcm4gdGhlIGRpZmZlcmVuY2VzIGJldHdlZW4gZWFjaCBmaWxlIHR5cGUgYW5kIGZpbmQgb3V0IGlmIHRoZXJlIGFyZSBhbnkgZGlmZmVyZW5jZXMgaW4gd2hhdCB0aGUgbG9hZGVkIGRhdGEgbG9va3MgbGlrZS4NCg0KYGBge3IgbG9hZC1wYWNrYWdlcywgbWVzc2FnZT1GQUxTRX0NCmxpYnJhcnkodGlkeXZlcnNlKQ0KbGlicmFyeShvcGVuaW50cm8pDQpsaWJyYXJ5KHhtbDIpDQpsaWJyYXJ5KFhNTCkNCmxpYnJhcnkocnZlc3QpDQpsaWJyYXJ5KHJqc29uKQ0KbGlicmFyeShyZWFkcikNCmxpYnJhcnkoanNvbmxpdGUpDQpsaWJyYXJ5KHBseXIpDQpsaWJyYXJ5KGFyc2VuYWwpDQpsaWJyYXJ5KHJlc2hhcGUyKQ0KYGBgDQoNCg0KIyMjIExvYWQgZmlsZXMNCg0KVGhlIGRhdGEgZXh0cmFjdGlvbiBwb3J0aW9uIG9mIHRoaXMgbGFiIGlzIGltcG9ydGFudCBiZWNhdXNlIGl0IGRlcGVuZHMgb24gdGhlIGZvcm0gb2YgdGhlIGRhdGE6IGh0bWwsIHhtbCwgb3IganNvbi4gTG9hZGluZyB0aGUgaHRtbCBhbmQgdXNpbmcgImh0bWxfdGFibGUiIG9uIGl0IGNyZWF0ZXMgYW4gdW50aWR5LCBidXQgdXNhYmxlIGRhdGEgZnJhbWUuIExvYWRpbmcgdGhlIHhtbCB1c2luZyAicmVhZF94bWwiLCAieG1sUGFyc2UiLCBhbmQgInhtbFRvTGlzdCIgdHVybnMgaXQgaW50byBhIHdpZGUgZGF0YSBmcmFtZS4gVGhlICJ4bWxUb0RhdGFGcmFtZSIgZnVuY3Rpb24gZGlkIG5vdCB3b3JrIGhlcmUgYmVjYXVzZSBvZiB0aGUgdHdvIGF1dGhvciB0YWdzLiBUaGUganNvbiBmaWxlIHR1cm5lZCBpbnRvIGEgcHJvcGVybHkgZm9ybWVkIGRhdGEgZnJhbWUgYXQgZmlyc3QgZ2xhbmNlLCBidXQgaXQgYWN0dWFsbHkgY29tYmluZWQgYW55IGJvb2tzIHdpdGggbXVsdGlwbGUgYXV0aG9ycyBpbnRvIHJvd3Mgd2hlcmUgYXV0aG9ycyBpcyBhIHZlY3Rvci4gDQpgYGB7ciBsb2FkLWZpbGVzfQ0KaHRtbF91cmwgPC0gJ2h0dHBzOi8vcmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbS9NZWdhYnVzdGVyL0RhdGE2MDcvcmVmcy9oZWFkcy9tYWluL2RhdGEvYXNzaWdubWVudDYvYm9va3MuaHRtbCcNCnhtbF91cmwgPC0gJ2h0dHBzOi8vcmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbS9NZWdhYnVzdGVyL0RhdGE2MDcvcmVmcy9oZWFkcy9tYWluL2RhdGEvYXNzaWdubWVudDYvYm9va3MueG1sJw0KanNvbl91cmwgPC0gJ2h0dHBzOi8vcmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbS9NZWdhYnVzdGVyL0RhdGE2MDcvcmVmcy9oZWFkcy9tYWluL2RhdGEvYXNzaWdubWVudDYvYm9va3MuanNvbicNCg0KcmF3X2h0bWxfZGYgPC0gcmVhZF9odG1sKGh0bWxfdXJsKSAlPiUgaHRtbF90YWJsZSgpICU+JSBhcy5kYXRhLmZyYW1lKCkNCnJhd19odG1sX2RmDQoNCnJhd194bWwgPC0gcmVhZF94bWwoeG1sX3VybCkgICU+JSB4bWxQYXJzZSh4bWxfdXJsKQ0KcmF3X3htbF9kZiA8LSB4bWxUb0xpc3QocmF3X3htbCkgJT4lIGxkcGx5KGRhdGEuZnJhbWUpDQpyYXdfeG1sX2RmDQoNCnJhd19qc29uX2RmIDwtIGZyb21KU09OKGpzb25fdXJsLCBmbGF0dGVuPVRSVUUpDQpyYXdfanNvbl9kZg0KYGBgDQoNCiMjIyBDb21wYXJlIGluaXRpYWwgZGF0YSBmcmFtZXMNCg0KTGV0J3MgY29tcGFyZSB0aGUgbG9hZGVkIGRhdGEuIEknbSBjaG9vc2luZyBub3cgdG8gZG8gYSBjb21wYXJpc29uIGJlY2F1c2UgdGhpcyBpcyByb3VnaGx5IHRoZSBmaXJzdCBpbnN0YW5jZSB3aGVyZSBhbGwgdGhyZWUgaW5zdGFuY2VzIG9mIHRoZSBkYXRhIGNhbiBiZSBjb21wYXJlZCBzaW5jZSB0aGV5IGFyZSBub3cgZGF0YSBmcmFtZXMuIFRoZXJlIGFyZSB0aHJlZSBkYXRhIGZyYW1lcywgdGh1cyB0aHJlZSBjb21wYXJpc29ucyB0byBjb21wYXJlIHRoZW0gYWxsLiANCmBgYHtyIGNvbXBhcmUtZGZ9DQpzYW1wX3N1bW1hcnkgPC0gc3VtbWFyeShjb21wYXJlZGYocmF3X2h0bWxfZGYsIHJhd194bWxfZGYpKQ0KaGVhZChzYW1wX3N1bW1hcnkpDQpjb21wYXJlZGYocmF3X2h0bWxfZGYsIHJhd194bWxfZGYpDQpjb21wYXJlZGYocmF3X2h0bWxfZGYsIHJhd19qc29uX2RmKQ0KY29tcGFyZWRmKHJhd194bWxfZGYsIHJhd19qc29uX2RmKQ0KYGBgDQoNClRoZSByZXN1bHRzIG9mIHRoZSBjb21wYXJpc29ucyBhcmUgaGVhdmlseSBkZXBlbmRlbnQgb24gdGhlIGxvYWRpbmcgcHJvY2Vzc2VzIGNob3NlbiBmb3IgZWFjaCBmaWxlIGZvcm1hdC4gSW4gdGhpcyBjYXNlLCBpdCBpcyBub3Qgc3VycHJpc2luZyB0aGF0IGFsbCB0aHJlZSBkYXRhIGZyYW1lcyBhcmUgdmVyeSBkaWZmZXJlbnQuIEhvd2V2ZXIsIHRoZXJlIGFyZSBzb21lIHNpbWlsYXJpdGllcy4gIlNoYXJlZDogNSBub24tYnkgdmFyaWFibGVzIGFuZCAzIG9ic2VydmF0aW9ucy4iIFRoZSBvYnNlcnZhdGlvbnMgd2VyZSBjb25zaXN0ZW50bHkgZm91bmQuIEEgc25pcHBldCBvZiBhIHN1bW1hcnkgaXMgc2hvd24gYXMgd2VsbC4gRm9yIGxhcmdlciBkYXRhIGZyYW1lcyB3aGVyZSBpdCBtYXkgYmUgZGlmZmljdWx0IHRvIHNlZSBkaWZmZXJlbmNlcywgdGhlICJzdW1tYXJ5IiBmdW5jdGlvbiBpbiB0YW5kZW0gd2l0aCAiY29tcGFyZWYiIHByb3ZpZGVzIGZhciBtb3JlIGRldGFpbHMgdGhhdCBjYW4gYmUgYW5hbHl6ZWQuIA0KDQojIyMgRml4IGRhdGEgZnJhbWVzDQoNCkluIG9yZGVyIHRvIGJlIGNvbXBsZXRlIHdpdGggdGhpcyBleGFtcGxlLCBsZXQncyBzZWUgdGhlIHByb2Nlc3Mgb2YgZml4aW5nIHRoZSBkYXRhIGZyYW1lcyBzbyB0aGF0IHRoZXkgYXJlIHRpZHkgYW5kIGFjdHVhbGx5IGNvbXBhcmFibGUuDQoNClRoZSBodG1sIGRhdGEgZnJhbWUgbmVlZHMgdmFsdWVzIHRvIGJlIGltcHV0ZWQgdG8gbm90IGhhdmUgbWlzc2luZyBkYXRhIGlmIHNvbWUgYm9va3MgaGF2ZSBtdWx0aXBsZSBhdXRob3JzLg0KYGBge3IgZml4LWh0bWx9DQpodG1sX2RmIDwtIHJhd19odG1sX2RmICU+JSBtdXRhdGVfYXQoYygndGl0bGUnLCAnbGFuZ3VhZ2UnLCAnY29weXJpZ2h0X3N0YXR1cycpLCB+bmFfaWYoLiwgJycpKSAlPiUgDQogIGZpbGwoYyh0aXRsZSwgbGFuZ3VhZ2UsIGVib29rX3JlbGVhc2VfeWVhciwgY29weXJpZ2h0X3N0YXR1cyksIC5kaXJlY3Rpb24gPSAnZG93bicpDQpodG1sX2RmDQpgYGANCg0KQ2hhbmdlIHRoZSB4bWwgZGF0YSBmcmFtZSB0byBhIGxvbmdlciBmb3JtIHVzaW5nICJtZWx0IiBhbmQgZXhjbHVkZSByb3dzIHRoYXQgZXhwZWN0IGV4dHJhIGF1dGhvcnMuDQpgYGB7ciBmaXgteG1sfQ0KeG1sX2RmIDwtIHJhd194bWxfZGYgJT4lDQogIHNlbGVjdChjb2xuYW1lcyhyYXdfeG1sX2RmKVsyOjddKSAlPiUNCiAgbWVsdChpZC52YXJzID0gYygndGl0bGUnLCAnbGFuZ3VhZ2UnLCAnZWJvb2tfcmVsZWFzZV95ZWFyJywgJ2NvcHlyaWdodF9zdGF0dXMnKSwgdmFsdWUubmFtZSA9ICdhdXRob3InLCBuYS5ybSA9IFRSVUUpIA0KeG1sX2RmJHZhcmlhYmxlID0gTlVMTCANCnhtbF9kZg0KYGBgDQoNClNpbXBseSB1bm5lc3QgdGhlIGF1dGhvciBjb2x1bW4gZm9yIHRoZSBqc29uIGRhdGEgZnJhbWUgdG8gdGlkeSBpdC4NCmBgYHtyIGZpeC1qc29ufQ0KanNvbl9kZiA8LSByYXdfanNvbl9kZiAlPiUgdW5uZXN0KGNvbHMgPSBjKGF1dGhvcikpDQpqc29uX2RmDQpgYGANCg0KIyMjIENvbXBhcmUgZmluYWwgZGF0YSBmcmFtZXMNCg0KVGhlIGRhdGEgZnJhbWVzIGFyZSBub3cgZWZmZWN0aXZlbHkgdGhlIHNhbWUgZm9yIG1vc3QgbWVhbmluZ2Z1bCBhbmFseXNlcy4gVGhlIGh0bWwgYW5kIGpzb24gYmFzZWQgZGF0YSBmcmFtZXMgYXJlIGVxdWFsLiBUaGUgeG1sIGRhdGEgZnJhbWUgc2hvd3MgYSBzbGlnaHQgZGlmZmVyZW5jZSBmcm9tIHRoZSBvdGhlciB0d28gYmVjYXVzZSB0aGUgY29sdW1uIG9yZGVyIGlzIGRpZmZlcmVudC4gDQpgYGB7ciBjb21wYXJlLWZpbmFsfQ0Kc2FtcF9zdW1tYXJ5X2ZpbmFsIDwtIHN1bW1hcnkoY29tcGFyZWRmKGh0bWxfZGYsIHhtbF9kZikpDQpoZWFkKHNhbXBfc3VtbWFyeV9maW5hbCkNCmNvbXBhcmVkZihodG1sX2RmLCB4bWxfZGYpDQpjb21wYXJlZGYoaHRtbF9kZiwganNvbl9kZikNCmNvbXBhcmVkZih4bWxfZGYsIGpzb25fZGYpDQpgYGANCg0KIyMjIENvbmNsdXNpb25zDQoNClRoZXJlIGFyZSBtYW55IHdheXMgdG8gbG9hZCBkYXRhIHN0b3JlZCBpbiBkaWZmZXJlbnQgZmlsZSB0eXBlcyBpbnRvIFIuIFNvbWUgbWV0aG9kcyBhcmUgYWJsZSB0byBjcmVhdGUgZGF0YSBmcmFtZXMgaW1tZWRpYXRlbHkuIFRoZSAieG1sVG9EYXRhRnJhbWUiIGhhZCBzb21lIGlzc3VlcyBkZWFsaW5nIHdpdGggdGhlcmUgYmVpbmcgdHdvIGF1dGhvciB0YWdzLiBBbGwgb3B0aW9ucyBsb2FkZWQgdGhlIGRhdGEsIGp1c3QgaW4gZGlmZmVyZW50IGZvcm1zLiBXaXRoIHNvbWUgdGlkeWluZywgZWFjaCBkYXRhIGZpbGUgY291bGQgZXZlbnR1YWxseSBiZWNvbWUgZXF1aXZhbGVudCBkYXRhIGZyYW1lcy4NCg0KVGhlcmUgd2VyZSBhbHNvIG11bHRpcGxlIGZvcm1zIHRoZSBkYXRhIGNvdWxkIGhhdmUgd2l0aGluIHRoZSBvcmlnaW5hbCBmaWxlcy4gVGhlc2Ugd2VyZSBjaG9pY2VzIHRoYXQgSSBoYWQgdG8gbWFrZSBteXNlbGYuIEkgY291bGQgaGF2ZSBuZXN0ZWQgc29tZSBvZiB0aGUgZGF0YSBkZWVwZXIgaW50byB0aGUgSFRNTCBvciBYTUwgZmlsZXMuIFRoaXMgd291bGQgYmUgYWtpbiB0byB0aGUgd29yayBuZWVkZWQgdG8gZGVyaXZlIHN1Y2ggZGF0YSBmcm9tIHdlYnNpdGVzIHdoZXJlIEkgZG8gbm90IGhhdmUgY29udHJvbCBvdmVyIHRoZSBmb3JtIG9mIHRoZSBkYXRhIGhvc3RlZC4gRnVydGhlciB3b3JrIGNvdWxkIGJlIHRvIHByYWN0aWNlIHRoaXMgcHJvY2VzcyB3aXRoIHZhcmlvdXMgd2Vic2l0ZXMuDQo=