Service plataforms enables customers and users to download the data via a huge number of open standards (CSV, JSON, etc.) so there is no risk of lock-in at the data level.
Socrata is a software-as-a-service platform that provides a cloud-based solution for open data publishing and visualization
CKAN is an open source project, developed by the Open Knowledge Foundation, that lets users provision open data catalogs and, in some cases, visualizations and APIs.
Succesuful cases: Colombia open data porta, Mexico open data portal, Paraguay Open Data portal, etc.
get_socrata_metada: fetch a detailed metada of open datasets (extract json and basic list of datasets available from a Socrata domain
search_data: Shows a list object with with available datasets by keywords/tags. The list contains four things:
eda_opendata: Generate a basic report in rmarkdown or shiny with selected datasets from an open data portal.
Description fetch a detailed metada of open datasets (extract json and basic list of datasets available from a Socrata domain
Usage
get_socrata_metada(url)
Arguments
Details a R data frame containing a listing of datasets along with detailed metadata. Next field are preserved for every open dataset:
Value The function returns a dataframe with detailed information about datasets.
Description Shows a list object with with available datasets by keywords/tags. The list contains four things:
Usage
search_data(metadata, tags)
Arguments
Details The selected sample is drawn according to a selection-rejection (list-sequential) algorithm
Value The function returns a list of 4 dataframes. The 4 dataframes contain information of tabular, geo, href and blobby data of open data portal.
Description Generate a basic report in rmarkdown or shiny with selected datasets from an open data portal.
Usage
eda_opendata(metadata, ids, tags)
Arguments
Details The selected sample is drawn according to a selection-rejection (list-sequential) algorithm
Value The function returns a list of 4 dataframes. The 4 dataframes contain information of tabular, geo, href and blobby data of open data portal.
viewType | count | percent |
---|---|---|
tabular | 3543 | 82.0 |
href | 574 | 13.3 |
blobby | 162 | 3.7 |
geo | 44 | 1.0 |
Stats | |
---|---|
nbr.val | 4323.0 |
nbr.null | 0.0 |
nbr.na | 0.0 |
min | 1.0 |
max | 25932.0 |
range | 25931.0 |
sum | 375981.0 |
median | 26.0 |
mean | 87.0 |
SE.mean | 10.9 |
CI.mean.0.95 | 21.4 |
var | 513229.0 |
std.dev | 716.4 |
coef.var | 8.2 |