ČSÚ

Náhled katalogu

czso_cat <- czso_get_catalogue()
ℹ Reading data from data.gov.cz
✓ Done downloading and reading data
ℹ Transforming data
glimpse(czso_cat, width = 110)
Rows: 723
Columns: 13
$ dataset_iri <chr> "https://data.gov.cz/zdroj/datové-sady/00025593/946049754/c420f3c02fcdb5bf0550bc5a7b95dd…
$ dataset_id  <chr> "https://vdb.czso.cz/pll/eweb/lkod_ld.datova_sada?nazev=Volby_do_Poslanecke_snemovny_Par…
$ title       <chr> "Volby do Poslanecké sněmovny Parlamentu ČR 2017 - výsledky za okres Mladá Boleslav a je…
$ provider    <chr> "Český statistický úřad", "Český statistický úřad", "Český statistický úřad", "Český sta…
$ description <chr> NA, "Číselník politické příslušnosti kandidátů", "Číselník zemí (CZEM) - agregace", "Har…
$ spatial     <chr> "https://linked.cuzk.cz/resource/ruian/stat/1", "https://linked.cuzk.cz/resource/ruian/s…
$ temporal    <chr> "https://data.gov.cz/zdroj/datové-sady/00025593/946049754/c420f3c02fcdb5bf0550bc5a7b95dd…
$ modified    <dttm> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA,…
$ page        <chr> "https://volby.cz/opendata/ps2017/PS2017_XML.htm", "https://www.volby.cz/opendata/ps2021…
$ periodicity <chr> "http://publications.europa.eu/resource/authority/frequency/NEVER", "http://publications…
$ start       <date> 2017-10-20, 2021-08-31, 1900-01-01, 1991-01-01, 2016-10-07, 2001-03-01, 2006-01-01, 190…
$ end         <date> 2017-10-21, 2021-08-31, 9999-09-09, 9999-09-09, 2016-10-08, 9999-09-09, 2006-12-31, 999…
$ keywords    <chr> "volby; výsledky voleb; Poslanecká sněmovna; okres Mladá Boleslav", "volby; Poslanecká s…
head(czso_cat, n = 30)
czso_cat %>% 
  filter(str_detect(title, "[Zz]emř")) %>% 
  select(dataset_id, title)

Načtení datové sady

czso_tbl <- czso_get_table("110080")
ℹ File already in '/Users/petr/czso_data/110080/', not downloading.
  Set `force_redownload = TRUE` if needed.
head(czso_tbl, n = 30)
glimpse(czso_tbl, width = 110)
Rows: 810
Columns: 14
$ idhod         <chr> "745958789", "745958790", "745958791", "745958792", "745958793", "745958794", "7459589…
$ hodnota       <dbl> 26211, 29026, 22729, 22266, 23955, 20271, 26033, 28873, 22496, 21997, 23652, 20042, 22…
$ stapro_kod    <chr> "5958", "5958", "5958", "5958", "5958", "5958", "5958", "5958", "5958", "5958", "5958"…
$ SPKVANTIL_cis <chr> NA, NA, NA, "7636", "7636", "7636", NA, NA, NA, "7636", "7636", "7636", "7636", NA, NA…
$ SPKVANTIL_kod <chr> NA, NA, NA, "Q5", "Q5", "Q5", NA, NA, NA, "Q5", "Q5", "Q5", "Q5", NA, NA, NA, "Q5", "Q…
$ POHLAVI_cis   <chr> NA, "102", "102", NA, "102", "102", NA, "102", "102", NA, "102", "102", NA, NA, "102",…
$ POHLAVI_kod   <chr> NA, "1", "2", NA, "1", "2", NA, "1", "2", NA, "1", "2", NA, NA, "1", "2", "1", "2", NA…
$ rok           <int> 2013, 2013, 2013, 2013, 2013, 2013, 2012, 2012, 2012, 2012, 2012, 2012, 2014, 2014, 20…
$ uzemi_cis     <chr> "97", "97", "97", "97", "97", "97", "97", "97", "97", "97", "97", "97", "97", "97", "9…
$ uzemi_kod     <chr> "19", "19", "19", "19", "19", "19", "19", "19", "19", "19", "19", "19", "19", "19", "1…
$ STAPRO_TXT    <chr> "Průměrná hrubá mzda na zaměstnance", "Průměrná hrubá mzda na zaměstnance", "Průměrná …
$ uzemi_txt     <chr> "Česká republika", "Česká republika", "Česká republika", "Česká republika", "Česká rep…
$ SPKVANTIL_txt <chr> NA, NA, NA, "medián", "medián", "medián", NA, NA, NA, "medián", "medián", "medián", "m…
$ POHLAVI_txt   <chr> NA, "muž", "žena", NA, "muž", "žena", NA, "muž", "žena", NA, "muž", "žena", NA, NA, "m…

Eurostat

Kódy odpovídají kódům tabulek zobrazených v https://ec.europa.eu/eurostat/data/database.

Kódy tabulek jsou pak v citacích jako odkazy, viz např. zdroj pod tabulkou v tomto dokumentu nebo zdroj pod grafem na této stránce.

Kódy navíc označují příslušnost sady ke skupině a další informace - jednotlivé části kódu odpovídají skupině („folder“), datové sadě, metrice, regionálnímu rozpadu: např. nama_10r_2gdp je HDP z národních účtů v regionálním rozpadu na NUTS2.

Katalog

es_toc <- get_eurostat_toc()
head(es_toc, n = 30)
glimpse(es_toc)
Rows: 10,499
Columns: 8
$ title                         <chr> "Database by themes", "General and regional statistics", "European and…
$ code                          <chr> "data", "general", "euroind", "ei_bcs", "ei_bcs_cs", "ei_bsco_m", "ei_…
$ type                          <chr> "folder", "folder", "folder", "folder", "folder", "dataset", "dataset"…
$ `last update of data`         <chr> NA, NA, NA, NA, NA, "30.08.2021", "30.08.2021", NA, "30.08.2021", "30.…
$ `last table structure change` <chr> NA, NA, NA, NA, NA, "30.08.2021", "29.07.2021", NA, "30.08.2021", "29.…
$ `data start`                  <chr> NA, NA, NA, NA, NA, "1980M01", "1990Q1", NA, "1980M01", "1980Q1", "198…
$ `data end`                    <chr> NA, NA, NA, NA, NA, "2021M08", "2021Q3", NA, "2021M08", "2021Q3", "202…
$ values                        <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
es_toc %>% 
  filter(str_detect(title, "Gross domestic")) %>% 
  select(code, title)

Datová sada

es_tbl <- get_eurostat("nama_10r_2gdp")
Table nama_10r_2gdp cached at /var/folders/c8/pj33jytj233g8vr0tw4b2h7m0000gn/T//RtmpwvDHfn/eurostat/nama_10r_2gdp_date_code_FF.rds
head(es_tbl, n = 30)

Katalog - python

import eurostat
import pandas as pd

es_toc = eurostat.get_toc_df()

es_toc
                                                   title  ... data end
0                                     Database by themes  ...         
1                        General and regional statistics  ...         
2      European and national indicators for short-ter...  ...         
3       Business and consumer surveys (source: DG ECFIN)  ...         
4                    Consumer surveys (source: DG ECFIN)  ...         
...                                                  ...  ...      ...
10494  Enterprises that provided training to develop/...  ...     2020
10495  Participation in education and training - cont...  ...         
10496  Enterprises providing training by type of trai...  ...     2015
10497  Participants in CVT courses by sex and size cl...  ...     2015
10498  Main skills targeted by CVT courses by type of...  ...     2015

[10499 rows x 7 columns]
es_toc[es_toc.title.str.contains("Gross domestic")][['code', 'title']]
                     code                                              title
193          reg_eco10gdp                  Gross domestic product indicators
194         nama_10r_2gdp  Gross domestic product (GDP) at current market...
196         nama_10r_3gdp  Gross domestic product (GDP) at current market...
452          met_10r_3gdp  Gross domestic product (GDP) at current market...
500          urt_10r_3gdp  Gross domestic product (GDP) at current market...
781      enps_nama_10_gdp            Gross domestic product at market prices
782   enps_nama_10_gdp_ea  Gross domestic product at market prices by exp...
943      enpe_nama_10_gdp            Gross domestic product at market prices
944   enpe_nama_10_gdp_ea  Gross domestic product at market prices by exp...
1027   enpe_rd_e_gerdfund           Gross domestic expenditure on R&D (GERD)
1034              med_ec1                             Gross domestic product
1200         nama_10r_gdp                  Gross domestic product indicators
1201        nama_10r_2gdp  Gross domestic product (GDP) at current market...
1202        nama_10r_3gdp  Gross domestic product (GDP) at current market...
7109                 rd_e  Gross domestic expenditure on R&D (GERD) at na...
7756             tgs00028  Gross domestic product, Candidate countries an...
7764             tec00001            Gross domestic product at market prices
7793             teina010             Gross domestic product, current prices
7797             teina011                    Gross domestic product, volumes
8852               tipsgd                       Gross domestic product (GDP)
8853             tipsau10  Gross domestic product (GDP) at market prices ...
8854             tipsau20  Gross domestic product (GDP) at market prices ...
8857             tipsst10  Gross domestic expenditure on research and dev...
8977            teina_gdp                             Gross domestic product
8978             teina010             Gross domestic product, current prices
8979             teina011                    Gross domestic product, volumes
9073             t2020_20           Gross domestic expenditure on R&D (GERD)
9242            sdg_09_10        Gross domestic expenditure on R&D by sector

Sada - python

eurostat.get_data_df('nama_10r_2gdp')
                   unit geo\time    2019    2018  ...  2003  2002  2001  2000
0               EUR_HAB       AL  4800.0  4500.0  ...   NaN   NaN   NaN   NaN
1               EUR_HAB      AL0  4800.0  4500.0  ...   NaN   NaN   NaN   NaN
2               EUR_HAB     AL01  3900.0  3600.0  ...   NaN   NaN   NaN   NaN
3               EUR_HAB     AL02  5700.0  5400.0  ...   NaN   NaN   NaN   NaN
4               EUR_HAB     AL03  4300.0  4000.0  ...   NaN   NaN   NaN   NaN
...                 ...      ...     ...     ...  ...   ...   ...   ...   ...
3073  PPS_HAB_EU27_2020     TRB2    25.0    25.0  ...   NaN   NaN   NaN   NaN
3074  PPS_HAB_EU27_2020      TRC    30.0    32.0  ...   NaN   NaN   NaN   NaN
3075  PPS_HAB_EU27_2020     TRC1    38.0    41.0  ...   NaN   NaN   NaN   NaN
3076  PPS_HAB_EU27_2020     TRC2    23.0    24.0  ...   NaN   NaN   NaN   NaN
3077  PPS_HAB_EU27_2020     TRC3    30.0    31.0  ...   NaN   NaN   NaN   NaN

[3078 rows x 22 columns]