Welcome to REACH

Introduction

Welcome to the REACH Team! We are delighted to have you on board as our new Data Officer/Assistant. Your role is vital in ensuring the effective management and utilization of data.

This welcome package is designed to provide you with essential information and resources to kick start your journey with us. Please take the time to read through these materials and familiarize yourself with our organization and your role in it.

Background on IMPACT and REACH

REACH was born in 2010 as a joint initiative of two International NGOs (IMPACT Initiatives and ACTED) and the United Nations Operational Satellite Applications Programme (UNOSAT). REACH’s purpose is to promote and facilitate the development of information products that enhance the humanitarian community’s decision making and planning capacity for emergency, reconstruction and development contexts. REACH facilitates information management for aid actors through three complementary services: (a) need and situation assessments facilitated by REACH teams; (b) situation analysis using satellite imagery; (c) provision of related database and (web)-mapping facilities and expertise.

IMPACT Initiatives is a humanitarian NGO, based in Geneva, Switzerland. The organisation manages several initiatives, including the REACH Initiative. The IMPACT team comprises specialists in data collection, management and analysis and GIS. IMPACT was launched at the initiative of ACTED, an international NGO whose headquarter is based in Paris and is present in thirty countries. The two organizations have a strong complementarity formalized in a global partnership, enabling IMPACT to benefit from ACTED’s operational support on its fields of intervention.

Inductions

Induction Meetings

During your first weeks with IMPACT/REACH, you will be reached out by several Unit/Team focal points to attend an induction meeting. Below are the main induction meetings and the facilitators.

Cailean MALLON (Humanitarian Needs Monitoring Unit - HNMU)

Anastasiia Fitsova (Durable Solutions Unit - DSU)

Katherine Smedema (Emergency Team)

Brett Schmicking (Resilience and Early Recovery Unit - RERU)

Tetyana Kuchma (Environment Team)

Antoine Chandonnet (Humanitarian Infrastructure and Damage Unit - HIDU)

RaeAnne Dietlin (Localisation Team)

Nestor Cheryba (Data/Tools)

Mariia Tomashchuk (GIS)

Alina Belobra (Reporting and Communications Unit)

Yuliia Lastochkina (Field Operations)

Natalie Bark (PD and M&E)

Kateryna Stepanenko/Khrystyna Trush (HR - Code of Conduct[National Staff])

Oleksandr Nesterenko (Finance)

Kyrylo Osmak (Logistics)

Dave van Zoonen/Oliver Lough (Programmes)

Casey Clark (Impact in Ukraine)

Team Meetings

All Impact Staff Weekly Team Meeting

The company Organigram

Tools

List of frequently used tools in REACH

Key Contacts

Cross-Cutting Team

Nestor Cheryba (Ukraine Data Technical Coordinator)

Mariia Tomashchuk (Ukraine GIS Specialist)

Bohdan Pelekh (Data Officer)

Units Specific

Oleksandr Sofronov (HNMU - MSNA Senior Data Officer)

Olha Mykhailuk (HNMU - MSNA Data Officer)

Maksym Kopylov (HNMU - JMMI/HSM Senior Data Officer)

Andrii Mynko (HNMU - JMMI Junior Data Officer)

Anastasiia Pylaieva (HIDU Senior Data Officer)

Andrii Andriievskyi (ET Data Officer)

Oleg Okhremenko (DSU - ATM/IDP Profiling Data Officer)

Volodymyr Lastochkin (DSU - CCCM Data Officer)

Danylo Chuzhov (DSU - CCCM Data Officer)

Anastasia Fedenkiv (DSU - CCCM Data Assistant)

Fedir Zhurba (DSU - Longitudinal study Data Officer)

Oleksandr Vynohradov (DSU - Longitudinal study Data Specialist)

Rodion Filatov (RERU - Socioeconomic Data Officer)

Svitlana Bozhenko (RERU - Cross cutting Data Officer)

Your role as a Data Personnel

Data processing/Data transformation activities:

Supervision of all data related processes before the beginning, during and after data collection - Questionnaire coding, monitoring of the data collection process, data quality control, communication with the field and assessment teams on the progress and potential issues encountered;
Providing data analysis support to the IMPACT office including but not limited to data cleaning, data processing, data analysis and data visualization;
Ensuring timely delivery of cleaned datasets and overall data quality control;
Supporting the development of internal Information management procedures and guidelines.

Data visualization activities

Supporting development of interactive data visualization products (dashboards) based on IMPACT data and data provided by IMPACT partners. This support will include development of interactive product concept, design and functionality.

Internal Communication Processes

Coordinating closely with IMPACT GIS/Data Specialists, IMPACT Senior Assessment Officer and IMPACT GIS Officer in country to guarantee a quality of all IMPACT products.

Confidentiality and Data Protection

Maintaining the strictest confidentiality of data and processes. You will actively take measures to prevent the unauthorized sharing of information and data.

SOPs

Cleaning SOP

Using utilityR package to clean your data.

The package is devoted to a number of functions used within the standard Reach Ukraine cleaning processes. Run devtools::install_github('https://github.com/REACH-WoU/utilityR', build_vignettes = T). The package is composed of a number of function ‘families’ each dealing with a specific aspect of the data manipulation. Please read vignettes and individual function documentation to discover more about each of the families in greater detail. you can browse the vignettes of this package by calling vignette(package='utilityR'). The text below presents a comprehensive overview of the cleaning process and how each of the functions are used within their respective frameworks. The text below follows the structure of the cleaning template presented in the markdown templates of this package.

Open up the cleaning template
Duplicates and no-consents
Audit checks and soft duplicates
Geospatial checks
Other requests and translations
999 checks
Logic checks
Checks for outliers
Finalize the data
Contributors

Open up the cleaning template

Load the utilityR library
Create a new file using the utilityR markdown template
Create a new Rmarkdown file by clicking ‘File’->‘New File’->‘R Markdown’->‘From Template’->‘Cleaning template’. This will create a new directory in your local machine that will host all of the cleaning template files as well as folder subdirectories that you’ll need for the cleaning.
Load the kobo tool into the ‘resources’ folder. Load the raw data into the ‘data_export’ folder and the audit files into ‘data’ folder if you have them.
Start the 1_cleaning.R file.

Initialization

Prior to running anything fill up the directory_dictionary list with the relevant names. Load the API key file and run the init.R and load_Data.R files. Usually, no inputs from your side are needed for these two bits of script.

The raw data are saved in the kobo.raw.main and kobo.raw.loopx files (x is the number of your loop e.g. kobo.raw.loop1) that are later duplicated into raw.main and raw.loopx files. This is convenient for cases where you want to re-run your cleaning procedure from scratch and don’t want to waste time on reading your dataframes again. Just run the rows that create raw.main and raw.loopx objects and you’ll be set.

If you need to modify your kobo tool in any way, please do so within the src/sections/tool_modification.R file. This is reserved for the cases where the tool was changed in the middle of data collection.

Audit checks and soft duplicates

Audit checks Prior to running the script you’ll have to specify the minimum and maximum time that the respondent can spend answering the questions. All of the interviews that are above/below these thresholds will be marked as suspicious. Additionally, some enumerators can spend too much time on a single question (consent, location, etc.) to make the interview seem longer than it actually was. You can smooth these interview times by passing the pre_process_audit_files = T argument and setting max_length_answer_1_question parameter. This will make the script run the pre.process.audits function, that will replace these long times with the sample average time for answering the given question, without the outliers. Those uuid-question pairs that had their time values replaced will be tagged in the tag column so that the user knows that something was wrong in the interview-question pair. This is a best practice, so we encourage the users to do so.

The analysis of audits will create a audits_summary excel file in the directory_dictionary$dir.audits.check directory. This file is your survey data + audit check columns such as:
n.iteration - The number of iterations per interviews (the number of times the user had to stop and then continue the interview)
tot.t - Total time of the interview. Calculated as start of the last form.exit event minus the start of form.start event
tot.rt - The total response time of the interview. The sum of the duration column in the loaded audits dataframe
tot.rt.inter - The total time between questions in the interview. The sum of the inter_q_duration column in the loaded audits dataframe
t - Time of each iteration. Calculated as start of the iteration’s form.exit event minus the start of the iterationsform.start event
rt - Response time of each iteration. The sum of the duration column in the loaded audits dataframe for the iteration
q - Number of questions per iteration
j - Number of jump events per iteration
e - Number of edits per iteration Calculated as the number of non NA entries in the old.value column
w - Waiting time - the start column of iteration’s form.resumeevent - the start for the column of the pervious iterations form.exit event
tag - If you’ve pre-processed files, this column will tag the uuid-question pairs that were outside of the set duration threshold
As well as NA, DK, and _other (open text answer) columns.

All of the suspicious surveys will be written into the survey_durations file. After the script is done analysing these things, you can browse the audits_summary excel file. If you decide to keep an entry despite it being in this file, delete the relevant excel row. Everyting within this file will be deleted when you run the section_2_run_audit_decisions.R.

Soft Duplicates

The only entry needed from the user for this bit of the script is min_num_diff_questions parameter, that is the minimum number of different columns that makes us confident that the entry is not a soft duplicate. The soft duplicate algorithm is based on the Gower distance parameter calculated for subsets of each individual enumerator.

This analysis produces 4 outputs: 1. soft.duplicates excel - the dataframe that contains entries with most similar surveys per enumerator. 2. soft_duplicates_analysis excel - the summary file with statistics for soft duplicates per enumerator. 3. soft_duplicates_outliers excel - outlier enumerators that have the most soft duplicates. 4. enumerators_surveys pdf - a visualisation of the enumerators with outlier values in terms of similarity of surveys.

Once again, if you’re fine with some of these duplicates, remove them from the soft_duplicates excel file in the directory_dictionary$dir.audits.check directory. Everything that is left in the excel will be deleted when you run the section_2_run_audit_decisions.R.

Once you’ve looked through the excel files, double-checked everything and left only those entries that you’d like to delete in audit and soft duplicate files, run the section_2_run_audit_decisions.R line in the cleaning script.

The only bit of manual entry that needs to be done when running this file is filling the ids_incompl object. It’s supposed to host the uuids of those surveys that are incomplete. If you don’t have any such surveys, you can leave it blank.

Geospatial checks

The spatial checks section checks for interviews with 0 geo coordinate precision. If these are present in the data, this may mean that the interviewer has installed a fake gps app onto their phone and has used it to fake the interview.

After this check is done, the deletion log is written into a geospatial_check excel file. Look trough it and remove those uuids that you’d like to keep in the data. After you’re done, run the section_3_spatial_decisions.R and we’re done with the deletion bit of the cleaning.

Other requests and translations

This section is the most hands-on part of this script. It is also the most complex one, so please take your time running it and be vary of any bugs, errors and warning that you may get. Please go into the scripts themselves when running them instead of just sourcing them.

section_4_create_other_requests_files.R is the bit of the script that gathers all of the text columns from your kobo questionnaire and translates them. It creates two files each having a different procedure applied to it. One file is dedicated to the _other requests the other one works with the open-ended questions.

NOTE
Each of the abovementioned files is generated by their respective functions in the get family - get.other.db and get.trans.db. If one of the functions didn’t find the respective entries (meaning that you didn’t follow the bese practices when designing the Kobo tool). You can run get.text.db function that will get you all of the text questions in tour kobo form. From that point you can transfer the ommited columns to either other.db or trans.db objects prior to running find.responses functions that follow them.

The other entry workflow.

The first type of a file that this script produces are the other_requests_final file. To produce the list of text questions that have _other response options the script uses the get.other.db function. This functions relies on the fact that in our data these questions have the _other suffix and have only one relevancy - their ref.name column in the following form - selected(${ref.name}, 'other').
If there are multiple relevancies for a given _other column or if the text column doesn’t have the _other suffix, the variable may be ommited from the analysis.
This output file has the following structure

uuid	loop_index	name	ref.name	full.label	ref.type	choices.label	choices	response.uk
ID	loop_ID	variable_name_other	variable_name	variable label	`select_one` or `select_multiple`	the labels of all available choices	respondent’s choices	the response in Ukrainian/Russian

response.en	true	existing	invalid	true_elsewhere	true_column	true_column_parent
The translated response in English	Whether the `_other` response is appropriate	Whether the `_other` response already exists within the `choices.label` column	Whether the response is invalid	If the response is appropriate but answers another question	The `name` of the `_other` question that it answers	The `ref.name` column for `true_column`

After the file is created, the user’s task is to open the excel file and look through the response.en column to see if the translation and the answer itself is appropriate.

The regular cases
Most of the time the user will be engaging with true, existing and invalid columns. - If the translation is good and the answer is appropriate to what was asked in the question stored in full.label column, put the correct translation into the true column. - If the answer that the user has given is already present in the choices.label column (meaning that the user didn’t understand that such option was already available), fill the existing column by pasting the exact appropriate option from the choices.label column. If you’re working with a select_multiple question, and the answer is appropriate for a few of the options in the choices.label you can add a few of them if you separate them with a semicolon - ;. Be careful when filling this column and double-check the choices column in your excel file. Sometimes the option that you want to put into the existing column has already be chosen by the user. This won’t break the script but you will get a warning. - If the answer is invalid - as in, it’s not related to the question that is being asked, type YES into the invalid column.

The elsewhere cases
The elsewhere case is reserved for occurences when the response.en is inappropriate for the question asked in the full.label but it can be appropriate for some other question in the survey and you want to transfer that response into a new column. If you want to do this you have to ensure the following: 1. The invalid column is filled with YES for this row. 2. You’ve inserted the correct translation into the true_elsewhere 3. You’ve inserted the correct _other column into the true_column 4. You’ve inserted the correct parent column into the true_column_parent

When you’re done with this, you can save the excel file and move on to the translation requests.

The translation entry workflow.

Prior to running the translation of the text responses, the user needs to specify two parameters: - trans_cols_to_skip - a vector list of columns that need to be omitted from the process and the translations. These may be columns of enumerator comments, names of locations of the interviews, personal data of the respondent, etc. After these are specified the user can run the get.trans.db function, which will return the trans.db object - a dataframe of variable names that are to be extracted from the data. This function is similar to the abovementioned get.other.db function, but it omits the _other questions. - missing_vars- a dataframe containing the variables that are not present in the trans.db and should be added to it. The user needs to specify the variable and its label.

After this, the user can continue running and translating the responses, this will produce the text_requests_final document in the directory_dictionary$dir.requests with the following structure

uuid	loop_index	name	responses	response.en	true	invalid
ID	loop_ID	variable_name	the response in Ukrainian/Russian	The translated response in English	Whether the response is appropriate	Whether the response is invalid

After the file is created, the user’s task is to open the excel file and look through the response.en column to see if the translation and the answer itself is appropriate.

If the translation is good and the answer is appropriate, put the correct translation into the true column.
If the answer is invalid - as in, it’s not related to the question that is being asked, type YES into the invalid column.

When you’re done with this, you can save the excel file and move on to applying the changes to the dataset.

Applying recode changes

When the user starts running the section_4_apply_changes_to_requests file the script will go through a round of checks to see whether the other_requests_final file was filled properly. It will check: - Whether the choices that the user has added in the existing column weren’t already chosen by the user within the choices column (if they were, those entries will be removed from the requests file). - Whether the entries were filled properly (only 1 column out of existing, true,invalid is filled). If the _other response is such that you need to recode the response into both existing and true, you can fill two columns at once, but you will get a warning to make sure you know what you’re doing. - Whether there are any empty rows - Whether the choices that the user has added in the existing column are actually present in the tool.choices object. - Whether the user has any entries within existing.v column that match the None criterion. These values include: None,Don't know,Prefer not to answer,etc. These entries need special treatment as the user entering them means that all other replies to the given question are invalid except for the None reply. The check looks if any of your entries are similar to these and asks the user to make sure that the ‘name’ values of these choices (from tool.choices object) are present in the none_selection object. By default, the object includes c(do_not_know, prefer_not_to_answer, none,none_of_the_above,dont_know, do_not_want_to_answer) and is passed to recode_others function that passes it internally to recode.others_select_multiple function. If you need to add some other cases, or remove them, feel free to modify this object.

If those checks have passed, the script will split the requests file depending on whether the questions belong to the main or loop dataframe. Each of these pairs of objects (the dataframe and its relevant recode requests) will be passed through the function recode.others that will create a cleaning log with the following set of changes for each case: - If the reply is true, it’ll replace the Ukrainian/Russian version of the _other response with the translated version. - If the reply is invalid, the function will: - Replace the text column value with an NA value. - The value in the cumulative column will be changed to NA if the ref.name is a select_one. If ref.name is a select_multiple and some other choices were also chosen except for other it’ll remove the other response from the cumulative column and replace the value in the binary column of ref.name/other with a 0. If the respondent has only chosen other when responding to ‘ref.name’, the function will replace the value of the cumulative column with NA, and change the values of all binary columns to NA as well. - If the reply is existing, the function will replace the text column value with an NA value, replace the other response in the cumulative column with the existing choice, recode the ref.name/other binary column to 0 and change the ref.name/existing column to 1.

Once this is done, the next bit of code deals with the elsewhere cases. This bit of the script creates another cleaning log that does the following: - As the row is already marked as invalid no changes need to be applied to the _other and ref.name columns. - The value of the _other column specified in the true_column will be changed to the value specified in the true_elsewhere - The value of other will be added to true_column_parent. - If the value of true_column_parent is a select_multiple the binary column true_column_parent/other will also be changed to 1.

This process is applied to the raw.main and each of the loops that you have in your environment.

Please note that the output of these functions is just the cleaning log. These changes haven’t yet been applied to the data

After this is done, we move on to the recode.relevancy framework.

recode.relevancy framework

The recode.relevancy framework is created for cases where the ref.name you are recoding is a select_multiple that direct the respondent to other questions depending on their answers. For example ref.name is a select_multiple asking the respondent to tell us what types of humanitarian aid they have received. After they reply, they are directed to a set of questions asking about the quality of said aid. If we are recoding the ref.name it inevitably influences these relevant questions, so we have to recode them as well. This is where the recode.relevancy framework comes in.

The first step in this framework is filling up the vector of select_multiple_list_relevancies. It stores the names of select_multiple variables that open up other relevant questions. The script then calls the find.relevances function that searches for the binary columns of the select multiple that open up each sub-question within the relevancy. This function creates relevancy_dictionary - a table of relationships between the variables in the following form.

name	relevancy
variable/option1	variable_option1_detail
variable/option2	variable_option2_detail

This object, together with the dataframe and its relevant cleaning log is fed to the recode.other.relevances function. This function creates a cleaning log documenting the following changes: - If the binary variable of _other was recoded to 0 it replaces the _detail variable with NA - If the _other response was classified as existing, the _detail variable for _other choice will be replaced with NA and its answer will be transfered to the existingOption_detail variable.

Please note that as of now, this feature is experimental, should you encounter any bugs please report them to the package’s maintainer.

This process is applied to the raw.main and each of the loops that you have in your environment.

Please note that the output of these functions is just the cleaning log. These changes haven’t yet been applied to the data

After all of cleaning logs have been created the changes outlined in those objects are applied to the datasets through the apply.changes function.

Recoding translation requests

Since the changes needed for the translation requests are pretty basic, the cleaning log is created out of them on the stage of loading the dataset through the recode.trans.requests function. If the response is deemed invalid it is replaced with NA, if it’s true then the Ukrainian/Russian text is changed to English.

After these cleaning logs are created, the changes outlined in those objects are applied to the datasets through the apply.changes function.

Non-english check

The next bit of the recoding script checks your dataframe for leftover non-english characters. After the steps above, you shouldn’t have any non-english characters left in your dataframe. First you need to specify the column that will be omitted from this check in the vector of the vars_to_omit object. The use for this object is the same as the trans_cols_to_skip object.
If any non-english characters have been found in your dataframes it’ll be stored in the cyrillic.main or cyrillic.loopx objects. It is up to you to decide what to do with them.

Consistency check

The final bit of the script is running the select.multiple.check which tries to find inconsistencies between the cumulative columns of select_multiple questions and their respective binary columns. The file will show the differences between what is expected from the cumulative column and what is present in the binaries. It is left up to the user to decide what to do with these inconsistencies.

999 checks

The next bit of the script checks the dataframes for 99 and 999 values in numerical columns as well as any other values that you’re suspicious of. You can specify these values in the code_for_check vector. These values are suspicious because they are a relic from the SPSS based sociological research. As the .sav values save values as numerics, it became necessary to assign NA or DK values a single code so that they are easily recognisable and you can recode them quickly. Usually, these are maked as 99,98 or 999. This is not applicable for us as we’re not working in SPSS. This bit of script produces a cl_log_999 object and a output/checking/999_diferences.xlsx excel file that document these entries. You can look through them and keep the rows that need to be recoded in your opinion.

Once you’ve deleted everything that needs to be deleted, you can apply these changes to the data and add them to the cleaning log by setthing apply_999_changes parameter to Yes and running section_5_finish_999_checks.R.

Logic checks

This section is dedicated to user-made checks for the general logical consistency across different columns. This is left for the user to fill in on their own as we cannot universalise this.

Checks for outliers

This section runs an algorithm over all numeric columns in the dataframe to see if any of the values are outside of the expected margins. The user needs to specify the following parameters: - n.sd - the number of standard deviations used to determine if an entry is an outlier - method - which method should be used to locate outliers. The function allows for the following methods: - method = ‘o1’ Method based on Z score and logarithmization of the values - method = ‘o2’ Modified Z score which based on the median absolute deviation, recommended n.sd - method = ‘o3’ Method based on the interquartile range - method = ‘o4’ Method based on the median absolute deviation - ignore_0 - Whether 0 should be ignored when calculating outliers (needs to be true for methods using logarithmic transformations of variables) - cols.integer - A set of parameters specifying exactly which variables should be checked in a given dataframe. All numeric variables will be checked if these are left blank.

The script then runs the selected outlier detection algorithm and writes the suspicious responses into cleaning.log.outliers object and the outlier_analysis_ excel file. It also creates a nice visualization of the distribution of the responses with outliers highlighted in red in the outlier_analysis_ pdf file.

The excel file will have a new column checked. It exists to allow the user to let the HQ know that the outlier value was checked even if it’s not fixed. If the outlier value is accurate and you with to keep it in the dataset, set the value of the checked column to value checked, if you want to change the old value to the new one, specify it within the new.value column and set the checked column value to value corrected. Now you can load up the clean excel file as the cleaning.log.outliers_full object and run section_6_finish_outlier_check.R.

Finalize the data

The last section goes through removal of PII columns, gathering the cleaning and deletion logs, building the submission excel for HQ validation and writing the submission package. Outside of specifying the pii.to.remove_main that holds the names of the PII columns barely any interaction is needed from the users side. The file for HQ submission is written as Cleaning_logbook excel in the output/Cleaning_logbook directory.

Analysis Scripts

This is a template repository containing scripts and the necessary folder structure to get you started with building your tabular analysis.

How to use this?

update to the latest version of the utility_repo submodule in the src directory.
put the necessary files into the resources folder: DAF and the Kobo tool
put your final data in the data folder
replace all the “???”s in strings in run_analysis.R
add disaggregations & grouping variables in format_dataset.R

CHANGELOG

Plenty of small improvements & additions, for example: for each question, the percentage of people who gave a response

example of small addition: percent of respondents who answered the question

code:

some functionality has been moved to separate R scripts: - fixing bugged choices and variable names -> fix_bugged_names.R - converting columns -> convert_cols_with_daf.R - adding grouping variables -> format_dataset.R

Hopefully this will make it easier to debug this script!

The new DAF:

all columns are optional except for variable and section (but section could be made optional too, if we use group names from tool.survey)
- label: by default takes the label of the question from the tool
- calculation, disaggregations: by default NA
- comments: by default “” (empty strings)
- func: by default uses var_type (taken from tool.survey)
- admin: by default equal to “strata” if this column exists in the data, otherwise “overall”

Default values are set in the init.R file, when the DAF is being loaded, but some are also calculated per-entry in the load_entry function

disaggregations:

as before, provide the variable name which will be used to disaggregate. You can specify more than one, by separating the names with semicolon ( ; )

admin:

same behavior as before. All data will be grouped by the provided admin column, unless admin==“overall”

func:

major change: ‘mean’ and ‘median’ have been combined and renamed to ‘numeric’. This func also outputs the min and max values for a variable.
‘select_one/_multiple’ provide the same tables as previously, with an option for providing the Totals, if the variable was disaggregated (example below)

an example of the Totals row

calculation:

you can set specific flags here which affect the outputted tables: you can provide any or all of these keywords: - include_na - to include NAs as one of the factor levels. Applicable only for ‘select_one/multiple’ - add_total - to include a row with Totals. Applicable only when the table is disaggregated. - join - not implemented yet

So, for example, if you’d like to include NAs for some table, and also have the Totals row, you can put into the calculation column this string: “add_total, include_na”.

“include_naadd_total” would also work, because load_entry simply detects the presence of a regex pattern in this column

what’s next to do?

‘count’ still needs to be re-implemented (shouldn’t be too difficult). For select-multiples, this func should give the numbers for each choice. For select-ones and text questions, this should be simply a group_by(entry$variable), then summarize(n = n())
additional calculation flags could be added (for example, “unweighted”)

Training and Development

List of trainigns provided by HQ (mainly R)

Here is the link for the list of trainings.

Intro to R (Basic - Intermediate)

Here is the link to a training in R for basic and intermediate level. (Mandatory)

Intro to Kobo (Basic - Intermediate)

Here is the link to a training in Kobo for basic and intermediate level.

Intro to Excel (Basic - Intermediate)

Here is the link to a training in Excel for basic and intermediate level.

FAQs

Impact Initiatives Intranet FAQs

Under this link, you will find the most frequently asked questions related to R/Kobo/GIS/Coding within the organization.