• Skip to main content
  • Skip to footer
  • Home

The May 13 Group

the next day for evaluation

  • Get Involved
  • Our Work
  • About Us
You are here: Home / allblogs / evalacademy / Data Dictionary: the what, why and how

Oct 12 2021

Data Dictionary: the what, why and how

 

Technological advances have resulted in the collection of large amounts of data and the availability of data continues to skyrocket. To give you a perspective, more data has been collected in the past two years (2019 & 2020) than the entire human history before that.

This post isn’t about big data – we know there are more than enough articles about big data out there. Here, we’ll focus on how evaluators can (and should) clarify details about the data being used for evaluation. In other words, how and why build an evaluation-specific data dictionary.


What is a Data Dictionary?

Definitions of “data dictionary” vary but it is generally understood to be a common language for quantitative data. Data dictionaries provide a precise vocabulary for specific data elements and help to standardize a dataset and ensure that the relevance, and quality of data elements, are the same for all users. Data dictionaries describe the meaning and purpose of data elements within the context of a project and provide guidance on interpretation.

Why Use a Data Dictionary?

It is ideal to have a data dictionary whenever you have quantitative data that will be used and shared by multiple people or groups. Without precise definitions, it is very easy to arrive at different results while using the same dataset. Confusions can be avoided by documenting data definitions and parameters and sharing them with all stakeholders.

Although creating a data dictionary is time-consuming, having precise documentation that can be used by all stakeholders promotes efficiency. Look at the following example for an online health education program:

  • The program team defines the “number of program participants per week” as the total number of participants that completed the online module per week.

  • The IT team defines the “number of program participants per week” as the total number of participants that accessed the online module per week.

As evaluators, if we didn’t examine the definition of “number of program participants per week” meant, we might draw some incorrect conclusions, and risk making unreliable or even dangerous recommendations.

A data dictionary that is prepared collaboratively between the evaluator and stakeholders can prevent confusion and promote alignment. In short, data dictionaries can:

  • Provide consistency in the collection and use of data across multiple users;

  • Make data analysis easier;

  • Promote usability of data; and

  • Increase confidence in the data, results, and decisions.

How to Prepare a Data Dictionary

Before embarking on the task of creating a data dictionary, ask the program team if there’s an existing data dictionary for the dataset. It is a common practice to share a dataset with the data dictionary if there is one. However, the project team/client might not think to share the data dictionary with you. If there is a common, vetted, and documented data dictionary, it may not be necessary to create a new one.  

 The built-in active data dictionary can be used in most data management systems including MS Access and SPSS to generate documents as needed. Below is an image of a simple SPSS codebook output.  

Image Source: https://libguides.library.kent.edu/SPSS/Codebooks

Image Source: https://libguides.library.kent.edu/SPSS/Codebooks

Alternatively, if your data set is in MS Excel, you can use MS Excel or Word for documentation. Creating and managing a data dictionary is an iterative process; the definitions for the data dictionary categories and the relationships need to be revised regularly. Often data dictionaries in program evaluation contain the following:  

  • A list of data objects: names, metrics (measurement units) and definitions; 

  • Inclusion and exclusion criteria: specify cases to be included or excluded; 

  • Data Source(s): specify the source of data;  

  • Data Update: state how frequently the data is updated and available (e.g., weekly, monthly, annually);  

  • Limitations: specify any considerations that would impact the use of the indicator (can comment on reliability and validity of the data and include any other detail); 

  • Missing data: state if there are any missing values and how they were handled;  

  • Technical notes: provide technical details which help interpret the data presented; and 

  • Approval and sign-off: a data dictionary should be created collaboratively with approval from all those that will use the dictionary. After revisions and edits, and it should be signed off by the team leads to finalize the document.   

Screen Shot 2021-10-12 at 3.02.26 PM.png

In summary, a data dictionary is a great evaluation tool for projects with quantitative data. A data dictionary is time-consuming to prepare; however, it can promote efficiency and accuracy in the long run. Try building one for your next evaluation project.

While you’re creating definitions, check out our Performance Measures Definitions template.  


Sign up for our newsletter

We’ll let you know about our new content, and curate the best new evaluation resources from around the web!


We respect your privacy.

Thank you!


 

Written by cplysy · Categorized: evalacademy

Related Posts

You may be interested in these posts from the same category.

[grid content=”post” taxonomy=”category” terms=”current” exclude_current=”true” number=”12″ gutter=”10″ align=”center” slider=”true” center_mode=”true”]

Footer

Follow our Work

The easiest way to stay connected to our work is to join our newsletter. You’ll get updates on projects, learn about new events, and hear stories from those evaluators whom the field continues to actively exclude and erase.

Get Updates

Want to take further action or join a pod? Click here to learn more.

Copyright © 2026 · The May 13 Group · Log in

en English
af Afrikaanssq Shqipam አማርኛar العربيةhy Հայերենaz Azərbaycan dilieu Euskarabe Беларуская моваbn বাংলাbs Bosanskibg Българскиca Catalàceb Cebuanony Chichewazh-CN 简体中文zh-TW 繁體中文co Corsuhr Hrvatskics Čeština‎da Dansknl Nederlandsen Englisheo Esperantoet Eestitl Filipinofi Suomifr Françaisfy Fryskgl Galegoka ქართულიde Deutschel Ελληνικάgu ગુજરાતીht Kreyol ayisyenha Harshen Hausahaw Ōlelo Hawaiʻiiw עִבְרִיתhi हिन्दीhmn Hmonghu Magyaris Íslenskaig Igboid Bahasa Indonesiaga Gaeilgeit Italianoja 日本語jw Basa Jawakn ಕನ್ನಡkk Қазақ тіліkm ភាសាខ្មែរko 한국어ku كوردی‎ky Кыргызчаlo ພາສາລາວla Latinlv Latviešu valodalt Lietuvių kalbalb Lëtzebuergeschmk Македонски јазикmg Malagasyms Bahasa Melayuml മലയാളംmt Maltesemi Te Reo Māorimr मराठीmn Монголmy ဗမာစာne नेपालीno Norsk bokmålps پښتوfa فارسیpl Polskipt Portuguêspa ਪੰਜਾਬੀro Românăru Русскийsm Samoangd Gàidhligsr Српски језикst Sesothosn Shonasd سنڌيsi සිංහලsk Slovenčinasl Slovenščinaso Afsoomaalies Españolsu Basa Sundasw Kiswahilisv Svenskatg Тоҷикӣta தமிழ்te తెలుగుth ไทยtr Türkçeuk Українськаur اردوuz O‘zbekchavi Tiếng Việtcy Cymraegxh isiXhosayi יידישyo Yorùbázu Zulu