The V-Dem codebook (version 11) as a tibble for easier use in R. The original codebook is available at https://www.v-dem.net/en/data/data-version-11/

vdem_codebook

Format

An object of class tbl_df (inherits from tbl, data.frame) with 761 rows and 22 columns.

Variable Descriptions

section

The section number in the codebook, as an integer. For example, section 2 contains the main indexes of democracy, section 3 the mid-level indexes, etc.

name

The name of the variable. Does not include suffixes (e.g., _codehigh, _osp)

label

The label extracted from the codebook.

type

The type of variable. These are, according to the codebook:

  • A*: Coding by Research Assistants. This is based on extant sources and is factual in nature. Country Experts indicate their confidence for this pre-coded data. * A: Coding by Project Managers and Research Assistants. This is based on extant sources and is factual in nature. * B: Coding by Country Coordinators. This is typically a graduate student or recent graduate who is from the country. These questions are also factual in nature. * C: Coding by Country Experts. This is typically a scholar or professional with deep knowledge of a country and perhaps of a particular political institution. Generally, that person is a citizen or resident of the country being coded. Multiple experts (usually 5 or more) code each variable. * D: Indices, i.e., composite variables composed of type A, B, or C variables. This may be accomplished by adding a denominator (e.g., per capita), by creating a cumulative scale (Total number of ...), or by aggregating larger concepts (e.g., components or indices of democracy). * E: Non-V-Dem. If we import a variable from another source without doing any original coding, except for perhaps imputing missing data, it is not considered a V-Dem product. Many of these variables are found in the section of the Codebook labeled "Background Factors.: If, however, we gather data from a number of sources and combine them in a more than purely mechanical fashion (requiring some judgment on our part), we regard this as a V-Dem product and classify it as type A, B, C, or D. All "E" variables are included in the codebook except those drawn from sources that have more than 30 variables such as Archigos (Goemans et al.), BDM (Bueno de Mesquita et al.), Henisz/POLCON (2000, 2002), Miller (Democratic Pieces), Performance of Nations (Kugler and Tammen), PEI (Norris et al.), PIPE (Przeworski et al.), QoG (Quality of Government). For these variables, we ask users to consult separate codebooks, as listed above.

project_manager

The project manager listed in the codebook.

additional_versions

Indicates if the variable is also available in the following versions; _osp, _ord, _codelow, _codehigh, _sd, _mean and/or _nr. Detailed information about the different versions can be found in section 1.5 of the codebook (Variable Versions and Suffixes).

available_versions

Lists the available variable types (Only applicable for ordinalized versions of indices).

question

If available, the question that the variable attempts to measure.

clarification

If available, the clarification in the codebook.

responses

If available, the possible responses.

answer_type

Multiple-choice: Where a coder can select only one answer. This is the usual protocol and is therefore not noted. Multiple-selection: Where a coder can select more than one answer. For most multiple-selection variables, the dataset contains both the original variable as well as a set of dummies for each of the responses.

ordering

If available, the order in which the variable is coded (e.g., only if previous quesiton is answered).

aggregation

If available, the type of aggregation for the index (e.g., Bayesian factor analysis, average of indicators). Only applicable to indices.

scale

If available, the scale of the responses (e.g., nominal, categorical).

cross_coder

If available, the method for aggregating multiple coders' judgments. Only applicable to C variables.

data_release

If available, the versions of the V-Dem data where the variable has been included.

source

If available, the source listed in the codebook. Citations for type-A* and A variables are listed, wherever possible, with complete references in the Bibliography. Note that this coding sometimes rests on numerous country-specific sources, in which case it has not been possible to include all citations. Composite indices (type-D) build on other variables in the V-Dem database, which are therefore listed as the source for that index.

cleaning

Specifies if observations are set to missing based on values on other variables.

citation

Suggested citation when using the specific variable.

years

Available coverage for the respective variable. For more information on country-specific year coverage, see the country table in the codebook.

note

If available, any notes in the codebook with additional info about the variable.

References

Coppedge, Michael, John Gerring, Carl Henrik Knutsen, Staffan I. Lindberg, Jan Teorell, David Altman, Michael Bernhard, Agnes Cornell, M. Steven Fish, Lisa Gastaldi, Haakon Gjerløw, Adam Glynn, Allen Hicken, Anna Lührmann, Seraphine F. Maerz, Kyle L. Marquardt, Kelly McMann, Valeriya Mechkova, Pamela Paxton, Daniel Pemstein, Johannes von Römer, Brigitte Seim, Rachel Sigman, Svend-Erik Skaaning, Jeffrey Staton, Aksel Sundtr¨om, Eitan Tzelgov,Luca Uberti, Yi-ting Wang, Tore Wig, and Daniel Ziblatt. 2021. "V-Dem Codebook v11" Varieties of Democracy (V-Dem) Project. Available at https://www.v-dem.net/en/data/data-version-11/