vdem_codebook.Rd
The V-Dem codebook (version 11) as a tibble for easier use in R. The original codebook is available at https://www.v-dem.net/en/data/data-version-11/
vdem_codebook
An object of class tbl_df
(inherits from tbl
, data.frame
) with 761 rows and 22 columns.
The section number in the codebook, as an integer. For example, section 2 contains the main indexes of democracy, section 3 the mid-level indexes, etc.
The name of the variable. Does not include suffixes (e.g.,
_codehigh
, _osp
)
The label extracted from the codebook.
The type of variable. These are, according to the codebook:
A*: Coding by Research Assistants. This is based on extant sources and is factual in nature. Country Experts indicate their confidence for this pre-coded data. * A: Coding by Project Managers and Research Assistants. This is based on extant sources and is factual in nature. * B: Coding by Country Coordinators. This is typically a graduate student or recent graduate who is from the country. These questions are also factual in nature. * C: Coding by Country Experts. This is typically a scholar or professional with deep knowledge of a country and perhaps of a particular political institution. Generally, that person is a citizen or resident of the country being coded. Multiple experts (usually 5 or more) code each variable. * D: Indices, i.e., composite variables composed of type A, B, or C variables. This may be accomplished by adding a denominator (e.g., per capita), by creating a cumulative scale (Total number of ...), or by aggregating larger concepts (e.g., components or indices of democracy). * E: Non-V-Dem. If we import a variable from another source without doing any original coding, except for perhaps imputing missing data, it is not considered a V-Dem product. Many of these variables are found in the section of the Codebook labeled "Background Factors.: If, however, we gather data from a number of sources and combine them in a more than purely mechanical fashion (requiring some judgment on our part), we regard this as a V-Dem product and classify it as type A, B, C, or D. All "E" variables are included in the codebook except those drawn from sources that have more than 30 variables such as Archigos (Goemans et al.), BDM (Bueno de Mesquita et al.), Henisz/POLCON (2000, 2002), Miller (Democratic Pieces), Performance of Nations (Kugler and Tammen), PEI (Norris et al.), PIPE (Przeworski et al.), QoG (Quality of Government). For these variables, we ask users to consult separate codebooks, as listed above.
The project manager listed in the codebook.
Indicates if the variable is also available in the following versions; _osp, _ord, _codelow, _codehigh, _sd, _mean and/or _nr. Detailed information about the different versions can be found in section 1.5 of the codebook (Variable Versions and Suffixes).
Lists the available variable types (Only applicable for ordinalized versions of indices).
If available, the question that the variable attempts to measure.
If available, the clarification in the codebook.
If available, the possible responses.
Multiple-choice: Where a coder can select only one answer. This is the usual protocol and is therefore not noted. Multiple-selection: Where a coder can select more than one answer. For most multiple-selection variables, the dataset contains both the original variable as well as a set of dummies for each of the responses.
If available, the order in which the variable is coded (e.g., only if previous quesiton is answered).
If available, the type of aggregation for the index (e.g., Bayesian factor analysis, average of indicators). Only applicable to indices.
If available, the scale of the responses (e.g., nominal, categorical).
If available, the method for aggregating multiple coders' judgments. Only applicable to C variables.
If available, the versions of the V-Dem data where the variable has been included.
If available, the source listed in the codebook. Citations for type-A* and A variables are listed, wherever possible, with complete references in the Bibliography. Note that this coding sometimes rests on numerous country-specific sources, in which case it has not been possible to include all citations. Composite indices (type-D) build on other variables in the V-Dem database, which are therefore listed as the source for that index.
Specifies if observations are set to missing based on values on other variables.
Suggested citation when using the specific variable.
Available coverage for the respective variable. For more information on country-specific year coverage, see the country table in the codebook.
If available, any notes in the codebook with additional info about the variable.
Coppedge, Michael, John Gerring, Carl Henrik Knutsen, Staffan I. Lindberg, Jan Teorell, David Altman, Michael Bernhard, Agnes Cornell, M. Steven Fish, Lisa Gastaldi, Haakon Gjerløw, Adam Glynn, Allen Hicken, Anna Lührmann, Seraphine F. Maerz, Kyle L. Marquardt, Kelly McMann, Valeriya Mechkova, Pamela Paxton, Daniel Pemstein, Johannes von Römer, Brigitte Seim, Rachel Sigman, Svend-Erik Skaaning, Jeffrey Staton, Aksel Sundtr¨om, Eitan Tzelgov,Luca Uberti, Yi-ting Wang, Tore Wig, and Daniel Ziblatt. 2021. "V-Dem Codebook v11" Varieties of Democracy (V-Dem) Project. Available at https://www.v-dem.net/en/data/data-version-11/