Wave 1 (2008-2013) Public Use Data

Wave 1 Public Use Dataset

The Survey of the Health of Wisconsin (SHOW) is a novel statewide infrastructure for population health data collection that was established in Wisconsin in 2008. The survey includes surveys of a representative sample of state residents including individual interviews, a physical exam, and biospecimen collection, and assessments of the community environment. This dataset is a statewide representative, cross-sectional sample of Wisconsin focusing on health data and contextual factors.

Access the Data

Questions?

Contact: data@show.wisc.edu

This is an accordion element with a series of buttons that open and close related content panels.

SHOW Cohort

The 2008-2013 SHOW cohort contains 3,380 individuals from many distinct cities and towns across 62 Wisconsin Counties, as shown in the maps below.

This Wave 1 dataset is the SHOW cohort “baseline” for the longitudinal follow-up, and includes individuals between 21 and 74 years of age.

Sampling Methodologies

The SHOW sampling frame is Wisconsin non-institutionalized/non-active duty adult residents ages 21-74 at the time of initial selection (longitudinal evaluation follows subjects beyond age 74). However, all household members (including children and older adults) are enumerated (see below), and may be the focus of targeted ancillary studies.

Presently, the goal is to continue this survey indefinitely, recruiting an independent sample of 800-1,000 survey participants each year. This target sample size is driven by a broad range of potential outcomes and research questions to be addressed with SHOW and in anticipation that researchers will use these data in multiple ways with varying sample size requirements.

Sampling procedures

Every year, survey participants are selected from a random sample of households using a two-stage, probability-based cluster sampling approach

Stage 1

The initial sampling frame is constructed using Census 2000 data to generate 4,388 Census Block Groups (CBGs) or clusters of CBGs for use as the primary sampling units.  In order to ensure a representative distribution of the sample across the entire spatial and sociodemographic range of the state population, the PSUs are stratified according to two criteria: 1) congressional district (8 strata); and 2) percentage of the population living below 100% of poverty level.

Stage 2

SHOW used a variety of data resources and geographic information system (GIS) technology to create its household sampling frame. A list of household addresses by CBG is generated from which 12-28 addresses are randomly selected using simple random sampling.

As a final quality assurance step in the household sampling frame development, the field teams conduct a modified half-open interval procedure to identify households that may have been missed in the original sampling frame.

 

 

 

 

 

 

 

For detailed information on the sampling design, recruitment methods and participation of the 2008-2013 study sample, please see Nieto et al. 2010 (https://doi.org/10.1186/1471-2458-10-785) & Malecki et al. 2022 (https://doi.org/10.3389/fpubh.2022.818777) methods papers.

Collection Methodologies

The SHOW survey in 2008-2013 consisted of a series of independent annual surveys gathering health-related information on representative samples of adult state residents and communities. Two-stage cluster sampling was used to select households and recruit approximately 800-1,000 adult participants (21-74 years old) each year. Recruitment and initial interviews were done at the household; additional interviews and physical exams were conducted at permanent or mobile examination centers. Individual survey data includes physical, mental, and oral health history, health literacy, demographics, behavioral, lifestyle, occupational, and household characteristics as well as health care access and utilization.

The physical exam included blood pressure, anthropometry, bioimpedance, spirometry, urine collection and blood draws. Serum and plasma were stored in the SHOW biorepository for future studies. Every household was geocoded for linkage with existing contextual data including community-level measures of the social and physical environment; local neighborhood characteristics were also recorded using an audit tool.

For detailed information on the sampling design, recruitment methods and participation of the 2008-2013 study sample, please see Nieto et al. 2010 (https://doi.org/10.1186/1471-2458-10-785) & Malecki et al. 2022 (https://doi.org/10.3389/fpubh.2022.818777) methods papers.

Data Topics

The SHOW Core Survey in 2008-2013 covered the following list of topics, for more information on all of the topics the SHOW Survey has covered across all years, please see our topics page.

Analytic Notes

All data in SHOW 2008-2013 public use datasets were masked and deidentified. A complete list of identification steps can be found within documentation accompanying the full public use dataset.

Sampling Weights

The SHOW core data was collected using a sampling frame across the state of Wisconsin.  Due to the deidentified nature of the data, no sampling weights were provided.  Data is still usable; the difference is that statements indicating that results are weighted to be representative of the residents of state of Wisconsin cannot be made.  What can be said is that SHOW surveyed the state of Wisconsin; responses represent WI residents who participated in SHOW.

Clustering

All eligible adults within a selected household were invited to participate in SHOW. There are participants in SHOW who have one more additional household members who also participated in SHOW and in the data set. For some analyses, correlations may exist among household members, and this may violate the assumption of regression analyses that all events are independent of one another. The variable Household Identification (HHID) is available upon request and will require a data request and IRB approval.

Submit a Data Request

Linkages - Connecting this Dataset to other SHOW Data or SHOW Biospecimen

Many topics covered within this dataset were also asked in other timepoints of the SHOW cohort. Learn about all the topics we’ve covered and our full range of services to enhance the insights this dataset offers.

The 2008-2013 dataset covers the SHOW survey baseline for longitudinal follow-up, as show in the following figure:

Request Data or a Research Consultation

SHOW Fee Structure

Non-public, restricted use variables from 2008-2013 include those that were modified (categorized or collapsed responses) from being continuous or were removed from the public use data set. You may request restricted use 2008-2013 data by submitting a data request via our consultation form.

All requests for non-public, restricted use variables require IRB approval letter and application.

These additional variables are FREE:

  • Household Identification number (HHID)
  • Sampling weights, strata and cluster variables

These data are NOT subject to the $3000+ restructured use data request fee if, but subject to the $103/hr analytic rate:

  • Specific additional variables
    •  Another example may be that you need a specific health outcomes variable in from health history questionnaire (HHQ) in order to run your analyses.
  • Unmodified variables
    • For example, final regression models and results have been run and presented, but you need continuous variable, not collapsed responses, for publication.

These data are subject to the $3000+ restricted-use data request fee:

  • Broad linkages with other years of SHOW data
  • Requesting other waves of collected SHOW data

Generally, the $3000 fee will be incurred if a specific research aim has not been sufficiently identified and analyzed with the public use data provided first. If you would like to discuss your research and data needs, please use the button below to request a free 1-hour consultation with our team.

Request Data or a Research Consultation