Norwegian Citizen Panel Methodology Report 31th wave

Background
Technical Aspects of the Survey
Previous Waves of Recruitment
Data Collection
- Platforms
- Time Usage
Representativity
- Factors Explaining Representativity
- Representativity of the Norwegian Citizen Panel
Weighting
Appendix

To cite this report, please use the following:
Wettergreen, J., Bjørnebekk, O., & Stokke, G. (2024). Norwegian Citizen Panel methodology report, wave 31 [Produced by Ideas2evidence].

Background

This report describes the procedures of data collection in the 31th wave of The Norwegian Citizen Panel. Technical aspects of data collection are discussed, along with the representativity of the panel, and how survey weights are calculated.

The Norwegian Citizen Panel (NCP) is one of the main components of Digital Social Science Core Facility (DIGSSCORE) at the University of Bergen. NCP is as a collaboration between several departments at the Faculty of Social Sciences at the University of Bergen and NORCE.

ideas2evidence is responsible for the panel recruitment, the administration of the panel, and the technical solutions regarding data collection and computing.

Technical Aspects of the Survey

Software

The surveys are administrated through the web-based survey software Confirmit (now part of the company Forsta). Confirmit is a “Software-as-a-Service” solution, where all software runs on Confirmit’s continuously monitored server park, and where survey respondents and developers interact with the system through various web-based interfaces. This software provides very high data security and operational stability. The security measures are the most stringent in the industry, and Confirmit guarantees 99.7 percent uptime. ideas2evidence does the programming of the survey in Confirmit on behalf of The Norwegian Citizen Panel.

Pilot, Soft Launch, and Distribution

The survey went through small-N pilot testing before data collection. In addition, the survey was tested extensively during the development phase by ideas2evidence and the researchers involved in the project.

The field period started by inviting a random sample of high participation respondents (soft launch). This was done in order to minimize the consequences if the questionnaire contained technical errors. No such errors were located/reported, and remaining panel members were invited shortly after.

Throughout the field period respondents was randomly allocated into one of five subgroups, all of which were exposed to different questions sets.

Randomisation Procedures

Each wave of NCP has an extensive use of randomisation procedures. The context of each randomisation procedure may vary¹, but they all share some common characteristics that will be described below.

All randomisation procedures are executed live in the questionnaire. This means that the randomisation takes place while the respondent is in the questionnaire, as opposed to pre-defined randomisations that are uploaded to the questionnaire. All randomisations are independent from another, unless the documentation states otherwise.

Randomisation functions are written in JavaScript. Math.random()² is widely used in combination with Math.Floor()³. These functions are deployed to achieve:

Randomly select one value from a vector
Randomly shuffle the contents of an array

The first procedure is typically used to determine a random sample of respondents to separate groups, for instance a treatment group within an experiment. As an example, consider an environment where we want to separate all respondents in two groups: group 1 and group 2. All respondents are randomly assigned the value 1 or 2, where each randomisation is independent from respondent to respondent. When N is sufficiently large, the groups will be of equal size.

Here is an example of the JavaScript code executed in Confirmit:

var form = f("x1");
if(!form.toBoolean()) { //If no previous randomisation on x1
  var precodes = x1.domainValues(); //copies the length of x1
  var randomNumber: float = Math.random()*precodes.length;
  var randomIndex: int = Math.floor(randomNumber);
  var code = precodes[randomIndex];
  form.set(code);
}

The second procedure is typically used when defining the order of an answer list as random. This can be useful when asking for the respondent’s party preference or in a list experiment. However, as a party cannot be listed twice, the procedure must take into account that the array of parties is reduced by 1 for each randomisation. Here is an example⁴:

function shuffle(array){
  var currentIndex = array.length, temporaryValue, randomIndex;
  
  //While there remain elements to shuffle
  while (0 != currentIndex) {
    randomIndex = Math.floor(Math.random() * currentIndex);
    currentIndex -= 1;
    
    // And swap it with the current element
    temporaryValue = array[currentIndex];
    array[currentIndex] = array[randomIndex];
    array[randomIndex] = temporaryValue;
  }
  return array;
}

Previous Waves of Recruitment

Existing panel members were recruited in wave 1, wave 3, wave 8, wave 11, wave 14, wave 16, wave 18, wave 22, wave 25, and wave 30. All samples were drawn from the National Population Registry of Norway. This registry holds information on everyone born in Norway, as well as former and current inhabitants. The Norwegian Tax Administration holds the formal responsibility for the registry.

Samples consist of people over the age of 18 who were randomly drawn from the registry. The extracted information was a) last name, b) first name, c) address, d) gender, e) year of birth, and f) phone number (the latter was not included in wave 1 or wave 30). Phone number was collected from Data Factory for respondents recruited in wave 30. Samples exclude people without a permanent address in Norway.

Table 1 outlines a short summary of previous recruitment efforts. Note that there are some differences between the recruitment processes. For a detailed description of each recruitment process, please refer to the respective methodology reports.

Table 1: Summary of recruitment
Event	Sample size	Mode	Contacts	Returned letters	Response rate (%)
Recruitment 1 (wave 1)	25 000	Postal	2	546	20.1
Recruitment 2 (wave 3)	25 000	Postal, phone/SMS	4	543	23.0
Recruitment 3 (wave 8)	22 000	Postal/SMS	3	479	19.4
Recruitment 4 (wave 11)	14 000	Postal/SMS	2	334	15.1
Recruitment 5 (wave 14)	14 000	Postal/SMS	2	389	15.0
Recruitment 6 (wave 16)	34 000	Postal/SMS	2	994	14.9
Recruitment 7 (wave 18)	15 000	Postal/SMS	2	381	14.0
Recruitment 8 (wave 22)	23 000	Postal/SMS	2	623	14.5
Recruitment 9 (wave 25)	18 000	Postal/SMS	2	562	13.9
Recruitment 10 (wave 30)	25 000	Postal/SMS	4	989	15.0

The response rate of recruitment in recruitment 4 and onwards were lower than recruitment in previous waves. The most important explanation were restrictions enforced by the Norwegian Tax Administration with regards to how many times the Citizen Panel can contact the invitees. Respondents in recruitments 4-9 were contacted twice at most. Recruitment 1 also had a maximum of two contact points, but achieved a response rate of 20 percent. One explanation for why we cannot replicate a response rate of 20 percent in recruitments 4-10 might be that NCP did a lot of promotion of the panel through media outlets leading up to and during recruitment 1. Additional promotion of the panel was carried out in relation to the Norwegian Parliamentary election the same fall. We also observe a slow, declining, recruitment rate after the fourth recruitment until recruitment 10 where the restriction on maximum contact points was lifted.

Data Collection

Responses by mode of contact

The survey was distributed to 29 759 panel members on the 4th of November 2024 for the softlaunch and on the 5th of November for the main launch. The invitation contained information on the Norwegian Citizen Panel, unique URLs for each panel member that led to the questionnaire, and unique access code which the panel members could use to log in to the survey by accessing a link on www.uib.no/medborger.

The invitation, first reminder, and third reminder were all distributed by e-mail. The second reminder was, depending on whether the panel member had a registered mobile phone number or not, distributed via SMS or e-mail. Prior to wave 31, 53.4 percent of the panel members were registered with a mobile phone number.

Table 2: Responses and response rate for panel members during data collection
Event	Response	Cumulative responses	Response rate	Cumulative response rate
Invitation (4th/5th of November)	5 555	5 555	34.6 %	34.6 %
First reminder (8th of November)	2 806	8 361	17.5 %	52.1 %
Second reminder - email (13th of November)	515	8 876	3.2 %	55.3 %
Second reminder - SMS (13th of November)	1 437	10 313	9 %	64.3 %
Third reminder (19th of November)	1 202	11 515	7.5 %	71.8 %

In total 11 515 existing panel members filled out the questionnaire. A response rate of 34.6 % was achieved between the invitation and the first reminder. Following a pattern observed in previous waves, the initial invitation produced a higher number of respondents than subsequent reminders. See table 2 for further details on number of respondents after reminders.

Using the same methodology as in previous waves for calculating response rate, respondents who have not participated in any of the last three waves are excluded. This leaves us with 16 054 eligible respondents. The overall response rate, as reported in table 2, is 71.8 %.

Approximately 1 700 of the initial invitations were reported as not delivered by Confirmit, which rounds to 5 percent. Measures are taken to ensure email deliverability, but are unable to accurately estimate how many of the delivered emails ended up as spam with the recipient.

Response of Existing Panel Members Over Time

Comparing the number of wave 31 respondents (11 515) to the number of respondents in the previous wave 30 (13 509), gives an overall wave-to-wave retention rate of 85 percent. Figure 1 illustrates each wave of recruitment by individual lines, and shows how many respondents that are preserved for each data collection. NCP has carried out 32 waves of data collection. Depending on when the respondents were recruited, the current wave is highlighted with a red circle. For respondents recruited in wave 1, the current wave is the 32th data collection (t32).

The wave-to-wave retention rate increases substantially after the first three waves (t1 - t3), until it stabilizes around a mean of 95 percent.

Figure 1: Wave-to-wave retention rate

Platforms

The questionnaire was prepared for input via smart phones, tablets, and other units capable of running web-browsers. In order to enhance the respondents’ experience, the questionnaire is responsive. Respondents on smaller devices, measured in pixels per inch (PPI), are exposed to slightly different visual representations of some questions. Question grids are presented as a set of individual questions on the same page, which is different from the desktop presentation where it would be presented in a table. 50 percent of all survey respondents that opened the questionnaire used a mobile phone.

A set number of survey questions must be answered for a person to be included as a respondent. 7 percent of the mobile users did not reach this minimum requirement, compared to 20 percent for non-mobile users.

The share of mobile users is high among respondents between 18 and 45 years of age. As shown in figure 2, the share of mobile users decline with age.

Figure 2: Share of mobile users by gender and age

Time Usage

The average respondent used 18.4 minutes to complete the questionnaire. Measuring average time usage is a challenge, as respondents may leave the questionnaire open in order to complete the survey later. This idle time causes an artificially high average for completing the survey. The average therefore includes only the respondents that spent 60 minutes or less completing the survey.

Figure 3: Time usage distribution of survey respondents

Table 3: Average time usage in minutes for each subset
Type	All	Group 1	Group 2	Group 3	Group 4	Group 5
All users	18.4	17.1	17.8	17.5	16.8	23.0
Non-mobile users	19.6	18.3	19.0	18.4	18.2	24.2
Mobile users	17.3	16.0	16.8	16.7	15.6	21.8

Representativity

In this section, we describe the representativity of the panel as a whole. First, we will discuss factors explaining representativity. Thereafter we apply demographic variables to present data on representativity by different strata. The data on representativity is the foundation for the section on weighting.

Factors Explaining Representativity

There are two main points that can serve as explanations to non-response and lack of representativity when recruiting and maintaining panel members:

access to and familiarity with the internet (given that a web-based questionnaire was the only response mode made available)
the motivation and interest of the respondents

The first challenge is strongly related to the age composition of the survey respondents. Although Norway has a very high computer and internet density, the probability of having an e-mail address, and the skills required to access and fill in an online questionnaire, normally decrease with increasing age. The second challenge, motivation and interest, is often explained by the respondents’ level of education. In addition to age and education, we added the variables of geography and gender in order to test the representativity of the survey respondents. The variables have the following categories:

Age: 18-29 years, 30-59 years, 60 and above.
Highest completed education: no education/elementary school, upper secondary, university/university college.
Geography: Oslo, Eastern Norway, Southern Norway, Western Norway, Trøndelag, Northern Norway.

Please note that starting wave twenty-one, the former county of Akershus is part of Eastern Norway, rather than being part of the traditional Akershus/Oslo stratum.

Representativity of the Norwegian Citizen Panel

The sampling frame of the survey equals to the Norwegian population above the age of 18, comprising a population of approximately 4.4 million individuals. Earlier reports have documented a systematic underrepresentation of respondents belonging to the two lowest educational groups, independent of gender and age. The underrepresentation is particularly strong for young men. As expected, individuals with education from universities or university colleges are overrepresented. All of these observations hold true for wave 31.

Table 4: Age distribution in the population and the net sample
Type	18 - 29 years	30 - 59 years	60 years and above
Population	18.3 %	49.2 %	32.6 %
Net sample	9 %	39.9 %	51.1 %

From the age distribution presented in table 4, we see that both the youngest age groups are underrepresented in the net sample of wave 31 when compared to the distribution we find in the population. Inversely, respondents age 60 years and more are overrepresented in the net sample when compared to the population.

Over time, and until wave 30, the panel exhibited a drift from perfect representativity with regard to age. As figure 4 shows, the oldest respondents started out underrepresented in wave 1, but have become increasingly overrepresented over time. The youngest respondents have become increasingly underrepresented at the same time. This has been explained by a difference in panel membership loyalty; younger panel members are less likely to participate in new waves of NCP after having been an active member of the panel. Recruitment in wave 30 introduced stratified sampling, and the youngest group was no longer underrepresented. Figure 4 shows a steep drop in representativity for the youngest age group from wave 30 to wave 31, highlighting how difficult it is to activate this age group.

Figure 4: Representativity of age groups

Table 5 breaks down population and the net sample by age and gender. This reveals a slight gender-age interaction in the panel representativity. Across the two youngest age groups, women are less undererepresented than men. Among the oldest in the panel men are more overrepresented than women in the same age bracket.

Table 5: Combined distribution of age and gender in the population and the net sample
	18 - 29 years		30 - 59 years		60 years and above
Type	Men	Women	Men	Women	Men	Women
Population	9.4 %	8.9 %	24.8 %	24.3 %	15.7 %	16.9 %
Net sample	3.7 %	5.3 %	19.3 %	20.6 %	27.8 %	23.3 %

The inclusion of educational level in table 6 reveals a systematic underrepresentation of respondents with little or no education, independent of age and gender. The underrepresentation is present in all age brackets, but is especially strong for young respondents.

Table 6: Combined distribution of age, gender and education in the population and the net sample
		Population		Net sample
		Men	Women	Men	Women
18 - 29 years	No education/elementary school	3.5 %	2.7 %	0.3 %	0.5 %
	Upper secondary education	3.9 %	3 %	1.9 %	2.5 %
	University/university college	2 %	3.1 %	1.4 %	2.2 %
30 - 59 years	No education/elementary school	4.8 %	3.7 %	0.5 %	0.4 %
	Upper secondary education	10.4 %	7.1 %	6 %	4.1 %
	University/university college	9.6 %	13.5 %	12.9 %	16.1 %
60 years and above	No education/elementary school	3.3 %	4.3 %	1.6 %	1.2 %
	Upper secondary education	7.8 %	7.8 %	9.5 %	6.9 %
	University/university college	4.6 %	4.8 %	16.8 %	15.2 %

Respondents who have completed upper secondary education or have not completed any level of education are in general underrepresented, while respondents with university/university college as their highest level of education are overrepresented. Those who have university or university college education are clearly overrepresented in the two oldest age brackets, irrespective of gender.

Figure 5: Representativity of education groups

Figure 5 illustrates the representation of education groups since wave one. The general trend is that the highly educated are overrepresented compared to those with less or no education. Except for slight improvements in representativity of the education groups when new respondents are recruited (wave 1, 3, 8, 11, 14, 16, 18, 22, 25 and 30), the overall pattern has remained stable throughout all waves.

Table 7: Combined distribution of age, gender and geography in the population and the net sample
		Population			Net sample
		Men	Women	Total	Men	Women	Total
Oslo	18-29 years	1.3%	1.4%	2.8%	0.7 %	1 %	1.7 %
	30-59 years	3.6%	3.5%	7%	3.3 %	3.9 %	7.2 %
	60 years and above	1.4%	1.6%	3.1%	3.4 %	3.3 %	6.7 %
	Total	6.3%	6.5%	12.9%	7.4 %	8.2 %	15.6 %
Eastern Norway	18-29 years	3.3%	3%	6.3%	1.2 %	1.6 %	2.8 %
	30-59 years	9.3%	9.3%	18.6%	6.4 %	7.1 %	13.4 %
	60 years and above	6.4%	7%	13.4%	10.9 %	9 %	19.9 %
	Total	19%	19.3%	38.3%	18.4 %	17.7 %	36.2 %
Southern Norway	18-29 years	0.6%	0.5%	1.1%	0.2 %	0.2 %	0.4 %
	30-59 years	1.4%	1.4%	2.8%	0.9 %	1 %	1.9 %
	60 years and above	0.9%	1%	1.9%	1.3 %	1.3 %	2.6 %
	Total	2.9%	2.9%	5.8%	2.5 %	2.5 %	4.9 %
Western Norway	18-29 years	2.5%	2.3%	4.8%	1 %	1.5 %	2.5 %
	30-59 years	6.3%	6.1%	12.4%	5.7 %	5.3 %	11 %
	60 years and above	4%	4.2%	8.2%	7.7 %	6.1 %	13.8 %
	Total	12.8%	12.6%	25.4%	14.5 %	12.9 %	27.3 %
Trøndelag	18-29 years	0.9%	0.8%	1.7%	0.4 %	0.6 %	1 %
	30-59 years	2.1%	2.1%	4.2%	1.6 %	1.8 %	3.4 %
	60 years and above	1.4%	1.5%	2.9%	2.2 %	1.8 %	4 %
	Total	4.4%	4.4%	8.8%	4.2 %	4.2 %	8.4 %
Northern Norway	18-29 years	0.8%	0.8%	1.6%	0.3 %	0.3 %	0.6 %
	30-59 years	2.1%	2%	4.1%	1.4 %	1.5 %	2.9 %
	60 years and above	1.6%	1.6%	3.2%	2.3 %	1.8 %	4.1 %
	Total	4.5%	4.4%	8.9%	3.9 %	3.7 %	7.6 %

We observe that the representation of panel members living in Trøndelag are nearly on level with the population, while respondents from Northern and Eastern Norway are underrepresented. Respondents from Western Norway and Oslo are overrepresented. Respondents aged 60 years and above are overrepresented in all parts of the country, but only slightly so in Southern and Northern Norway. We find the biggest overrepresentation in Oslo.

Figure 6: Representativity of regions

For wave twenty-one, population data stratified on the new regions was available for the first time since the regional reform of 2020. While this data eliminates some small uncertainty in the representativity analyses⁵, it also introduces a break in time series for Oslo (previously including Akershus) and Eastern Norway (now including Akershus). In the 2024 regional reform, Akershus, among other former counties, were reinstituted as an independent county. In order to preserve the regional division established in 2020, Akershus remains a part of Eastern Norway in our analysis. Compared to age and education, geography does, however, not seem to be a strong determinant of survey participation. Apart from effects from the regional reform, the geographic representativity is more or less stable over time.

Weighting

A weight has been calculated to compensate for observed biases. The weight is equal to the relation between a given strata in the population and the total population, divided by the relation between a given strata in the net sample and the total net sample⁶. This procedure returns values around 1, and always more than 0. Respondents belonging to a stratum that is underrepresented will receive a weight above 1 and respondents belonging to an overrepresented stratum will receive a weight below 1. We have listed the weights of the different strata in the table in the appendix.

When calculating the weight, information regarding the respondents’ geographical location, gender, and age is based on registry data. Information on these variables was included in the sample file received from the Norwegian National Population Registry upon recruitment. Information on the level of education is collected from NCP surveys. 1 percent of the 31th wave net sample have not answered the question about level of education.

Weight 4 combines demographic variables with education. Respondents with missing data on the education variable are only weighted on demography (the education component of the weight is in these cases set to 1). Level of education is recoded into 1) upper secondary and lower levels of education, and 2) university/university college. Weight values are capped to a minimum of 0.2 and a maximum of 5. Capping limits the influence of highly overrepresented and underrepresented strata on the weighted statistics, and reduces the likelihood of the weights being overfitted with respect to the demographic composition of each wave.

The following demographic variables are applied in weight4:

Age: 18-29 years, 30-59 years, 60 and above.
Gender: Male, female.
Highest level of completed education: upper secondary and lower, university/university college.
Geography: Oslo, Eastern Norway, Southern Norway, Western Norway, Trøndelag, Northern Norway.

When applied, the weight will provide a weighted N equal to the number of cases in the dataset. In other words, the weight is calculated using the whole dataset. NCP has an extensive use of (randomized) sub-groups, which might alter the demographic profile of the sub-group compared to the whole dataset.

Consequently, the weight might be less precise for some sub-groups. Note that the dataset is provided with necessary information⁷ to calculate custom weights if needed, following the procedure described above.

Table 8 shows the effects of weight 4 on the distribution of self-reported level of education in the net sample. As we can observe, the weight gives the sample a distribution closer to the population. It is, however, important to stress that the unweighted distribution is not ideal, with a clear underrepresentation of people with lower levels of education, and young people.

Table 8: Effect of weight4 on self-reported level of education
	Sample - standard	Sample - weighted	Population	Difference between sample and population	Difference between weighted sample and population
Upper secondary school and lower	35.5 %	62.3 %	62.3 %	-26.9	0
University/university college	64.5 %	37.7 %	37.7 %	26.9	0

Appendix

Table 9: Weight4 applied to different strata
	Age	Education	Men	Women		Age	Education	Men	Women
Oslo	18-29	Upper secondary and lower	2.9	1.8	Western Norway	18-29	Upper secondary and lower	3.4	1.5
		University/university college	1.3	1.3			University/university college	1.3	1.7
	30-59	Upper secondary and lower	2.4	2.6		30-59	Upper secondary and lower	2	2.3
		University/university college	0.8	0.7			University/university college	0.6	0.8
	60+	Upper secondary and lower	1	1.1		60+	Upper secondary and lower	0.8	1.3
		University/university college	0.3	0.3			University/university college	0.3	0.3
Eastern Norway	18-29	Upper secondary and lower	3.5	2.2	Trøndelag	18-29	Upper secondary and lower	2.5	1.8
		University/university college	1.6	1.4			University/university college	1.8	1.1
	30-59	Upper secondary and lower	2.5	2.5		30-59	Upper secondary and lower	2.7	2.4
		University/university college	0.8	0.9			University/university college	0.7	0.8
	60+	Upper secondary and lower	1	1.5		60+	Upper secondary and lower	1.1	2.1
		University/university college	0.3	0.3			University/university college	0.3	0.3
Southern Norway	18-29	Upper secondary and lower	4.4	2.4	Northern Norway	18-29	Upper secondary and lower	3.2	2.6
		University/university college	1.4	1.7			University/university college	1.8	1.6
	30-59	Upper secondary and lower	2.1	2.2		30-59	Upper secondary and lower	2.8	2.3
		University/university college	0.9	1			University/university college	0.8	1
	60+	Upper secondary and lower	1.4	2.1		60+	Upper secondary and lower	1.1	1.8
		University/university college	0.3	0.3			University/university college	0.3	0.4

Some examples: sorting respondents in different thematic subsets, randomly allocate treatment value in experiments, randomize order of an answer list/array, order a sequence of questions by random, ask a given question to a subset of the respondents.↩︎
Please see following resource: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/random ↩︎
Please see following resource: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/floor ↩︎
Collected from Mike Bostocks visualisation: https://bost.ocks.org/mike/shuffle/↩︎
Note that Oslo (including Akershus), and Eastern Norway diverge in wave eighteen, due to the regional reform implemented 1st of January 2020.↩︎
The applied formula for weight wi for element i, in strata h is: $w_i=\frac{N_h/N}{n_h/n}$ ↩︎
See columns r31_Weight4_stratapop and r31_generic_stratapop↩︎