To cite this report, please use the following:
Wettergreen, J.,
Bjørnebekk, O., & Stokke, G. (2024). Norwegian Citizen Panel
methodology report, wave 31 [Produced by Ideas2evidence].
This report describes the procedures of data collection in the 31th wave of The Norwegian Citizen Panel. Technical aspects of data collection are discussed, along with the representativity of the panel, and how survey weights are calculated.
The Norwegian Citizen Panel (NCP) is one of the main components of Digital Social Science Core Facility (DIGSSCORE) at the University of Bergen. NCP is as a collaboration between several departments at the Faculty of Social Sciences at the University of Bergen and NORCE.
ideas2evidence is responsible for the panel recruitment, the administration of the panel, and the technical solutions regarding data collection and computing.
The surveys are administrated through the web-based survey software Confirmit (now part of the company Forsta). Confirmit is a “Software-as-a-Service” solution, where all software runs on Confirmit’s continuously monitored server park, and where survey respondents and developers interact with the system through various web-based interfaces. This software provides very high data security and operational stability. The security measures are the most stringent in the industry, and Confirmit guarantees 99.7 percent uptime. ideas2evidence does the programming of the survey in Confirmit on behalf of The Norwegian Citizen Panel.
The survey went through small-N pilot testing before data collection. In addition, the survey was tested extensively during the development phase by ideas2evidence and the researchers involved in the project.
The field period started by inviting a random sample of high participation respondents (soft launch). This was done in order to minimize the consequences if the questionnaire contained technical errors. No such errors were located/reported, and remaining panel members were invited shortly after.
Throughout the field period respondents was randomly allocated into one of five subgroups, all of which were exposed to different questions sets.
Each wave of NCP has an extensive use of randomisation procedures. The context of each randomisation procedure may vary1, but they all share some common characteristics that will be described below.
All randomisation procedures are executed live in the questionnaire. This means that the randomisation takes place while the respondent is in the questionnaire, as opposed to pre-defined randomisations that are uploaded to the questionnaire. All randomisations are independent from another, unless the documentation states otherwise.
Randomisation functions are written in JavaScript. Math.random()2 is widely used in combination with Math.Floor()3. These functions are deployed to achieve:
The first procedure is typically used to determine a random sample of respondents to separate groups, for instance a treatment group within an experiment. As an example, consider an environment where we want to separate all respondents in two groups: group 1 and group 2. All respondents are randomly assigned the value 1 or 2, where each randomisation is independent from respondent to respondent. When N is sufficiently large, the groups will be of equal size.
Here is an example of the JavaScript code executed in Confirmit:
var form = f("x1");
if(!form.toBoolean()) { //If no previous randomisation on x1
var precodes = x1.domainValues(); //copies the length of x1
var randomNumber: float = Math.random()*precodes.length;
var randomIndex: int = Math.floor(randomNumber);
var code = precodes[randomIndex];
form.set(code);
}
The second procedure is typically used when defining the order of an answer list as random. This can be useful when asking for the respondent’s party preference or in a list experiment. However, as a party cannot be listed twice, the procedure must take into account that the array of parties is reduced by 1 for each randomisation. Here is an example4:
function shuffle(array){
var currentIndex = array.length, temporaryValue, randomIndex;
//While there remain elements to shuffle
while (0 != currentIndex) {
randomIndex = Math.floor(Math.random() * currentIndex);
currentIndex -= 1;
// And swap it with the current element
temporaryValue = array[currentIndex];
array[currentIndex] = array[randomIndex];
array[randomIndex] = temporaryValue;
}
return array;
}
Existing panel members were recruited in wave 1, wave 3, wave 8, wave 11, wave 14, wave 16, wave 18, wave 22, wave 25, and wave 30. All samples were drawn from the National Population Registry of Norway. This registry holds information on everyone born in Norway, as well as former and current inhabitants. The Norwegian Tax Administration holds the formal responsibility for the registry.
Samples consist of people over the age of 18 who were randomly drawn from the registry. The extracted information was a) last name, b) first name, c) address, d) gender, e) year of birth, and f) phone number (the latter was not included in wave 1 or wave 30). Phone number was collected from Data Factory for respondents recruited in wave 30. Samples exclude people without a permanent address in Norway.
Table 1 outlines a short summary of previous recruitment efforts. Note that there are some differences between the recruitment processes. For a detailed description of each recruitment process, please refer to the respective methodology reports.
Event | Sample size | Mode | Contacts | Returned letters | Response rate (%) |
---|---|---|---|---|---|
Recruitment 1 (wave 1) | 25 000 | Postal | 2 | 546 | 20.1 |
Recruitment 2 (wave 3) | 25 000 | Postal, phone/SMS | 4 | 543 | 23.0 |
Recruitment 3 (wave 8) | 22 000 | Postal/SMS | 3 | 479 | 19.4 |
Recruitment 4 (wave 11) | 14 000 | Postal/SMS | 2 | 334 | 15.1 |
Recruitment 5 (wave 14) | 14 000 | Postal/SMS | 2 | 389 | 15.0 |
Recruitment 6 (wave 16) | 34 000 | Postal/SMS | 2 | 994 | 14.9 |
Recruitment 7 (wave 18) | 15 000 | Postal/SMS | 2 | 381 | 14.0 |
Recruitment 8 (wave 22) | 23 000 | Postal/SMS | 2 | 623 | 14.5 |
Recruitment 9 (wave 25) | 18 000 | Postal/SMS | 2 | 562 | 13.9 |
Recruitment 10 (wave 30) | 25 000 | Postal/SMS | 4 | 989 | 15.0 |
The response rate of recruitment in recruitment 4 and onwards were lower than recruitment in previous waves. The most important explanation were restrictions enforced by the Norwegian Tax Administration with regards to how many times the Citizen Panel can contact the invitees. Respondents in recruitments 4-9 were contacted twice at most. Recruitment 1 also had a maximum of two contact points, but achieved a response rate of 20 percent. One explanation for why we cannot replicate a response rate of 20 percent in recruitments 4-10 might be that NCP did a lot of promotion of the panel through media outlets leading up to and during recruitment 1. Additional promotion of the panel was carried out in relation to the Norwegian Parliamentary election the same fall. We also observe a slow, declining, recruitment rate after the fourth recruitment until recruitment 10 where the restriction on maximum contact points was lifted.
The survey was distributed to 29 759 panel members on the 4th of November 2024 for the softlaunch and on the 5th of November for the main launch. The invitation contained information on the Norwegian Citizen Panel, unique URLs for each panel member that led to the questionnaire, and unique access code which the panel members could use to log in to the survey by accessing a link on www.uib.no/medborger.
The invitation, first reminder, and third reminder were all distributed by e-mail. The second reminder was, depending on whether the panel member had a registered mobile phone number or not, distributed via SMS or e-mail. Prior to wave 31, 53.4 percent of the panel members were registered with a mobile phone number.
Event | Response | Cumulative responses | Response rate | Cumulative response rate |
---|---|---|---|---|
Invitation (4th/5th of November) | 5 555 | 5 555 | 34.6 % | 34.6 % |
First reminder (8th of November) | 2 806 | 8 361 | 17.5 % | 52.1 % |
Second reminder - email (13th of November) | 515 | 8 876 | 3.2 % | 55.3 % |
Second reminder - SMS (13th of November) | 1 437 | 10 313 | 9 % | 64.3 % |
Third reminder (19th of November) | 1 202 | 11 515 | 7.5 % | 71.8 % |
In total 11 515 existing panel members filled out the questionnaire. A response rate of 34.6 % was achieved between the invitation and the first reminder. Following a pattern observed in previous waves, the initial invitation produced a higher number of respondents than subsequent reminders. See table 2 for further details on number of respondents after reminders.
Using the same methodology as in previous waves for calculating response rate, respondents who have not participated in any of the last three waves are excluded. This leaves us with 16 054 eligible respondents. The overall response rate, as reported in table 2, is 71.8 %.
Approximately 1 700 of the initial invitations were reported as not delivered by Confirmit, which rounds to 5 percent. Measures are taken to ensure email deliverability, but are unable to accurately estimate how many of the delivered emails ended up as spam with the recipient.
Comparing the number of wave 31 respondents (11 515) to the number of respondents in the previous wave 30 (13 509), gives an overall wave-to-wave retention rate of 85 percent. Figure 1 illustrates each wave of recruitment by individual lines, and shows how many respondents that are preserved for each data collection. NCP has carried out 32 waves of data collection. Depending on when the respondents were recruited, the current wave is highlighted with a red circle. For respondents recruited in wave 1, the current wave is the 32th data collection (t32).
The wave-to-wave retention rate increases substantially after the first three waves (t1 - t3), until it stabilizes around a mean of 95 percent.
Figure 1: Wave-to-wave retention rate
The questionnaire was prepared for input via smart phones, tablets, and other units capable of running web-browsers. In order to enhance the respondents’ experience, the questionnaire is responsive. Respondents on smaller devices, measured in pixels per inch (PPI), are exposed to slightly different visual representations of some questions. Question grids are presented as a set of individual questions on the same page, which is different from the desktop presentation where it would be presented in a table. 50 percent of all survey respondents that opened the questionnaire used a mobile phone.
A set number of survey questions must be answered for a person to be included as a respondent. 7 percent of the mobile users did not reach this minimum requirement, compared to 20 percent for non-mobile users.
The share of mobile users is high among respondents between 18 and 45 years of age. As shown in figure 2, the share of mobile users decline with age.
Figure 2: Share of mobile users by gender and age
The average respondent used 18.4 minutes to complete the questionnaire. Measuring average time usage is a challenge, as respondents may leave the questionnaire open in order to complete the survey later. This idle time causes an artificially high average for completing the survey. The average therefore includes only the respondents that spent 60 minutes or less completing the survey.
Figure 3: Time usage distribution of survey respondents
Type | All | Group 1 | Group 2 | Group 3 | Group 4 | Group 5 |
---|---|---|---|---|---|---|
All users | 18.4 | 17.1 | 17.8 | 17.5 | 16.8 | 23.0 |
Non-mobile users | 19.6 | 18.3 | 19.0 | 18.4 | 18.2 | 24.2 |
Mobile users | 17.3 | 16.0 | 16.8 | 16.7 | 15.6 | 21.8 |
In this section, we describe the representativity of the panel as a whole. First, we will discuss factors explaining representativity. Thereafter we apply demographic variables to present data on representativity by different strata. The data on representativity is the foundation for the section on weighting.
There are two main points that can serve as explanations to non-response and lack of representativity when recruiting and maintaining panel members:
The first challenge is strongly related to the age composition of the survey respondents. Although Norway has a very high computer and internet density, the probability of having an e-mail address, and the skills required to access and fill in an online questionnaire, normally decrease with increasing age. The second challenge, motivation and interest, is often explained by the respondents’ level of education. In addition to age and education, we added the variables of geography and gender in order to test the representativity of the survey respondents. The variables have the following categories:
Please note that starting wave twenty-one, the former county of Akershus is part of Eastern Norway, rather than being part of the traditional Akershus/Oslo stratum.
The sampling frame of the survey equals to the Norwegian population above the age of 18, comprising a population of approximately 4.4 million individuals. Earlier reports have documented a systematic underrepresentation of respondents belonging to the two lowest educational groups, independent of gender and age. The underrepresentation is particularly strong for young men. As expected, individuals with education from universities or university colleges are overrepresented. All of these observations hold true for wave 31.
Type | 18 - 29 years | 30 - 59 years | 60 years and above |
---|---|---|---|
Population | 18.3 % | 49.2 % | 32.6 % |
Net sample | 9 % | 39.9 % | 51.1 % |
From the age distribution presented in table 4, we see that both the youngest age groups are underrepresented in the net sample of wave 31 when compared to the distribution we find in the population. Inversely, respondents age 60 years and more are overrepresented in the net sample when compared to the population.
Over time, and until wave 30, the panel exhibited a drift from perfect representativity with regard to age. As figure 4 shows, the oldest respondents started out underrepresented in wave 1, but have become increasingly overrepresented over time. The youngest respondents have become increasingly underrepresented at the same time. This has been explained by a difference in panel membership loyalty; younger panel members are less likely to participate in new waves of NCP after having been an active member of the panel. Recruitment in wave 30 introduced stratified sampling, and the youngest group was no longer underrepresented. Figure 4 shows a steep drop in representativity for the youngest age group from wave 30 to wave 31, highlighting how difficult it is to activate this age group.
Figure 4: Representativity of age groups
Table 5 breaks down population and the net sample by age and gender. This reveals a slight gender-age interaction in the panel representativity. Across the two youngest age groups, women are less undererepresented than men. Among the oldest in the panel men are more overrepresented than women in the same age bracket.
Type | Men | Women | Men | Women | Men | Women |
---|---|---|---|---|---|---|
Population | 9.4 % | 8.9 % | 24.8 % | 24.3 % | 15.7 % | 16.9 % |
Net sample | 3.7 % | 5.3 % | 19.3 % | 20.6 % | 27.8 % | 23.3 % |
The inclusion of educational level in table 6 reveals a systematic underrepresentation of respondents with little or no education, independent of age and gender. The underrepresentation is present in all age brackets, but is especially strong for young respondents.
Men | Women | Men | Women | ||
---|---|---|---|---|---|
18 - 29 years | No education/elementary school | 3.5 % | 2.7 % | 0.3 % | 0.5 % |
Upper secondary education | 3.9 % | 3 % | 1.9 % | 2.5 % | |
University/university college | 2 % | 3.1 % | 1.4 % | 2.2 % | |
30 - 59 years | No education/elementary school | 4.8 % | 3.7 % | 0.5 % | 0.4 % |
Upper secondary education | 10.4 % | 7.1 % | 6 % | 4.1 % | |
University/university college | 9.6 % | 13.5 % | 12.9 % | 16.1 % | |
60 years and above | No education/elementary school | 3.3 % | 4.3 % | 1.6 % | 1.2 % |
Upper secondary education | 7.8 % | 7.8 % | 9.5 % | 6.9 % | |
University/university college | 4.6 % | 4.8 % | 16.8 % | 15.2 % |
Respondents who have completed upper secondary education or have not completed any level of education are in general underrepresented, while respondents with university/university college as their highest level of education are overrepresented. Those who have university or university college education are clearly overrepresented in the two oldest age brackets, irrespective of gender.
Figure 5: Representativity of education groups
Figure 5 illustrates the representation of education groups since wave one. The general trend is that the highly educated are overrepresented compared to those with less or no education. Except for slight improvements in representativity of the education groups when new respondents are recruited (wave 1, 3, 8, 11, 14, 16, 18, 22, 25 and 30), the overall pattern has remained stable throughout all waves.
Men | Women | Total | Men | Women | Total | ||
---|---|---|---|---|---|---|---|
Oslo | 18-29 years | 1.3% | 1.4% | 2.8% | 0.7 % | 1 % | 1.7 % |
30-59 years | 3.6% | 3.5% | 7% | 3.3 % | 3.9 % | 7.2 % | |
60 years and above | 1.4% | 1.6% | 3.1% | 3.4 % | 3.3 % | 6.7 % | |
Total | 6.3% | 6.5% | 12.9% | 7.4 % | 8.2 % | 15.6 % | |
Eastern Norway | 18-29 years | 3.3% | 3% | 6.3% | 1.2 % | 1.6 % | 2.8 % |
30-59 years | 9.3% | 9.3% | 18.6% | 6.4 % | 7.1 % | 13.4 % | |
60 years and above | 6.4% | 7% | 13.4% | 10.9 % | 9 % | 19.9 % | |
Total | 19% | 19.3% | 38.3% | 18.4 % | 17.7 % | 36.2 % | |
Southern Norway | 18-29 years | 0.6% | 0.5% | 1.1% | 0.2 % | 0.2 % | 0.4 % |
30-59 years | 1.4% | 1.4% | 2.8% | 0.9 % | 1 % | 1.9 % | |
60 years and above | 0.9% | 1% | 1.9% | 1.3 % | 1.3 % | 2.6 % | |
Total | 2.9% | 2.9% | 5.8% | 2.5 % | 2.5 % | 4.9 % | |
Western Norway | 18-29 years | 2.5% | 2.3% | 4.8% | 1 % | 1.5 % | 2.5 % |
30-59 years | 6.3% | 6.1% | 12.4% | 5.7 % | 5.3 % | 11 % | |
60 years and above | 4% | 4.2% | 8.2% | 7.7 % | 6.1 % | 13.8 % | |
Total | 12.8% | 12.6% | 25.4% | 14.5 % | 12.9 % | 27.3 % | |
Trøndelag | 18-29 years | 0.9% | 0.8% | 1.7% | 0.4 % | 0.6 % | 1 % |
30-59 years | 2.1% | 2.1% | 4.2% | 1.6 % | 1.8 % | 3.4 % | |
60 years and above | 1.4% | 1.5% | 2.9% | 2.2 % | 1.8 % | 4 % | |
Total | 4.4% | 4.4% | 8.8% | 4.2 % | 4.2 % | 8.4 % | |
Northern Norway | 18-29 years | 0.8% | 0.8% | 1.6% | 0.3 % | 0.3 % | 0.6 % |
30-59 years | 2.1% | 2% | 4.1% | 1.4 % | 1.5 % | 2.9 % | |
60 years and above | 1.6% | 1.6% | 3.2% | 2.3 % | 1.8 % | 4.1 % | |
Total | 4.5% | 4.4% | 8.9% | 3.9 % | 3.7 % | 7.6 % |
We observe that the representation of panel members living in Trøndelag are nearly on level with the population, while respondents from Northern and Eastern Norway are underrepresented. Respondents from Western Norway and Oslo are overrepresented. Respondents aged 60 years and above are overrepresented in all parts of the country, but only slightly so in Southern and Northern Norway. We find the biggest overrepresentation in Oslo.
Figure 6: Representativity of regions
For wave twenty-one, population data stratified on the new regions was available for the first time since the regional reform of 2020. While this data eliminates some small uncertainty in the representativity analyses5, it also introduces a break in time series for Oslo (previously including Akershus) and Eastern Norway (now including Akershus). In the 2024 regional reform, Akershus, among other former counties, were reinstituted as an independent county. In order to preserve the regional division established in 2020, Akershus remains a part of Eastern Norway in our analysis. Compared to age and education, geography does, however, not seem to be a strong determinant of survey participation. Apart from effects from the regional reform, the geographic representativity is more or less stable over time.
A weight has been calculated to compensate for observed biases. The weight is equal to the relation between a given strata in the population and the total population, divided by the relation between a given strata in the net sample and the total net sample6. This procedure returns values around 1, and always more than 0. Respondents belonging to a stratum that is underrepresented will receive a weight above 1 and respondents belonging to an overrepresented stratum will receive a weight below 1. We have listed the weights of the different strata in the table in the appendix.
When calculating the weight, information regarding the respondents’ geographical location, gender, and age is based on registry data. Information on these variables was included in the sample file received from the Norwegian National Population Registry upon recruitment. Information on the level of education is collected from NCP surveys. 1 percent of the 31th wave net sample have not answered the question about level of education.
The following demographic variables are applied in weight4:
When applied, the weight will provide a weighted N equal to the number of cases in the dataset. In other words, the weight is calculated using the whole dataset. NCP has an extensive use of (randomized) sub-groups, which might alter the demographic profile of the sub-group compared to the whole dataset.
Consequently, the weight might be less precise for some sub-groups. Note that the dataset is provided with necessary information7 to calculate custom weights if needed, following the procedure described above.
Table 8 shows the effects of weight 4 on the distribution of self-reported level of education in the net sample. As we can observe, the weight gives the sample a distribution closer to the population. It is, however, important to stress that the unweighted distribution is not ideal, with a clear underrepresentation of people with lower levels of education, and young people.
Sample - standard | Sample - weighted | Population | Difference between sample and population | Difference between weighted sample and population | |
---|---|---|---|---|---|
Upper secondary school and lower | 35.5 % | 62.3 % | 62.3 % | -26.9 | 0 |
University/university college | 64.5 % | 37.7 % | 37.7 % | 26.9 | 0 |
Age | Education | Men | Women | Age | Education | Men | Women | ||
---|---|---|---|---|---|---|---|---|---|
Oslo | 18-29 | Upper secondary and lower | 2.9 | 1.8 | Western Norway | 18-29 | Upper secondary and lower | 3.4 | 1.5 |
University/university college | 1.3 | 1.3 | University/university college | 1.3 | 1.7 | ||||
30-59 | Upper secondary and lower | 2.4 | 2.6 | 30-59 | Upper secondary and lower | 2 | 2.3 | ||
University/university college | 0.8 | 0.7 | University/university college | 0.6 | 0.8 | ||||
60+ | Upper secondary and lower | 1 | 1.1 | 60+ | Upper secondary and lower | 0.8 | 1.3 | ||
University/university college | 0.3 | 0.3 | University/university college | 0.3 | 0.3 | ||||
Eastern Norway | 18-29 | Upper secondary and lower | 3.5 | 2.2 | Trøndelag | 18-29 | Upper secondary and lower | 2.5 | 1.8 |
University/university college | 1.6 | 1.4 | University/university college | 1.8 | 1.1 | ||||
30-59 | Upper secondary and lower | 2.5 | 2.5 | 30-59 | Upper secondary and lower | 2.7 | 2.4 | ||
University/university college | 0.8 | 0.9 | University/university college | 0.7 | 0.8 | ||||
60+ | Upper secondary and lower | 1 | 1.5 | 60+ | Upper secondary and lower | 1.1 | 2.1 | ||
University/university college | 0.3 | 0.3 | University/university college | 0.3 | 0.3 | ||||
Southern Norway | 18-29 | Upper secondary and lower | 4.4 | 2.4 | Northern Norway | 18-29 | Upper secondary and lower | 3.2 | 2.6 |
University/university college | 1.4 | 1.7 | University/university college | 1.8 | 1.6 | ||||
30-59 | Upper secondary and lower | 2.1 | 2.2 | 30-59 | Upper secondary and lower | 2.8 | 2.3 | ||
University/university college | 0.9 | 1 | University/university college | 0.8 | 1 | ||||
60+ | Upper secondary and lower | 1.4 | 2.1 | 60+ | Upper secondary and lower | 1.1 | 1.8 | ||
University/university college | 0.3 | 0.3 | University/university college | 0.3 | 0.4 |
Some examples: sorting respondents in different thematic subsets, randomly allocate treatment value in experiments, randomize order of an answer list/array, order a sequence of questions by random, ask a given question to a subset of the respondents.↩︎
Please see following resource: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/random↩︎
Please see following resource: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/floor↩︎
Collected from Mike Bostocks visualisation: https://bost.ocks.org/mike/shuffle/↩︎
Note that Oslo (including Akershus), and Eastern Norway diverge in wave eighteen, due to the regional reform implemented 1st of January 2020.↩︎
The applied formula for weight wi for element i, in strata h is: wi=Nh/Nnh/n↩︎
See columns r31_Weight4_stratapop and r31_generic_stratapop↩︎