A methodological report from the Malmö Diet and Cancer study: development and evaluation of altered routines in dietary data processing
Nutrition Journal volume 1, Article number: 3 (2002)
In the Malmö Diet and Cancer study, information on dietary habits was obtained through a modified diet history method, combining a 7-day menu book for cooked meals and a diet questionnaire for foods with low day-to-day variation. Half way through the baseline data collection, a change of interview routines was implemented in order to reduce interview time.
Changes concentrated on portion-size estimation and recipe coding of mixed dishes reported in the menu book. All method development and tests were carefully monitored, based on experiential knowledge, and supplemented with empirical data. A post hoc evaluation study using "real world" data compared observed means of selected dietary variables before and after the alteration of routines handling dietary data, controlling for potential confounders.
These tests suggested that simplified coding rules and standard portion-sizes could be used on a limited number of foods, without distortions of the group mean nutrient intakes, or the participants' ranking. The post hoc evaluation suggested that mean intakes of energy-adjusted fat were higher after the change in routines. The impact appeared greater in women than in men.
Future descriptive studies should consider selecting subsets assessed with either method version to avoid distortion of observed mean intakes. The impact in analytical studies may be small, because method version and diet assistant explained less than 1 percent of total variation. The distribution of cases and non-cases across method versions should be monitored.
Diet history methods are interviewer administered quantitative diet methods which typically use cross-check frequency lists to estimate usual food consumption frequencies, and photographic aids, food models or household measuring devices to estimate usual portion-sizes [1, 2]. The assessment methodology is relatively time consuming, and the period of participant accrual will inevitably be long in large-scale studies [2, 3]. Therefore, variations in interview routines or in changes of dietary data collection procedures over time, could potentially affect observed nutrient intakes. Studies of "usual diet" methods have indicated that different portion-size alternatives mainly influence the estimated group mean intakes of energy and nutrients, but have smaller effect on the ability to rank individuals on specific nutrient intakes [4–6]. Differential reports may depend on personal characteristics of study participants like obesity [7–11], socio-economic status , education and ethnicity , or gender . However, food selection is also known to vary by season , and general food consumption trends over long time-periods are observed in most populations [15–17].
In the Malmö Diet and Cancer (MDC) study, a prospective cohort study in the third largest city of Sweden , food habit information was obtained through a modified diet history method [12, 19, 20]. During the six-year baseline examination period, a total of seventeen interviewers conducted the dietary interviews. Several measures were taken to facilitate standardized dietary data collection procedures: a continuous in-service training of interviewers; a computer software for standardized entering and coding of data; an extensive set of coding rules for food items and mixed dishes; and a quality control program of collected data. In the spring of 1993 an unforeseen reduction of grants initiated measures to simplify dietary data collection routines and to make faster interviews possible. The change in routines was preceded by a phase evaluating possible options for simplifying the procedures, and a series of tests to examine the effect of different coding and portion-size alternatives on dietary intakes. Because the total and "real world" effect of the altered routines could not be evaluated prior to change, a separate evaluation project was called for. It is, for instance, plausible that, when implemented in full the altered routines would affect dietary interviewers and study-participants, and subsequently, observed intakes in unpredicted ways. Observed intakes at different points in time, could also depend on factors like characteristics and lifestyle of study-participants, or on the year and season of data collection rather than on the handling of data.
This paper therefore presents two separate studies. Study I describes the method development procedures before routine change. First, the amount of time spent on different parts of the dietary interview, and the contribution of total nutrient intake from the different components of the diet history method was examined. Secondly, a series of tests was conducted using two samples of the MDC study population to examine the effect of different coding and portion-size estimation alternatives on the ability to estimate group mean intakes, and to rank individuals on nutrient intakes. Study II examines MDC baseline data collected before and after the change in routines, and evaluates whether the alteration in handling dietary data influenced the observed mean intakes of selected food-groups and nutrients, independently of lifestyle and characteristics of study-participants.
The Malmö Diet and Cancer study
Population and baseline examinations
The baseline examinations of the MDC study started in March 1991 and ended in October 1996. Eligible participants were men in the age-range 46 to 73 years, and women in the age-range 45 to 73 years, living in the City of Malmö and with Swedish reading and writing skills. When recruitment closed, 28 098 persons had completed all baseline examinations. The data collection included dietary habits, socio-economics, medical history, and lifestyle habits using questionnaires and interview. Anthropometrics, body composition, and blood pressure were collected through direct measurements. Blood samples were collected, frozen, and stored for biochemical analysis at a later stage. Participants visited the study centre twice. During the first visit, the study procedures and questionnaires were explained, direct measurements made and blood samples collected. Two weeks later, the questionnaires completed at home were reviewed and the diet history interview conducted.
The modified diet history method
The modified diet history method of the MDC  consisted of a menu book and a diet history questionnaire. In the menu book, participants recorded cooked meals, cold beverages (i.e., milk, juice, soft drinks, water and alcoholic beverages), drugs, natural remedies, and dietary supplements during seven consecutive days. In the diet history questionnaire the general meal pattern and the frequency and portion-size information of foods consumed regularly and with low day-to-day variation (i.e., hot beverages, sandwiches, edible fats, breakfast cereals, yoghurt, milk, fruits, cakes, candies and snacks) were recorded. The reference period of the questionnaire was the preceding year. The choice of methodology was guided by the need to assess total diet in a middle-aged and older urban population where the daily eating habits included cooked meals and mixed dishes. The participant at home estimated the usual portion-sizes of foods reported in the questionnaire from a booklet with 48 black and white photographs. A more extensive book of photographs was used during the dietary interview to estimate usual portion-sizes of dishes and foods in the menu book. During the interview, the questionnaire and the menu book were checked, according to predefined rules, so that reported food consumption did not overlap and were in concordance with the overall meal pattern reported by the participant.
The specific food information obtained from the questionnaire, the menu book and during the diet history interview was coded, entered and converted into nutrient intake data by use of the interactive computer software KOSTSVAR (AIVO AB), and the Swedish Food Database PC KOST2-93 of the Swedish National Food Administration. PC KOST2-93 contains approximately 1600 basic foods; additional recipes and food codes were added specifically for the MDC study.
Portion-sizes were estimated with photographic aids during the interview. Typically, a set of 4 photos (A-D), displaying 4 different portion-sizes of the same dish, was shown to the participants. One set of photos was shown for each dish, or food, registered in the menu book. The participants were not limited to the amounts indicated by the photos, but were encouraged to describe their usual portion-sizes as exact as possible. Thus, portion-sizes could be expressed in several ways i. e. "half the size of C", "between B and C", "D plus A" etc. Information on portion-sizes was entered into the computer and converted into grams.
When coding foods and mixed dishes recorded in the menu book, the software guided the interviewer through a system of "recipe identifiers". These specifically helped identify preparation methods and ingredients in mixed dishes. A "recipe identifier" indicating the type of dish (e.g., casserole with meat) was first entered. The following menu on the screen listed potential codes, indicating the specific constituents of different casseroles. The interviewers choose the most appropriate code and made necessary adaptations of the recipe concurrently, depending on the information given by the participant. The MDC method included the option of exchanging a maximum of four ingredients in standard (default) recipes. The ingredient changes focused on the amount and quality of fat (type of dietary fat, liquid in sauces and casseroles, meat, fish) and vegetables. In addition, the MDC method included the option of creating new individual recipes during the dietary interview. This procedure was used (by judgement of the dietary interviewer) when standard recipes, with ingredient exchanges, did not cover the recipe described by the participant.
Extensive in-service training, the interactive computer software with specific coding rules and a continuous quality control program of collected data ensured standardisation of dietary data collection across dietary interviewers. Weekly training sessions and bi-yearly workshops were conducted to discuss and solve problems related to coding and entering of dietary data. In addition, the two head nutritionists (I.M. and U.J.) conducted weekly inspections of questionnaires and menu books (randomly selected from each dietary interviewer), and regularly listened in on dietary interviews. Extreme portion-sizes were identified through a monthly, computerised quality control routine, and were either verified or corrected if erroneous. In addition, the extreme and median values of total energy, all nutrients, and major food groups were regularly inspected, and erroneous values attended to. Finally, the age and gender specific ratios of total energy intake to basal metabolic rate (EI/BMR) was computed , using the formula for BMR identified by a joint FAO/WHO/UNU expert consultation . Extreme and median values were identified, and the dietary reports of these individuals were checked for errors.
The concurrent validity of the diet assessment method was previously tested against 18 days of weighed food records, collected during one year, as the reference . The validation study included 241 Malmö residents (126 men and 115 women) in the age range 50–69 years. The energy and nutrient correlations were amongst the highest compared to those found in validation studies of other "usual" diet instruments, performed in other populations [23–27].
Study I: Development of new routines to handle dietary data
Study I describes the development procedures undertaken during the autumn of 1993 and spring of 1994, prior to implementing the change of routines in dietary data handling.
Six dietary interviewers with long interviewing experience recorded the amount of time required for the different parts of the diet history interview (i.e., information given to participants, general meal pattern, menu book, and diet history questionnaire). The time records were kept during one week in the autumn of 1993 and included interviews with all study-participants (n = 64), seen by the six interviewers that week.
Nutrient intakes from the different components of the diet history method
Dietary data from all participants joining the study during 1992 was used to examine the origin of nutrient intake information. Total nutrient intake was first partitioned into intakes estimated from the diet history questionnaire, from the menu book excluding beverages, and from beverages in the menu book. Secondly, the menu book excluding beverages was examined separately, to identify the types of foods and mixed dishes that were the major contributors of specific nutrients. Mixed dishes and foods were aggregated into groups defined by the type of dish (e.g. all sauces in one group). The choice of nutrient variables was guided by (1) the potential relations between foods, nutrients and cancer development described in the literature, and (2) the wish to examine changes in nutrient-markers of specific food groups (e.g. B12 for meat and eicosapentaenoic acid for fish). The percentage contribution of different food sources in the menu book was calculated for the following nutrients: Total fat, saturated fatty acids (SFA), monounsaturated fatty acids (MUFA), polyunsaturated fatty acids (PUFA), linoleic acid (18:2), α-linolenic acid (18:3), eicosapentaenoic acid (EPA), cholesterol, β-carotene, tocopherol, selenium, dietary fibre, B12, and folic acid.
Tests of different options for reducing interview time
Two samples from the MDC cohort were selected for the testing of interview simplifications. All study participants who joined the study during 1992 (2660 women and 1769 men) and 156 study participants (88 women and 68 men) who joined the study in November and December 1993.
Standard portion-sizes and simplified coding
Dietary data collected from individuals of the larger sample was used in three steps testing the use of standard portion-sizes and simplified coding. First, median portion-sizes were defined, separately for men and women; these were used as "standard portion sizes" in the tests. Secondly, intake data assessed with the original individually estimated portion-sizes were compared with intake data using the standard portion-sizes for a limited number of dishes and foods (see appendix 1). Thirdly, intake data assessed with the original recipe coding (with ingredient exchange options) were compared with intake data obtained with recipes in the default format (standard recipes with no exchanges). These tests focused on recipes that required large time input during coding, and dishes with low nutrient contributions (see appendix 1). Nutrient intake estimates of test data were compared with original data.
The combined effect of simplified coding and standard portion-sizes was finally examined. In the same data set, the individually estimated portion-sizes were converted to standard portion-sizes, and the recipes with exchange options were converted to default recipes. The combined effect was examined by comparing converted data with original data (i.e., data including both individually estimated portion-sizes and exchange of ingredients in recipes).
Fewer portion-size photos and individual recipes
Individuals of the smaller sample participated in an experiment of using fewer portion-size photos. In this test, one set of photos was selected to represent several foods within a food group (e.g. photos of one type of boiled vegetables for all types of boiled vegetables). The number of photo sets was reduced from 180 to 14. Participants were asked to select portion-sizes twice: first with to the original, complete photo sets, and then using the reduced number photo sets. In addition, data from the smaller sample were used to examine the effect of not using individual recipes, by re-coding all individual recipes to the "best choice" among the standard recipes in the existing database. The two head nutritionists undertook this re-coding procedure after completion of the dietary interviews.
The nutrients examined when testing the effect of different portion-size and coding alternatives were: Total energy (kcal), total fat (g), linoleic acid, 18:2 (g), α-linolenic acid, 18:3 (g), eicosapentaenoic acid, EPA (g), tocopherol (g), selenium (μg), β-carotene (mg), ascorbic acid (mg), dietary fibre (g) and B12 (μg).
Paired t-tests were used to compare mean nutrient intakes between test data and original data. The effect of changed routines on ranking of individuals was examined with two approaches. First, quintiles of energy and nutrient intakes estimated from the original data were cross-classified against quintiles of the test data. Secondly, correlation analysis was conducted comparing data from the original data with the test data. The Pearson product moment correlation coefficients were calculated for energy and all examined nutrients. In all tests, all variables were logarithm transformed and nutrient intakes were energy-adjusted according to the residual method . Tests with the larger sample of standardised portion-sizes and of simplified coding, were gender specific. In addition, these tests first examined the specific effect of each subset of codes (appendix 1), and secondly, the total effects of all subsets. The combined effect of standard portion-sizes and simplified coding was examined with all subsets of codes.
Results and Discussion
The mean time required for the diet history interview was 65 (SD 8) minutes. Approximately 50 percent of the time was spent on the menu book, 25 percent on the dietary questionnaire and the rest on the meal pattern and on information given to the participants. The menu book (exclusive beverages) contributed approximately 30 percent of total mean intake for most nutrients (see Table 1,2). However, more than 60 percent of the total intake of eicosapentaenoic acid, approximately 50 percent of cholesterol, β-carotene, and selenium, and almost 50 percent of B12 originated from the menu book. Fish-dishes contributed most to the total intake of eicosapentaenoic acid, while boiled vegetables, salads as side dishes, casseroles, and soups were major contributors of β-carotene. Dishes with meat, fish, or egg contributed most to selenium intakes, and meat and fish dishes to B12.
Tests of different options for reducing interview time
Because handling of dietary information from the menu book, specifically the use of certain portion-size aids and coding routines, proved to be the most time-consuming parts of the dietary interview, these were given priority in method development. Although, the overall intake contribution of the menu book was smaller compared to the diet questionnaire, a concern was raised for some nutrients (from vegetables, meat and fish) of potential importance in diet-cancer studies . This concern prompted the extensive testing of the potential changes in dietary data handling.
Because the menu book also contributed substantially to the assessment of important foods and nutrients, the possible interview simplifications were limited to two procedures: portion-size estimation and coding of foods and dishes assessed through the menu book. The results from tests examining the combined effects are presented.
When examining data from the larger sample and the use of standard portion-sizes compared to individual portion-sizes, the correlation coefficients were very high, in both men and women, table 3. The lowest correlation was observed in women for linoleic acid (r = 0.982) and α-linolenic acid (r = 0.983). In cross-classification of corresponding quintiles, the exact agreement was above 90 % for β-carotene, vitamin C, fibre, and B12 both in men and women. For energy, α-linolenic acid, tocopherol and selenium the agreement was between 85 and 90 percent. Linoleic acid had the lowest exact agreement, 84 percent in women. Crude data had slightly lower agreement in some nutrients, but for most nutrients, the results were the same. Differences in mean intakes of energy and nutrients were significant, original data had slightly higher (i.e., <2% for men and <1% for women) levels compared to test data (data not shown).
Also, with the larger sample, simplified coding only marginally affected the ranking ability of observed intakes, table 3. The correlations were very high (r > 0.990) for energy and all nutrients, except β-carotene, in both gender groups. The exact agreement of corresponding quintiles was well above 90 percent for energy and most nutrients, in both men and women. In women, the exact agreement for β-carotene was 91 percent, and in men 89 percent. Crude data had slightly (1–2%) higher agreement. Differences in mean energy and nutrient intakes were overall extremely small, original data had somewhat higher (<1% for both men and women) intake levels). Differences were not significant for ascorbic acid in women and for fibre in men (data not shown).
The combined effect of standard portion-sizes and simplified coding showed slightly lower correlations and agreements compared to standard portion-sizes or simplified coding only. Overall correlations were higher than 0.980, except for linoleic acid in women (r = 0.972). The exact agreement in classification was 85 percent or higher for energy and all nutrients except selenium (83 percent in women, 81 percent in men), linoleic acid, and α-linolenic acid in women (82 and 83 percent), data not shown.
When testing the effect of fewer sets of portion-size photos, in the smaller sample, compared to complete sets of photos, the correlations were high (r > 0.950) for all nutrients. In cross-classification the exact agreement was between 85 and 90 percent for fibre, eicosapentaenoic acid and α-linolenic acid. For most other nutrients, the agreement ranged between 75 and 85 percent, table 4. The lowest agreement (74 percent) was seen in β-carotene. Differences in mean intakes of energy, total fat, selenium and fibre were significant but small (1,5–3,5%), original data having the highest level (data not shown). Differences in mean intakes of linoleic acid, α-linolenic, eicosapentaenoic acid, tocopherol, β-carotene, ascorbic acid, and B12 were not significant.
When re-coding individual recipes to standard recipes, also with the smaller sample, the observed correlations were extremely high (r > 0.995) for all nutrients, table 4. The exact agreement of corresponding quintiles was 97 percent or more for most nutrients. For vitamin B12 the exact agreement was 95 percent. No significant differences were seen in mean intakes of energy or in any of the examined nutrients (data not shown).
The influences on dietary intakes, ranking especially, of routine changes was small. Using standardised portion-sizes or reduced number of photo sets influenced the observed intakes to a larger extent, than using standard recipes instead of individual recipes or recipes with fewer ingredient exchanges.
Overall, the changes to use of portion-size aids influenced the observed intakes more than changes in coding routines. Since the effect on nutrient intakes was considered to be too large (table 4), the tested reduction of portion-size photo sets was not implemented. The use of standard portion-sizes was accepted, but to a smaller extent than in, the tests performed. For instance, individual portion-sizes were kept for major food sources of polyunsaturated fatty acids and selenium (table 3), and for all vegetables to minimise effect on observed intakes. The procedure of creating individual recipes was time consuming. Since the tests showed that removing this procedure only marginally influenced nutrient intakes, this option was removed. The effect of reducing the number of recipes with exchange options was more complex. This option was retained for recipes that were major sources of β-carotene, total fat and specific fatty acids. When these changes (standard portion-sizes on a limited number of foods, fewer recipes with exchange options, and no individual recipes) were carried through, the number of questions asked during the interview was substantially reduced. Other tasks (i.e., check of the socio-economic questionnaires) were also removed from the dietary interview. No changes were made to the interview routine of the diet history questionnaire, or to coding of reported beverages in the menu book. There were no changes in the information given to participants on their first visit to the screening centre, nor was the menu book or diet history questionnaire changed in layout, number of questions, or in any other way. Taken together the time allocated to each interview was reduced from 75 minutes to 45 or 60 minutes, depending on logistics at the screening centre.
It is not uncommon that large-scale studies modify methods over time. For instance, the well-known Nurses Health Study has expanded its food frequency questionnaire (FFQ) several times. A 61-item FFQ was used in 1982, when dietary data collection first was implemented . However, a 121-item and a 134-item FFQ have been used in later follow up studies [30, 31]. As pointed out by Block, a FFQ should be validated in a population with the same age and sex distribution as the study population. If modifications are made, the ideal procedure is to validate the modified method together with the original in a new validation study .
All method development and tests described in this paper were carefully monitored, based on experiential knowledge, and supplemented with empirical data. The study design was under strict control and the tests illustrate the ideal situation. However, "in the real world" a number of other factors would also influence the effects of the method change. Most of these were not possible to examine before the change of data handling. For instance, the effect of the new coding rules when implemented in total was not examined. It is also possible that the dietary interviewer's decisions on choice of codes were influenced in ways not predicted. In addition, the introduction of new routines was preceded by intensified training sessions, which possibly could influence the dietary interviewers to follow all rules more consistently. Long interviewing experience could have had the opposite effect. In addition, the shorter interview time could potentially produce unintended influences on the complex interview. Both the participants and the interviewers might for instance experience the situation as more stressful. The interviewers met more participants each day and, although the number of questions asked was reduced, seeing more individuals could be perceived as an added burden. Many participants wanted to talk about details in recipes and food choices. With the reduced interview time and simplified coding procedures, there was neither time nor need for details, which could have been negatively perceived by the participants. In addition, it is possible that the shorter interview had non-intended influences on the check-up of diet history questionnaires.
In summary, the tests indicated that a change of dietary data handling routines potentially was possible without major effects on ranking of individuals on dietary intakes. However, the tests could not fully evaluate the impact of the change in dietary data handling. Such an evaluation was only possible after implementing the altered routines in the "real world" setting.
Study II: Comparison of intakes estimated before and after routine change
Study II is a post hoc study that uses data from the MDC baseline examinations, and examines observed mean intakes before and after change in routines.
Study II uses of two samples selected from the MDC cohort to include individuals joining the study before and after the alteration of interview routines (i.e., the original routines were in use until August 31 1994, and the new routines started on September 1 1994. Also, see Table 5). A study design with paired comparisons (i.e., data collected with both methods in the same individuals) was not possible. One sample included 672 individuals who joined the study in July and August 1994 (i.e., just before the alteration) and in September 1994 (i.e., right after the alteration). The other sample, selected to avoid seasonal influences, consisted of 621 individuals who joined the study in the month of September during four consecutive years (i.e., 1992 and 1993 before the alteration, 1994 and 1995 after the alteration).
A dichotomous method variable was constructed in each of the two data sets. Individuals that joined the study before September 1 1994 were categorised as "One", and those that joined in September 1994 or later as "Two".
The two samples were selected with regard to interview month and year, but without specifying any particular dietary interviewer. As a result, study-participants were not uniformly distributed across the six dietary interviewers and the two method versions of sample I. However, participants belonging to sample II were interviewed by two dietary interviewers only (i.e., those that interviewed a sufficient number of participants for all four study periods), and were evenly distributed across both dietary interviewers and method versions.
Food groups, energy and nutrients
The method development and tests before alteration of routines indicated that the menu book contributed about 30 percent of the intake information for most nutrients, but for some nutrients, more than 50% of the information came from the menu book. Therefore, this study selected eight food groups to represent foods assessed mainly through the menu book (i.e., vegetables, meat, fish and milk), or the diet history questionnaire (i.e., fruits, bread, dietary fats and cheese). Energy and nutrient variables were also selected so that both those with smaller and larger contributions from the menu book would be represented, that is total energy (kcal), total fat (g), dietary fibre (g), polyunsaturated fatty acids, PUFA (g), β-carotene (mg), B12 (μg), selenium (μg), and eicosapentaenoic acid, EPA (g).
Socio-economic, demographic and lifestyle information was in the MDC collected through a self-administrated questionnaire. The influence of age, gender, and socio-economic status were in this study examined as covariates and cofactors in the final multivariate analysis.
Leisure time physical activity was assessed by a list of activities in the questionnaire (18 items), modified from of the Minnesota Leisure Time Physical Activity Instrument . Participants were asked to report how many minutes per week on average, and for each of the four seasons, they spend on a specific activity. A physical activity score was obtained by multiplying the number of minutes for each activity with an activity-specific factor, and four category variable was defined by the participants' quartile ranking.
This study used Body Mass Index (BMI) and Waist-Hip Ratio (WHR), computed from direct measurements, as indicators of obesity. Leisure time physical activity, BMI and WHR were included as covariates in the final multivariate analysis.
The statistical package SPSS was used in all analytical procedures . All continuous variables were log-transformed before analysis to normalise distributions. Intakes of selected food groups, energy, and nutrients were compared between the first and second method versions using the general factorial analysis of variance including two-way interactions. The relations between intakes and method versions were first examined in a gender specific fully factorial design, including dietary interviewer as a cofactor. Secondly, gender specific models using main effects designs were constructed, including dietary interviewer-method interaction terms if significant. Finally, nutrient intakes were compared between method version while simultaneously controlling for energy intake, dietary interviewer, socio-economic status, age, obesity indicators (BMI and WHR), and leisure time physical activity. In order to account for the fact that dietary interviewer represents a stochastic (random) effect, the model was also formulated as a mixed general factorial model . However, the analysis indicated that the stochastic effects assumption of dietary interviewers was not valid (the estimated variance components were negative), and therefore only the fixed model analysis is reported.
The short term effects of change in data collection routines involving six dietary interviewers are illustrated by comparison of intakes using sample I. As indicated in Table 6 women underestimated fish, fruits and milk with the second method version compared to the first, but both men and women overestimated dietary fats. In women, a dietary interviewer-method interaction was seen for vegetables.
In women, unadjusted energy and nutrient intakes were mostly underestimated with the second method version compared to the first, except for polyunsaturated fatty acids and β-carotene. In men, only energy, selenium, and fibre were underestimated with the second method version. Significant dietary interviewer-method interactions were seen for β-carotene and B12 in women, and for energy, fibre, and selenium in men (data not shown).
When additional variables were added to the models (Table 7) energy intakes remained significantly lower for both gender groups. However, energy-adjusted fat intakes were higher with the second method version. Intake differences remained significant with adjustments for energy, fat, selenium, and B12 in women, and for energy, fat, and fibre in men. The only dietary interviewer-method interaction that remained significant was that for differences in β-carotene in women.
Similar comparisons using sample II, which illustrate the long term effects of altered routines involving only two dietary interviewers, indicate that fruit intakes were underestimated in women with version two, but dietary fats were overestimated (Table 8). No food-group differences between method versions were seen in men, but significant dietary interviewer-method interactions were observed for cheese, dietary fats, and bread.
When examining unadjusted intakes in women, eicosapentaenoic acid, fibre, selenium, and B12 were underestimated with the second version, but no differences were seen in energy and other nutrients. In men, energy and B12 were significantly underestimated, but β-carotene was overestimated with the second version compared to the first. Significant dietary interviewer-method interactions were observed in women for eicosapentaenoic acid, and in men for energy, fat and PUFA.
When including additional variables in the models (Table 9), most differences remained significant. All dietary interviewer-method interactions disappeared, except for energy in men. Energy-adjusted fat was overestimated in both gender groups.
Thus, differences in energy-adjusted total fat estimates were consistent in both data sets. After altered interview routines energy-adjusted fat intakes were overestimated. Additional adjustment did not change these relationships, but the dietary interviewer effects and interactions were removed (Table 7 and 9). In addition, estimates of poly-unsaturated fatty acids and eicosapentaenoic acid showed some consistency in both data sets. With energy adjusted data there were no differences in poly-unsaturated fatty acid intakes in either data set. Intakes of eicosapentaenoic acid were significantly underestimated for women in both data sets without energy adjustment, but only in sample II when adjusting for energy.
Although the relations with fat appear consistent, dietary interviewer and method version explained a very small proportion of the total variation in fat intake. The R-squared was 0.023 for women in sample I, and 0.004 for men, when the model included method version and dietary interviewer only. With additional adjustment (i.e., energy, obesity indicators, age, socio-economic status), the R-squared was 0.777 in women and 0.812 in men. The corresponding figures for sample II are 0.005 for women and 0.033 for men (when the model also included the dietary interviewer-method version interaction term). With additional adjustment, the R-squared was 0.804 for women in sample II and 0.773 for men. When the multivariate relation between fat intake and adjusting variables was examined, excluding method version and dietary interviewer, the R-squared was in sample I 0.773 for women and 0.806 for men. In sample II, the corresponding numbers were 0.801 for women and 0.769 for men. Thus dietary interviewer, method version and dietary interviewer-method interactions explain less than one percent of the total variation in fat intake.
In summary, after alteration of interview routines intakes of energy and several of the examined nutrients were lower, but energy-adjusted intakes of total fat appear overestimated. However, method version and dietary interviewer explained a very small proportion of the total variation of fat intake. The altered routines also appeared to affect intake estimates more in women than in men. For instance, fruits were consistently underestimated and dietary fats overestimated in women. Dietary interviewer-method interactions, contributed significantly to observed differences between method versions in several food groups and unadjusted nutrient intakes, but most of these did not remain significant in multivariate analysis. It should be noted that, due to the multitude of comparisons, some of the observed differences between method versions might be due to type I errors.
Study II does not compare two dietary assessment methodologies, but evaluates whether two different approaches in handling of dietary data collected with the same MDC dietary history method produce similar mean intake estimates. The MDC study used a detailed dietary history method to enhance precision of dietary intake estimates. The high concurrent validity of the MDC diet history method has previously been documented [3, 36]. Other studies, which have opted for the less costly food frequency questionnaire method appear to show weak or inconsistent results , and have attracted strong criticism [28, 38]. Different types of methodologies differ greatly in details of food information and precision. A common threat to all nutrition epidemiological studies is random non-differential misclassification, because it commonly results in attenuated diet-disease relations. The source of such misclassification is often measurement "errors" in the dietary assessment process [38–42]. Systematic misclassification between population sub-groups is a serious problem in descriptive studies when mean nutrient intakes of specific population groups are estimated in order to evaluate the health status of the population. For instance, mean nutrient intakes are commonly compared with recommended daily intakes of specific nutrients. Regardless of choice of assessment methodology, it is essential for nutrition epidemiologists and public health nutritionists to understand the specific features of the dietary assessment process so that these can be accounted for in analysis and interpretation of results. For instance, diet history methods may not be robust to changes in the interviewer . Also, the ability of individuals to estimate portion-sizes and common consumption frequencies in usual diet methods may depend on the specific assessment aid [22, 43, 44], the interview technique [45, 46], or the organisation of the food list . Studies also suggest that usual diet reports may be affected by diverse factors like season of data collection , ethnicity and education , degree of obesity [7, 49], socio-economic status , and the perception of societal norms [50, 51].
Impact of method change
The altered routines of dietary data handling were only implemented for foods estimated with the menu book. No changes were made to the routines of the diet history questionnaire, or to coding of reported beverages in the menu book. The implemented changes in coding and portion size estimation were selected so that assessment of vegetables, of major selenium sources, of total fat and polyunsaturated fatty acids would not be compromised. There was a concern that foods (like vegetables, meat and fish) and nutrients (like β-carotene, selenium and EPA) with hypothesized importance for cancer development would be most affected. The different results observed in this study for women and men, the elevated estimates of energy-adjusted fat, and the dietary interviewer-method interactions (i.e., bread, cheese and dietary fats in men) were not expected. The findings may indicate that the individual portion sizes in version one produced estimates with higher precision in women, and that lower precision was obtained with the standardized portion-sizes of version two. In men, the two method versions may not have produced such differences in precision. This observation is supported by findings from another methodological study within the MDC, which concluded that women were better compared to men at estimating the amount of fat on bread when using photographic aids . Researchers have observed that women are likely to respond differently to dietary assessment than men, although it is hard to fully conceptualise what it is about gender that cause these differences . Studies imply that gender differences may be population specific , and depend on personal characteristics .
The observed differences in mean intakes estimated before and after change in routines needs to be considered in studies using MDC data to describe and compare food and nutrient intakes between population groups. For instance, when comparing mean intakes with dietary recommendations, and between population groups, erroneous conclusions about healthfulness of diet could be made, if method versions are not randomly distributed across intake levels and population sub-groups. Descriptive studies should consider selecting samples assessed with either method version to avoid distorted intakes. Depending on the specific research question adjustment for year and season of dietary interview, and dietary interviewer, should also be considered.
Some seasonal influences on observed intakes are expected in the MDC study. The menu book requests "current diet" information from seven consecutive days, while the questionnaire asked participants about their "usual diets" during the past year. Current diet methods (like diet records and 24-hour recalls) reflect dietary intakes on singular days, and are influenced by seasonal variation in food selection [3, 52]. However, usual diet reports also tend to be influenced by the season of data collection . This study controlled for season of data collection in the design, i.e., selecting individuals examined in the month of September during four consecutive years removed the influence of season. However, seasonal differences could be expected when comparing participants examined in the summer or early autumn with those examined during winter or early spring. Future studies therefore may need to consider seasonal adjustment in analysis.
The elevated fat estimates in relation to total energy with the second method was an unexpected finding. It is plausible that when the interview time was reduced comparatively more attention was given by dietary interviewers to fat providing foods, which resulted in an overestimation of fat relative to other macronutrients. This is illustrated by the overestimation of dietary fats and underestimation of fruits with version two. Both food groups are important energy contributors in this population. These observations could have implications for future studies of the relation between dietary fat and disease. Due to the latency period of chronic disease, it is likely that many cases will be assessed with the first version of the dietary assessment method during the early follow-up period. A greater proportion of non-cases would then erroneously be assigned to higher energy-adjusted fat intakes. If this were to happen in a study examining the relation between dietary fat and disease, the interpretation would be that dietary fat protects against disease even if there was no "true" relation. It should, however, be noted that method version explained a very limited proportion of the overall variation in fat intakes. Because method version, dietary interviewer and dietary interviewer-method version interactions accounted for less than one percent of the total variation, the impact of method change may be small in analytical studies. However, the distribution of cases and non-cases across methods version should be monitored in future analytical studies, and its impact on outcomes evaluated further.
This study could only compare group mean intakes in groups of individuals participating in the MDC baseline examinations either before or after the change in methodology. The two method versions were not administrated in parallel and therefore it was not possible to compare means or ranking of estimates in the same individuals (i.e., paired comparisons). This approach probably exaggerated observed differences between method versions. It may be the major reason why the results are discrepant from those of Study I. The development work either used paired comparisons in the same individuals or dealt with recoded data. Other studies of usual diet methods have, however, found that portion size estimation have greater influence on estimated group means than on the ranking ability [4–6]. It is therefore plausible that the apparent underestimation of the second version compared to the first does not affect the ranking ability to the same degree. Since studies of diet and disease are examining the difference in risk between extreme exposure groups , the influence from the two method versions on ranking of nutrient intakes would be of major interest, but the design of this study did not allow for such examinations.
Another limitation is the non-random distribution of individuals and dietary interviewers across comparison groups. The multivariate analysis used both fixed and mixed models, but the results presented are those from the fixed model analysis. The overall effect of dietary interviewer appeared to be very small and the assumption of stochastic (random) effects of dietary interviewers was not valid. It can be argued that this study underestimated the overall dietary interviewer effects, because only a few dietary interviewers conducted interviews in the data sets selected for analysis. However, small differences across dietary interviewers are also an indication that the extensive efforts to standardise interview and coding had the intended effect. Regardless, dietary interviewer, and dietary interviewer-method interactions, contributed significantly to differences between method versions for food groups, and for energy and several unadjusted nutrient intakes. Most of these effects seemed to disappear either with energy-adjustment, or with additional variables included in the models. Therefore, the influence of dietary interviewers appears to depend on personal characteristic of study-participants.
Although, a change in dietary data collection routines is not recommended during the active data collection phase, reality sometimes forces such undesired changes. Findings of Study I suggest that alterations in the handling of dietary data (when examined under carefully controlled situations) appear possible without substantial impact on the ranking ability or mean nutrient intake levels. However, as has been discussed, Study I could not assess the total impact of altered routines. Study II, which used "real world" data collected during the baseline examinations, examined mean dietary estimates before and after the alteration of routines. The latter study suggests that future descriptive studies using the MDC data should preferably select subsets of the population assessed with either method version to avoid distortion of observed intakes. Depending on the specific research question, adjustment in analysis for year and season of data collection, and dietary interviewer may need to be considered. However, the impact of altered interview routines on the outcome of analytical studies probably is small. Although, the impact of dietary data collection procedures on risk estimates of disease in the MDC study is not yet evaluated, Study II suggests that method version and dietary interviewer explained a very small proportion of total variation. Method version differences between cases and non-cases need, however, to be monitored.
Appendix 1. List of dishes/foods included in different test
The effect of standard portion size was first analysed separately for each of the four groups, and then the effect of all groups was estimated.
Group 1: Pickled vegetables, boiled legumes, salads as main dish, shellfish, smoked fish, black-pudding/black sausage, fried potato dishes, egg dishes, porridge, pasta sauce, fast food, cakes, dressing, condiments, smoked meat, gratin/pudding, soufflé, pizza.
Group 2: Corn on the cob, tomato (preserved), artichoke, garlic, avocado, tomato, sweet pepper, onion/leek, dill/parsley/chive, salads with mayonnaise (as side dish), fried vegetables, pickled herring, bacon, spareribs, pork bone, chicken, liver, pigs trotters, snails, cold sauces, stewed macaroni.
Group 4: Rice, pasta, mashed potatoes, French fries, fried potatoes, mashed turnips.
The effect of simplified coding was first analysed separately for each of the six groups, and then the effect of all groups was estimated.
Group1 (included dishes giving <1 percent of energy and key nutrients): Fried vegetables, baked vegetables, deep fried vegetables, deep fried fish, deep fried meat, smoked meat/poultry, boiled poultry, fried black sausage, fried potato dishes, porridge, pasta sauce, pie, pate/mousse, crepes, pirogues, pizza, miscellaneous small dishes.
Group 2: Desserts, puddings, gratins, complete dishes, salad as main dish.
Group 3: Sauces
Group 4: Soups
Group 5: Casseroles
Group 6: Minced meat dishes (if not already tested in group 1–5)
Malmö Diet and Cancer
Ratio of Energy Intake to Basal Metabolic Rate
Basal Metabolic Rate
Food and Agricultural Organization/World Health Organization/United Nations University
saturated fatty acids
monounsaturated fatty acids
polyunsaturated fatty acids
2 linoleic acid
3 linolenic acids
Food Frequency Questionnaire
Body Mass Index
Waist Hip Ratio
Dwyer JT: Assessment of dietary intake. In: odern nutrition in health and disease. Edited by: Shils ME, Olson JA, Shike M. 1994, Philadelphia, Lea & Febiger, 842-860.
Margetts BM, Nelson M: Design Concepts in Nutritional Epidemiology. Oxford New York Tokyo, Oxford University Press Oxford Medical Publications. 1997, 2
Willett W: Nutritional Epidemiology. Monographs in Epidemiology and Biostatistics. 1998, New York Oxford, Oxford University Press, 30: 101-147. 2
Flegal KM, Larkin FA: Partitioning Macronutrient Intake Estimates from a Food Frequency Questionnaire. Am J Epidemiol. 1990, 131: 1046-1058.
Flegal KM, Larkin FA, Metzner HL, Thompson FE, Guire KE: Counting Calories: Partitioning Energy Intake Estimates from a Food Frequency Questionnaire. Am J Epidemiol. 1988, 128: 749-760.
Hunter DJ, Sampson L, Stampfer MJ, Colditz GA, Rosner B, Willett WC: Variability in portion sizes of commonly consumed foods among a population of women in the United States. Am J Epidemiol. 1988, 127: 1240-1249.
Braam LA, Ocké MC, Bueno-de-Mesquita HB, Seidell JC: Determinants of obesity-related underreporting of energy intake. Am J Epidemiol. 1998, 147: 1081-1086.
Lafay L, Basdevant A, Charles M, Vray M, Balkau B, Borys JM, Eschwège E, Romon M: Determinants and nature of dietary underreporting in a free-living population:the Fleurbaix Laventie Ville Santé (FLVS) study. Int J Obes Relat Metab Disord. 1997, 21: 567-573. 10.1038/sj.ijo.0800443.
Willett WC, Stampfer MJ: Total energy intake: Implications for epidemiologic analysis. Am J Epidemiol. 1986, 124: 17-27.
Pryer JA, Vrijheid M, Nichols R, Kiggins M, Elliott P: Who are the "Low Energy Reporters" in the Dietary and Nutritional Survey of British Adults?. Int J Epidemiol. 1997, 26: 146-154. 10.1093/ije/26.1.146.
Klesges RC, Eck LH, Ray JW: Who underreports dietary intake in a dietary recall? Evidence from the second National Health and Nutrition Examination Survey. J Consult Clin Psychol. 1995, 63: 438-444. 10.1037//0022-006X.63.3.438.
Stallone DD, Brunner EJ, Bingham S: Dietary Assessment in Whitehall II: The influence of reporting bias on apparent socioeconomic variation in nutrient intakes. Eur J Clin Nutr. 1997, 51: 1-11. 10.1038/sj.ejcn.1600491.
Liu K, Slattery M, Jacobs D, Cutter D, Mcdonald A, Van Horn L, Hilner J, Caan B, Bragg C, Dyer A: A study of the reliability and comparability of the CARDIA dietary history. Ethn Dis. 1994, 4: 15-27.
Wirfält AKE, Pettersson U, Andrén C, Mattisson I: Fat on bread: An evaluation of photographic aids in portionsize estimation. Eur J Clin Nutr. 1998, 52 (Supplement 2): S30-
Helsing E: Trends in fat consumption in Europe and their influence on the Mediterranean diet. Eur J Clin Nutr. 1993, 47: S4-S12.
Senauer B, Asp E, Kinsey J: Food trends and the changing consumer. St. Paul MN, Eagan Press. 1993
Kromhout D, de Lezenne-Coulander C, Obermann de Boer GL, van Kampen Donker M, Goddijn E, Bloemberg BP: Changes in food and nutrient intake in middle-aged men from 1960 to 1985 (the Zutphen Study). Am J Clin Nutr. 1990, 51: 123-129.
Mullenbach V, Kushi LH, Jacobson C, Gomez-Martin O, Prineas RJ, Roth-Yousey L, Sinaiko A: Comparison of 3-day food record and 24-hour recall by telephone for dietary evaluation in adolescencents. J Am Dietet Assoc. 1992, 92: 743-745.
Fanelli MT, Stevenhagen KJ: Consistency of energy and nutrient intakes of older adults: 24-hour recall vs. 1-day food record. J Am Dietet Assoc. 1986, 86: 665-667.
Goldberg GR, Black AE, Jebb SA, Cole TJ, Murgatroyd PR, Coward WA, Prentice A: Critical evaluation of energy intake data using fundamental principles of energy physiology: 1. Derivation of cut-off limits to identify under-recording. Eur J Clin Nutr. 1991, 45: 569-581.
Mertz W, Tsui JC, Judd JT, Reiser S, Hallfrisch J, Morris ER, Steele P, Lashley E: What are people really eating? The relation between energy intake derived from estimated diet records and intake determined to maintain body weight. Am J Clin Nutr. 1991, 54: 291-295.
Wolk A, Bergström R, Adami H-O, Ohlander E-M, Bruce Å, Holmberg L, Bergkvist L: Self-administered food frequency questionnaire:the effect of different designs on food and nutrient intake estimates. Int J Epidemiol. 1994, 23: 570-576.
Mahalko JR, Johnson L-AK, Gallagher SK, Milne DB: Comparison of dietary histories and seven-day food records in a nutritional assessment of older adults. Am J Clin Nutr. 1985, 42: 542-553.
Madden JP, Goodman SJ, Guthrie HA: Validity of the 24-hour recall. Analysis of data obtained from elderly subjects. J Am Dietet Assoc. 1976, 68: 143-147.
Carter RL, Sharbaugh CO, Stapell CA: Reliability and validity of the 24-hour recall. J Am Dietet Assoc. 1981, 79: 542-547.
Gersovitz M, Madden JP, Smiciklas-Wright H: Validity of the 24-hr. dietary recall and seven-day record of group comparisons. J Am Dietet Assoc. 1978, 73: 48-55.
Fries E, Green P, Bowen DJ: What did I eat yesterday? Determinants of accuracy in 24-hour food memories. App Cogn Psychol. 1995, 9: 143-155.
Sempos CT: Invited Commentary: some limitations of semiquantitative food frequency questionnaires. Am J Epidemiol. 1992, 135: 1127-1132.
WCRF/AICR, Potter JD: Food, Nutrition and the Prevention of Cancer:a Global Perspective. Washington DC, World Cancer Research Fund/American Insitute for Cancer Research. 1997
Mullen BJ, Krantzler NJ, Grivetti LE, Schutz HG, Meiselman HL: Validity of a food frequency questionnaire for the determination of individual food intake. Am J Clin Nutr. 1984, 39: 136-143.
Gibson RS: Sources of error and variability in dietary assessment methods: A review. J Can Dietet Assoc. 1987, 48: 150-155.
Livingstone MB: Assessment of food intakes: are we measuring what people eat?. Br J Biomed Sci. 1995, 52: 58-67.
Taylor HL, Jacobs DR, Schucker B, Knudsen J, Leon AS, Debacker G: A questionnaire for the assessment of leisure time physical activities. J Chronic Dis. 1978, 31: 741-755.
Norusis MJ: SPSS for Windows Professional Statistics. Release 6.0. Chicago IL, SPSS Inc. 1993
Montgomery DC: Design and analysis of experiments. New York, John Wiley & Sons. 1991, 3
Riboli E, Elmståhl S, Saracci R, Gullberg B, Lindgärde F: The Malmö Food Study: Validity of Two Dietary Assessment Methods for Measuring Nutrient Intake. Int J Epidemiol. 1997, 26: S161-S173. 10.1093/ije/26.suppl_1.S161.
Boyd NF, Martin M, Noffel GA, Lockwood GA, Trichler DL: A meta-analysis of studies of dietary fat and breast cancer risk. Br J Cancer. 1993, 68: 627-636.
Prentice RL: Measurement Error and Results From analytic Epidemiology: Dietary Fat and Breast Cancer. J Natl Cancer Inst. 1996, 88: 1738-1747.
Beaton GH: Approaches to analysis of dietary data: relationship between planned anlysis and choice of methodology. Am J Clin Nutr. 1994, 59 (suppl): 253S-261S.
Liu K, Stamler J, Dyer A, McKeever J, McKeever P: Statistical Methods to Assess and Minimize the Role of Intra-Individual Variability in Obscuring the Relationship Between Dietary Lipids and Serum Cholesterol. J Chronic Dis. 1978, 31: 399-418.
Beaton GH, Burema J, Ritenbaugh C: Errors in the interpretation of dietary assessments. Am J Clin Nutr. 1997, 65 (suppl): 1100S-1107S.
Freedman LS: Challenges for statistical approaches to dietary assessment. Eur J Clin Nutr. 1998, 52: S6-
Nelson M, Atkinson M, Darbyshire S: Food photography II: Use of food photographs for estimating portion size and the nutrient content of meals. Br J Nutr. 1996, 76: 31-49.
Schwartz N, Bless H, Bohner G, Harlacher U: Response scales as frames of reference:the impact of frequency range on diagnostic judgements. Appl Cogn Psychol. 1991, 5: 37-49.
Fischer RP, Quigley KL: Applying Cognitive Theory in Public Health Investigations: Enhancing Food Recall with the Cognitive Interview. In Questions about questions. Inquiries into the cognitive basis of surveys. Edited by: Tanur JM. 1992, New York, Russel Sage Foundation, 154-169.
Ervin RB, Smiciklas-Wright H: Using encoding and retrieval strategies to improve 24-hour dietary recalls among older adults. J Am Diet Assoc. 1998, 98: 989-994. 10.1016/S0002-8223(98)00227-2.
Boutron MC, Faivre J, Milan C, Lorcerie B, Esteve J: A comparison of two diet history questionnaires that measure usual food intake. Nutr Cancer. 1989, 12: 83-91.
Subar AF, Frey CM, Harlan LC, Kahle L: Differences in Reported Food Frequency by Season of Quetionnaire Administration: The 1987 National Health Interview Survey. Epidemiology. 1994, 5: 226-233.
Johansson L, Solvoll K, Björneboe G-EA, Drevon CA: Under-and overreporting of energy intake related to weight status and lifestyle in a nationwide sample. Am J Clin Nutr. 1998, 68: 266-274.
Kromhout D, Bloemberg BPM: Methods in nutritional epidemiology. In Coronary Heart Disease Epidemiology From Aetiology to Public Health. 1992, Oxford New York Toronto Melbourne, Oxford University Press, 140-151.
Hebert JR, Yunsheng M, Clemow L, Ockene IS, Saperia G, Stanek EJ, Merriman P, Ockene J: Gender differences in social desirability and social approval bias in dietary self-report. Am J Epidemiol. 1997, 146: 1046-1055.
Hartman AM, Brown CC, Palmgren J, Pietinen P, Verkasalo M, Myer D, Virtamo J: Variability in Nutrient and Food Intakes Among Older Middle-Aged Men. Am J Epidemiol. 1990, 132: 999-1012.
The authors wish to acknowledge the funding support received from the Swedish Cancer Society, the Swedish Medical Research Council, the European Commission, and the City of Malmö.
Competing interests: none declared.
Elisabet Wirfält, who is the main author and initiator of this paper, designed and implemented the analyses of Study II; Irene Mattisson and Ulla Johansson who were responsible for the dietary data collection of the Malmö Diet and Cancer study, designed and implemented the dietary method development described in Study I; Bo Gullberg provided analytical support and statistical advice; Peter Wallström provided constructive advice; Göran Berglund is the principal investigator of the Malmö Diet and Cancer Study.
About this article
Cite this article
Wirfält, E., Mattisson, I., Johansson, U. et al. A methodological report from the Malmö Diet and Cancer study: development and evaluation of altered routines in dietary data processing. Nutr J 1, 3 (2002). https://doi.org/10.1186/1475-2891-1-3
- Leisure Time Physical Activity
- Method Version
- Diet History
- Diet History Questionnaire
- Dietary Interview