Journal of Obesity & Metabolic Syndrome

Search

Article

J Obes Metab Syndr 2017; 26(2): 122-129

Published online June 30, 2017 https://doi.org/10.7570/jomes.2017.26.2.122

Copyright © Korean Society for the Study of Obesity.

Development and Cross-Validation of Equation for Estimating Percent Body Fat of Korean Adults According to Body Mass Index

Hoyong Sung 1,*, and Junbae Mun2

1Department of Physical Education, Korea Military Academy, Seoul, Korea;
2Department of Health and Human Performance, Middle Tennessee State University, Tennessee, USA

Correspondence to:
Hoyong Sung Department of Physical Education, Korea Military Academy, 574 Hwarang-ro, Nowon-gu, Seoul 01805, Korea Tel: +82-2-2197-2983 Fax: +82-2-2197-6415 E-mail: hoyongsung86@naver.com

Received: September 27, 2016; Reviewed : November 6, 2016; Accepted: December 21, 2016

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Background:

Using BMI as an independent variable is the easiest way to estimate percent body fat. Thus far, few studies have investigated the development and cross-validation of an equation for estimating the percent body fat of Korean adults according to the BMI. The goals of this study were the development and cross-validation of an equation for estimating the percent fat of representative Korean adults using the BMI.

Methods:

Samples were obtained from the Korea National Health and Nutrition Examination Survey between 2008 and 2011. The samples from 2008-2009 and 2010-2011 were labeled as the validation group (n=10,624) and the cross-validation group (n=8,291), respectively. The percent fat was measured using dual-energy X-ray absorptiometry, and the body mass index, gender, and age were included as independent variables to estimate the measured percent fat. The coefficient of determination (R2), standard error of estimation (SEE), and total error (TE) were calculated to examine the accuracy of the developed equation.

Results:

The cross-validated R2 was 0.731 for Model 1 and 0.735 for Model 2. The SEE was 3.978 for Model 1 and 3.951 for Model 2. The equations developed in this study are more accurate for estimating percent fat of the cross-validation group than those previously published by other researchers.

Conclusion:

The newly developed equations are comparatively accurate for the estimation of the percent fat of Korean adults.

Keywords: BMI, Cross-validation, Percent fat, Estimation

Methods for measuring human body components are divided into direct chemical and physical methods using cadavers and indirect methods involving the measurement of the body density. Although the direct methods are more accurate, they have the disadvantage of ethical issues surrounding the dissection of cadavers. Furthermore, dissection and analysis involve numerous people and require considerable time and cost, placing practical limitations on direct methods.1 Additionally, people cannot be measured when they are alive, which is the biggest problem. Hence, human body components have mostly been estimated using indirect measurement methods.

Representative measurement methods that use the volume of the human body to estimate human body components include underwater weighing and plethysmography using a BOD POD device. While accurate estimation relies on the assumption that the densities of lean and fat components are similar between subjects, there are differences in density between individuals or specific racial groups, which can make measurements cumbersome because appropriate estimation equations should be used depending on the characteristics of the subjects.1 Both of the aforementioned methods cost a substantial amount of money for laboratory testing and require qualified personnel; thus, they are difficult to apply without sufficient financial support. There is a more affordable and easier method for estimation, in which skinfolds of specific body parts or the body circumference are measured and then the values are applied in estimation equations. However, the contact between the measuring personnel and the subject can be limited by a gender difference, and the measuring personnel must have abundant experience in measurements and be capable of accurately distinguishing body parts. Other indirect measurement methods include bioelectrical impedance analysis (BIA), which involves estimating the body fat using the principle that electric current flows through fat-free tissue. However, the devices used for this method have a range of validity, and a single device can under- or overestimate depending on the subject.1 Technologies such as dual-energy X-ray absorptiometry (DXA), computed tomography, and magnetic resonance imaging can measure human body components more accurately, positioning them between the direct measurement methods, which have limited practicality, and the inaccurate indirect methods, which involve measurement difficulties. However, use of these technologies is normally limited owing to price of the instruments and the exposure to radiation.2 Among them, DXA involves measuring the difference in the absorbed radiation between soft tissues and bones and using the measured values to calculate the density. DXA is advantageous because it can estimate human body components with little radiation and in a short time.1,2

Among the human body components, the body fat is an index of obesity that is related to various diseases, including hypertension, type 2 diabetes, coronary heart disease, stroke, gallbladder disease, osteoarthritis, breast cancer, uterine cancer, esophageal cancer, colon cancer, pancreatic cancer, and kidney cancer; thus, it must be measured accurately.3,4 The prevalence of obesity in Koreans at least 19 years old increased from 26.0% in 1998 to 31.7% in 2007 and remained at 31-32% until 2014.5 For physical-education teachers, trainers, professors, and doctors who work in related fields, it is essential to measure the body fat accurately. When it is not feasible to apply the aforementioned measurement methods, an easier method is used, which involves the body mass index (BMI). The BMI equals the weight divided by the square of the height (m2) and is used in various fields to evaluate obesity. A higher BMI indicates risk factors for hypertension, hyperlipidemia, and high cholesterol, whereas an extremely low BMI value may indicate risk of death.4,6 Because of their ease of measurement, weight and height have been applied not only to evaluate the performance of athletes and the obesity of soldiers and civilians but also as physical data that can affect jobs, salaries, and insurance in the general society.7,8 However, because the BMI is affected by the weight and height, it has a limitation for reflecting body components completely; i.e., individuals with the same BMI can have different percent-fat values.9-11 In addition, human body components can differ depending on characteristics such as age, gender, and race, regardless of the BMI.2,7,12,13 For these reasons, efforts have been directed towards developing equations for estimating the percent fat in combination with other variables, including age, gender, and race.7 However, in most studies, validity tests on the developed equations with subjects besides those employed for the development of the estimation equations were not conducted. With 9,934 Americans who were at least 20 years of age, Cui et al.13tested the validity of 26 estimation equations developed in other studies that used not only the BMI but also various other body measurements for the estimation of the body fat. In Korea, there have been only a few studies involving the development of equations for estimating the percent fat using the BMI and validity tests.6-8,10 Although a study was conducted with normal adults as subjects8, the body-fat value measured through BIA was used as a dependent variable, and the subjects were limited to residents in Seoul. Thus, the study should be complemented to ensure the validity of the dependent variable and the representativeness of the samples.

The goals of the present study were to develop equations for estimating the percent fat (validation) by using the percent fat as a dependent variable measured with a high-validity method (gold standard) for a representative adult group and to examine the validity of the equations with another representative adult group (cross-validation). The newly developed estimation equations should allow specialists and non-specialists to estimate the percent fat more accurately using variables that can be obtained without special devices.

The Korean Ministry of Health and Welfare and the Korea Centers for Disease Control and Prevention investigated the states of health and nutrition through the Korea National Health & Nutrition Examination Survey (KNHANES) with representative samples from the entire Korean population.14 The applied sampling methods were three-stage stratified cluster sampling for the years 2008-2009 using data from the “Population and Housing Census” conducted by the National Statistical Office in 2005 and two-stage stratified cluster sampling for the years 2010-2011 using the registered resident population for 2009 and the apartment market research data for 2008.14,15 The KNHANES was conducted after approval from the Institutional Review Board of the Korea Centers for Disease Control and Prevention.

Selection of subjects

The selected subjects were at least 20 years old and participated in the KNHANES from July 2008 to May 2011. The study period was limited to the time during which the percent fat was measured using DXA. The subjects surveyed in 2008-2009 (validation group) were selected for the development of the estimation equations, and those surveyed in 2010-2011 were selected as the cross-validation group to test the validity of the developed estimation equations.

Data processing for analysis

Raw data for 2008-2011 were downloaded from the KNHANES homepage (http://knhanes.cdc.go.kr/) as a basic survey database (DB) (Year HN_ALL) containing body measurement data by year. Data from bone-density and body-fat test DBs (Year HN_DXA) were combined, and then an integrated DB was established to perform an integrated analysis of the combined data by year. For complex sample analysis, among a total of 37,753 people, those whose body fat was measured with only DXA were selected for the group of interest (n=18,915).

Study methods

In the KNHANES, since the second half of 2008, tests were performed in mobile examination vehicles. Body measurements such as height, weight, and percent fat were performed by professional testing teams in independent examination vehicles.

Body measurement

Regarding the tool used for body measurement, Physical Examination, i.e., the guidebook for the 5th term of the KNHANES (2010–2012) was downloaded from the KNHANES homepage and referenced.16 The heights were measured in 0.1 cm units using a Seca225(Seca, Germany), and the weights were measured in 0.1 kg units using a GL-6000-20 (G-tech, Korea). The body fat was measured using a Discovery-W (Hologic Discovery, USA), which is a bone-density testing device that employs DXA. The BMI was calculated by dividing the weight (kg) by the square of the height (m2).

Statistical analysis

Data analysis

The KNHANES samples were extracted by multistage stratified cluster probability sampling, which is a complex sample design method. In accordance with the analysis instructions in the Guidebook for KNHANES Raw Data Analysis (2013), the complex sample analysis method was employed. The weight, stage, and cluster (plot) were considered in order to control the bias in the statistics.17 The statistical program was SPSS (VER 21; IBM Corp., Armonk, NY, USA), and the significance level was set as P=0.05.

Sample characteristics

To identify the sample characteristics, the mean, standard error of the mean (SEM), and rate by age group of the major variables were calculated through descriptive statistics of the complex sample analysis and frequency analysis. In the complex sample analysis, a general linear model was applied to identify the mean differences in the major variables between the validation group and the crossvalidation group.

Development of estimation equation

An equation for estimating the percent fat was developed using the general linear model of the complex sample analysis for the validation group. The percent fat measured via DXA was set as the dependent variable, and the gender, age, and BMI were regarded as basic independent variables. Referring to the various independentvariable types proposed in a preceding study13, estimation equations with a high explanatory power (R2) and low standard error of estimation (SEE) and total error (TE) were selected.

Cross-validation

The newly developed estimation equations were applied to the cross-validation group to evaluate their validity. The validity was tested via analysis of the statistics using the SEE (Sy√1-R2, where Sy is the standard deviation of the measured percent fat, and R is the multiple correlation coefficient) and TE (√Σ(Y-Y’)2/N, where Y is the measured percent fat, and Y’ is the percent fat obtained using the estimation equation), as proposed by Kang.18

Characteristics of sample groups

The mean ages of the males and females were 43.55 and 45.62 years, respectively, for the validation group and 44.22 and 46.48 years, respectively, for the cross-validation group. The mean BMI values for males and females were 23.93 and 23.08 kg/m2, respectively, for the validation group and 24.04 and 23.30 kg/m2, respectively, for the cross-validation group. The mean percent fat (%) was moderately higher for the cross-validation group (22.36% for males and 33.34% for females) than for the validation group (21.13% for males and 32.17% for females). In the analysis of the mean difference between the validation and cross-validation groups, there were no significant differences in the age, height, weight, or BMI (P<0.05), but there was a significant difference in the percent fat (f[1, 529]=28.500, P<0.001) (Table 1).

Table 1 . Characteristics of samples.

CharacteristicsValidation group (N=10,624*) KNHANES (2008-2009)Cross-validation group (N=8,291*) KNHANES (2010-2011)


Male (n=4,554*)Female (n=6,070*)Male (n=3,550*)Female (n=4,741*)
Age (yr)43.55±0.39345.62±0.37644.22±0.48746.48±0.483
 20-39 (%)41.6±1.438.1±1.139.7±1.635.8±1.2
 40-59 (%)40.5±1.239.0±0.941.8±1.440.3±1.0
 ≥60 (%)17.8±0.722.9±0.818.5±1.123.8±1.1
Height (cm)170.53±0.138157.18±0.133170.94±0.190157.16±0.165
Weight (kg)69.71±0.21357.01±0.16270.37±0.31857.48±0.224
BMI (kg/m2)23.93±0.06223.08±0.06724.04±0.08923.30±0.086
Percent fat (%)21.13±0.15132.17±0.14022.36±0.18933.34±0.180

Data are presented as mean±standard error of mean or frequency (%)..

*Unweighted sample size.

KNHANES, Korea national health and nutrition examination survey; BMI, body mass index..


Development of estimation equations and cross-validation

Various estimation models were analyzed by referring to the independent variables used for body-fat estimation in a previous study.13 The models with a high validity are presented in Table 2. The validity was analyzed for each model, and independent variables were included in each model. Two models exhibited high estimation validity, with a high explanatory power and a low estimation error. The first model used the inverse of the BMI, square of the inverse BMI, gender, and age as independent variables. The other model involved groups considering both gender and age. The equations for these two models were selected to estimate the percent fat for cross-validation.

Table 2 . Validation of examined models.

Included variablesMean of estimated percent fatMean of measured percent fatR2SEETE
Total (male and female)
 BMI, gender, age27.66327.6080.7333.9613.973
 BMI, gender, age, BMI227.6560.7383.9223.925
 BMI, gender, age, BMI2, gender×BMI27.6550.7383.9213.925
 BMI, gender, age, gender×BMI27.6620.7333.9603.972
 BMI, 1/BMI, gender, age27.6580.7383.9233.926
 BMI, 1/BMI, gender, age, 1/BMI227.6560.7383.9213.924
 1/BMI, gender, age27.6560.7373.9283.927
*1/BMI, gender, age, 1/BMI227.6570.7383.9223.925
Grouped in gender and age
1/BMI, gender and age, 1/BMI227.6920.7423.8963.892

Models were developed from the validation group..

Age was counted in full..

*The preliminary model (total): (‘54.601’ for male and ‘66.622’ for female)

+0.014×age-982.749×1/BMI+4192.373×1/BMI2.

The preliminary model (grouped by gender and age):

Male 20-39 yr: 54.614-945.604×1/BMI + 3599.326×1/BMI2.

Male 40-59 yr: 54.270-945.604×1/BMI + 3599.326×1/BMI2.

Male ≥60 yr: 56.066-945.604×1/BMI + 3599.326×1/BMI2.

Female 20-39 yr: 67.086-945.604×1/BMI + 3599.326×1/BMI2.

Female 40-59 yr: 66.379-945.604×1/BMI + 3599.326×1/BMI2.

Female ≥60 yr: 67.093-945.604×1/BMI + 3599.326×1/BMI2.

R2 is explanatory power..

BMI, body mass index; SEE, standard error of estimate; TE, total error..


The results of the variance analysis for the two estimation equations are presented in Table 3. The regression model showed that they had a good statistical fit (F=5,770.213, P<0.001 for Model 1, F=3,498.014, P<0.001 for Model 2). The explanatory power of the independent variable in the estimation equation for the percent fat was slightly higher for Model 2 (74.2%) than for Model 1 (73.8%).

Table 3 . Variance analysis of preliminary models.

 Non-standardized coefficient tP

Bstandard error
Model 1
 Constant66.6221.95234.124<0.001
 Gender (male)-12.0210.095-127.136<0.001
 Gender (female)----
 Age0.0140.0043.952<0.001
 1/BMI-982.74984.997-11.562<0.001
 1/BMI24192.373921.8074.548<0.001
R2=0.738 F=5770.213 P<0.001
Model 2
 Constant67.0931.87835.733<0.001
 Male 20-39 yr-12.4790.200 -62.328  <0.001 
 Male 40-59 yr-12.8230.200-64.231<0.001
 Male ≥60 yr-11.0270.173-63.798<0.001
 Female 20-39 yr-0.0070.186-0.0390.969
 Female 40-59 yr-0.7150.180-3.975<0.001
 Female ≥60 yr----
 1/BMI-945.60482.151-11.511<0.001
 1/BMI23599.326888.2484.052<0.001
R2=0.742 F=3498.014 P<0.001

The preliminary model (total) was represented as Model 1, and that grouped by gender and age was represented as Model 2..

Variance analysis for the preliminary equations of Table 1 are presented..

R2 is explanatory power..

BMI, body mass index..


As shown in Table 4, testing the validity of both estimation equations via application to the cross-validation group resulted in accurate estimation of the percent fat measured via DXA. The criteria proposed by Heyward19 for a desirable estimation equation are a high validity of the measurement method for dependent variable, a large sample size with at least 100 people, an R of at least 0.80, and an SEE less than 4.00. Both of the selected estimation equations satisfied all of these criteria.

Table 4 . Cross-validation of preliminary models.

ModelsMean of estimated percent fatMean of measured percent fatR2SEETE
Total
 Model 1: 1/BMI, gender, age, 1/BMI228.73729.2260.7313.9784.114
 Model 2: 1/BMI, gender and age, 1/BMI228.7820.7353.9514.093
*Group by age
 20-39 yr27.98328.2990.7053.9854.137
 40-59 yr28.67529.3360.7443.7774.016
 ≥60 yr29.48729.9000.7504.0524.148

The preliminary model (total) was represented as Model 1, and that grouped by gender and age was represented as Model 2..

*Model 2 was applied for the cross-validation in each age group.

R2 is explanatory power..

BMI, body mass index; SEE, standard error of estimate; TE, total error..


Scatter plots comparing the estimated and measured percent fat obtained using DXA are presented in Figs. 1 and 2. As shown in the cross-validation results of Table 4, the estimated mean percent fat values were smaller than the actual values. This is indicated by the figures, where the case group is slightly biased over the graph with y=0 and slope=1. When the validity of the estimation equations was tested with respect to age after the application of Model 2, the overall R2 was 0.705-0.750. The explanatory power of the estimation equation increased with the age, and the SEE was between 3.777 and 4.052.

Figure 1.

Scatterplots between estimated (x-axis) and measured (y-axis) percent fat (Model 1).


Figure 2.

Scatterplots between estimated (x-axis) and measured (y-axis) percent fat (Model 2).


The accuracy of the two selected estimation equations for estimating the percent fat for the cross-validation group was compared with that of six estimation equations previously developed by other researchers, as shown in Table 5. The two estimation equations had greater explanatory power (R2=0.731 for Model 1 and R2=0.735 for Model 2) and smaller prediction errors (SEE=3.978 for Model 1 and SEE=3.951 for Model 2) than the estimation equations developed by the other researchers. The validity of the estimation equations of the other researchers varied between 0.574 and 0.668 in R2 and between 4.419 and 5.009 in SEE.

Table 5 . Comparisons with previously published models.

ModelsR2SEETE
Deurenberg et al. (1991)20 0.574  5.009  5.128 
Deurenberg et al. (1998)210.6124.7804.887
Gallagher et al. (1996)220.6684.4204.514
Gallagher et al. (2000)230.6684.4194.495
Gómez-Ambrosi et al. (2012)240.6494.5444.639
Cho (2006)80.6554.5094.743
Current study (Model 1)0.7313.9784.114
Current study (Model 2)0.7353.9514.093

R2 is explanatory power..

SEE, standard error of estimate; TE, total error..


An estimation equation was developed by using the entire validation and cross-validation groups (N=18,915) as samples and considering the independent variables in the two models together, as presented in Table 6. The R2 of the new equation was slightly superior to those of the two estimation equations in the cross-validation. In contrast, the SEE levels were similar.

Table 6 . Development of final models.

ModelsR2SEETEFP
Final Model 1
 (Male: 54.811, Female: 66.622)+0.010×age-956.110×1/BMI+ 3864.956×1/BMI20.7323.9914.0737790.765<0.001
Final Model 2
 Male 20-39 yr: 54.4840.7363.9654.0454995.211<0.001
 Male 40-59 yr: 54.015
 Male ≥60 yr: 55.758
 Female 20-39 yr: 66.831
 Female 40-5 9yr: 66.153
 Female ≥60 yr: 66.744
 -909.814×1/BMI+ 3186.205×11/BMI2

Final models were developed from both validation and cross-validation groups..

R2 is explanatory power..

SEE, standard error of estimate; TE, total error..


The developed equations for estimating the percent fat exhibited explanatory powers as high as R2=0.738 for Model 1 and R2=0.742 for Model 2, and application to the cross-validation group resulted in similar explanatory powers of R2=0.731 for Model 1 and R2= 0.735 for Model 2. The 73.1% (Model 1) and 73.5% (Model 2) variance in actual percent fat could be explained by the independent variables. In addition, the error level in the estimation is important. In this study, the SEEs were ±3.978% (Model 1) and ±3.951% (Model 2) for the estimation of the percent fat. Moreover, the two developed estimation equations had greater explanatory power and smaller prediction errors than equations previously proposed by other researchers, as shown in Table 5, confirming the validity of the two estimation equations.

However, the two estimation equations tended to underestimate the percent fat for the cross-validation group. This is attributed to the significantly lower mean percent fat for the validation group—for which the estimation equations were developed—compared with the cross-validation group. Previously, it was suggested that the estimation power is higher in groups having similar characteristics to the validation group.13 The differences in the estimation pattern also occurred when the two estimation equations were applied to groups that had a significantly different percent fat from the validation group. These phenomena are “shrinkage” that occurred because of the difference in the percent fat between the group for which the estimation equations were developed and the other group, for which they were applied, and explain why the explanatory powers of the developed estimation equations decreased from R2=0.738 to R2=0.731 for Model 1 and R2=0.742 to R2=0.735 for Model 2.25 The shrinkage phenomenon can also be caused by racial differences between the groups, as well as differences in the percent fat. As shown in Table 5, the estimation equations developed with non-Koreans as subjects had less explanatory power for the cross-validation group composed of Korean adults. Among the other six models, those proposed by Cho8 and Gallagher et al.22,23 had the greatest explanatory power (R2=0.655-0.668), which appears to be because more than 600 Koreans and Japanese were included in the validation groups in each study.

We conclude that gender had a larger effect on the estimation of the percent fat than the BMI and age, on the basis of a previous study12 and the physiological theory that females have a higher percent fat than males. Therefore, gender must be considered in every equation for estimating the percent fat. Of the two developed estimation equations, Model 1 was formulated to input each of the dummy variables for the gender and age into the estimation equation, and Model 2 was formulated to apply the dummy variable after consideration of the two variables for the gender and age together with the category division. In conclusion, Model 2 had greater explanatory power and a lower estimation error than Model 1 and is thus likely to have higher estimation accuracy. According to the validity of the estimation equations developed from the validation group, the final model was developed using the same independent variables with all of the subjects in the validation and cross-validation groups. However, its validity was not tested through cross-validation; thus, it may be limited in application.

In this study, we developed equations for estimating the percent fat of Korean adults using the BMI as a predictor. The equations were more accurate than those developed by other researchers. However, the BMI is an external measurement depending on only the height and weight, whereas the percent fat is internal measurement indicating a human body component. Therefore, the developed estimation equations may not be the most accurate methods for estimating the percent fat, because the predictors were selected for simple measurement according to the purpose of this study. To develop more accurate equations for estimating the percent fat, estimation equations using predictors such as the waist circumference, BIA, or skinfolds should be further studied, although these might be more difficult to measure than the BMI. In addition, cross-validation with other sample groups should be performed to improve the validity of the equations developed in this study.

The newly developed estimation equations provide easy methods for estimating the percent fat with no cost. Because of inaccuracies caused by error, they have limitations for use in diagnostic settings, such as obesity diagnosis and examination to determine presence or absence of diseases. However, they can be utilized in smartphone healthcare applications, as well as in vast scale epidemiologic studies, and are thus useful for professionals in settings where the percent fat must be estimated without devices.

This study was conducted by a research grant (20160514) provided by the Hwarangdae Research Institute of the Korea Military Academy in 2016.

Hoyong Sunghttp://orcid.org/0000-0002-8325-5206

Fig. 1.

Scatterplots between estimated (x-axis) and measured (y-axis) percent fat (Model 1).


Fig. 2.

Scatterplots between estimated (x-axis) and measured (y-axis) percent fat (Model 2).


Characteristics of samples

CharacteristicsValidation group (N=10,624*) KNHANES (2008-2009)Cross-validation group (N=8,291*) KNHANES (2010-2011)


Male (n=4,554*)Female (n=6,070*)Male (n=3,550*)Female (n=4,741*)
Age (yr)43.55±0.39345.62±0.37644.22±0.48746.48±0.483
 20-39 (%)41.6±1.438.1±1.139.7±1.635.8±1.2
 40-59 (%)40.5±1.239.0±0.941.8±1.440.3±1.0
 ≥60 (%)17.8±0.722.9±0.818.5±1.123.8±1.1
Height (cm)170.53±0.138157.18±0.133170.94±0.190157.16±0.165
Weight (kg)69.71±0.21357.01±0.16270.37±0.31857.48±0.224
BMI (kg/m2)23.93±0.06223.08±0.06724.04±0.08923.30±0.086
Percent fat (%)21.13±0.15132.17±0.14022.36±0.18933.34±0.180

Data are presented as mean±standard error of mean or frequency (%).

*Unweighted sample size.

KNHANES, Korea national health and nutrition examination survey; BMI, body mass index.

Validation of examined models

Included variablesMean of estimated percent fatMean of measured percent fatR2SEETE
Total (male and female)
 BMI, gender, age27.66327.6080.7333.9613.973
 BMI, gender, age, BMI227.6560.7383.9223.925
 BMI, gender, age, BMI2, gender×BMI27.6550.7383.9213.925
 BMI, gender, age, gender×BMI27.6620.7333.9603.972
 BMI, 1/BMI, gender, age27.6580.7383.9233.926
 BMI, 1/BMI, gender, age, 1/BMI227.6560.7383.9213.924
 1/BMI, gender, age27.6560.7373.9283.927
*1/BMI, gender, age, 1/BMI227.6570.7383.9223.925
Grouped in gender and age
1/BMI, gender and age, 1/BMI227.6920.7423.8963.892

Models were developed from the validation group.

Age was counted in full.

*The preliminary model (total): (‘54.601’ for male and ‘66.622’ for female)

+0.014×age-982.749×1/BMI+4192.373×1/BMI2

The preliminary model (grouped by gender and age):

Male 20-39 yr: 54.614-945.604×1/BMI + 3599.326×1/BMI2

Male 40-59 yr: 54.270-945.604×1/BMI + 3599.326×1/BMI2

Male ≥60 yr: 56.066-945.604×1/BMI + 3599.326×1/BMI2

Female 20-39 yr: 67.086-945.604×1/BMI + 3599.326×1/BMI2

Female 40-59 yr: 66.379-945.604×1/BMI + 3599.326×1/BMI2

Female ≥60 yr: 67.093-945.604×1/BMI + 3599.326×1/BMI2

R2 is explanatory power.

BMI, body mass index; SEE, standard error of estimate; TE, total error.

Variance analysis of preliminary models

 Non-standardized coefficient tP

Bstandard error
Model 1
 Constant66.6221.95234.124<0.001
 Gender (male)-12.0210.095-127.136<0.001
 Gender (female)----
 Age0.0140.0043.952<0.001
 1/BMI-982.74984.997-11.562<0.001
 1/BMI24192.373921.8074.548<0.001
R2=0.738 F=5770.213 P<0.001
Model 2
 Constant67.0931.87835.733<0.001
 Male 20-39 yr-12.4790.200 -62.328  <0.001 
 Male 40-59 yr-12.8230.200-64.231<0.001
 Male ≥60 yr-11.0270.173-63.798<0.001
 Female 20-39 yr-0.0070.186-0.0390.969
 Female 40-59 yr-0.7150.180-3.975<0.001
 Female ≥60 yr----
 1/BMI-945.60482.151-11.511<0.001
 1/BMI23599.326888.2484.052<0.001
R2=0.742 F=3498.014 P<0.001

The preliminary model (total) was represented as Model 1, and that grouped by gender and age was represented as Model 2.

Variance analysis for the preliminary equations of Table 1 are presented.

R2 is explanatory power.

BMI, body mass index.

Cross-validation of preliminary models

ModelsMean of estimated percent fatMean of measured percent fatR2SEETE
Total
 Model 1: 1/BMI, gender, age, 1/BMI228.73729.2260.7313.9784.114
 Model 2: 1/BMI, gender and age, 1/BMI228.7820.7353.9514.093
*Group by age
 20-39 yr27.98328.2990.7053.9854.137
 40-59 yr28.67529.3360.7443.7774.016
 ≥60 yr29.48729.9000.7504.0524.148

The preliminary model (total) was represented as Model 1, and that grouped by gender and age was represented as Model 2.

*Model 2 was applied for the cross-validation in each age group.

R2 is explanatory power.

BMI, body mass index; SEE, standard error of estimate; TE, total error.

Comparisons with previously published models

ModelsR2SEETE
Deurenberg et al. (1991)20 0.574  5.009  5.128 
Deurenberg et al. (1998)210.6124.7804.887
Gallagher et al. (1996)220.6684.4204.514
Gallagher et al. (2000)230.6684.4194.495
Gómez-Ambrosi et al. (2012)240.6494.5444.639
Cho (2006)80.6554.5094.743
Current study (Model 1)0.7313.9784.114
Current study (Model 2)0.7353.9514.093

R2 is explanatory power.

SEE, standard error of estimate; TE, total error.

Development of final models

ModelsR2SEETEFP
Final Model 1
 (Male: 54.811, Female: 66.622)+0.010×age-956.110×1/BMI+ 3864.956×1/BMI20.7323.9914.0737790.765<0.001
Final Model 2
 Male 20-39 yr: 54.4840.7363.9654.0454995.211<0.001
 Male 40-59 yr: 54.015
 Male ≥60 yr: 55.758
 Female 20-39 yr: 66.831
 Female 40-5 9yr: 66.153
 Female ≥60 yr: 66.744
 -909.814×1/BMI+ 3186.205×11/BMI2

Final models were developed from both validation and cross-validation groups.

R2 is explanatory power.

SEE, standard error of estimate; TE, total error.

  1. Katch VL, McArdle WD, Katch FI. Essentials of exercise physiology (translated in Korean). Seoul: Lifescience; 2015.
    Pubmed
  2. Norgan NG. Laboratory and field measurements of body composition. Public Health Nutr 2005;8:1108-22.
    Pubmed CrossRef
  3. Anderson RE. Array. Seoul: Daehan Media Co; 2006.
  4. Dishman RK, Heath GW, Lee IM. Physical activity epidemiology. Champaign, IL: Human Kinetics; 2013.
  5. Ministry of Health and Welfare, Korea Centers for Disease Control and Prevention. 2014 Health Statistics for Korean I: KNHANES (VI-2); 2015. [accessed 2016 Sep 22] Available from: URL: https://knhanes.cdc.go.kr/knhanes/index.do
  6. Cho JH. A cross validation study of developing regression model for estimation of %body fat from body mass index. Korean Society of Measurement and Evaluation 2007;9:75-85.
  7. Kim DY, Chun HJ. Development of prediction equation of %body fat using BMI and cross-validation for persons with intellectual disability. Journal of Korean Physical Education Association for Girls and Women 2014;28:123-34.
    CrossRef
  8. Cho JH. Developing regression model for estimation of % body fat from body mass index. The Korean Journal of Physical Education 2006;45:869-78.
  9. Sung SC, Kim HS. Relationship between body mass index and, fat-free mass index and fat-mass index in Korean population. Exerc Sci 2013;22:301-8.
    CrossRef
  10. Heo JG, Lee JO. Evaluation on the regression model for % body fat estimated from BMI in elementary students. Korea Sport Res 2006;17:769-80.
  11. Russell AR, Esco MR, Lizana SN, Williford HN, Olson MS, Kim H. The accuracy of a BMI-based equation in predicting percent body fat in college-age female athletes. J Strength Cond Res 2011;25:5-6.
  12. Jackson AS, Stanforth PR, Gagnon J, Rankinen T, Leon AS, Rao DC, et al. The effect of sex, age and race on estimating percentage body fat from body mass index: The Heritage Family Study. Int J Obes Relat Metab Disord 2002;26:789-96.
    Pubmed
  13. Cui Z, Truesdale KP, Cai J, Stevens J. Evaluation of anthropometric equations to assess body fat in adults: NHANES 19992004. Med Sci Sports Exerc 2013;46:1147-58.
    Pubmed CrossRef
  14. Ministry of Health and Welfare, Korea Centers for Disease Control and Prevention. KNHANES IV Regulation for Using of Raw Data (2007-2009) 2014. Available from: URL: https://knhanes.cdc.go.kr/knhanes/index.do [accessed 2016 Sep 22]
  15. Ministry of Health and Welfare, Korea Centers for Disease Control and Prevention. KNHANES V Regulation for Using of Raw Data (2010-2012) 2014. Available from: URL: https://knhanes.cdc.go.kr/knhanes/index.do [accessed 2016 Sep 22]
  16. Ministry of Health and Welfare, Korea Centers for Disease Control and Prevention. KNHANES V Regulation for Physical Examination (2010-2012) 2014. Available from: URL: https://knhanes.cdc.go.kr/knhanes/index.do [accessed 2016 Sep 22]
  17. Ministry of Health and Welfare, Korea Centers for Disease Control and Prevention. KNHANES Regulation for Examination of Raw Data 2013. Available from: URL: https://knhanes.cdc.go.kr/knhanes/index.do [accessed 2016 Sep 22]
  18. Kang SJ. Statistics of kinesiology. Seoul: 21st Publishing; 2015.
  19. Heyward V. ASEP methods recommendation: body composition assessment. J Exerc Physiol Online 2001;4:1-12.
  20. Deurenberg P, Weststrate JA, Seidell JC. Body mass index as a measure of body fatness: age- and sex-specific prediction formulas. Br J Nutr 1991;65:105-14.
    Pubmed CrossRef
  21. Deurenberg P, Yap M, van Staveren WA. Body mass index and percent body fat: a meta analysis among different ethnic groups. Int J Obes Relat Metab Disord 1998;22:1164-71.
    Pubmed CrossRef
  22. Gallagher D, Visser M, Sepúlveda D, Pierson RN, Harris T, Heymsfield SB. How useful is body mass index for comparison of body fatness across age, sex, ethnic groups?Am J Epidemiol 1996;143:228-39.
    Pubmed CrossRef
  23. Gallagher D, Heymsfield SB, Heo M, Jebb SA, Murgatroyd PR, Sakamoto Y. Healthy percentage body fat ranges: an approach for developing guidelines based on body mass index. Am J Clin Nutr 2000;72:694-701.
    Pubmed
  24. Gómez-Ambrosi J, Silva C, Catalán V, Rodríguez A, Galofré JC, Escalada J, et al. Clinical usefulness of a new equation for estimating body fat. Diabetes Care 2012;35:383-8.
    Pubmed KoreaMed CrossRef
  25. Vincent WJ, Weir JP. Statistics in kinesiology. Champaign, IL: Human Kinetics; 2012.