• Tidak ada hasil yang ditemukan

Directory UMM :Data Elmu:jurnal:N:Nutrition And Food Science:Vol30.Issue4.2000:

N/A
N/A
Protected

Academic year: 2017

Membagikan "Directory UMM :Data Elmu:jurnal:N:Nutrition And Food Science:Vol30.Issue4.2000:"

Copied!
6
0
0

Teks penuh

(1)

science ± VI: correlation

and regression (Part C)

John A. Bower

Parts A and B of ``Correlation and regression'' in the ``Statistics for food science'' series (SFS VIA, B) described the general characteristics, nature and application of these methods. The current section illustrates the techniques with two worked examples. Calculation of the basic coefficients can be done with a scientific calculator but computer software may be required for significance testing. The protocol employed for statistical tests should be laid out as detailed in previous articles in respect of statement of objectives and hypotheses, etc. but these are abbreviated to a degree in the current illustrations. Both examples are contrived but are typical of real data. All analyses and graphs were prepared using the SPSS for Windows v. 9.0 software package (SPSS Inc., Chicago).

Example 1 ± consumer survey data

Consumer opinion was surveyed in a pilot study examining attitude to food additive use as related to other measures. A random sample of 50 consumers were asked for their level of agreement with the use of food additives using a seven-point Likert scale with categories from ``disagree strongly'' to ``agree strongly''. Their level of knowledge of food and nutritional science and food issues was assessed by a series of ten questions, of one mark each. The test scores were summed and the relationship between level of science knowledge and level of agreement with the use of additives was examined by correlation analysis.

Analysis

The data are summarised in Table I, but the analysis commences with a clear statement of what it intends to achieve (objectives).

What are the objectives of the correlation analysis?

The objectives of the correlation analysis are: (1) to ascertain whether or not there is a

relationship between additive use acceptance and the level of science knowledge; and

(2) to establish whether or not the relationship is statistically significant The author

John A. Boweris Lecturer in Food Science, Faculty of Business and Consumer Studies, Queen Margaret University College, Edinburgh, UK.

Keywords

Statistics, Correlation analysis

Abstract

Illustrates correlation and regression in food and consumer science applications, with worked examples of these methods on data from a consumer survey and from a sensory-versus-instrumental study.

Electronic access

The current issue and full text archive of this journal is available at

http://www.emerald-library.com

Nutrition & Food Science

Volume 30 . Number 4 . 2000 . pp. 194±199

(2)

and hence allow some inference to the population and justify future research.

These objectives will be met by carrying out correlation and regression. Some preamble is useful to identify certain components and clarify some assumptions.

What are the components of the analysis and are there any assumptions?

The objects being studied in this experiment are the consumer subjects. Two measures are available on the subjects:

(1) ``level of agreement with use of additives''; and

(2) ``science knowledge score''.

These are assigned arbitrarily as Xand Yfor the purpose of analysis.

General assumptions are that the pairs of measures are independent and that any relationship is linear. Each subject has submitted no more than one set of each measure; thus each set is distinct and does not depend on any other. Subjects should not have been allowed to discuss the issues or influence one another. Linearity is checked by generation of a scatter diagram (Figure 1). This shows a general trend of a positive linear relationship. Note that when plotting integer values in scatterplots many points may be superimposed, and some way of showing point density is useful. Figure 1 uses the ``sunflower'' technique (SPSS) where each petal represents a case. Here the denser points, with more petals, show greater linearity.

Each measurement scale can be assumed to be interval in nature. Thus Pearson's product-moment correlation coefficient is suitable.

Is there a relationship between ``additive agreement'' (X) and ``science

knowledge'' (Y)?

The correlation coefficient is shown on the graph (Figure 1, r= 0.56). A positive relationship exists but it is not particularly marked and would be classified as

``moderate'' in nature according to the criteria specified in SFS VIB. The coefficient of determination,r2, is approximately 31 per cent, which, although low numerically, is high for the relationship, considering the number of possible influences on attitude to food additives.

Is the relationship betweenXandY

simply due to chance?

A significance test is required to answer this question and additional assumptions apply.

XandYare assumed to have been randomly sampled from a bivariate normal population (O'Mahony, 1986). This distribution has been defined mathematically but simple tests for adherence to it are not available. The variables can by checked individually for normality by a ``goodness of fit'' test such as chi-square or the Shapiro-Wilks' test

described previously (SFS VB; Bower, 1998) and detailed in Rees (1995). Rees also gives a simpler alternative: examine the raw data distribution and look for ``symmetry and bunching''. Each measure in Table I shows

Table ISummary of consumer survey data of Example 1

Science knowledge

score Frequency

Additive agreement

rating Frequency

3 2 ± ±

4 4 1 3

5 7 2 4

6 8 3 8

7 11 4 11

8 9 5 11

9 5 6 7

10 4 7 6

Mean 6.78 4.36

SDa 1.84 1.67

Percentage CVb 27.19 38.42

Notes:

aSD, standard deviation bCV, coefficient of variation

(3)

higher frequency near the middle of the scale with tailing off on either side in a rough symmetry. For a sample size of 50 this is an acceptable adherence to a normal

distribution. Equality of variance is also assumed. This can be ascertained by examination of a standardised measure of variance such as the coefficient of variation, or by a more formal test (e.g. the Levene test, SPSS). As seen, variances differ but are roughly comparable (Table I). An adequate approximation to the assumptions is met.

The null hypothesis (H0) being tested is

that there is no significant correlation and the population correlation is zero. Alternatively (H1), correlation exists at a stated significance

level (5 per cent). The question of one or two-tailed significance depends on whether or not it would be possible to predict the result if the null hypothesis were rejected. Assuming that the research team have no previous data relating to this, then the correlation could be positive or negative ifH0is rejected (two-tail).

The test is performed (using software or by consulting tables) and found to be highly significant as seen by the p-value (Figure 1).

Are the results significant enough to justify further investigation?

The study was based on a random sample of a reasonable size. This will have ensured that each measure could attain a wide range. Thus no ``range constriction'', which could falsify the correlation, will have occurred. The combination of a sample size of 50 with a coefficient of 0.56 is of high power; the test's ability to find a true significance (SFS IIIA; Bower, 1995). There is sufficient evidence to warrant a larger study with at least one proviso. The ``science knowledge'' measure was based on a series of questions, all of which examined a basic single state. This gives a more valid ``measuring instrument'' (Black, 1999) than the much simpler single question statement used to gauge ``agreement with additive use''. The researchers would be more confident about pursuing this line of research if a more elaborate measure of this attitude had been included.

Example 2 ± correlation and regression

in sensory science

The product development team in a food company have started a series of development

trials for products in the dessert market. A sensory panel (n= 10) is being trained in conventional profiling of dessert

formulations. As part of the training program and as a way of examining the effect of a specific ingredient on one sensory attribute, six formulations are prepared. These are systematically varied in the level of the ingredient so as to cause a difference in ``mouth-feel firmness''. Ten sub-samples of each of the formulations are scooped out and cut in half. One half is examined by the panel and the other is tested for mechanical firmness using an instrumental texture device. The panellists score the samples for intensity of firmness on a graphic line scale, producing values between 0 and 100. All data are averaged (Table II) and analysed for correlation and regression.

What are the objectives of this analysis? The objectives of this analysis are to establish: (1) whether or not the panel can accurately

score the samples for sensory firmness, at least in rank order, related to the

compositional data (firming agent content) and to the instrumental data; (2) whether or not the relationship between

these measures is statistically significant and hence allows some inference to the population of existing and future products; and

(3) how suitable the instrumental measure would be at predicting the sensory firmness.

This example is obviously different from the one above in several respects.

What are the components and assumptions in this example?

In this experiment, the objects being studied are the product samples. Three measures are available on these objects:

(1) sensory firmness;

(2) concentration of firming agent; and (3) instrumental texture.

One measure (concentration) is fixed by the experimenters and is thus the Xvariate. The sensory measure is dependent on this and is oneYvariate. The instrumental measure is also viewed as aYvariate when related to the concentration, but it is anticipated to be anX

variate for prediction purposes.

(4)

Example 1. The sensory panel is viewed as a measuring instrument and we are not interested at this stage in individual panellists' scores. Thus the sensory measure and any other replicated measures will be averaged for the analysis. Although many measurements were performed for both sensory (ten panel members) and instrumental firmness (ten replicates), only one value goes forward for the analysis. This ensures the independence of the sets ofXandYmeasures.

Only one set of formulations was made and hence the concentration levels (X) were not replicated. These are assumed to be error free. For eachXvalue there is assumed to be a population of Yvalues which are normally distributed. Randomness of sampling for determination of Yis limited to the drawing of sub-samples from each formulation. There are sufficient numbers of replicate sensory and instrumental measures to permit a normality check, but these are on sub-samples from one batch. A normality check of this form was illustrated previously (SFS VB; Bower, 1998), but such procedures can give only an indication. A more suitable

representation would require preparation of several separate batches of each formulation. While this is possible it is unlikely to be done ten or more times, which is a typical

minimum number for normality tests. One solution is to look at previous work with similar data with sufficient numbers to allow for normality testing; otherwise it is not possible to verify this assumption. The same argument applies to equality of variances, although this can be verified with smaller numbers of replicate batches. There are few examples of published papers with such checks on assumptions for correlation studies. Adherence need only be approximate and presumably researchers take this compromise or base decisions on established data. An

assumption will be made that this is the case for the data of Example 2.

The eventual analysis will be based on only six points. This could result in low power even with a high magnitude for the coefficient. Power can be improved by inclusion of more samples, but for this analysis, the most error prone data source (the sensory panel) should be checked for a reasonable degree of panellist concurrence (e.g. by comparing the rank order of the scores for each member with that of the panel). Assuming this has been done and that no extreme outliers were found, the

development team can be confident that the sensory means are valid.

How well does the sensory measure correlate with the compositional data, and is the correlation significant? A scatter diagram is drawn (Figure 2). This shows reasonable linearity with no marked outliers.

Table IIExperimental data summary for Example 2

Formulation

Firming agent concentration (mg/100g)

Logarithm of

concentration Sensory panel

Instrumental texture (g x 10)

Logarithm of instrumental

texture

A 5 (1) 0.7 18 (1) 93 (1) 1.97

B 10 (2) 1.0 58 (3) 203 (2) 2.31

C 20 (3) 1.3 62 (4) 439 (3) 2.64

D 40 (4) 1.6 39 (2) 788 (4) 2.90

E 80 (5) 1.9 69 (5) 976 (5) 2.99

F 160 (6) 2.2 80 (6) 1506 (6) 3.18

Notes: Values are means. Figures in parentheses signify rank order

(5)

Some points are ``off the line'' but this is not unusual in sensory vs instrumental studies when the panel are in the initial stages of training. Assume that no gross errors are detected and that the points are retained for this analysis. The logarithm of the stimulus is used. This gives slightly better linearity than use of the untransformed measure, but its success may depend on the fact that a wide range of stimuli were tested, stepped up in a ratio manner. This range is likely to contain concentrations which would be unrealistic in eventual products, but they serve to calibrate panel perception. A range of concentrations with small differences near the intended range could be used later ± transformation may not be required as this could be adequately linear without transformation. Pearson's coefficient is usual for this type of data, but as ranking is specifically of interest, the Spearman

coefficient will be included also. Correlation analysis is performed and summarized (Table III, Row 1).

The measures are correlated, but the panel scores are more error prone and give lower coefficients, which, although close, do not achieve significance with Pearson's

correlation. The panel were able to rank the samples more accurately than any interval scaling as shown by the significant Spearman coefficient. Note that the correlation may be overestimated because the measures were not performed on the same sub-samples (SFS VIA), but this is countered to a degree because fixed values ofXwere used.

Two-tailed significance testing has been used. It could be argued that the relationship between the amount of added firming agent and any firmness measure would be a direct one, i.e. a positive correlation.

Let us assume that the research team take a conservative view and cannot predict the direction of the correlation if H0is rejected

(i.e. two-tail).

Thus there is no distinct relationship between sensory and compositional data, given the typical expectation. Inspection of the ranked data (Table II) confirms this ± the panel have not been able to achieve a minimum of only one pair of adjacent samples out of sequence, although more training should improve this. Thercoefficients (Table III) are within the usual region for this type of experiment and coefficients of determination are now nearing 60 per cent or more.

How does the sensory measure correlate with the instrumental measure?

The scatter graph for this relationship is similar to that of Figure 2, as the instrumental measure correlates in a similar manner with concentration, as does the panel (Figure 3).

For this test both measures are unfixed, although the assumption is generally that the instrumental one is subject to less error (SFS VIB). The sensory vs instrumental coefficient is numerically the lowest in magnitude, but the rank correlation is the same as that of the sensory vs concentration comparison (Table III, Row 2).

Table IIICorrelation analysis summary for Example 2 (n = 6)

Test Pearson'sr Spearman'srs

1 Panel vs logarithm of concentration 0.77 (p = 0.08ns) 0.83 (p = 0.042*)

2 Panel vs logarithm of instrumental texture 0.75 (p = 0.09ns) 0.83 (p = 0.042*)

3 Instrumental texture vs concentration 0.96 (p = 0.002**) 1.00 (p = 0.00***)

Notes:

p = probability value (two-tail) ns= not significant

* = significant at the 5 per cent level; ** = significant at the 1 per cent level; *** = significant at the 0.01 per cent level

(6)

Table III also includes the analysis for the relationship between instrumental texture and concentration. Pearson's ris higher and significant (p< 0.01). ``Perfect'' rank correlation has been found between quantity of firming agent and the instrumental texture. On this basis the instrumental measure is confirmed as being more accurate and provides a calibration tool for the panel, along with the concentration of the firming agent.

What is the nature of the predictive ability of ``instrumental to sensory''? The regression line has been calculated and the line of best-fit drawn (Figure 3). Linearity is not perfect and because of data transformation there is a negative intercept. The regression analysis has been performed (Table IV) with the instrumental measure as a predictor of sensory. An example of a regression with more predictive power is also included where the firming agent concentration is the predictor.

Can the instrumental measure predict the sensory measure with confidence? The regression for the instrumental-sensory relationship is non-significant. Thus it would give poor prediction and would have wide confidence limits around the true value. The

R2value reflects the low predictive ability of the regression model ± 44 per cent of the variance is unexplained. More panel training would be required. As an example of use, a predicted value can now be calculated using the regression equation (Table IV). The predicted sensory measure for a instrumental value of 78.2g would be 62 on the sensory scale. This is an example of interpolation ± prediction within the boundaries of the original data. Extrapolation outside these limits can be done but this involves more risk as an assumption is made that the relationship still holds. A significant regression is obtained when the concentration is used to predict the

instrumental variable. This second regression model imbues much more confidence, as 93 per cent of the variance in the firming agent concentration is accounted for by the instrumental texture measure.

The power of this type of experiment can be improved considerably by a slight increase in the number of objects (e.g. nine instead of six). This is countered by the increased assessment load on panellists, although it would be acceptable for a single sensory measure.

Conclusions

Correlation provides a simple tool for examination of the relationship between two variables. It constitutes an invaluable initial analysis and a building block for further analysis, in particular regression, in many fields of food studies. These methods apply mainly to interval/ratio data but there are other correlation techniques for nominal data or combinations of interval and ordinal with nominal data (Black, 1999).

References

Black, T.R. (1999),Doing Quantitative Research in the Social Science, Sage Publications, London, pp. 618-58.

Bower, J.A. (1995), ``Statistics for food science ± III: sensory evaluation data (Part A) ± sensory data types and significance testing'',Nutrition & Food Science, No. 6, November/December, pp. 35-42. Bower, J.A. (1998), ``Statistics for food science ± V: Anova

and multiple comparisons (Part B)'',Nutrition & Food Science, No. 1, January/February, pp. 41-8. O'Mahony, M. (1986),Sensory Evaluation of Food ±

Statistical Methods and Procedures, Marcel Dekker Inc., New York, NY, pp. 279-302.

Rees, D.G. (1995),Essential Statistics, 3rd ed., Chapman & Hall, London, pp. 189-200.

Table IVRegression analysis summary for Example 2

Predictor Dependent Equation

Probability value ofFregression R2

Logarithm of instrumental (X) Panel (Y) Y = 36.89*X ± 43.94 0.09ns 0.56

Concentration of firming agent (X) Instrument (Y) Y = 8.62*X + 215.0 0.002*** 0.93

Referensi

Dokumen terkait

Nama Paket : Pembangunan ruang Rawat Neonatal Intensive Care Unit (NICU) (DAK) Kategori Pekerjaan : Pekerjaan Jasa Konstruksi. Metode Pemilihan : e-Pemilihan Langsung 1 (satu)

Menindaklanjuti Hasil Evaluasi Penawaran dan Evaluasi Kualifikasi, Pekerjaan Belanja Bahan Medis Habis Pakai (BMHP) Rumah sakit Umum Daerah Kabupaten Kaur, dengan ini

perilaku tersebut telah menjadi budaya siswa yang lebih demokratis. Peranan dan partisipasi pengurus organisasi kesiswaan SMA Negeri 1a. Lembang ialah; a) sebagai mediator

Metode penelitian yang digunakan adalah metode survey, teknik penentuan informan menggunakan ( nonprobability sampling ) dengan jumlah seluruh sampel penelitian

Digital Repository Universitas Jember Digital Repository Universitas Jember... Digital Repository Universitas Jember Digital Repository

Peraturan Presiden No.70 Tahun 2012 tentang Perubahan Kedua atas Peraturan Presiden Nomor 54 Tahun 2010 tentang Pengadaan Barang dan Jasa Pemerintah pada Pasal 84 ayat

Menghapus node pada AVL Tree sama dengan menghapus binary search tree procedure dengan perbedaan pada penanganan kondisi tidak balance. Penanganan kondisi tidak balance

Lagu dinyanyikan satu kali oleh seluruh jemaat dengan dipandu