elocation-id: e3101
Given the need to make efficient use of the resources allocated to experimental research in sugarcane crops, this research was proposed with the aim of determining the optimal size of the experimental plot in sugarcane crops in the Brunca region of Costa Rica. During the 2018-2019 harvest, an experimental uniformity trial was established, and five methods were compared: maximum curvature method, maximum curvature of the coefficient of variation, linear regression with constant, quadratic regression with constant and maximum distance method. The results indicate that the most efficient estimators were obtained with models that consider all sizes and forms of the uniformity trial (n=63). Segmented regression and linear regression with constant models produced the best estimators of the optimal size of the experimental plot: 72.16 and 93.22 m2, respectively. With the other three methods, considerable and inconsistent differences in the sizes of the experimental plot were obtained. With the methods of maximum curvature and maximum curvature of the coefficient of variation, the results were so small: 14.01 and 12.5 m2, respectively, that they are inadequate to carry out research in sugarcane; on the contrary, with the method of maximum distance, the size obtained was 157.48 m2, statistically and economically inefficient. Therefore, the linear regression with constant and quadratic regression with constant models are appropriate for determining the experimental plot size in sugarcane. It was concluded that the recommended size to be used in the area is 72 m2. This research was completed in December 2021.
estimators, experimental error, soil heterogeneity.
To achieve more efficient results in the design of experiments, Casler (2015) mentioned that repetitions, randomization, block definition, and experimental units should be analyzed; he also pointed out that the size of the experimental plot is the least understood aspect and the one with the least amount of theoretical and empirical results. According to Sripathi et al. (2017), the size of the experimental plot is one of the main causes of high residual variation in field experiments, which affects the statistical efficiency of the plot used as an experimental unit.
According to Smith (1938), there is a negative asymptotic relationship between variance by unit and plot size. For sugarcane, research has been carried out in Guatemala, such as the works of Palencia (1965); Álvarez (1982), in Brazil, that of Igue et al. (1991) and in India, that of Mahalanobis et al. (1939). Nonetheless, the validity of this research is local in nature.
Barrantes-Aguilar et al. (2020) estimated the optimal size of the experimental plot with sugarcane in the Brunca Region of Costa Rica with segmented regression models, but they did not apply other methods to compare their results and select the best one. The present research focused on determining the optimal size of the experimental plot in sugarcane crops of in the Brunca region of Costa Rica by comparing five methods to determine the most statistically efficient and best adapted to the conditions of the crop and the region where the study was carried out.
The definition of the size of the experimental unit should be aimed at minimizing experimental error. In order to calculate the optimal size, uniformity trials have been used more frequently, in which a single plot of the same variety is sown, treated uniformly in terms of practices and crop management, the only factor that varies is the soil and its heterogeneity is one of the factors that most affect experimental error and efficiency (Sripathi et al., 2017).
The uniformity trial is divided into basic experimental units (BEUs), which are then grouped into different sizes and forms. Smith (1938) showed that there is a negative asymptotic relationship between variation and plot size, which he expressed as: Vx=V1/xb 1) Where:Vx = corresponds to the variance of yield or any other variable of interest per unit area between plots of size x, V1= is the variance between plots of a basic unit; x= the number of BEUs that make up the plot; and b is the soil heterogeneity index, where:0< b < 1.
With this coefficient and a cost ratio, Smith (1938) set out to find the optimal plot size as the area with which the greatest amount of information per unit of cost is obtained; that is,x0=bC1/((1-b)C2) 2). Where:C1= is the portion of the total cost that is proportional to the number of plots per treatment; andC2= the portion of the total cost that is proportional to the total area per treatment. Some methods have been proposed from the work of (Smith, 1938). The five methods that were evaluated in this research are explained below.
Lessman and Atkins (1963) explain the relationship between the coefficients of variation (CVx) and the size of the plot with the following expression:CVx= a/xb 3). Where:
are obtained using the Gauss-Newton least squares method for nonlinear models; and x corresponds to the size of the plot.
From the first and second derivatives of (3), a curvature function is constructed; and from the first derivative of this function, the critical point (x0) is obtained, which corresponds to the plot size with the highest rate of change of the coefficient of variation (Meier and Lessman, 1971):
4).
Where: and
are estimators of and , respectively.
This method has the advantage that it does not require grouping the basic units or adjusting a model, as in other cases (Sari and Lúcio, 2018). The method was proposed by Paranaíba et al. (2009a) and is based on the estimation of the first-order spatial autocorrelation coefficient between the basic experimental units:
5).
Where: εi is the experimental error associated with each observation; Zi; εi-1 is the error of the previous observation; and is the average of the experimental error of all observations.
From this coefficient, the mean ( ) and variance (s2) of the variable of interest measured in basic experimental units, the coefficient of variation for each size of plots (x) is obtained by using the following formula:
6).
From the first and second derivatives of equation (6), the curvature function is obtained:
7).
The point where the curvature function is maximum is the optimal size:
8).
This method has been considered suitable for obtaining the optimal plot size in rice, wheat, and cassava.
This model describes the relationship between the coefficient of variation and the plot size in two segments. The first is a line with a negative slope that decreases to a certain value and then behaves as a constant line. This model was proposed by Paranaíba et al. (2009a) and is mathematically represented as:
9).
Where: CVx = is the coefficient of variation between the totals for plots with x i basic units; CVP = is the coefficient of variation at the point where the two segments meet; and εi = corresponds to the error associated with CVx, εi N (0,σ2). Due to the condition of continuity of the two segments that are equal at the point , we can write:
x0 = ((CVP-β0))/β1
10).
This model is represented by two segments: the first, described by a second-degree polynomial equation, and the second, by a constant line. As in the previous case, the optimal plot size is defined by the meeting point between the two segments. According to Moreira et al. (2016), the QRP model can be represented as:
11).
The function must be continuous and smooth, which means that the first derivatives with respect to x in both segments must be equal at the point x0. Under this condition, the optimal size of the experimental plot is:
x0 = β1/2β2
12).
This method, proposed by Lorentz et al. (2012), seeks to solve the subjectivity of some methods, such as visual inspection, based on the geometry formed by: yc=a/xb 13) and a line secant to the function (13), defined as: yR=cx+d 14). In (13), and can be obtained from the model of Smith (1938) or from Lessman and Atkins (1963). This method tries to find the point where the curve yC is as far as possible from the line yR, or in other words, it tries to find a line that maximizes that distance.
The data used were from a uniformity trial that was established on May 29, 2018, at the El Porvenir farm, owned by CoopeAgri RL, which is located in La Fortuna de San Pedro in the canton of Pérez Zeledón, which belongs to the Brunca region, Costa Rica. Sugarcane of the RB 99-381 variety was sown in 40 rows 84 m long with a separation of 1.5 m.
In total there was 5 040 m2 of experimental area, which was treated homogeneously in all the practices and tasks of the crop, once the edge was discounted there was 4 800 m2 of useful plot. The basic experimental unit (BEU) was defined as 2 m long by 1.5 m wide (3 m2), resulting in a total of 1 600 BEUs. The harvest and obtaining of weights of the trial were carried out manually on March 6 and 7, 2019. The variable of field yield was evaluated and the weight of each plot measured in kg was quantified, for which an electronic scale previously calibrated under the metric system was used.
For most of the methods, it was necessary to group adjacent BEUs, maximum curvature method (MCM), linear regression with constant (LRP), quadratic regression with constant (QRP) and maximum distance method (MDM), in different forms and sizes, considering that the entire experimental area should always be used. When tests of homogeneity of variances were necessary, an F-test was applied in the case of only two forms, and a Bartlett’s test (Bartlett, 1937) was applied when three or more forms of grouping were involved.
In both cases, the null hypothesis of homogeneity of variances is tested. If the test was significant, the variance with the lowest value was taken to associate it with the corresponding size, if not, the variances were averaged. The statistical analysis of the information was carried out with the following packages: Microsoft Office Excel 2016, R version 4.2.1 (R Core Team, 2022), SAS version 9.3 (SAS Institute, 2011) and Python version 3 (Van-Rossum and Drake, 2009).
We group data from 63 different forms, equivalent to 20 plot sizes, and considered only combinations that resulted in plot sizes that exactly fit the total area. For each form, the mean, sample variance and the coefficient of variation were calculated. As expected, the rate of decrease in the coefficient of variation decreases rapidly in the small plot segment but decreases more slowly for larger plots (Figure 1).
To apply this method, it was necessary: first, to group the BEUs into different sizes and forms and second, to fit two models, one based on tests of homogeneity of variances and another that considers the cases in which, for the same size, there is more than one possible form of grouping (for example, the forms: 1x2 and 2x1represent 6 m2 plots).
In the tests of homogeneity of variance, the smallest sizes always showed significant results (p< 0.05), so the variances obtained from clusters of the same size, but in different forms, were statistically different. From 96 m2 onwards, only in some cases was the null hypothesis of equality of variances rejected. Once the variance was adjusted according to the results of these tests, the coefficient of variation associated with each size (CVx) was obtained.
This information was used to fit model 1 (n=20). For model 2, we considered data from all forms and sizes, in this case n=63 (Table 1). Both models are based on equation (3) and their coefficients were significant (p< 0.01), with coefficients of determination greater than 90%. Using model one, the optimal size was 3.98 BEUs, equivalent to 11.94 m2, and with model two the result was 4.67 BEUs, which means 14.01 m2 plots. The smallest standard errors were obtained with model one.
The uniformity trial data were used to estimate the mean ( ), sample variance (s2) and first-order spatial autocorrelation coefficient ( ). For the calculation of , the data were ordered in such a way that xi+1 was always the immediate plot closest to x1, starting from north to south and then advancing from east to west, it is understood that the closest plants tend to influence each other more.
The coefficient of variation for each plot size (x) is:
15).
With the first and second derivatives of (15), it is possible to obtain the values of the curvature function for each plot size (x). These values and the estimated coefficients of variation are substituted in equation (8). From the analysis of these results with the MCCV method, it was concluded that the optimal size of the experimental plot is 4.17 BEUs, equivalent to a 12.5 m2 plot.
Segmented regression models were adjusted for the scenario where variance homogeneity tests were applied (n=20) and for the case where all forms and sizes of the uniformity trial were used (n=63). The results are shown in Table 2 and the fit of the models in Figure 2. The coefficients were significant (p< 0.01) as was the F-statistic; the coefficients of determination ranged from 62 to 96%.
Plot size estimates were found to be between 10.85 BEUs ( ) and 31.07 BEUs ( ). Greater results were obtained with the application of the QRP model. Peixoto et al. (2011) used these segmented regression models in the estimation of plot size in in vitro preservation experiments of passion fruit (Passiflora edulis) and also found higher values when applying the QRP method than when calculating the LRP method, they attributed it to the curvature of the model. With both LRP and QRP, we obtained coefficients with lower standard error than when the 63 trial data were used.
For the application of the MDM, the scenario in which the homogeneity of variance tests was applied (n=20) and in which they were not applied (n=63) were considered. For the values of and , those estimated by means of the MCM were used. As noted in Table 3, the plot size results obtained by this method differ greatly from the previous cases, with a size of approximately 56.21 and 52.49 BEUs estimated for models 7 and 8. These sizes represent plots of 168.63 and 157.48 m2, respectively.
Model | n | † | † | CV associated with a | Size ( ) |
---|---|---|---|---|---|
7 | 20 | 17.8908 | 0.492 | 2.4645 | 56.2109 |
8 | 63 | 22.1630 | 0.525 | 2.7433 | 52.4943 |
The summary of estimates for the different methods is shown in Table 4. For the MCM, the estimators were more efficient when the homogeneity of variance tests were performed (n=20); however, the difference with the other scenario was only 2 m2, approximately.
Note that the MCCV method does not require grouping BEUs, so all 1 600 data points were used. The estimates by this method were very similar to those obtained by the MCM. The largest differences were found with MDM, which differ significantly from all others. And as intermediate results are the estimates by the LRP and QRP methods.
The maximum curvature method (MCM) of Lessman and Atkins (1963) is one of the most widely applied methods (Paranaíba et al., 2009b), although it tends to underestimate the size of the experimental unit. Silva et al. (2003) applied this and two other methods with eucalypt clones, they concluded that this method for the algebraic determination of the point of maximum curvature uses this and the apex of the curve, but not the point of stabilization of the values of the experimental coefficient of variation, which means that increases in the size of the experimental unit above the critical point ( ) still bring significant gain to the experimental accuracy.
On the other hand, Henriques-Neto (2003), by using the MCM method, found small plot values that do not represent appropriate sizes for wheat research. Silva (2014) found differences in his estimates of up to 60% when comparing the MCM and the MDM, concluding that the estimation of the optimal plot size varies with the method used and with the characteristic evaluated. Henriques-Neto et al. (2009) came to a similar conclusion when evaluating yield characteristics of wheat grains. Cyprian et al. (2012) applied the MCM and evaluated eleven growth characteristics in coffee, they found that the plot size shows a different behavior according to the evaluated characteristic and the differences are up to 75%.
Leite et al. (2005) found plot sizes that were also not considered adequate, their estimates by MCM yielded results of less than one plant per plot for the estimation of genetic parameters in sugarcane families. Paranaíba et al. (2009b) compared four methods, including MCM, using experiments on wheat (four varieties) and cassava (two varieties and two traits). For wheat varieties, the smallest plot size was always the one estimated by the MCM, with differences of up to 83%. With cassava, the MCCV and MCM methods produced similar estimates, with differences between 1 and 3%.
MCM is not the only one with underestimation problems. In general, the results may vary depending on the method applied and it will be the responsibility of the researcher to select the one that best suits their situation. On the other hand, Paranaíba et al. (2009b) state that the value of the abscissa of maximum curvature should be interpreted as the minimum limit of plot size and not as the optimal size.
The minimum size of the experimental plot with sugarcane in the region is 72 m2. This is the optimal size because, among all experimental plot sizes, it is the one that minimizes the coefficient of variation and because it has the smallest experimental cost among the other plots that have a coefficient of variation close to that of the LRP method with n= 63, but with a larger plot size and, consequently, with a higher experimental cost.
The results obtained by applying the five methods for estimating the optimal size of the experimental agricultural unit in sugarcane present very marked differences, of 145 m2; the Brunca region uses plots with an approximate size of 80 m2. The lack of feasibility in the application of the results of the MCM, MCCV and MDM methods indicates that they should be discarded. With the segmented regression models, values more in line with the conditions and characteristics of sugarcane crops in the region were obtained.
Those that yielded the most efficient estimators were those that considered all sizes and forms of the uniformity trial (n=63). It was concluded that the LRP and QRP models are appropriate for the determination of the size of the experimental plot in sugarcane in soils with conditions similar to those of the Brunca region of Costa Rica. The recommended size to use in the region is 72 m2.
Álvarez, V. M. 1982. Determinación del tamaño óptimo de parcela experimental en caña de azúcar (Saccharum officinarum L.) bajo condiciones de la finca Bulbuxya. Tesis de licenciatura. Universidad de San Carlos de Guatemala. 59 p. http://biblioteca.usac.edu.gt/tesis/01/01-0639.pdf.
Barrantes-Aguilar, L. E.; González-Estrada, A.; Martínez, M. Á. y Valdivia, R. A. 2020. Modelos de regresión segmentada para estimar el tamaño óptimo de la parcela experimental con caña de azúcar. Revista Mexicana de Ciencias Agrícolas. 11(8):1763-1773. https://doi.org/10.29312/remexca.v11i8.1934.
Bartlett, M. S. 1937. Properties of sufficiency and statistical tests. Proceedings of the royal society of London. Series mathematical and physical sciences. 160(901):268-282. https://doi.org/10.1098/rspa.1937.0109.
Casler, M. D. 2015. Fundamentals of experimental design: guidelines for designing successful experiments. Agronomy Journal. 107(2):692-705. https://doi.org/10.2134/agronj2013.0114.
Cipriano, P. E.; Cogo, F. D.; Campos, K. A.; Almeida, S. L. S. y Morais, A. R. 2012. Suficiência amostral para mudas de cafeeiro cv. Rubi. Revista agrogeoambiental. 4(1):61-66. https://doi.org/10.18406/2316-1817v4n12012375.
Henriques-Neto, D. 2003. Estimativas de tamanho e forma de parcelas experimentais para avaliação do rendimento de grãos em trigo tesis doctoral, Universidad Federal de Viçosa. 151 p. https://locus.ufv.br//handle/123456789/10261.
Igue, T.; Espironelo, A.; Cantarella, H. and Nelli, E. J. 1991. Tamanho e forma de parcela experimental para cana de açúcar. Bragantia. 50(1):163-180. https://doi.org/10.1590/S0006-87051991000100016.
Leite, M. S.; Peternelli, L. A. and Barbosa, M. H. 2006. Effects of plot size on the estimation of genetic parameters in sugarcane families. Cropp Breeding and Applied Biotechnology. 6(1):40-46. https://doi.org/10.12702/1984-7033.v06n01a06.
Lessman, K. J. and Atkins, R. E. 1963. Optimum plot size and relative efficiency of lattice designs for grain sorghum yield tests. Crop Science. 3(6):477-481. https://doi.org/10.2135/cropsci1963.0011183X000300060006x.
Lorentz, L. H.; Erichsen, R. y Lúcio, A. D. 2012. Proposta de método para estimação de tamanho de parcela para culturas agrícolas. Revista Ceres. 59(6):772-780. https://doi.org/10.1590/S0034-737X2012000600006.
Meier, V. D. and Lessman, K. J. 1971. Estimation of optimum field plot shape and size for testing yield in Crambe abyssinica Hochst. Crop Science . 11(5):648-650. https://doi.org/10.2135/cropsci1971.0011183X001100050013x.
Moreira, J. M.; Melo, A. F.; Oliveira, J. M.; Ataides, D. S.; Ribeiro, M. C. y Bortolini, J. 2016. Parcela ótima para a cultura do cafeeiro obtido por simulação de dados com variâncias conhecidas. Pubvet. 10(9):636-642. https://doi.org/10.22256/pubvet.v10n9.636-642.
Palencia, J. A. 1965. Determinación del tamaño óptimo de parcela experimental para estudios experimentales en caña de azúcar (Saccharum officinarum L.), bajo las condiciones de la estación experimental agrícola “Sabana Grande” Tesis de Licenciatura, Universidad de San Carlos de Guatemala. 42 p. http://biblioteca.usac.edu.gt/tesis/01/01-0184.pdf.
Peixoto, A. P. B.; Faria, G. A. and Morais, A. R. 2011. Modelos de regressão com platô na estimativa do tamanho de parcelas em experimento de conservação in vitro de maracujazeiro. Ciência Rural. 41(11):1907-1913. https://doi.org/10.1590/s0103-84782011001100010.
R Core Team. 2019. R: A language and environment for statistical computing (3.6.1). R Foundation for Statistical Computing. https://www.R-project.org/.
Sari, B. G. and Lúcio, A. D. 2018. Uniformity trials size for estimating cherry tomato plot size. Revista Ciência Agronômica. 49(4):653-662. https://doi.org/10.5935/1806-6690.20180074.
Silva, R. L.; Xavier, A.; Leite, H. G. and Pires I. E. 2003. Determinação do tamanho ótimo da parcela experimental pelos métodos da máxima curvatura modificado, do coeficiente de correlação intraclasse e da análise visual em testes clonais de eucalipto. Revista Árvore. 27(5):669-676. https://doi.org/10.1590/S0100-67622003000500009.
Silva, W. C. 2014. Estimativas de tamanho ótimo de parcelas experimentais para a cultura do taro (Colocasia esculenta) Tesis doctoral, Universidad Federal de Viçosa. 59 p. https://locus.ufv.br//handle/123456789/1236.
Smith, H. F. 1938. An empirical law describing heterogeneity in the yields of agricultural crops. Journal of Agricultural Science. 28(1):1-23. https://doi.org/10.1017/S0021859600050516.
Sripathi, R.; Conaghan, P.; Grogan, D. and Casler, M. D. 2017. Field design factors affecting the precision of ryegrass forage yield estimation. Agronomy Journal , 109(3):858-869. https://doi.org/10.2134/agronj2016.07.0397.