elocation-id: e3611
In 2021, Mexico ranked sixth worldwide in sugarcane production with 2.9% and internally, this grass is grown in 16 states. In the 2021-2022 harvest, the sugarcane agribusiness employed 182 512 producers and in 2022, it generated 5% of the value of agricultural production. The work aimed to estimate the relative efficiency under constant and variable returns to scale of the 47 sugar mills that operated during the 2021-2022 harvest in the 16 states where sugarcane is produced in order to identify the most efficient mills in the sugarcane agribusiness. The methodology used was the super-efficiency slack-based model of input-oriented data envelopment analysis. In the results, 29 mills were identified as the most efficient. The five best performing mills were Zapoapita, San Cristóbal, Tres Valles, Benito Juárez, and Calipam and the three with the lowest performance were San Miguelito (0.937), Bellavista (0.919) and Constancia (0.905). It was concluded that the method used made it possible to identify the most efficient decision-making units and to support the design of programs that promote the best productive practices and management of those units with lower performance.
agricultural policy, data envelopment analysis, linear programming, slacks, super-efficiency model.
In Mexico’s primary sector, sugarcane cultivation is carried out in 16 states and 267 municipalities, occupying an area of 912 968 ha industrialized in 49 mills (CONADESUCA, 2022). In terms of employment, if we consider the sugarcane suppliers of the 49 sugar mills that operated in the 2021-2022 harvest, 188 512 direct jobs were created among the producers themselves, 25 thousand jobs among permanent, temporary, and casual employees in the sugar mills themselves in addition to 72.9 thousand cane cutters (CNIAA, 2022).
In this research, the approach used to determine the relative efficiency of decision-making units, in this case sugar mills, is a non-parametric approach and therefore does not require the specification of a functional form or assumptions about the normality of the variables to be used; it is possible to use the methodology of data envelopment analysis in its variant of the super-efficiency model in order to determine which are the most efficient sugar mills among the 47 selected for the study (Simar and Wilson, 2000; Hollingsworth and Smith, 2003; CONADESUCA, 2024).
The approach of data envelopment analysis (DEA) has been widely used to analyze technical efficiency, economies and returns to scale, productivity, and technological change over time in a wide variety of areas since its introduction by Charnes et al. (1978); however, its use in agriculture has been very scarce in the vast majority of countries, including Mexico.
In a study by Escobedo and Valdivia (2019), using the conventional DEA and a dataset of 50 observations for the 2015-2016 harvest, they identified that all the mills that are efficient, the respective indicator was equal to one; nevertheless, among the efficient mills, there will be some that are more efficient than others in practice. Santiago et al. (2021) also used the DEA to calculate the Malmquist productivity index and estimate the total factor productivity, technical efficiency, and technological change of the sugarcane mills in Mexico over time for the 2006-2007-2015-2016 harvest period. In their results, they found that the El Dorado sugar mill in Sinaloa during the entire study period registered an increase of 4.4% in its technical efficiency, of 21.7% in its technical progress and innovation, and of 27% in total factor productivity.
In this framework, the objective of this research is to estimate the relative efficiency indicators under constant and variable returns to scale of the sugar mills that operated during the 2021-2022 harvest through the super-efficiency model of the data envelopment analysis to effectively identify the most efficient mills in the sugarcane agribusiness in Mexico.
The material used for the estimation of the data envelopment analysis model was secondary statistical information on sugarcane from the 2021-2022 harvest obtained from UNC-CNPR (2022). The software used to perform the analysis was Cran-R Benchmarking. The method used to estimate the relative efficiency of the mills is data envelopment analysis (DEA) (Rincón et al., 2016; Bogetoft and Otto, 2024).
Nonetheless, in the case of the two conventional models of the DEA, that is the DEA with constant returns to scale and the DEA with variable returns to scale, when estimating the relative efficiencies of the decision-making units (DMUs), they do not discriminate between all those units that are completely efficient, so the relative efficiency indicator in all cases will be equal to one (Tone, 2002).
In this way, the method used to identify the most efficient mills in the sugarcane agribusiness during the 2021-2922 harvest was the super-efficiency model, which is a variant of the two conventional models of data envelopment analysis, which estimates efficiency indicators greater than one for those decision-making units that, in conventional models, can only take a value equal to one if they are completely efficient; therefore, if there are a significant number of mills whose efficiency is 100%, it is not possible to identify among them which are effectively the most efficient, a situation that does not occur in the super-efficiency model, which does allow it to be done.
The theoretical model estimated using the available sugarcane database is the input-oriented non-radial super-efficiency slack-based (SSB-I) theoretical model of the DEA. The linear programming expressions of the super-efficiency model can be found in Tone (2002). The variables used in the research are shown in (Table 1).
[i] Based on UNC-CNPR (2022); Debernardi et al. (2018).
With regard to the variables used, it is necessary to point out the following singularities. The total amount of diesel for transporting cane from the field to the mill was imputed based on Debernardi et al. (2018), who estimated this parameter to be 14 mills. For the remaining 33 mills, the arithmetic average of the aforementioned study was used, which had a value of 2.45 L t-1 of cane transported. In this way, the multiplication of this constant by the raw milled cane produced the estimate of the total diesel for the transport and lifting of sugarcane in the 2021-2022 harvest.
Table 2 presents the descriptive statistics of the variables used in the empirically applied methodology. The definition of the variables corresponds to the one presented in Table 1.
When applying the data envelopment analysis with constant returns to scale with input orientation, it is found that the relative efficiency of a significant proportion of such decision-making units is equal to the unit. This situation is observed in Table 3.
Conventional DEA model | Num. of mills with relative efficiency | Total mills | |
---|---|---|---|
Less than one | Equal to one | ||
With constant returns to scale | 26 | 21 | 47 |
With variable returns to scale | 18 | 29 | 47 |
In the case of the DEA model with constant returns to scale, the average relative technical efficiency was 97.6% and the lowest was 90.1%, which corresponded to the Constancia mill. In the case of the model with variable returns to scale, the average relative technical efficiency was 98.6% and the lowest was 90.47%, which also corresponded to the Constancia mill. As has been pointed out, a disadvantage of the two conventional models of the DEA is that, if a significant proportion of the decision-making units show that their efficiency is 100%, this fact did not provide criteria on the design of differentiated support policies for those mills whose relative efficiency is low or to know which mills are the ones that carry out the best production practices, which can be the reference for the design of public policies to support the income of suppliers of sugarcane material or a support policy according to the size and scale of operation of the sugar mills.
However, as Tone (2002) points out, empirical research has shown that, in industries that produce a homogeneous good, in this case standard sugar, even when the decision-making units operate at the same scale, they will do so at different levels of relative technical efficiency. Within the framework of the methodology of data envelopment analysis itself, one of its variants that allows identifying which mills are in practice the most efficient is the super-efficiency slack-based model (SBM) (Petersen, 1993; Tone, 2002; Chen, 2004). This model made it possible to identify and classify the most productive sugar mills since the technical efficiency indicator of the most efficient mills will be greater than the unit.
The results of the super-efficiency model obtained with the runs of the Cran-R Benchmarking package are those presented in Table 4.
As can be seen in Table 4, the super-efficiency slack-based model of DEA does allow the identification of the mills that in practice are the best performing in the sugarcane agribusiness, unlike the results of the two conventional DEA models where, as can be seen in Table 3, only two groups of decision-making units are identified: the group of mills with efficiency equal to or less than one, both in the case of constant returns to scale and in the case of variable returns to scale.
In cases where the relative technical efficiency indicator is equal to the unit (21 and 29 mills, respectively) it is impossible to differentiate which are in practice the really efficient. This situation can be observed in Table 3, where it was found that, for the study harvest, the Benito Juárez, El Higo, Santa Clara, and El Mante mills were the most efficient and those with the poorest performance were Bellavista, San Miguelito and Constancia.
Analogous to the previous case, Table 5 shows the results of the super-efficiency model with variable returns to scale.
Table 5 shows the two groups of mills identified once the assumption of constant returns to scale has been changed to that of variable returns to scale. The first corresponds to those mills whose efficiency indicator is greater than one and the second in which this indicator is less than one. However, as explained above, in the first group, it is possible to obtain a hierarchy of the mills that present the greatest technical efficiency.
The result indicates that, under variable returns to scale, the San Cristóbal, Tres Valles, Zapoapita, Benito Juárez, and Calipam mills are the best performers, that is, those that had the best relative technical efficiency in the 2021-2022 harvest for having had the best production practices and management of their production factors. In this scenario, the three mills with the lowest efficiency were San Miguelito, Bellavista, and Constancia, with an indicator of 0.927, 0.919, and 0.905, respectively.
The results of this research can be discussed in the light of other works that used the efficient production frontier methodology. Escobedo and Valdivia (2019) were the first to evaluate the relative technical efficiency of the sugarcane agribusiness for the 2015-2016 harvest under the assumption of constant returns to scale and input orientation. In their results, these authors found that the average relative technical efficiency of the agribusiness in the study harvest was 89%, without considering the mechanization of the sugarcane harvest, and it rose to 96.5% when considering the mechanization of the cutting and lifting of sugarcane. The research concluded that there was room for improvement in technical efficiency in agribusiness.
In their study, Santiago et al. (2021) applied the Malmquist productivity index for the 2006-2007-2015-2016 harvest period to a balanced panel dataset and a total of 50 mills. In their results, they found that, in the study period, in the El Dorado mill, the cumulative percentage change in its technical efficiency was 4.4%, in technical progress and innovation 21.7%, and a total productivity of the factors of 27%, whereas the opposite situation occurred for the San Miguel del Naranjo mill, whose cumulative percentage change in its technical efficiency was -10.2%, its technological progress and innovation grew by -0.3%, while total factor productivity was -10.4%. Valdivia et al. (2022) analyzed the relative efficiency and economies of scale of sugar mills for the 2010-2011 harvest. Their work indicates that, between 1996 and 2018, 10 sugar mills closed or stopped operating; that is, in a 19-year period, 16.4% of the milling infrastructure of the Mexican sugarcane agribusiness closed, which was due to inefficiencies in its productivity parameters.
In their results, Valdivia et al. (2022) found that, out of 51 mills, 23 operated at an optimal scale, a second group of 9 mills operated with total efficiency, but with pure inefficiency and scale inefficiency, and a third group of 19 mills that operated with total inefficiency, pure inefficiency, and scale inefficiency. Regarding the nature of the yields, they identified 23 mills that operated with constant returns to scale, 29 with increasing returns to scale, and only two mills with decreasing returns to scale.
The main conclusion of the study is that the 29 mills with increasing returns to scale could improve their performance, if they expanded their size, production and factor productivity would increase more than proportionally. A particularity of Valdivia et al. (2022) results is that they also concluded that the Zapoapita mill (Pánuco) was the only one that was identified with both inefficiency of scale and decreasing returns to scale.
As can be seen in Escobedo and Valdivia (2019); Santiago et al. (2021); Valdivia et al. (2022), those mills that are relatively the most efficient in technical terms have been identified. Nevertheless, none of these studies makes a classification (ascending or descending) of which mills are the most productive in practice in the respective harvest or harvest period of the respective study; however, the information they provide is valuable because it helps to understand the performance of these complex production units in terms of their production practices and management of the factors of production at their disposal. Finally, according to Valdivia et al. (2022), during the 2010-2011 harvest season, this mill was inefficient and with decreasing returns to scale, which implied that, according to this research, it could have closed and stopped operating. Nonetheless, according to its indicators for the 2021-2022 harvest obtained in this study, this mill turned out to be one of the three with the highest performance, the other two being San Cristóbal and Tres Valles. The Zapoapita sugar mill was acquired by the Pantaleón Group of Guatemala in 2011.
Zapoapita, San Cristóbal, Tres Valles, Benito Juárez, and Calipam sugarcane mills were identified as the most efficient the 2021-2022 harvest, and those with the lowest performance were San Miguelito, Bellavista, and Constancia (whose average efficiency was 0.92). In this way, the evaluation of the relative technical efficiency of the sugarcane agribusiness mills in Mexico through the variant of the data envelopment analysis method, called the super-efficiency slack-based model, oriented to the input and with variable returns to scale, made it possible to identify which sugarcane mills were effectively the most efficient in the 2021-2022 harvest and thus overcome the disadvantage of the two conventional models of the data envelopment analysis methodology, which, under the assumptions of constant returns to scale and with an orientation to the input, identified 21 sugar mills with relative efficiency equal to one and 29 mills in the case of variable returns to scale, which did not allow, in practice, to distinguish which mills are effectively the most efficient. Under the same assumptions as above, it was found that a group of 16 had a scale efficiency greater than one, so it can be said that they can further improve their performance by identifying better productive and management practices.
Bogetoft, P. and Otto, L. 2024. Benchmarking with DEA and SFA. R package version 0.32. https://cran.r-project.org/web/packages/Benchmarking/citation.html. 15-18 pp.
CONADESUCA. 2024. Comité Nacional para el Desarrollo Sustentable de la Caña de Azúcar. Sistema de información de costos de producción de caña de azúcar. Estructuras de costos. https://www.siiba.conadesuca.gob.mx/sicostossustentabilidad/consultapublica/ConsultaPublica.aspx?app=costos.
CONADESUCA. 2022. Comité Nacional para el Desarrollo Sustentable de la Caña de Azúcar. Noveno informe estadístico del sector agroindustrial de la caña de azúcar zafras. https://www.siiba.conadesuca.gob.mx/siiaca/docext/9no-informe-estadistico.pdf.
Debernardi-Vequia, H.; Ortiz-Laurel, H. y Rosas-Calleja, D. 2018. Eficiencia en el uso del combustible para la producción de la caña de azúcar (Saccharum spp.) en la Región Golfo de México. Agroproductividad. 10(11):81-86. https://www.revista-agroproductividad.org/index.php/agroproductividad/article/view/60/55.
Escobedo, C. O. A. y Valdivia, A. R. 2019. Análisis de la eficiencia técnica de la agroindustria de la caña de azúcar en México con el análisis envolvente de datos. Revista Mexicana de Economía Agrícola y de los Recursos Naturales. 12(1):77-91. https://dicea.chapingo.mx/wp-content/uploads/2019/07/enero-junio-2019.pdf.
Rincón, S. I. B.; Arango, B. L. J. y Torres, Y. O. J. 2016. Metodología de análisis envolvente de datos (DEA), procesos administrativos y operacionales de las políticas gubernamentales en los países latinoamericanos. Tlatemoani. 7(22):63-89. https://www.eumed.net/rev/tlatemoani/22/tlatemoani22.pdf.
Santiago-Zarate, I. M.; Martínez-Damián, M. A.; Cuevas-Alvarado, C. M; Valdivia-Alcalá, R.; García-Hernández, M. I. y Hernández-Toscano, J. 2021. Productividad y cambio tecnológico en la agroindustria de la caña de azúcar en México. Revista Mexicana de Ciencias Agrícolas. 12(6):1005-1017. Doi: 10.29312/remexca.v12i6.2692.
Valdivia-Alcalá, R.; Cuevas-Alvarado, C. M.; Vázquez-Muñoz, J. S. García-Hernández, M. I.; Santiago-Zarate, I. M. y Garay-Jácome, A. S. 2022. Análisis de la eficiencia y retornos de escala de los ingenios azucareros de México. Revista Mexicana de Ciencias Agrícolas. 13(1):141-153. Doi: 10.29312/remexca.v13i1.2639.