Skip to main content

A model for the location and scheduling of the operation of second-generation ethanol biorefineries


This document presents an economic optimization model which identifies the location, the nominal plant capacity and the operation scheduling for set of biorefineries of second-generation ethanol using the biomass obtained as waste in the sugarcane industry. The model also determines the gasoline volumes that will be mixed with ethanol in order to produce a mixed fuel. Given a planning horizon of the operation of the system, the model obtains its optimal parameters at fixed time intervals (annual) so the global optimum is obtained by minimizing the mathematical expectation of the stochastic process generated when the product demand is assumed random with known density. Partial optimization of the process is achieved using a mixed integer linear programming model. Real information obtained from the Secretariat of Energy for the management of biorefineries in the state of Veracruz of the Mexican Republic is included and numerical results are reported.


During the last century the world average annual temperature of the planet increased by 0.7°C, and it is expected to increase further, in the range of 0.8 to 2.6°C by the year 2050. As a consequence, there has been a decrease in the glaciers and sea ice. Today, hurricanes are of greater magnitude and frequency and previously desert-like areas receive large amounts of rain while traditionally humid areas are subject to desertification. These effects are attributable to greenhouse gas emissions [15]. Of these gases, CO2 represents more than 80% [6], Mexico is one of the top 20 countries, with the highest production volume of CO2 emissions [14].

Worldwide, two man-made activities, the generation of energy (electrical, thermal) and transport, produced almost two thirds of global CO2 emissions. While the generation of energy comes from many sources, the transport sector depends almost exclusively on fossil fuels by 97% [13]. In Mexico gasoline imports are growing to 25 billion liters annually. For both economic and environmental reasons, it is especially important to substitute fossil fuels with biofuels [26].

Biofuels are classified as first generation and second generation or cellulose. The difference between them are the raw materials used for their manufacture. In the production of ethanol the first-generation biofuels are made from corn (Zea mays), sugar cane (Saccharum officinarum L.), sugar beet (Beta vulgaris), and grain sorghum (Sorghum spp.). In the case of biodiesel, palm oil (Elaeis guineensis), jatropha (Jatropha curcas), soybean (Glycine max), rapeseed (Brassica napus), safflower (Carthamus tinctorius) and sunflower (Helianthus annuus) can be used. The first-generation biofuels are produced using conventional technology such as fermentation (for sugars and carbohydrates), transesterification (for oils and fats), and anaerobic digestion (for organic waste). Second-generation biofuels use mainly agricultural, agroindustry and forestry wastes, and require pre-treatment before using conventional methods to obtain them [17].

First-generation ethanol is produced on a commercial scale. In the year 2000, 13 billion liters were produced and by 2016 a figure of 99 billion liters was reached, an increase of more than 750%. The main producers of ethanol worldwide are the United States and Brazil [20]. In Mexico, commercial scale production of ethanol is non-existent. However, in 2015 Petróleos Mexicanos (PEMEX) entered 10-year agreement with several Mexican companies to produce ethanol beginning in 2017. This production will be mixed 5.8% with gasoline to produce a mixed fuel. The ethanol will come from sugarcane juice, produced in San Luis Potosí and Veracruz; as well as sorghum grain, produced in Tamaulipas [18]. In addition, the Mexican government established the mandatory standard, that ethanol can be mixed up to 10% with gasoline [5].

The first-generation biofuels present some important drawbacks, such as the possibility of shortage of some crops dedicated to food, to an increase in prices and to an acceleration of deforestation. In this context, the use of cellulosic materials to produce ethanol is an alternative, which counteracts certain disadvantages of first-generation biofuels [17]. Second generation biofuels already have a promising future. In the year 2000 their production was minimal, and for the year 2014 a production of 16 billion liters was reported, led by the United States and Brazil [27]. In Mexico, several studies have been done on the annual potential of agricultural waste for the production of cellulosic ethanol. Those studies, have identified several potential sources of agricultural waste with an energy vocation, including stubble of agricultural crops: corn, sugarcane, rice, barley, sorghum and wheat; as well as some agro-industrial by products: sugarcane bagasse, corn cobbles, rice husks, wheat, coffee and sunflower. An annual potential of cellulosic ethanol in Mexico was estimated from the waste of several crops as: 3, 405 Mlt of sugarcane, 383 Mlt of sweet sorghum, 151 Mlt of coffee, 99 Mlt of wheat and 17 Mlt of rice.

In Mexico, sugarcane is the most produced crop in volume and the country is an exporter of sugar; the 2015–2016 harvest produced, more than 55 million tons of sugarcane. For its harvest, in general, the system of burned cane, manual cutting and mechanized moose is used. It involves burning the ripe cane to facilitate the harvest and weeks after the first burning, a second burn eliminates the tips of cane that were left in the soil. This practice prevents the reincorporation of organic matter into the soil and degrades the physical and chemical properties. This practice is not sustainable in the long term [24]. Additionally, it does not take advantage of an important biomass volume that could serve as raw material to produce second-generation ethanol, as has already been done in Brazil [12].


Facility location models have a long history dating back to the year 1600 with the jurist, mathematician and physicist Pierre de Fermat who is credited with creating the method of the spatial median. Today, there is a vast literature on the methods, models and results applied to the problem of location of facilities the most used being multicriteria optimization. Here, the optimality criteria are: minimizing the total setup cost, minimizing the longest distance from the existing facilities, minimizing fixed cost, minimizing total annual operating cost, maximizing service, minimizing average time/distance traveled, minimizing maximum time/distance traveled, minimizing the number of located facilities, maximizing responsiveness [9].

The variety of applications covers from the location of banks [3, 7, 10, 11, 22, 23] industrial facilities [2, 4, 8, 19, 25]. A review on location of facilities and supply chain can be found in [16]. A special contribution on the design of the bio fuels supply chain can found in [1].

This proposal addresses a real problem of planning and managing the operation of a set of refineries in a state of the Mexican Republic under optimal conditions of selection of its location, the periods of operation of these up to T years, the distribution network of the ethanol produced and the determination of the quantities of gasoline and ethanol to be purchased externally.

In this document, we develop a mathematical model that addresses the problem of the production of ethanol mixed with gasoline and the amounts to be imported from various external sources to meet the demand for biofuel required in the country. Likewise, it is determined using a model of stochastic networks, the optimal trajectory of transport routes from sugarcane bagasse producing centers to the consuming centers of the final product. A clustering model initiates the proposal indicating the optimal grouping of biorefineries around the producing centers (cane mills and sugar producing fields). Subsequently, the programming of the operation of the storage centers for biofuels is modeled. For this, binary variables were used in order to establish the annual plan of operation of the whole in the area. An integer linear (binary) stochastic programming model is the model inserted in each stage of the stochastic process that shapes the decision for the global planning of the operation to T years. Next our proposal is developed.

Description of the problem and its proposed mathematical model

In the construction of this model we assume that the biomass comes from the bagasse of the sugar cane and it is constituted by the residue of matter that remains after its juice is extracted. It is the residue (tips and leaves) that remain in the field after the harvest or when processing the sugarcane to obtain the product. The goal is to use the material to obtain ethanol. The following aspects will be considered, Figure 1:

  1. 1

    The biomass can be located in two different origins: (a) the sugarcane field, (b) the sugar mills

  2. 2.

    The two sources of biomass produce raw material for the operation of biorefineries

  3. 3.

    The biorefineries send ethanol as a finished product to the storage centers

  4. 4.

    The storage centers also receive gasoline and ethanol imported through the corresponding customers

  5. 5.

    The storage centers are used as temporary stores (buffers) and as areas for mixing ethanol with gasoline

  6. 6.

    The total costs involved in the process (production, storage and transport) are known and deterministic

  7. 7.

    The demand D for ethanol is a random variable with cumulative distribution function \(F_{D} (d)\) with finite mathematical expectation \(E ( D ) < \infty\), and known variance \(\sigma_{D}\)

  8. 8.

    The efficiency of production η in each biorefiner is known

  9. 9.

    In the decision to operate a biorefinery or not, two types of costs are included: (a) fixed costs \(c_{f}\), and (b) variable costs \(c_{v}\) in such a way that the total cost is given by \(c_{t} = c_{f} + c_{v}\)

  10. 10.

    In the decision to operate or adapt a warehouse, two types of costs are included: (a) fixed cost \(\check{c}_{f}\) and (b) variable costs \(\check{c}_{v}\), Thus, the total cost would be given by \(\check{c}_{t} = \check{c}_{f} + \check{c}_{v}\)

  11. 11.

    The model includes 5 different types of transport for the raw material, 4 for ethanol and 3 for the transportation of gasoline.

Figure 1

Graphical representation of the system network

In this model we are interested in obtaining an optimal product distribution program in the network as a function of the estimated demand from the latter year. In the set of decisions, it must be specified which refineries and warehouses should be operating in each year of project execution as well as the route of traffic of the end product.

Description of the model

Our proposal considers the design of an ethanol supply chain in the state of Veracruz, Mexico, using as base the biomass that is generated in the main sugar cane areas of the zone. The storage centers should be placed according to the geographical position of the suppliers of cane bagasse and local customers for the purchase of ethanol and gasoline (clustering process).

In the second stage, the programming of the operation of the biorefineries and the import requirements of gasoline and ethanol to cover the total fuel demand are determined. The development of both models follows.

The clustering and location of biorefineries model

For the construction of this model a distance matrix \(M= m_{ij}\) was built. The matrix contains the distances by land, from the origin i to the destination j and it involves the biomass producing centers (mills and sugarcane fields) and local customers. With the above information, the clustering process is now done. Here, the raw material sources are grouped and the optimal places to locate the refineries are determined (first stage). Now one assigns the clusters to the customers importing ethanol and gasoline (second stage). Finally, the optimal position of the finished product storage centers is determined (third stage), Fig. 2. For the construction of the mathematical model, the following indicator sets are defined.

  1. 1.

    For biomass producing fields: \(\mathcal{I}=\{ i:i=1, \dots, I\}\)

  2. 2.

    For the sugar mills: \(\mathcal{R}=\{ r:r=1, \dots, R\}\)

  3. 3.

    For the biorefineries of ethanol: \(\mathcal{J} =\{ j: j =1, \dots, J \}\)

  4. 4.

    For customers receiving ethanol: \(\mathcal{S}=\{ s:s=1, \dots , S\}\)

  5. 5.

    For customers receiving gasoline: \(\mathcal{G}=\{ g:g=1, \dots,G\}\)

  6. 6.

    For ethanol and gasoline storage centers: \(\mathcal{L}=\{ l:l=1, \dots,L\}\)

  7. 7.

    For the type of transport used: \(\mathcal{V}=\{ v:v=1, \dots,V\}\)

  8. 8.

    For the time: \(\mathcal{T}=\{ t:t=1, \dots,T\}\)

  9. 9.

    For fuel delivery centers (customers): \(\mathcal{U}=\{ u:u=1, \dots,U\}\).

Figure 2

The three stages of construction of the model

Once the geographic positions of the biomass producing centers are known, the first stage consists of grouping them into representative clusters of the whole. Then, each cluster is assigned to a biorefinery according to the installed production capacity of the same. The process becomes iterative until it reaches the convergence. For the design of the clusters the method mentioned in [19] is used. Here, the use of radial distribution was used due to the transportation costs of the product and the characteristics of it. In this process, the geographical location of the import customers is considered fixed and, the distances from these to the set of clusters are amounts set automatically by the algorithm. The following algorithm was used to locate the biorefineries according to the providers (first stage).

Clustering algorithm of the biomass producing centers (mills and cane fields).

  1. 1.

    Randomly select a number of ω biomass producing centers closest to each other in a given region and locate them in a Cartesian coordinate system. Let Ω be the set of selected suppliers.

  2. 2.

    Determine the coordinates (\(x,y\)) of the center of gravity of each cluster by solving the next non-linear program:

    $$ \operatorname{Min} z= \sum_{r=1}^{\omega} Q_{r} d_{r} \mid d_{r} = \bigl[ (x_{r} -x)^{2} + ( y_{r} -y)^{2} \bigr]^{1/2}, $$

    where \(Q_{r}\) is the average annual biomass volume from the supplier r that will be moved from this point to the refinery, and (\(x_{r}, y_{r} \)) are the (known) coordinates of the biomass suppliers.

    1. (a)

      If the coordinates are feasible (meanings that the place found is economically and technologically feasible to locate the refinery here), go to step 3.

    2. (b)

      If the coordinates are infeasible (for example, places that are not accessible, such as lakes, rivers, buildings, or more), add or remove elements of \(\mathcal{S}\) from the nearby clusters and apply Equation (1) while the infeasibility remains. Go to step 3.

  3. 3.

    Repeat the process including or eliminating suppliers in Ω and successively apply Equation (1) in every attempt. Let \(z_{1} ( x_{1}, y_{1} ), \dots, z_{l} ( x_{l}, y_{l} )\) be the sequence of optimal values found in each iteration. Choose the coordinates (\(x_{\alpha}^{*}, y_{\alpha}^{*} \)) of the optimum point that will represent the centroid of gravity of the polygon using the following criteria.

    $$ z_{\alpha}^{*} \bigl( x_{\alpha}^{*}, y_{\alpha}^{*} \bigr) = \min\bigl\{ z_{1} ( x_{1}, y_{1} ), \dots z_{l} ( x_{l}, y_{l} ) \bigr\} . $$
  4. 4.

    The point \(( x_{\alpha}^{*}, y_{\alpha}^{*} )\) is an estimator of the center of gravity of the polygon formed by the suppliers and is a candidate to locate a refinery.

  5. 5.

    Apply the process while there are clusters without refinery assignment

The clustering and location of biorefineries model

The process of assignment of customers for the purchase of ethanol is as follows (second stage). Let \(D_{js}\) be the distances by land from the biorefinery j to the customers s. Then for a given biorefinery j the customers s is assigned if

$$ D_{js}^{*} = \min \{ D_{js} \},\quad \forall s\in \mathcal{S}. $$

The process of assignment of customers for gasoline purchase is similar to the previous one.

The allocation of finished product storehouse model

For the construction of the model for the location of the finished product storehouse and product flow calculation the following notation is proposed (stage 3).

\(q_{it}\equiv\) :

Volume of biomass collected from the cane field i in the year t (in m3)

\(\check{q}_{rt}\equiv\) :

Volume of biomass collected in the sugar mill r in the year t (in m3)

\(c_{it}\equiv\) :

Cost of lifting biomass in the sugarcane field i (include loading the truck and packaging) in the year t (in USD/m3)

\(\check{c}_{rt}\equiv\) :

Cost of collecting biomass in the sugar mill r in the year t (in USD/m3)

\(\varsigma_{st}\equiv\) :

Cost of importing ethanol for customers s in the year t (in USD/m3)

\(\vartheta_{gt}\equiv\) :

Cost of importing gasoline for customers g in the year t (in USD/m3)

\(\tilde{c}_{jt}\equiv\) :

Fixed cost of biorefinery operation j in the year t (in USD/year)

\(\gamma_{jt}\equiv\) :

Variable cost of biorefinery operation j in the year t (in USD/m3-year)

\(\psi_{ijtv}\equiv\) :

Costs of transport of the biomass generated in the field i to the, biorefinery j in the year t using the transport v (in USD/m3)

\(\check{\psi}_{rjtv}\equiv\) :

Transportation costs of the biomass generated in the mill r to the, biorefinery j in the year t using the transport v (in USD/m3)

\(x_{ijtv}\equiv\) :

Volume of biomass transported from the field i to the biorefinery j in the year t using the transport v (in m3)

\(\check{x}_{rjtv}\equiv\) :

Volume of biomass transported from the mill r to the biorefinery j in the year t using the transport v (in m3)

\(\mathcal{E}_{jltv}\equiv\) :

Transportation costs of ethanol generated in the biorefinery j towards the plant of storage and mixing l in the year t using the transport v (in USD/m3)

\(\check{\mathcal{E}}_{sltv}\equiv\) :

Transportation costs of ethanol generated by customers s to the plant of storage and mixing l in the year t using the transport v (in USD/m3)

\(y_{sltv}\equiv\) :

Volume of ethanol transported from the biorefinery j to the plant of storage and mixing l in the year t using the transport v (in m3)

\(\check{y}_{sltv}\equiv\) :

Volume of ethanol transported (imported) from customers s to the plant of storage and mixing l in the year t using the transport v (in m3)

\(\tilde{{z}}_{jt}\equiv\) :

Volume of ethanol produced in the biorefinery j during the year t (in m3)

\(\check{{z}}_{st}\equiv\) :

Volume of ethanol purchased (imported) by customers s during the year t (in m3)


Unit cost of mixing ethanol with gasoline and its storage in the store l in the year t (in USD/m3)

\(\kappa_{jt}\equiv\) :

Installed production capacity of the biorefinery j in the year t (in m3/year)

\(I_{lt}\equiv\) :

Inventory of final product (ethanol mixed with gasoline) that must be in the warehouse l at the end of year t (in m3)

\(\theta_{lt}\equiv\) :

Cost of storing a cubic meter of final product in the warehouse l during the year t (in USD/m3-year)

\(\delta_{lt}\equiv\) :

Warehouse storage capacity l in the year t (in m3)

\(E_{i m_{t}}\equiv\) :

Volume of ethanol to import in the year t (in m3)

\(G_{i m_{t}}\equiv\) :

Volume of gas to import in the year t (in m3).

The strategy to optimize this model consists in obtaining the minimum mathematical expectation of the process in each year of operation of the system. So, for each \(t \in\mathcal{T}\) we have, the objective function and the model is constructed as follows:

  1. 1.

    Elements of the objective function

    1. a.

      The (fixed) costs of lifting the biomass in the sugar cane i field during the year t, plus the costs of acquiring the biomass in the mill r during the year t, are given by the function

      $$ Q ( q_{i}, \check{q}_{r} ) = \sum _{t\in\mathcal{T}} \sum_{i \in \mathcal{I}} c_{it} q_{it} + \sum_{t\in\mathcal{T}} \sum _{r \in\mathcal{ R}} \check{c}_{rt} \check{q}_{rt} = \sum _{t\in\mathcal{T}} \biggl[ \sum_{i\in\mathcal{ I}} c_{it} q_{it} + \sum_{r\in \mathcal{R}} \check{c}_{rt} \check{q}_{rt} \biggr] $$
    2. b.

      The costs of transporting biomass from the sugar cane field i towards the biorefinery j in the year t, using the transport v; plus, the costs of transporting the biomass from the mill r towards the biorefinery j in the year t, using the transport v

      $$\begin{aligned} &\sum_{i\in \mathcal{I}} \sum_{j\in\mathcal{J}} \sum_{t\in\mathcal{T}} \sum_{v\in\mathcal{V}} \psi_{ijtv} x_{ijtv} + \sum_{r\in \mathcal{R}} \sum _{j\in\mathcal{J}} \sum_{t\in\mathcal{T}} \sum _{v\in\mathcal{V}} \check{\psi}_{rjtv} \check{x}_{rjtv} \\ &\quad = \sum_{j\in\mathcal{J}} \sum _{t\in \mathcal{T}} \sum_{v\in\mathcal{V}} \biggl[ \sum _{i \epsilon\mathcal{ I}} \psi_{ijtv} x_{ijtv} + \sum _{r\in \mathcal{R}} \check{\psi}_{rjtv} \check{x}_{rjtv} \biggr] \end{aligned}$$
    3. c.

      The fixed operating costs of the biorefinery j in the year t plus the variable costs of ethanol production in that year

      $$ \sum_{j\in\mathcal{J}} \sum_{t\in\mathcal{T}} [ \tilde{c}_{jt} \alpha_{jt} + \gamma_{jt} \tilde{ {z}}_{jt} ] $$
    4. d.

      The operation costs of mixing ethanol with gasoline and its corresponding stock are given by

      $$ \sum_{l\in \mathcal{L}} \sum_{t\in\mathcal{T}} \sum_{v\in\mathcal{V}} \varrho \biggl[ \sum _{j\in\mathcal{J}} y_{jltv} + \sum_{s\in\mathcal{S}} \check{y}_{sltv} \biggr] $$
    5. e.

      The costs of acquiring ethanol through customers s in the year t

      $$ \sum_{s\in\mathcal{S}} \sum_{t\in\mathcal{T}} \varsigma_{st} \check{{z}}_{st} $$
    6. f.

      The costs of transporting ethanol from biorefineries j towards the storage centers l, using the transportation v in the year t plus the costs of transporting ethanol from customers s to the l storage centers using the transportation v in the year t plus the costs of transporting gas from customers g to the l storage centers using the transportation v in the year t

      $$\begin{aligned} &\sum_{j\in\mathcal{J}} \sum_{l\in \mathcal{ L}} \sum_{t\in\mathcal{T}} \sum_{v\in\mathcal{V}} \xi_{jltv} y_{jltv} + \sum_{s\in\mathcal{S}} \sum _{l\in \mathcal{L}} \sum_{t\in\mathcal{T}} \sum _{v\in\mathcal{V}} \check{\xi}_{sltv} \check{y}_{sltv} + \sum_{g\in\mathcal{G}} \sum _{l \in \mathcal{L}} \sum_{t\in\mathcal{T}} \sum _{v\in\mathcal{V}} \check{\xi}_{gltv} \tilde{y}_{gltv} \\ &\quad = \sum_{l\in\mathcal{ L}} \sum_{t\in \mathcal{T}} \sum _{v\in\mathcal{V}} \biggl[ \sum_{j\in\mathcal{J}} \xi_{jltv} y_{jltv} + \sum_{s\in\mathcal{S}} \check{\xi}_{sltv} \check{y}_{sltv} + \sum _{g\in\mathcal{G}} \tilde{\xi}_{gltv} \tilde{ y}_{gltv} \biggr] \end{aligned}$$
    7. g.

      The costs of transporting the final product (ethanol mixed with gasoline) from storage centers l to the consumer centers u using the transportation v in the year t

      $$ \sum_{l\in \mathcal{L}} \sum_{u\in\mathcal{U}} \sum_{t\in\mathcal{T}} \sum_{v\in\mathcal{V}} \phi_{lutv} {z}_{lutv}, $$
    8. h.

      Cost of storing the final product in the warehouse l during the year t (in USD/m3-year)

      $$ \sum_{l\in\mathcal{ L}} \sum_{t\in\mathcal{T}} \theta_{lt} \mathcal{I}_{lt} $$
  2. 2.

    Constitutive elements of the constraints (restrictive set)

    1. a.

      Product offer equations

      $$\begin{aligned} &\sum_{j\in\mathcal{J}} \sum_{v\in\mathcal{V}} x_{ijtv} \leq q_{it},\quad \forall i \in \mathcal{ I}, t \in \mathcal{T} \end{aligned}$$
      $$\begin{aligned} &\sum_{j\in\mathcal{J}} \sum_{v\in\mathcal{V}} \check{x}_{rjtv} \leq \check{q}_{rt},\quad \forall r \in \mathcal{R}, t \in\mathcal{T} \end{aligned}$$
    2. b.

      Let \(k_{1} =0.35\) y \(k_{2} =0.120\) be, the standardization constants defined as the cubic meters of ethanol produced by each cubic meter of biomass in the producing centers 1 (sugarcane field) y 2 (sugar mill). Then, the flow balance equations from the biomass supply areas to the biorefineries considering an efficiency \(\eta =0.9\) of the process are

      $$ \eta \biggl[ \sum_{i\in \mathcal{I}} \sum _{j\in\mathcal{J}} \sum_{v\in \mathcal{V}} k_{1} x_{ijtv} + \sum_{r\in \mathcal{R}} \sum _{j\in \mathcal{J}} \sum_{v\in\mathcal{V}} k_{2} \check{x}_{rjtv} \biggr] = \sum_{j\in\mathcal{J}} \sum _{l\in\mathcal{J}} \sum_{v\in\mathcal{V}} y_{jltv},\quad \forall t\in\mathcal{T} $$
    3. c.

      The flow balance equations from the biorefineries to the storage and mixing zones considering the inventory I that there must be of ethanol at the end of the year t

      $$ \sum_{j\in\mathcal{J}} \sum_{v\in\mathcal{V}} y_{jltv} + \sum_{s\in \mathcal{S}} \sum _{v\in\mathcal{V}} \check{y}_{sltv} - \sum _{u\in \mathcal{U}} \sum_{v\in\mathcal{V}} {z}_{lutv} =0, \quad\forall l \in\mathcal{ L}, \forall t\in\mathcal{T} $$
    4. d.

      Ethanol demand equations in consumer centers is

      $$ \sum_{v\in\mathcal{V}} \sum_{l \in \mathcal{ L}} {z}_{lutv} \geq D_{ut},\quad \forall u\in\mathcal{U}, t\in \mathcal{T}, $$

      \(D_{ut}\) is a random variable with known density \(f_{D} ( d )\), for \(D>0, \forall t\in \mathcal{T}\).

    5. e.

      Decision on the biorefineries that must operate in the year t

      $$ \sum_{l \in \mathcal{ L}} y_{jltv} \leq \alpha_{jt} k_{jt},\quad \forall j\in \mathcal{J}, t\in\mathcal{T}, v\in\mathcal{V} $$


      $$ \alpha_{lt} = \textstyle\begin{cases} 1& \mbox{if the warehouse } l \mbox{ works in the year } t,\\ 0& \mbox{elsewhere}. \end{cases} $$
    6. f.

      Decision on the stores that must operate in the year t

      $$ \sum_{v\in\mathcal{V}} \biggl[ \sum _{j\in\mathcal{J}} y_{jltv} + \sum_{s\in\mathcal{S}} \check{y}_{sltv} \biggr] \leq\beta_{lt} \delta_{lt},\quad \forall l \in \mathcal{ L} , \forall t\in\mathcal{T}, $$


      $$ \beta_{lt} = \textstyle\begin{cases} 1& \mbox{if the warehouse } l \mbox{ works in the year } t,\\ 0 &\mbox{elsewhere}. \end{cases} $$
    7. g.

      The optimum number of biorefineries that will operate during the planning horizon is given by

      $$ \mu= \sum_{j\in\mathcal{J}} \sum_{t\in\mathcal{T}} \alpha_{jt}, $$
    8. h.

      The total amount of warehouses required to fulfill the customer’s demand

      $$ \tilde{\mu} = \sum_{l \in \mathcal{ L}} \sum _{t\in\mathcal{T}} \beta_{lt}, $$
    9. i.

      The amount of ethanol to be imported is given by

      $$ E_{i m_{t}} = \sum_{s\in\mathcal{S}} \sum _{v\in\mathcal{V}} \check{y}_{slvt},\quad \forall l \in \mathcal{ L} ,t \in\mathcal{T}, $$
    10. j.

      The amount of gas to be imported is

      $$ G_{i m_{t}} = \sum_{g\in\mathcal{G}} \sum _{v\in\mathcal{V}} \check{y}_{glvt}, \quad\forall l \in \mathcal{ L}, t \in\mathcal{T}. $$
    11. k.

      The equation of continuity of inventories

      $$\begin{aligned} \mathcal{I}_{lt} &= \mathcal{I}_{l,t-1} + \sum _{j\in\mathcal{J}} \sum_{v\in \mathcal{V}} y_{jltv} + \sum_{s\in\mathcal{S}} \sum_{v\in\mathcal{V}} \check{y}_{slvt} - \sum_{l \in \mathcal{ L}} \sum _{v\in\mathcal{V}} {z}_{lutv} \\ & \leq\delta_{lt},\quad \forall l \in \mathcal{ L}, t\in \mathcal{T}. \end{aligned}$$
    12. l.

      Capacity constraints for inventories

      $$ \mathcal{I}_{lt} \leq\delta_{lt}, \quad\forall l \in \mathcal{ L}, t\in \mathcal{T} $$
    13. m.

      Capacity constraints for biorefineries

      $$ \sum_{i \in \mathcal{ I}} \sum_{j\in\mathcal{J}} \sum_{v\in\mathcal{V}} \eta k_{1} x_{ijtv} + \sum_{r \in \mathcal{ R}} \sum_{j\in\mathcal{J}} \sum _{v\in\mathcal{V}} \eta k_{2} \check{x}_{rjtv} - \sum_{j\in \mathcal{J}} \sum_{l \in \mathcal{ L}} \sum_{v\in\mathcal{V}} y_{jltv} \leq k_{j}\quad \forall t\in\mathcal{T}. $$

For each \(t \in\mathcal{T}\), let \(D'_{t} = \sum_{u\in U} [\delta D_{u} + ( 1- \delta ) \tilde{D}_{u} ]\) be the total demand for mixed fuel (ethanol and gasoline), with \(\delta \in(0, 1)\) in the centers of consumption in the year t. Note that, for each random variable \(D'_{t}\) we have a set of values \(v_{t}=( {x}, \check{{x}}, y, \check{y}, {z}, \check{{z}}, I, \mu, \check{\mu} )_{t}\). That, we are interested in to obtain.

$$ \sum_{t\in\mathcal{T}} \bigl\{ \min\mathbf{E} \bigl[ g ( v_{t} ) \bigr] \mid D'_{t} \bigr\} = g^{*} \bigl( v^{*} \bigr), $$

where the components of ν satisfy completely the restrictive set, and g represents the objective function, E is the mathematical expectation operator, and by virtue of Equation (28), \(g( \theta_{t} )\) depends on \(g( \theta_{t-1} )\).

Numerical example

Clusterization and biorefineries location yields a total of 5. Geographical locations provide the following information (Table 1):

Table 1 Geographical locations of biorefineries

In the construction of this instance, 4 cane fields and 3 sugar mills are considered, and its production covers the years 2014 to 2016 (3 years). The average quantities of bagasse collected during the three years were (Table 2).

Table 2 Amounts of bagasse collected per year of planning

Here, the following values will be used \(c_{it} =7\) USD/m3 and \(\check{c}_{rt} =3.5\) USD/m3. \(\forall i\in I, r\in R, t\in T\) Equation (5) is completely defined with the following values \(\psi_{ijt1} =3.5\), \(\check{\psi}_{ijt2} =4.5\), \(\check{\psi}_{ijt3} =5\).

For Equation (6), the following costs were considered (Table 3)

Table 3 Fixed and variable costs used in this instance

Analogously, for Equation (7) we use \(\varrho=30\) USD/m3 and for Equation (8) we used \(\zeta_{1t} =690\) and \(\zeta_{2t} =950\) for \(t=1, 2, 3\). In this model, 3 storage centers (\(S_{c} \)) were evaluated (prefixed in advance). The shipping costs used are as follows (Table 4), Equation (9). The value used for \(\xi_{gltv}\) were \(\check{\xi}_{1ltv} =200\), \(\check{\xi}_{2ltv} =250\) for all \(l, t, v\).

Table 4 The shipping costs \(\xi_{jltv}\) and \(\check{\xi}_{jltv}\), for all \(t\in T\)

The distribution of the product is done from the storage centers to 4 consumption centers (\(C_{C} \)). Table 5 shows the transportation costs, Equation (10).

Table 5 Costs of transporting the final product to the consumption centers for all \(t\in T\)

The storage costs of the final product are given by \(\theta_{1t} =1500\), \(\theta_{2t} =1800\), \(\theta_{3t} =1450\), \(\theta_{4t} =1550\). Finally, the demand for the final product was sampled from the Box–Muller Method [21]. The values used were, Table 6.

Table 6 Values used for demand sampling

Where \(N \sim ( \mu, \sigma) \) means normally distributed with mean μ and standard deviation σ.

Results obtained

In the analysis of this instance for obtaining results the following initial values were used. \(I_{10} =0\), \(I_{20} =0\), \(I_{30} =0\). The capacities of biorefineries were \(k_{11} =90{,}000\), \(k_{21} =80{,}000\), \(k_{31} =60{,}000\), \(k_{41} =70{,}000\), \(k_{51} =73{,}000\). The storage capacities of the storage of the final product were \(\delta_{11} =2{,}500{,}000\), \(\delta_{21} =2{,}800{,}000\), \(\delta_{31} =5{,}400{,}000\). The volumes of harvested bagasse were \(q_{11} =8741\), \(q_{21} =24{,}142\), \(q_{31} =3450\), \(q_{41} =19{,}166\), \(\check{q}_{11} =2182\), \(\check{q}_{21} =9182\), \(\check{q}_{31} =9783\).

The results obtained for year 1 were: \(x_{4311} =19{,}166\), \(x_{4412} =477{,}603.2\), \(y_{3112} =6228.95\), \(y_{4113} =138{,}971\), \(D_{11} = z_{1111} =5696\), \(D_{21} = z_{1212} =8770\), \(D_{31} = z_{1311} =2850\), \(D_{41} = {z}_{1411} =11{,}724\), \(I_{11} =116{,}160 \alpha_{31} =1, \alpha_{41} =1, \beta_{11} =1, {u} =1, \tilde{{u}} =1\). The value of the objective function was 201,657,400 USD.

The initial conditions of year 2 were \(I_{11} =116{,}160\), \(I_{21} =0\), \(I_{31} =0\). The optimal solution for year 2 is as follows: \({x}_{2412} =2435\), \({x}_{4412} =557.308\), \(\alpha_{41} =1\), \(y_{4111} =2435\), \(\tilde{y}_{1111} =116{,}160\), \(D_{12} = z_{1121} =7864\), \(D_{22} = z_{1221} =62{,}994\), \(D_{32} = {z}_{1321} =12{,}005\), \(D_{42} = z_{1421} =9578\), \(I_{12} =26{,}154\), \(G_{i m_{2}} =116{,}160\), \(\beta_{11} =1\). The value of the objective function was 92,839,550 USD.

The previous result constitutes the entry conditions of year 3 through inventory in warehouses, i.e., \(I_{12} =26{,}154\), \(I_{21} =0\), \(I_{31} =0\). The optimal solution for year 3 were as follows: \({x}_{4311} =19{,}166\), \(\alpha_{31} = \alpha_{41} =1\), \(y_{3112} =6228.950\), \(\check{y}_{1113} =128{,}905.1\), \(y_{4112} =22{,}750\), \(\tilde{y}_{1111} =26{,}154\), \(D_{13} = z_{1131} =10{,}500\), \(D_{23} = z_{1231} =16{,}900\), \(D_{33} = z_{1331} =11{,}300\), \(D_{43} = z_{1431} =13{,}800\). In this year, the objective function had a value of 245,450,500 USD.


In this document, a mixed integer linear programming model (MILP) has been developed to determine the levels of operation of a group of biorefineries that process ethanol. The objective of the global model is to determine which plants should operate and at what levels of their capacity. The same situation is presented for storage and mixing warehouses. The model also allows obtaining the quantities of biogas and gasoline that must be imposed on the system so that it works correctly. The first part of this model required using a clustering model to conveniently group the biorefineries that must operate around the producing centers (sugar mills and sugarcane fields).

Although originally the problem is posed as a stochastic model, the solution strategy was to obtain 500 runs of the three years (one after the other) and evaluate the inventory parameters resulting from year 1 and 2 to feed the model in years 2 and 3 respectively. From this, the minimum value of the sequence of the three years is obtained.

Special mention should be made for the result of year 2. Here, there was no need to operate the global system because the inventory for year 1 was sufficient to supply the demand for year 2 with the reserves saved in year 1. Note that the model requires production and operation of the system again because reserves are depleted in year 2, requiring their activity again for year 3.

The stochastic nature of the original model deserves to be approached from another perspective using heuristic methods. The size of the original instance was reduced to evaluate the efficiency of the model. However, this can easily be scaled to the desired size. Future extensions to the proposed model may consider the transport capabilities involved, the levels of customer service and even other, more complicated, network topologies. In any case, the resulting instance grows disproportionately as a function of the number of variables involved, as well as its number of constraints.

The exercise presented here shows a wide utility for modeling georeferenced localization systems as well as for the design of the distribution network of its products. Special attention is given to the design of networks for the distribution of perishable products.



PEMEX: Petróleos Mexicanos


Mlt: Millions of liters


EEA: European Enviroment Agency


IEA: International Energy Agency


SENER: Secretaría de Energía


REN21: (Renewable Energy Policy Network for the 21st Century)


DOF: Official Gazette of the Federation


CRE: Energy Regulatory Commission


  1. 1.

    An H, Wilhem WE, Searcy SW. A mathematical model to design a lignocellulosic biofuel supply chain system with a case study based on a region in Central Texas. Bioresour Technol. 2011;102(17):7860–70.

    Article  Google Scholar 

  2. 2.

    Athawale VM, Chakraborty S. Facility location selection using PROMETHEE II method. In: Proceedings of the 2010 international conference on industrial engineering and operations management. Bangladesh: Dhaka; 2010.

    Google Scholar 

  3. 3.

    Basar A, Kabak O, Ilker T. A decision support methodology for locating bank branches: a case study in Turkey. Int J Inf Technol Decis Mak. 2017;16(01):59–86.

    Article  Google Scholar 

  4. 4.

    Boloori AA, Farahani RZ. Facility location dynamics: an overview of classifications and applications. Comput Ind Eng. 2012;62(1):408–20.

    Article  Google Scholar 

  5. 5.

    DOF (Official Gazette of the Federation). 2017. Modification to the Official Mexican Standard NOM 016 CRE-2016. CRE (Energy Regulatory Commission), EEA (European Environment Agency). 2008. Annual European Community Greenhouse, (2008),, [accessed October 17, 2017].

  6. 6.

    EEA (European Environment Agency). 2008. Annual European Community Greenhouse Gas Inventory 1990-2006 and inventory report, Brussels, Belgium, (2008),, [accessed January 12, 2018].

  7. 7.

    Ehsani AH, Danaei A, Hemmati M. A mathematical model for facility location in banking industry. Manag Scie Lett. 2014.

  8. 8.

    Farahani RZ, Hekmatfar M, Boloori AA, Nikbakhsh E. Hub location problems: a review of models, classification, solution techniques, and applications. Comput Ind Eng. 2013;64(4):1096–109.

    Article  Google Scholar 

  9. 9.

    Farahani RZ, SteadieSeifi S, Asgari N. Multiple criteria facility location problems: a survey. Appl Math Model. 2010;34(7):1689–709.

    MathSciNet  Article  MATH  Google Scholar 

  10. 10.

    García CJ. A decision model for bank branch site selection: define branch success and do not deviate. Socio-Econ Plan Sci. 2017.

    Article  Google Scholar 

  11. 11.

    Golabi M, Nedjati A, Izbirak G. Multi-objective ATM location problem in a Metropolitan City. In: New challenges in banking and finance. 2017. p. 39–46.

    Google Scholar 

  12. 12.

    IEA (International Energy Agency). Sustainable Production of Second-Generation Biofuels, (2010), Paris, France., [accessed march 12, 2017].

  13. 13.

    IEA (International Energy Agency), CO2 emissions from fuels combustion highlights. Paris, France, (2014),, (2014), [accessed December 12, 2017].

  14. 14.

    IEA Atlas of Energy. 2016, CO2 Emissions from fuel Combustion,!/tellmap/1378539487, [accessed January 12, 2018].

  15. 15.

    Leduc S. Development of an optimization model for the location of biofuel production plants. Doctoral Thesis. Division of Energy Engineering Department of Applied Physics and Mechanical Engineering, (2009), Lulea, University of Technology.

  16. 16.

    Melo MT, Nickel S, Saldanha-da-Gama F. Facility location and supply chain management—a review. Eur J Oper Res. 2009;196(2):401–12.

    MathSciNet  Article  MATH  Google Scholar 

  17. 17.

    Mohrn A, Raman S. Lessons from first generation biofuels and implications for the sustainability appraisal of second-generation biofuels. Energy Policy. 2013;63(100):114–22.

    Article  Google Scholar 

  18. 18.

    PEMEX (Petróleos Mexicanos), Bidding results for the acquisition of anhydrous ethanol. Press Release no. 24. 03/19/2015, (2015),, [accessed October 5, 2017].

  19. 19.

    Pérez-Lechuga G. Optimal logistics strategy to distribute medicines in clinics and hospitals. J Math Ind. 2018;8:2.

    MathSciNet  Article  Google Scholar 

  20. 20.

    REN21. Renewables 2017 Global Status Report. Paris: REN21 Secretariat. Paris. France, (2017),, [accessed February 19, 2018].

  21. 21.

    Rubinstein RY, Kroese DP. Simulation and the Monte Carlo method. Wiley series in probability and statistics. Neq York: Wiley; 2008.

    Google Scholar 

  22. 22.

    Ruíz-Hernández D, Delgado-Gómez D. The stochastic capacitated branch restructuring problem. Ann Oper Res 2016;246(1–2):77–100.

    MathSciNet  Article  Google Scholar 

  23. 23.

    Ruíz-Hernández D, Delgado-Gómez D, Joaquín López-Pascual J. Restructuring bank networks after mergers and acquisitions: a capacitated de location model for closing and resizing branches. Comput Oper Res. 2015;62:316–24.

    MathSciNet  Article  MATH  Google Scholar 

  24. 24.

    Salgado S, Aranda E, Castelán M, Ortiz H, Palma D, Córdova S. Qué hacer con la paja de la cosecha mecanizada de la caña de azúcar. Agro Productividad 2014;7(2):3–8.

    Google Scholar 

  25. 25.

    Samanlioglu F. A multi-objective mathematical model for the industrial hazardous waste location-routing problem. Eur J Oper Res. 2013;226(2):332–40.

    MathSciNet  Article  MATH  Google Scholar 

  26. 26.

    SENER (Secretary of Energy). Analysis and proposal for the introduction of anhydrous ethanol in the gasolines sold by PEMEX. Mexico DF, (2014),, [accessed December 19, 2017].

  27. 27.

    WBA. Global Bioenergy Statistics 2017. World Bioenergy Association Stockholm, Sweden (2017),, [accessed March 12, 2017].

Download references


Thanks to the University Autonomous of the Hidalgo State for the funding of this paper. Special recognition to the anonymous referees for the review and adaptation of this document, and to Anne-Marie Armstrong from the Direction of Academic Overcoming of the University Autonomous of the State of Hidalgo, Mexico for the revision of style and the adaptation to the English language. We gratefully acknowledge financial support granted by the sectoral fund National Council of Science and Technology (CONACyT)-Secretariat of Energy-Energetic Sustainability Program.

Availability of data and materials

The authors states that for the construction of this document he used real information. The confidentiality of the same is supported and available in electronic format for review by the referees (data and results of computer runs) of the journal as long as the reliability of the journal is not violated.


The author states that we have the necessary funds for the payment of the publication of this document if the Editorial Committee deems it feasible for that purpose. Such funds come in part from the University Autonomous of the Hidalgo State, Mexico.

Author information




The authors of this proposal declare that all reported results come from them, and, in the necessary cases, the original authors of said proposals are given credit. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Gilberto Pérez-Lechuga.

Ethics declarations

Competing interests

The authors states that he has no conflict of interest with other authors or with the organization for which he served as a consultant in the solution of this case.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Pérez-Lechuga, G., Aguilar-Velázquez, S.L., Cisneros-López, M.A. et al. A model for the location and scheduling of the operation of second-generation ethanol biorefineries. J.Math.Industry 9, 3 (2019).

Download citation


  • Biorefineries location
  • Operations scheduling
  • Linear integer programming
  • Stochastic processes
  • Supply chain