Machine learning-based simpli ﬁ ed methods using shorter wind measuring masts for the time ahead wind forecasting at higher altitude for wind energy applications

. Since wind is a ﬂ uctuating resource, the integration of wind energy into the electricity grid necessitates precise wind speed forecasting to maintain grid stability and power quality. Machine learning models built on different algorithms are widely used for wind forecasting. This requires a vast quantity of past wind speed data collectedatthehublevelsofthewindelectricmachinesemployed.Tallmetmastsposeavarietyofpracticalissuesin termsofinstallationandlong-termmaintenance,whichwillgrowmorechallengingasnext-generationwind turbines come with larger capacities and higher hub heights. In this paper, we propose four non-conventional methods for the time ahead forecasting of wind speed at a higher height by utilizing the wind speed data collected with relatively shorterwind measuring masts. Weemploymachine learning-based models and rely onthe principle of interrelation between wind speeds at differentaltitudes inourinvestigations. Wind speed forecasts generated by the newmethods at analtitudeof80mabovethe ground level using wind speed data measured atloweraltitudes of 50m and20m are of industrially acceptable accuracy. Thesimpli ﬁ ed physicalrequirementssuch methods demand far outweigh the marginal fall in prediction accuracy observed with these methods.


Introduction
As a clean and renewable energy source, the wind continues to play a significant role in addressing the global energy crisis.The global wind power installations reached 743 GW in 2020 with the addition of 90 GW in 2020 alone, showing the phenomenal progression of this energy segment [1].Going by the trends in wind energy development and utilization, the wind is expected to provide 20% of the global electricity production by 2030 [2].Since wind is an intermittent resource, precise wind forecasting is essential for the proper integration of the varying wind energy into the electricity grid for ensuring grid stability and power quality.Accurate wind forecasts on different lead time scales assist wind farms in real-time grid control, efficient load dispatch planning, deciding on reserve requirements, market trading, maintenance scheduling and the like.Effective wind forecasting and planning can result in better exploitation of useful wind energy [3].As the energy produced from the wind scales with the cube of wind velocity, even small errors in the assessment of wind speed can contribute to larger errors in the energy production estimate [4,5].Because of these concerns, wind forecasting continues to be a field of great research interest to the wind energy sector.
Extensive research has been conducted in the past in the area of short-term wind prediction, giving attention to forecasting methods, types of models as well as meteorology [6].Diverse investigations carried out in wind forecasting have yielded appreciable results for wind energy applications [7,8].Based on the adopted methodology, wind forecast models are primarily classified into physical, statistical, data learning, and hybrid models.The physical models employ various atmospheric parameters and are useful to identify recurring trends and to make long-term predictions.Statistical models assume that the variations in wind speed are stochastic.Recent studies have however shown that the complex stochastic-like fluctuations in wind conditions arose from the chaotic dynamics of the underlying system [9][10][11].One study has validated a method of nonlinear autoregressive network with exogenous input (NARX) for the hourly forecasting of wind speed [12].Short-term wind power forecasting using linear and non-linear data mining algorithms has also been demonstrated with successful field trials [13].Various artificial neural network-based data learning techniques are also gaining prominence in the recent literature for wind forecasting.A layer recurrent neural network (LRNN) method has been demonstrated for the prediction of wind speed up to 5 days ahead in 5 min steps [14].Another study designed an adaptive neuro-fuzzy inference system (ANFIS) based intelligent machine for utility operators to fulfill the forecasting obligations as per Indian electricity grid code (IEGC) 2010 regulations [15].Random Forest model has been shown in one particular study to yield better wind speed prediction accuracy over the classical neural network prediction method [16].Hybrid models developed by combining various methods like physical, statistical and machine learning methods are found to improve predictive accuracy [17][18][19][20].Simple auto regressive (AR) wind speed forecast models have been found to improve prediction accuracy when combined with the wavelet decomposition [21].Short-term wind forecasting by combining different variants of support vector regression (SVR) with wavelet decomposition has also been reported [22].Another study has attempted wind speed data analysis by combining wavelet transform, support vector machines and genetic algorithms [23], wherein wind speed data has been analyzed by decomposing it into several components and integrating it with a genetic algorithm parameter optimized support vector machine.A method of correcting the wind speed forecast errors by building a correction topography through statistical means from measured data and then applying the same to the wind forecasts generated by a conventional model has also been reported [24].Another study has demonstrated very short-term wind forecasting by using a vector autoregressive model integrated with large-scale meteorological information [25].A new enhanced ensemble method has shown improved performance in the short-term probabilistic forecasting of wind speed [26].Hybrid models involving a combination of time series models with pressure, temperature and precipitation as inputs to predict monthly average wind speed values have also been investigated [27].A hybrid model that uses ensemble empirical mode decomposition for noise reduction and a modified wind-driven optimized backpropagation neural network has been employed for predicting 10-min-ahead and 30-min-ahead wind speed [28].One study has analyzed wind speed data with three different machine learning algorithms namely radial basis support vector machine, multilayer feed-forward neural network and adaptive neuro-fuzzy inference system and demonstrated the supremacy of support vector machine (SVM) variant over the neural network and neuro-fuzzy techniques [29].Another study has shown Improved performance of machine learning-based wind forecast models with the inclusion of atmospheric stability parameters in them [30].It has been shown that the gradient boost machine learning regressor is more accurate compared to any of the existing analytical models for the runtime performance prediction whenever the range of the prediction follows that of the training.However, the two analytical models namely the 2D-plate model and the fully-connected model outperform the machine learning regressor when using the extrapolation method [31].Our recent study [32] has demonstrated the spatio-temporal independent applicability of one time trained machine learning wind forecast models within a wind regime, which has been applied in this research.
Even as different methods are available for the effective forecasting of wind speed at an altitude of interest, a considerable quantity of historical wind speed data at the same location and altitude is necessary for the modeling and prediction, which is usually accomplished by establishing and retaining tall wind masts at the location of interest.The wind masts are tubular or lattice-type steel towers as shown schematically in Figure 1.Tubular towers are made of galvanized steel or stainless steel tubes with customizable heights.Lattice masts are made of steel lattice structures, usually connected one above the other to reach the required height.Tubular masts are normally preferred for 30 m to 85 m altitudes while the lattice masts are chosen for altitudes above 100 m.The installation and prolonged maintenance of high-altitude wind measuring masts give rise to several practical difficulties from an engineering standpoint.The tendency to lose the structural stability of mast in heavy wind load, the need for vast guy wire support for wind mast, the requirement of a larger area for the high mast together with the supporting guy wires, the propensity of component failures due to various reasons like lightning strikes, high cost associated with the de-erection and restoration of wind masts for rectifying the issues with malfunctioning anemometers and sensors, etc. are a few of such practical challenges usually to be addressed.Cost escalation with the increase in mast height is yet another disadvantage.All these practical difficulties get more complicated when wind forecasting is required to be done at increased heights as the new age wind turbines come with enhanced capacities and increased hub heights since the increased energy production by taller wind turbines far offsets the increased tower cost [33].Time ahead wind speed forecasting at higher altitudes by using relatively shorter and less challenging wind measuring masts in place of the taller masts has not been reported in the literature so far.In this paper, we investigate the possibility of predicting future wind speeds at a higher altitude using machine learning predictive models by processing the present and past wind speed data measured at relatively lower heights.In this respect, four non-traditional methods are investigated in this study.
In the usual practice, the machine learning models are trained with a substantial quantum of historical wind data at an altitude and the trained model is subsequently employed for generating the values ahead at the same altitude.In contrast, we, in one of our methods, train the model by inputting each set of past data on the wind speed time series at a lower altitude against the succeeding value on the wind speed time series at a higher altitude.This has been attempted, keeping in view the fact that each wind speed value at a higher altitude is directly dependent on the corresponding value at a lower altitude at the same geographical position.In yet another approach, we attempt to incorporate the power law of wind shear for wind speed forecasting at a higher altitude by utilizing wind speed data measured at two lower heights.The wind speed values are first predicted at two lower altitudes using the machine learning model and the corresponding future value at the higher altitude is then calculated using the power law.We also investigate the combined application of the above two approaches towards achieving further simplification of the wind measuring physical setup for the forecast of wind speeds at higher altitudes.We further examine the applicability of the machine learning model trained with data from one location for making forecasts at a different location within the geographical area of the wind regime, with a view to simplifying the wind measuring mast systems required to be set up in the field.The study establishes the prospect of employing low-height wind measuring masts in the field for reliable wind forecasting at higher altitudes, wherein the practical advantages offered by such simplified physical systems more than compensate for the marginal falls in forecast accuracies brought about by these new methods.

Machine learning
Machine learning is an area of artificial intelligence that works by identifying patterns from past data and predicting upcoming information.It is a method of improving the performance of a computational software program by itself with experience.According to a widely accepted definition [34], 'A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P improves with experience E'.In machine learning, past experience is fed to the machine as input, and it gives the output as a typical model capable of solving future problems of the same nature.Here, past experience is collected for the purpose of imparting training.An abstract target function is determined that well describes the relationship between existing input and desired output.Subsequently, a suitable machine learning model is selected to approximate the target function.In the end, a suitable algorithm is used to build the model from the training examples.
Numerous learning algorithms are available to select from for this purpose and new ones are also being put forth by researchers.Based on the different approaches of learning methods, machine learning can be grouped into four categories; supervised, unsupervised, semi-supervised and reinforcement.Supervised learning can be further divided into classification and regression methods.In unsupervised learning, either clustering or reducing the dimensionality of the input information is followed.The different categories of machine learning are illustrated in Figure 2.
In supervised learning, the algorithm determines the optimal function that describes the relationship between the input and output target variables through an iterative process of predictions and adjustment until the prediction and the targeted value reach a maximal efficiency.Supervised learning can be grouped into two categories of classification and regression methods.The classification problem aims to predict labels or classes into which the new data will fall.Classification is better suited for problems that can be broken into categorical decisions (e.g.clinical trials).It can be applied to both structured and unstructured data.The regression method aims at determining a score on a continuous scale based on the value of one or more predictor variables.The output in a regression model has a quantitative value that can be ranked (e.g., energy market modeling).
Unlike in supervised learning, there is no target value in an unsupervised learning environment.Instead, it attempts to detect underlying structures in the data.Unsupervised learning can be either through a clustering approach or a dimensional reduction approach.The clustering approach of unsupervised learning is an analytical technique to develop meaningful subgroups from enormous samples based on the similarities amongst them (e.g., grid-mapping used in image recognition).When the number of features significantly outnumbers the number of observations, a dimensional reduction method is suitable to reduce the visual complexity.In this method, the number of random features under consideration is reduced by replacing it with a principal set (e.g., shape descriptors in text recognition problems).
Semi-supervised learning is used when target variables are not available for the entire data.In such cases, semisupervised learning allows the model to integrate the available unlabelled features for supervised learning.This method is employed in predicting difficult-to-measure quality variables (e.g., melt index and octane number in determining the quality of ammonia extraction).
Reinforcement learning enables an agent to learn from the interactions in its environment by trial and error using feedback from its own actions and experiences (e.g., autonomous control of vehicles).While in supervised learning, the agent gets the correct set of actions for doing a task, reinforced learning uses a sequence of rewards and penalties as signals depending on whether the decision is towards the required final goal.

Power law of vertical wind shear
The availability and speed of wind resources are highly influenced by the geography and climate of the area.The wind speed is found to increase with altitude owing to the thermal gradient formation.At far away elevations of the order of one kilometer, the impact of land cover on wind speed is almost negligible [35].The surface friction characteristic, on the other hand, has a substantial impact on wind speed in the air layers close to the land.The terrain's surface roughness creates wind shear, which causes vertical wind speed variations.In the assessment of wind shear, accurate measurement of surface roughness is a major challenge [35].To characterize wind shear in the lowest layers of the atmosphere, some analytical models based on long-standing experience in wind engineering are commonly used.
Extrapolating wind speed from a relatively lower height to a required higher height is a typical strategy used in the wind energy sector when dealing with wind turbines with high hub heights [36].Different methods named power law, logarithmic law and log-linear law are commonly used for vertical wind speed extrapolation [37][38][39][40][41][42].These methods are based on combined hypothetical and experiential approaches and are therefore vulnerable to significant errors due to varying features such as ambient temperature, atmospheric pressure, humidity, terrain type, surface roughness, elevation, time of the day, seasons of the year, atmospheric stability, mean wind speed, wind direction and the like [43][44][45][46][47].There has been a lot of work done in the past to improve these methods in order to get more reliable results.A data-driven method using symbolic regression has been proposed recently for improved vertical wind speed extrapolation [48].Among these different methods, the power law of vertical wind shear is widely accepted in the wind energy industry across geographies as the simplest method for vertical wind speed extrapolation.
The power law is expressed as: where v 1 and v 2 are the wind speeds at a lower reference height h 1 and a higher reference height h 2 respectively and a is the coefficient of wind shear [49].The coefficient of wind shear is also known as the Hellmann exponent [50] and this empirical constant is usually derived over the range of heights of practical interest.The coefficient of wind shear a is derived from equation (1) as: The wind shear coefficient a essentially combines all of the influential components into a single feature.It is normally assumed as constant for a given set of height levels, even though this is not exactly the true scenario [49].As a thumb rule, a is often assumed as 1/7, or 0.143, and the power law, in that case, is called the 1/7th power law.As per recommendations of the American Society of Civil Engineers, the value of a is taken as 1/9 for flat, unobstructed areas and water surface, 1/6.5 for open terrain and 1/4 for urban and suburban areas [51].The value of a for a given site has been found to vary significantly with the variations in height levels, ambient temperature, land features, time of the day, the month of the year, etc. [52][53][54].When no significant obstacles are present on the ground, wind shear coefficient a obtained from equation ( 2) by inputting the wind speeds at the two heights can be taken as the most reliable value at that instant, particularly in the altitudes ranging from 10 up to 100-150 meters [35].The wind speed v 3 at that instant at a higher altitude h 3 is estimated by substituting the computed wind shear coefficient a in the equation: 4 Study area and application strategy In the initial phase of the study by employing the SVR modeling, the first part of this phase is concentrated at location S1.The wind speed data at the three altitudes in the first year (2013) have been used to train and validate the SVR models to make sure successful learning of the  Apart from determining the RMSE over the whole prediction timeline, we also follow a new approach of breaking the prediction timeline into segments and assessing the percentage of such segments qualifying against pre-set threshold RMSE values.This approach is adopted to minimize the misleading effect of outlier data on the RMSE values calculated and to enable better interpretation of the model performance.In the next part of the study, all the above-stated investigations have been repeated for the second location S2 to validate the location-independent functioning of the proposed predictive models.In the final phases of the study, KNN and GBM regression models are also employed for the forecasting in place of the SVR models, and the results are compared.

Methods of forecasting
The five methods involving different wind mast configurations investigated in this study are described here.All these methods are first analyzed extensively using SVRbased machine learning forecast models as detiled below.

Method 1
In this method, which is the conventional method of wind speed forecasting using machine learning models, the historical data in the year 2013 at the altitude of 80 m have been used for training the model, which is then used to predict at the same location at the same altitude in the year 2014.In a finite time series, the state of the system at a particular point of time always depends on some timelagged events.The extent of dependence of one value on the previous values in a given time series can be estimated by obtaining the mutual information between delayed time series [57].Based on this, it is assumed that the number of past wind speed values that each occurrence depends on can be determined from the time delayed mutual information function.The mutual information comes down with the increase in time delay and after a certain limit, it  starts to level-off as illustrated in the typical plot in Figure 6.It is taken in the present analysis that this leveling-off starts with 72 data points and hence it is assumed that every data point on the wind speed time series is a function of its previous 72 data points.Accordingly, the training of models with the past data has been done in such a way by inputting the preceding 72 continuous values against each wind speed value ahead.Each of the one-step ahead predictions over the testing period (the year 2014) is generated by inputting the preceding 72 continuous data points into the trained model.The predicted values are appended in sequence and compared with the actual values for the initial 1000 data points in Figure 7a.

Method 2
In this method, we rely on the fact that each wind speed value at a higher altitude is directly dependent on the corresponding value at a lower altitude.Consequently, instead of training the model with a definite set of past data against each data ahead along the wind speed time series at the height of 80 m, we use the past data on the wind speed time series at the lower altitude of 50 m against the succeeding value at the height level of 80 m.Predictions at 80 m altitude over the test period are generated in a similar fashion by inputting values from the time series at 50 m altitude into the trained model.Training of the model and generating the forecasts are illustrated using block diagrams in Figure 11.The closeness of predictions in this method is shown for the initial 1000 data points in Figure 7b.

Method 3
In this method, the wind speeds are predicted separately along the wind speed time series at 50 m and 20 m altitudes as per procedures followed for 80 m altitude in Method 1, and then the corresponding value at the height of 80 m is determined using the power law .The profile made with one step ahead predictions in this method is shown in relation to the actual values for the initial 1000 data points in Figure 7c.

Method 4
This method combines the procedures followed in Methods 1, 2 and 3 above in such a way that one model is used to predict one step ahead predictions at the height of 20 m using past data at the same height and another model is employed to predict along the 50 m altitude using past data at the 20 m altitude.The values thus predicted at the altitudes of 50 m and 20 m are then extrapolated to 80 m altitude by using the power law method.The block diagram in Figure 12 illustrates the forecasting process in this method.Figure 7d compares the predicted values at the height of 80 m in this method against the actual values for the initial 1000 data points.

Method 5
This method follows the same principle relied upon in Method 2, with the difference that the training of the model is carried out using the data from one location, termed the reference location, and that the trained model is used to generate forecasts at another location at a distance from the reference location.Whereas steps (a) and (b) in Figure 11 are performed for the same location in Method 2, in the present method, these two steps are carried out at two different locations.The data at location S2 is used here to train the model before applying the same to generate forecasts at location S1.The predicted values in this method are shown in relation to the actual values for the initial 1000 data points in Figure 7e.

Results and discussion
In the analyses of the forecast results, the forecast profile made of one step ahead predictions appended over the one-    3.This clearly shows the time-independent characteristic of the trained machine learning models in wind speed forecasting, as the models one time trained with data for a definite period can be employed for forecasting tasks along the respective wind speed time series near as well as far away in time from the training data.
Method 1 is the conventional route of machine learningbased forecasting along a given wind speed time series.Intelligent training and testing of the model have been carried out in this process by determining the optimum data size that has an influence on the succeeding wind speed value and designing the training and testing of the model in such a way to fully absorb this interdependency.This approach has yielded better model performance, obtaining an RMSE value of 0.93 m/s for the one-step (one hour) ahead predictions mapped along the one-year forecast period.It is further seen in this case in Figure 8b that when a limiting RMSE of 1 m/s is taken, RMSE values of 70.18% of quarter-a-day prediction segments over the one year test period fall below this mark.When limiting RMSEs of 1.5 m/s and 2 m/s are taken, the percentage of such prediction segments to qualify increases to 95.03% and 99.39% respectively.Though this conventional method is the most accurate one among all the five methods investigated, it is essential in this method to construct and maintain a wind measuring mast system of 80 m height for the model training period as well as during the entire forecast timeline.
In Method 2, the RMSE of one step ahead predictions along the one-year period is found to be 0.97 m/s, which is slightly more than that in Method 1.When a limiting RMSE of 1 m/s is benchmarked, as shown in Figure 9b, 65.96% of quarter-a-day prediction segments along the prediction period of the year 2014 are found to possess RMSE values below this limiting value.Though the model error in this method is marginally higher compared to Method 1, this method, on the other hand, provides considerable relief in the field preparedness as a wind mast of 80 m height is required during the initial one year training period only, after which a wind mast of 50 m height only needs to be retained for the subsequent forecast periods.
The complexity of the wind measuring mast system is further simplified in Method 3 as this method requires a wind mast of 50 m height only to be used throughout the training and subsequent testing periods with anemometers fitted at 50 m and 20 m height levels.This relaxation, however, is accompanied by a fall in model accuracy compared to those achieved in the above two methods.The RMSE of one step ahead predictions along the one-year period is obtained in this method as 1.17 m/s and as illustrated in Figure 9d, 53.34% of the quarter-a-day prediction segments only qualify when a limiting RMSE of 1 m/s is insisted.
Method 4 requires the most simplified wind mast set up to be established and retained as a mast of 50 m with anemometers fitted at 50 m and 20 m height levels is required in this method to cover the training period and the same can be replaced with a mast system of 20 m height for generating forecasts during the subsequent test periods.This simplified physical arrangement can be pursued if relaxed prediction accuracy is acceptable as the RMSE of one step ahead predictions over the one-year period obtained in this method is 1.67 m/s, the highest among the corresponding values obtained in all the methods.Further, as provided in Figure 9f, only 52% of the quartera-day prediction segments along the one-year prediction timeline is found to have RMSE values below 1 m/s in this case.
Method 5, which is a reorganized version of Method 2, is slightly less accurate compared to Method 2 as the RMSE of one step ahead predictions over the one-year test period    4, with the corresponding RMSE values specified thereon.The advantageous aspects from the practical standpoint in setting up and maintaining the physical structures of wind measuring mast systems for these methods in relation to the levels of model accuracies are briefly described in this table.
The five methods yield results of similar nature and trend when employed for identical analyses at location S2 as well, as illustrated in Figure 14b and d.The location S1 is taken here as the reference location for model training in Method 5 in this case.Based on both the yardsticks of (i) RMSE over the whole forecast period and (ii) RMSE values of segmented prediction periods, as in the case of location S1, the Methods 1, 2, 5, 3 and 4 emerge as options in the increasing order of RMSE values in the case of location S2 also, which implies that the research findings are locationindependent.
After the analyses using SVR models as described above, models based on KNN and GBM algorithms are also employed in place of the SVR models.The RMSE values of predictions for the two locations using the five different methods, each based on the three machine learning models over the one year test period are compared in Figure 15.All the models show comparable forecast performance, with the GBM regression models showing better results compared to the other two in all prediction scenarios.Except in Method 1, SVR models come second in prediction accuracy in all instances, while in Method 1, the KNN model slightly outperforms the SVR model at both the test locations.It is also observed that except in the instance of using the KNN model at location S1, Methods 1, 2, 5, 3 and 4 continue to emerge as methods in the increasing order of RMSE values as in the case of results  obtained with the SVR models.Methods 3 and 5 interchange the positions in this order at location S1 when the KNN model is employed.

Conclusion
Wind speed forecasting at the hub heights of wind turbines is an important prerequisite for the proper planning and effective grid integration of wind electricity.This is usually achieved by employing computational predictive models and utilizing a vast amount of wind speed data measured by installing and retaining high-altitude wind measuring masts in the field.Tall wind mast structures reaching the hub heights of huge wind machines are difficult and expensive to erect and upkeep for long periods.This study explored alternative methods to obtain time ahead wind forecasts at a high altitude with accuracy levels acceptable to the wind energy industry, by employing low-height wind measuring mast structures that are easier to install and maintain.In the first phase, five distinct methods requiring different wind measuring physical arrangements in the field  were investigated using the SVR-based machine learning wind forecast models.While the conventional Method 1, trained and tested with size optimized data sets, yielded an RMSE of 0.93 m/s for one step (one hour) ahead predictions mapped along the one year forecast period, that value increased to 0.97 m/s, 1.17 m/s, 1.67 m/s and 1.06 m/s for the Methods 2, 3, 4 and 5 respectively, when employed at the location S1.Furthermore, the RMSE values of predictions over quarter-a-day time segments along the one-year forecast duration analyzed for the location S1 using Methods 1 to 5 revealed that 70.18%, 65.96%, 53.34%, 52% and 59.04%, respectively of such predictions had individual RMSE values less than 1 m/s.Analyses with data from the location S2 also produced similar results in terms of comparable accuracy levels and order of error variation in respect of the different methods, implying generality in the application.The different methods show comparable performance when employed with other machine learning models of KNN regression and GBM regression also.Though, in general, the individual Methods 2, 5, 3 and 4 show diminishing forecast accuracies in that order compared to the conventional approach in Method 1, the practical advantages gained in the installation and maintenance of the relatively simpler engineering structures of wind measuring mast systems in the field, as outlined in Table 4, make each of these newly proposed alternative methods useful and acceptable to the wind energy industry.This study is constrained within three fixed altitudes of 80 m, 50 m and 20 m based on data availability.Investigations using different combinations of altitudes are suggested to understand the relative performance of the different methods in such cases.In Method 5, more studies by varying the distance between the reference and test locations are also suggested for optimizing the crosslocation process.In addition to the presently employed machine learning models, other models based on betterperforming algorithms, including hybrid models also are likely to improve the forecast results.
The hourly wind speed time series at the altitudes of 80 m, 50 m and 20 m at two different windy locations S1 and S2 in the state of Kerala in India, assessed over two years (2013 and 2014) by employing tubular type wind measuring masts have been exhaustively analyzed in this research.These locations, distanced by 136 km, are detailed with corresponding geographic coordinates and physical conditions in Table1and shown for better visualization on a map in Figure3.Depicting the massive nature of a typical wind measuring mast structure, Figure4shows the 80 m high, tubular-type wind measuring mast used for wind data collection at the location S1, with the inset picture showing a closer view of the wind anemometer fitted at 50 m height.The location S1 is a hilly terrain whereas the location S2 is almost a plain terrain.The wind flow patterns as measured at the height of 80 m at these locations from June 2013 to May 2014 are illustrated in Figure5.The relevant particulars of the instruments used for wind data collection are given in Table2.The five methods involving varying wind mast arrangemernts are investigated for the wind forecasting at the highest height of 80 m in this study by using machine learning modeling and the prediction accuracies are compared.Support Vector Regression (SVR), K-Nearest Neighbours (KNN) regression and Gradient Boosting Machine (GBM) regression models are commonly used in machine learning-based wind forecasting[32].The Support Vector Machine (SVM) algorithm attempts to find a hyperplane in an n-dimensional space that clearly classifies the data points.In SVM-based regression (called SVR), the straight line required to fit the data is referred to as the hyperplane.KNN regression is a simple, non-parametric machine learning regression technique.KNN method predicts new data point values using the concept of "feature similarity".The new point's value depends on how closely it matches the training data.The training dataset identifies a new data point's k nearest neighbours.The projected data point is the average of k-neighbours.KNN calculates the distance between training and testing datasets.Next, the k-nearest neighbours with the k-smallest training data set distances are picked.The final estimate uses a weighted average.Boosting algorithms in classification problems combine several simple models ("weak learners") iteratively, resulting in a "strong learner" with improved prediction accuracy.Mostly, the boosting algorithm consists of weak learners, an additive model and a loss function.The GBM regression algorithm tries to find an additive model capable of minimizing the loss function, thereby improving the regression accuracy.The SVR model has been used in the first phase of this study.The e1071 package in R is used in this work to develop, train and test the models [55].In the subsequent phases of the study, the KNN model and the GBM model are also employed to corroborate the research findings.

Fig. 4 .
Fig. 4. Tubular, 80 m high wind measuring mast used for wind data collection at one of the locations S1; inset picture gives a closer view of the wind anemometer fitted at 50 m height.

Fig. 5 .
Fig. 5.The wind flow patterns at the locations S1 and S2 depicting the mean monthly wind speeds at the height of 80 m from June 2013 to May 2014.

Fig. 6 .Fig. 7 .
Fig. 6.The mutual information of hourly wind speed time series as function of delay.

8 V
. P. et al.: Renew.Energy Environ.Sustain.7,24 (2022)       year prediction horizon in each method is considered.The RMSE values of these forecast profiles over quarter-a-day time segments along the one year test period are plotted against time in respect of the Methods 1 to 5 in Figures8a, 9a, c, e and 10a respectively.Additionally, the percentage-wise distribution of these prediction segments with RMSE values below different limiting thresholds for these methods are plotted in Figures8b, 9b, d, f and 10b.These plots are combined for easy comparison in Figure14a.Apart from these assessments, the RMSE values over the year-long forecast period in the five methods are also determined, which are compared in Figure14c.The straight lines given in Figures8a, 9a, c, e and 10a show the lines of best fit for the scatter plot of RMSE values in the respective cases, which are almost parallel to the time axis in all instances, with near-zero slopes as given in Table

Fig. 8 .
Fig. 8. Distribution of RMSEs of segment (quarter-a-day) wise SVR predictions along the one year test period (on the left side) and percentage-wise distribution of the same coming below different limiting RMSE values (on the right side) for the location S1 using the Method 1.

Fig. 9 .
Fig. 9. Illustrations as in Figure 8, for the location S1 using the Methods 2, 3 and 4 based on SVR predictions.

Fig. 10 .
Fig.10.Illustrations as in Figure8, for the location S1 using the Method 5 based on SVR predictions.

Fig. 11 .
Fig. 11.Block diagram showing (a) training of the model and (b) testing of predictions in Method 2.

Fig. 13 .
Fig. 13.Comparison of performance of the different methods in generating predictions of quarter-a-day segments in the one year test period with RMSE values less than two benchmarks of 1 m/s and 1.5 m/s based on SVR models.

Fig. 14 .
Fig. 14.Percentage of quarter-a-day predictions in the one year test period with RMSE values below different benchmarks in respect of the various methods for (a) Location S1 and (b) Location S2 and RMSE values of predictions with the different methods over the one year test period for (c) locations S1 and (d) location S2 based on SVR models.

Fig. 15 .
Fig. 15.Comparison of performance of the different methods when employing different machine learning models.

Table 1 .
Geographic coordinates of the wind mast locations.

Table 2 .
Details of instrumentation used for wind data collection.

Table 3 .
Slope of best fitting lines of RMSEs of segment (quarter-a-day) wise predictions over the one year test period generated by different methods based on SVR models.

Table 4 .
Listing of the various methods in the increasing order of model error, with brief descriptions of the practical advantages associated with such methods in the erection and upkeep of the physical systems for data collection for wind speed forecasting at the height of 80 m.The Establishment of an 80 m high wind mast at the location of interest is required during the training period of the first one year.Wind speed data are taken at 80 m and 50 m heights during this period for the training of the model.During the subsequent testing periods, a wind mast of 50 m height only needs to be retained.3 Method 5 1.06 Erection and upkeep of an 80 m wind mast at one reference location within the geographical area of the wind regime are required only for the training period of an initial one year.Wind speed data are taken from this wind mast at 80 m and 50 m heights during this period for training the model.This trained model is employed at any other location of interest, where a wind mast of 50 m height only is needed to be erected and maintained.4 Method 3 1.17A wind mast of 50 m height only is required to be established and retained at a location of interest and wind data from anemometers fitted to it at 50 m and 20 m heights are collected, using which the time ahead wind speeds at the higher altitude are predicted.5 Method 4 1.67 A wind mast of 50 m height is to be established and retained for the training period of the first one-year at the location of interest and wind data at 50 m and 20 m heights are to be collected from it during this period.In the subsequent testing periods, a shorter wind mast of 20 m height only needs to be retained.