Yingchun
Niu‡
,
Ali
Heydari‡
,
Wei
Qiu
,
Chao
Guo
,
Yinping
Liu
,
Chunming
Xu
,
Tianhang
Zhou
* and
Quan
Xu
*
State Key Laboratory of Heavy Oil Processing; China University of Petroleum (Beijing), Beijing 102249, China. E-mail: zhouth@cup.edu.cn; xuquan@cup.edu.cn
First published on 8th February 2024
Iron–chromium flow batteries (ICRFBs) are regarded as one of the most promising large-scale energy storage devices with broad application prospects in recent years. However, transitioning from laboratory-scale development to industrial-scale deployment can be a time-consuming process due to the multitude of complex factors that impact ICRFB stack performance. Herein, a data-driven optimization methodology applying active learning, informed by an extensive survey of the literature encompassing diverse experimental conditions, is proposed to enable exceptional precision in predicting ICRFB system performance considering both operation conditions and key materials selection. Specifically, multitask ML models are trained on experimental data with a high prediction accuracy (R2 > 0.92) to link ICRFB properties to energy efficiency, coulombic efficiency, and capacity. We also interpret the ML models based on Shapley additive explanations and extract valuable insights into the importance of descriptors. It is noted that the operation conditions (current density and cycle number) and the electrode type are the most critical descriptors affecting the voltage efficiency and coulombic efficiency while the electrode size strongly affects the capacity. Moreover, active learning is used to explore the most optimized cases considering the highest energy efficiency and capacity. The versatility and robustness of the approach are demonstrated by the successful validation between ML prediction and our experiments of energy efficiency (±0.15%) and capacity (±0.8%). This work not only affords fruitful data-driven insight into the property–performance relationship, but also unveils the explainability of critical properties on the performance of ICRFBs, which accelerates the rational design of next-generation ICRFBs.
The iron–chromium redox flow battery (ICRFB) is recognized as the first true redox flow battery.10,11 It utilizes abundant and low-cost redox-active materials such as ferrous chloride (FeCl2) and chromium chloride (CrCl3), making it a cost-effective energy storage system.12 ICRFBs are highly scalable due to their unique design that separates the energy storage capacity from power output, have a long cycle life compared to many other battery technologies, possess relatively high energy densities among flow battery systems, offer enhanced safety features and have quick response times. Researchers have made significant efforts to enhance the efficiency and optimize the performance of ICRFBs. For example, studies have been conducted by Wang et al. on various electrolyte compositions containing FeCl2, CrCl3, and HCl with different molar masses to understand their electrochemical performance.13 They concluded that with the increase of the concentration of Fe/Cr in the electrolyte, the redox reaction peak current increases. The effect of chelation has also been explored by Waters et al. using Fe diethylenetriaminepentaacetic acid (FeDTPA) and Cr 1,3-diaminopropanetetraacetic acid (CrPDTA) as additives in a KBi electrolyte.14 Investigations into indium chloride (InCl3) addition to improve battery performance have been conducted by Wang and co-workers as well.15 The results showed that In3+ effectively inhibits the HER and accelerates the kinetics of Fe2+/Fe3+ and Cr3+/Cr2+ to a certain extent. Membrane thickness has also been studied, with researchers finding that commercial Nafion membranes of certain thicknesses are more suitable for high current density operation in ICRFBs.16,17 Zhang and co-workers further researched the electrochemical properties of PGF and CF with added BiCl3 catalyst.18 Additionally, studies by Li et al. have focused on improving the electrocatalytic performance of graphite felt electrodes through treatments such as boric acid thermal etching to enhance hydrophilicity.19 After H3BO3 thermal etching, the hydrophilicity of the graphite felt was obviously enhanced.
The performance of an iron–chromium redox flow battery (ICRFB) stack is influenced by factors such as materials, architecture, and operational conditions. Achieving optimal performance requires significant effort and time through experimental optimization of these components. To expedite the research and development (R&D) process and facilitate commercialization, it is crucial to explore innovative methods for accurately predicting the performance of ICRFB stacks and overall systems. Currently, R&D efforts in ICRFB technology rely on a combination of simulation, design, and experimentation to develop efficient stack configurations with suitable materials. However, this conventional approach can be resource-intensive and costly. Furthermore, it can be challenging to precisely determine how different factors impact system performance and cost. To address these challenges specific to ICRFBs, researchers are actively exploring novel techniques that enable accurate prediction of stack performance without relying solely on extensive experimentation. These methods involve computational modeling approaches that simulate various scenarios while optimizing materials selection and stack designs. By employing predictive models tailored for ICRFBs, researchers aim to streamline the R&D process while gaining valuable insights into how different factors influence the system performance and associated costs. This approach holds great potential for accelerating advancements in ICRFB technology towards successful commercialization.
In recent years, machine learning (ML) techniques have emerged as powerful tools for accelerating various aspects of material development,20 chemical synthesis planning,21 catalyst activity tuning,22 and system optimization.23 These ML approaches have demonstrated their ability to efficiently predict and optimize the performance of different battery types.24,25 Despite these advancements in other fields, there has been limited research focusing on applying ML techniques for prediction and optimization specifically in iron–chromium redox flow battery (ICRFB) systems. The prediction of battery efficiency remains challenging due to limited databases and the inherent bias in feature selection. Recently, a promising approach called active learning has emerged as an alternative method for designing and optimizing systems in various domains. Active learning, a subfield of machine learning, involves the iterative selection of unseen data points by surrogate models to improve the predictive capability of these models. This iterative process involves using the previous model, trained based on the observed results, to guide the selection of the next set of experiments. As a result, the gathered data points are utilized iteratively to update and improve the model. Active learning holds great potential for reducing computational costs associated with battery design, while also incorporating and guiding experimental data and procedures.
This article aims at using a data-driven, active-learning model to accurately predict and optimize the performance of ICRFB systems. We use active ML to rapidly design battery characteristics and operation characteristics to improve the performance of ICRFB systems (Fig. 1). The surrogate mode can be used as a guide to evaluate the most and least effective factors in the design and evaluation of ICRFBs. Post hoc analysis of the data and ML models reveals important relationships between specific battery characteristics and battery performance, providing mechanistic insight into how different features can affect the energy efficiency, coulombic efficiency, and capacity of ICRFBs. Overall, this framework will automate and accelerate the optimization of ICRFBs for the transition from laboratory scale development to industrial scale deployment.
Input parameters | Range/category | Units (if any) |
---|---|---|
Flow rate | 12.5–200 | mL min−1 |
Electrode type | CC,26 CF,27 CP,28,29 GF,13,15 SCC,30 TCC,26 TGF,16 TiN-3D GF31 | — |
Electrode size | 4–50 | cm2 |
Membrane type | IEM,13,15 N212,16,26,29–31 N115,16 N117![]() |
— |
Reactant (FeCl2, CrCl3) concentration | 0.5–1.5 | M |
HCl concentration | 1–4 | M |
Catalyst – Bi3+ concentration | 0–5 | mM |
Catalyst – In3+ concentration | 0–15 | mM |
Current density | 40–320 | mA cm−2 |
Cycle number | Measured every 5 steps | — |
The data processing and ML modeling for this study were conducted using Python 3.7, utilizing popular packages such as NumPy, pandas, and sklearn. Initially, the dataset was collected from previous literature. After defining the features and output variables, the dataset was split into train and test subsets. Next, hyperparameters were tested and corrected. To handle the multidimensional nature of the dataset, four different models, namely linear regression, random forest, extra trees regressor and gradient boosting were utilized to select the most accurate model based on the prediction accuracy. Machine learning (ML) is a broad concept that relies on statistical models and data analysis. Upon providing the computer with a rule discovery algorithm and sufficient relevant data, it can uncover previously unknown connections between input and output variables, a process commonly referred to as “training.” Once trained, the model can be utilized to forecast the extent of correlation between various input features and target output variables. After ensuring the accuracy of our models, the gradient boosting model was selected as the most accurate model to calculate the efficiency parameters for each unique combination of features.
To assess the accuracy of our model, we divided the dataset into a training set comprising 75% of the data and a test set containing the remaining 25%. This division allowed us to evaluate how well our model predicted various performance metrics. The evaluation results are presented in Table 2. The coefficient of determination (R2), mean absolute error (MAE) and mean squared error (MSE) for different models were calculated for the test sets as shown in Table 2. These results demonstrate that our ML methodology yields accurate predictions for key performance metrics of ICRFB systems across the datasets. The comparatively higher accuracy values of the gradient boosting model indicate that our model captures and analyzes relationships between system characteristics and performance outcomes more effectively than other models.
Model | R 2 (VE) | MSE (VE) | MAE (VE) | R 2 (CE) | MSE (CE) | MAE (CE) | R 2 (capacity) | MSE (capacity) | MAE (capacity) |
---|---|---|---|---|---|---|---|---|---|
Linear regression | 0.8843 | 2.9735 | 1.3319 | 0.4061 | 1.1773 | 0.8998 | 0.8520 | 0.0321 | 0.1495 |
Extra trees | 0.9764 | 0.5179 | 0.5450 | 0.7989 | 0.4382 | 0.5060 | 0.9890 | 0.0023 | 0.0318 |
Random forest | 0.9608 | 0.8015 | 0.6477 | 0.8642 | 0.2691 | 0.4275 | 0.9854 | 0.0026 | 0.0342 |
Gradient boosting | 0.9859 | 0.3608 | 0.4309 | 0.9212 | 0.1522 | 0.2924 | 0.9940 | 0.0011 | 0.0235 |
Thus, gradient boosting is chosen as the best model for further predictions and optimizations. Fig. 2a illustrates the distribution of collected experimental datasets from the literature resources. The residual plots for gradient boosting model predictions are given in Fig. 2b–d and the training and testing of the models are done based on the same data points. We evaluate the performance of the models based on their coefficient of determination (R2), mean absolute error (MAE) and mean squared error (MSE). The test scores for voltage efficiency, coulombic efficiency and capacity show high accuracy, as indicated in Fig. 2b–d and Table 2. It is also worth noting that the predictions for coulombic efficiency, although satisfying, have lower accuracy in all four utilized models compared to the prediction accuracies for voltage efficiency and capacity.
In our dataset, we determined the mean voltage efficiency, coulombic efficiency, and capacity. We collected training data points that were centered around these mean values. For our first case, we selected training data points that were within 5% of the mean output values, while the remaining points were used for testing. We repeated this process for six more cases, gradually expanding the range of training datasets to 10%, 20%, and 30% of the calculated mean output value. Fig. 2e–g illustrates the absolute error in the model's prediction for both the training and test datasets for the model with 30% training size. Other results are summarized in ESI Fig. S2.† We observed that all three models performed reasonably well in predicting the data points within the training range and the nearby region. However, the error in prediction increased as we moved further away from the training data range. Additionally, we found that the capacity and voltage efficiency prediction models had higher R2 values compared to the coulombic efficiency prediction model, especially for smaller ranges of training data. The R2 values for the coulombic efficiency prediction model became comparable to those of the voltage efficiency or capacity models when the training data size accounted for approximately 25% of the overall dataset (Fig. S3†).
![]() | ||
Fig. 3 (a) Absolute average values of feature SHAP of all the prediction models. The SHAP scores for the features of the gradient boosting model for (b) voltage efficiency, (c) coulombic efficiency and (d) capacity. As depicted in Fig. 3(a–d), current density, cycle number and electrode type have the highest impact on the battery performance. Absolute average values of feature SHAP values of (e) the electrode and (f) membrane types versus voltage efficiency, coulombic efficiency, and capacity, respectively. Accordingly, CC and TCC electrodes and N117 and IEM have the most determining role in battery efficiency. A: current density, B: cycle number, C: electrode type, D: catalyst – Bi3+, E: electrolyte – H+, F: catalyst – In3+, G: membrane, H: electrolyte – Cr3+, I: flow rate, J: electrolyte – Fe2+, and K: electrode size. |
Fig. S5† shows the factors that have a direct or inverse relationship with battery efficiency based on the SHAP values. Accordingly, an increase in electrolyte and catalyst concentration will generally result in enhancement of battery efficiency and performance. This is why an increase in current density will result in an overall decrease of all the performance variables of the battery. The effect of utilizing each electrode is also depicted in Fig. S7.† While TCC can be considered as an electrode that simultaneously enhances all the performance variables, TiN-3D GF has a negative influence on all the performance parameters, leading to less efficiency. The rest of the electrode types do not possess universal influence on battery performance and their function must be separately investigated for each performance parameter.
Determining the appropriate stopping criteria for an iterative process, such as active learning, is crucial in optimization algorithms. Typically, the process is terminated when the desired outcome of the target property has been achieved or when the computational budget has been exhausted.34 In this study, the number of iterations was set to 10, and the desired efficiencies were successfully obtained within this limited number of iterations. Given our theoretical understanding of the problem, it is known that the maximum voltage and coulombic efficiency cannot exceed 100 percent. Therefore, the desired solution was considered to be slightly less than this maximum value. By setting a predefined number of iterations and considering the theoretical limits of the efficiency parameters, the active learning process was effectively terminated once the desired efficiencies were obtained within the specified constraints. Detailed information can be found in the ESI.† Based on the analysis of Fig. 4a, after 10 iterations, the optimum values were observed to be approximately 88% for voltage efficiency, approximately 98% for coulombic efficiency, and approximately 1.75 A h for capacity. These values indicate the desired performance levels achieved by the model after the active learning process. Notably, voltage efficiency and capacity require around 4 or 5 iterations to achieve noticeable optimization. On the other hand, coulombic efficiency demonstrates a distinct behavior, necessitating at least 8 iterations to exhibit a substantial increase that can be considered as an optimized efficiency value. This disparity in convergence rates among the efficiency parameters highlights the differing optimization requirements and sensitivities of the battery system.
In order to assess the reliability of the proposed model, TCC and SCC electrodes from the ML mode (Fig. 4b, indicated by pink and blue star symbols, respectively) were selected for validation due to accessibility in our lab. The prepared two electrodes were analyzed by XPS. As shown in Fig. 4(c) and (e), the C 1s spectrum is resolved into five peaks at 284.7, 285.3, 286.5, and 289 eV, representing graphite carbon, defect carbon, C–O, and O–CO, respectively. As shown in Fig. 4(d) and (f), the O 1s spectra of the two samples consist of three peaks attributed to C
O (531.8 eV), C–OH (532.3 eV), and H–OH (533.2 eV). In order to analyze the pore size distribution of the two electrode materials, BET (Brunauer–Emmett–Teller) testing was conducted to confirm the defect structures of TCC and SCC electrodes. The adsorption/desorption isotherms of TCC and SCC electrodes shown in Fig. 4(g) are typical type IV isotherms, indicating that the surfaces of the two electrode materials have high adsorption capacity. At the same time, the BET surface area of the TCC electrode was also found to be 0.6637 m2 g−1, and the BET surface area of the SCC electrode was 0.7052 m2 g−1. Its sufficient specific surface area provides abundant reaction sites for the electrode.
In the liquid flow battery testing system used in this study, the concentrations of iron ions, chromium ions, and acid are 1.2 M, 1.4 M, and 2.5 M, respectively. Additional 0.001 M bismuth ions will be added as a catalyst. Carbon cloth electrodes measuring 5 cm long and 2 cm wide are used as the positive and negative electrodes of the battery. Nafion 212 membrane was used as the separator between the positive and negative electrolytes. The electrolyte volume in both the positive and negative slots is maintained at 80 mL. During battery testing, the circulating flow rates of the positive and negative electrolytes were set to 20 mL min−1. The performance of ICRFB was measured using a potentiostat/galvanostat. The battery operates at a current density of 140 mA cm−2 in constant current mode. To prevent overcharging and over-discharging during the testing process, the upper and lower voltage limits are set to 1.2 V and 0.7 V, respectively. Fifty charging and discharging cycles were performed. Keeping the above conditions constant, experiments were conducted using electrodes modified with TCC and SCC, respectively. The test results are shown in Fig. 4(h) and (i). For TCC electrodes, the optimization results from the ML model are an energy efficiency of 82.69% (experiment: 82.73%) and a capacity of 1.769 A h (experiment: 1.755 A h). For SCC electrodes, the optimization results from the ML model are an energy efficiency of 83.31% (experiment: 83.15%) and a capacity of 1.785 A h (experiment: 1.776 A h). The experimental results demonstrate astonishing agreement with the predicted efficiency values which further validates the developed model to be used for efficiency prediction and optimization of iron–chromium redox flow batteries in the future. However, due to limitations in experimental equipment, we failed to test the most optimized cases explored by the active learning approach. Therefore, those cases are supposed to be tested in future research investigations.
This work paves the way for extensive future studies in which the methodology can be extrapolated to a range of other variables. This extension could include the addition of state of charge parameters, additional electrochemical behaviors and even cost analysis to provide a more rounded view of the potential for ICRFB deployment. Furthermore, the flexibility of the framework suggests that it could be adapted to investigate other battery chemistries, different electrode and membrane configurations, or a wider range of operating conditions. The application of ML not only enhances the ability to identify complex patterns within rich datasets, but also sets the stage for molecular and structural-level simulations that could elucidate the mechanisms underlying battery stability and longevity in the future. Encouragingly, this research suggests a potential shift away from purely empirical paradigms towards one that integrates predictive modelling as a core component of battery development. A more targeted experimental compass, informed by the predictive power of ML models, could dramatically streamline the path to discovering high-performance materials, saving both time and resources in the pursuit of advanced energy storage solutions.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3nr06578b |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2024 |