基于小样本数据集的煤层顶板突水溃砂危险性预测

Predicting the risk of water-sand inrushes from coal seam roofs based on a small-size sample dataset

  • 摘要:
    目的 我国华东、华北地区松散层厚度大、基岩薄,突水溃砂事故频发,实现顶板突水溃砂危险性精准预测对保障煤矿安全生产意义重大。但突水溃砂致灾机理极为复杂,涉及多因素耦合作用。现场实测面临高风险、高成本等问题,导致数据获取困难,样本量严重不足,制约了传统预测模型的精度与性能,探索适用于小样本场景的有效预测方法迫在眉睫。
    方法 梳理分析近松散层工作面现场实测数据与历史案例,确定底部含水层厚度、基岩厚度等11个影响因素,构建原始样本数据集。运用斯皮尔曼相关性分析揭示各因素的内在联系及相关性;基于条件表格生成对抗网络(CTGAN)、探测粒子群优化算法(DPSO)、随机森林算法(RF)构建突水溃砂危险性预测模型(CTGAN−DPSO−RF),探讨CTGAN合成数据的质量,并与DPSO−SVM、DPSO−XGBoost模型进行对比,最后结合工程实例验证模型有效性。
    结果和结论 11个突水溃砂影响因素中,垮落带高度与采高相关性最大,相关系数为0.93;松散层底部含水层水压与导水裂隙带发育高度相关性最小。CTGAN合成数据与原始数据高度相似,综合质量分数达85.03%;DPSO寻优后最优适应度为0.9265,优于PSO算法;CTGAN−DPSO−RF模型测试集ACPWRWF1W均达到1,全面优于对比模型,工作面预测结果与实际开采情况一致,该模型通过合成高质量数据扩充样本集、优化超参数,有效解决小样本下传统模型精度低、性能差的问题,为厚松散层薄基岩条件下煤层顶板突水溃砂危险性预测提供了新方法。

     

    Abstract:
    Objective The eastern and northern regions of China exhibit thick unconsolidated layers and thin bedrocks, with water-sand inrush disasters occurring frequently. Therefore, accurately predicting the risk of water-sand inrushes from coal seam roofs holds great significance for safe coal mining. However, the water-sand inrushes in these regions exhibit complex disaster-causing mechanisms, which involve the coupling effects of multiple factors. Accordingly, the real-time prediction of the water-sand inrush risk in the field poses challenges including high risk and cost. These issues lead to difficult data acquisition and severely insufficient samples, limiting the accuracy and performance of traditional prediction models. Therefore, there is an urgent need to explore effective prediction methods suitable for a small sample size.
    Methods Based on a review and analysis of the measured field data and historical cases of mining faces adjacent to unconsolidated layers, this study determined 11 factors influencing water-sand inrushes (e.g., the thickness of aquifers at the unconsolidated-layer bottom and bedrock thickness) and constructed the original sample dataset. Subsequently, the intrinsic relationships and correlations among various influencing factors were discovered using Spearman correlation. A risk prediction model for water-sand inrushes, termed CTGAN-DPSO-RF, was developed based on conditional tabular generative adversarial networks (CTGAN), detecting particle swarm optimization (DPSO) algorithm, and random forest (RF) algorithm. The quality of data synthesized using CTGAN was explored. Furthermore, the effectiveness of the CTGAN-DPSO-RF model was validated through comparison with the DPSO-SVM and DPSO-XGBoost models, along with two engineering cases.
    Results and Conclusions Among the 11 influencing factors, the caving zone height exhibited the strongest correlation with mining height (correlation coefficient: 0.93), while the water pressure of aquifers at the unconsolidated-layer bottom presented the weakest correlation with the height of the hydraulically conductive fracture zone. The data synthesized using CTGAN highly resembled the original data, with a comprehensive quality score reaching up to 85.03%. The DPSO algorithm yielded an optimal fitness of 0.9265 after the hyperparameter tuning, outperforming the particle swarm optimization (PSO) algorithm. The CTGAN-DPSO-RF model yielded accuracy (Ac), weighted precision (Pw), weighted recall (Rw), and weighted F1-score (F1w) consistently reaching 1.0 on the test set, outperforming its counterparts. The risk prediction results of two mining faces derived using the proposed model were consistent with actual mining conditions. By synthesizing high-quality data to expand the sample dataset and optimizing hyperparameters, the proposed model effectively overcomes the low accuracy and poor performance of traditional models under a small sample size, providing a new method for predicting the risk of water-sand inrushes from coal seam roofs under conditions of thick unconsolidated layers and thin bedrocks.

     

/

返回文章
返回