水力发电学报
            首 页   |   期刊介绍   |   编委会   |   投稿须知   |   下载中心   |   联系我们   |   学术规范   |   编辑部公告   |   English

水力发电学报

• •    

水工混凝土材料不可编辑文本智能解译方法研究

  

  • 出版日期:2024-04-24 发布日期:2024-04-24

Intelligent interpretation method for non-editable text of hydraulic concrete materials

  • Online:2024-04-24 Published:2024-04-24

摘要: 在水电工程建设过程中,产生了大量不可编辑的水工混凝土材料文档,采用人工解译的方法获取文本费时费力且精度不可控,难以满足材料数据信息化管理的需求。为此,本文提出了面向水工混凝土材料不可编辑文本的智能解译方法。首先,构建了基于像素级分割的文本检测模型HC-PSENet,融合PP-HGNet主干网络实现文本行的精确检测。进一步,基于领域知识创建专业语料库以获取字符的准确映射,以检测文本框和专业语料库为输入,建立了水工混凝土材料文本识别模型HC-CRNN,采用ResNet主干网络和改进损失函数C-CTC Loss提高字符分类准确性。最后,以自制数据集为例,引入迁移学习策略训练模型,通过消融、对比实验验证了方法的有效性和优越性。结果表明,本文提出的方法检测文本区域的调和平均数为0.985,识别文本的准确率达到90.62%,综合性能均优于经典方法,以期为混凝土材料不可编辑资源的自动化再利用提供新的技术手段。

Abstract: In the process of hydropower engineering construction, a large number of non-editable documents for hydraulic concrete materials are generated. Using manual interpretation methods to obtain texts is time-consuming, laborious, and the uncontrollable in accuracy, making it difficult to meet the demand for information management of material data. To address this issue, this paper proposes an intelligent interpretation method for non-editable texts of hydraulic concrete materials. Firstly, a text detection model called HC-PSENet based pixel level segmentation is constructed, which integrates the backbone network of PP-HGNet to achieve accurate detection of text lines. Furthermore, a professional corpus based on domain knowledge is created to obtain accurate character mapping. The text recognition model named HC-CRNN for hydraulic concrete materials is established using detection text boxes and professional corpus as inputs. The backbone network of ResNet and improved loss function C-CTC Loss are used to improve the accuracy of character classification. Finally, taking the self-made dataset as an example, the transfer learning strategy is introduced to train the model. The effectiveness and superiority of the proposed method are verified through ablation and comparative experiments. The results show that the proposed method has a harmonic mean of 0.985 for detecting text regions and the accuracy of text recognition reaches 90.62%. Its overall performance is superior to classical methods, aiming to provide new technical means for the automated reuse of non-editable resources in concrete materials.

京ICP备13015787号-3
版权所有 © 2013《水力发电学报》编辑部
编辑部地址:中国北京清华大学水电工程系 邮政编码:100084 电话:010-62783813
本系统由北京玛格泰克科技发展有限公司设计开发  技术支持:support@magtech.com.cn