ass日本风韵熟妇pics男人扒开女人屁屁桶到爽|扒开胸露出奶头亲吻视频|邻居少妇的诱惑|人人妻在线播放|日日摸夜夜摸狠狠摸婷婷|制服 丝袜 人妻|激情熟妇中文字幕|看黄色欧美特一级|日本av人妻系列|高潮对白av,丰满岳妇乱熟妇之荡,日本丰满熟妇乱又伦,日韩欧美一区二区三区在线

基于文本數(shù)據(jù)增強的中文水稻育種問句命名實體識別
CSTR:
作者:
作者單位:

作者簡介:

通訊作者:

中圖分類號:

基金項目:

國家自然科學基金項目(62303472)


Named Entity Recognition in Chinese Rice Breeding Questions Based on Text Data Augmentation
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 圖/表
  • |
  • 訪問統(tǒng)計
  • |
  • 參考文獻
  • |
  • 相似文獻
  • |
  • 引證文獻
  • |
  • 資源附件
  • |
  • 文章評論
    摘要:

    針對現(xiàn)有水稻育種問答系統(tǒng)存在數(shù)據(jù)管理水平低,、知識粒度大,水稻育種領(lǐng)域缺乏用于命名實體識別的標注數(shù)據(jù),、人工標注成本高等問題,,提出了一種基于文本數(shù)據(jù)增強的方法來識別水稻育種問句的命名實體,通過構(gòu)建水稻育種知識圖譜,,對水稻育種問句中的大類命名實體進行分類,,從而增強實體邊界,降低知識粒度,。針對水稻育種數(shù)據(jù)標注成本高導致命名實體識別性能不佳的難點,,通過在BERT-BILSTM-CRF模型中引入數(shù)據(jù)增強層,提出了DA-BERT-BILSTM-CRF模型,。實驗以標注的水稻育種問句為訓練數(shù)據(jù),,將所提出的模型與其他基線模型進行比較。結(jié)果表明,,本文方法在水稻育種問句中命名實體識別的單類別識別任務和整體識別任務上均優(yōu)于其他方法,,其中單類別識別精確率達到94.26%,F(xiàn)1值達到93.32%,;整體識別精確率達到93.86%,,F(xiàn)1值達到93.34%。

    Abstract:

    Issues of low-level data management and high knowledge granularity exist in current rice breeding question answering systems. In addition, there is a lack of publicly available labeled data for named entity recognition in rice breeding, and manual annotation can be costly. To address these issues, an approach based on text data augmentation to the named entity recognition was proposed for rice breeding questions. The rice breeding knowledge graph was created to assist in subdividing larger named entity categories in rice breeding, such as rice characteristics entities, into smaller subcategories, such as resistance to abiotic stress and eating quality. It helped to enhance entity boundaries and reduce knowledge granularity. Responding to the challenge of high annotation costs for rice breeding data that results in suboptimal performance in named entity recognition, the DA-BERT-BILSTM-CRF model was presented by introducing a data augmentation layer into the BERT-BILSTM-CRF model. Using manually labeled rice breeding questions as training data, the proposed model was compared with three other baseline models. In the overall named entity recognition experiment under the small class entity division, the model achieved a precision of 93.86%, a recall of 92.82%, and an F1 score of 93.34%. Compared with the best-performing BERT-BILSTM-CRF model among the three baseline models, the model outperformed by 4.98, 5.3 and 5.15 percentages points, respectively. Meanwhile, it also performed better in the single-entity recognition metric, achieving a precision of 94.26% and an F1 score of 93.32%. The experiments showed that the proposed approach performed better in both overall named entity recognition and single-class named entity recognition tasks in rice breeding questions.

    參考文獻
    相似文獻
    引證文獻
引用本文

牛培宇,侯琛.基于文本數(shù)據(jù)增強的中文水稻育種問句命名實體識別[J].農(nóng)業(yè)機械學報,2024,55(8):333-343. NIU Peiyu, HOU Chen. Named Entity Recognition in Chinese Rice Breeding Questions Based on Text Data Augmentation[J]. Transactions of the Chinese Society for Agricultural Machinery,2024,55(8):333-343.

復制
分享
文章指標
  • 點擊次數(shù):
  • 下載次數(shù):
  • HTML閱讀次數(shù):
  • 引用次數(shù):
歷史
  • 收稿日期:2023-12-07
  • 最后修改日期:
  • 錄用日期:
  • 在線發(fā)布日期: 2024-08-10
  • 出版日期:
文章二維碼