论文标题
朝着社区问题搜索抵押贷款官员
Towards Semantic Search for Community Question Answering for Mortgage Officers
论文作者
论文摘要
社区问题回答(CQA)在许多领域中的普及越来越多。抵押是一个复杂而充满活力的行业,灵活而有效的CQA平台可以显着提高抵押贷款官员的服务质量。我们已经根据最新的自然语言处理(NLP)技术建立了一个动态的CQA平台,它具有最先进的语义搜索引擎,以动态,集体地捕获和转移对经验丰富的劳动力的成熟和部落知识。搜索引擎允许关键字和自然语言查询,并基于通过TF-IDF Vectorizer线性组成的微调域调整句子编码器,并与BM25矢量器融合在一起。域的适应和微调基于公开可用的抵押贷款公司。使用标准信息检索指标(例如归一化折扣累积增益(NDCG),n处的精度/召回率,平均相互等级和平均平均精度(MAP)等标准信息检索指标,在内部注释的数据集上进行评估。结果表明,与传统搜索技术相比,我们的混合,微调,适应域的搜索引擎是一种更有效的方法来响应抵押贷款官员的信息需求。我们的目标是在不久的将来发布内部注销的评估和培训数据集。
Community Question Answering (CQA) has gained increasing popularity in many domains. Mortgage is a complex and dynamic industry, and a flexible and efficient CQA platform can potentially enhance the quality of service for mortgage officers significantly. We have built a dynamic CQA platform with a state of the art semantic search engine based on recent Natural Language Processing (NLP) techniques to dynamically and collectively capture and transfer the maturity and tribal knowledge of the more experienced workforce to less experienced ones. The search engine allows for both keyword and natural language queries and is based on a fine-tuned domain-adapted Sentence-BERT encoder linearly composed with a TF-IDF vectorizer, and reciprocal-rank fused with a BM25 vectorizer. Domain adaptation and fine-tuning is based on publicly available mortgage corpora. Evaluation is performed on an internally annotated dataset using standard information retrieval metrics such as normalized discounted cumulative gain (nDCG), precision/recall at n, mean reciprocal rank, and mean average precision (MAP). The results indicate that our hybrid, fine-tuned, domain-adapted search engine is a more effective approach in responding to the information needs of our mortgage officers compared to traditional search techniques. We aim to publish the internally-annotated evaluation and training datasets in the near future.