Automatic Summarization of Technical Documents in the Oil and Gas Industry

Joao Marcos Correia Marques, Fabio Gagliardi Cozman, Ismael Humberto Ferreira Dos Santos

Research output: Contribution to conferenceConference Paper

2 Scopus citations

Abstract

© 2019 IEEE. We address extractive summarization of technical documents in the oil and gas industry, a major and urgent task due to the large volume of critical reports in that industry. We examine five distinct state-of-the-art extractive algorithms; to assess performance, a new open dataset was created using the open access Journal of Petroleum Exploration and Production Technology (JPEPT). Abstracts for papers in this journal were used as ground truths for summarization. Algorithms were refined to work with these documents in the best possible way. Our most effective algorithm achieved a state-of-the-art ROUGE-2 score of 0.123, taking 83 minutes to summarize the entire JPEPT dataset.
Original languageAmerican English
Pages431-436
Number of pages6
DOIs
StatePublished - 1 Oct 2019
Externally publishedYes
EventProceedings - 2019 Brazilian Conference on Intelligent Systems, BRACIS 2019 -
Duration: 1 Oct 2019 → …

Conference

ConferenceProceedings - 2019 Brazilian Conference on Intelligent Systems, BRACIS 2019
Period1/10/19 → …

Fingerprint

Dive into the research topics of 'Automatic Summarization of Technical Documents in the Oil and Gas Industry'. Together they form a unique fingerprint.

Cite this