Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI-23), Main Track
Abstractive Summarization of Biomedical Papers via Semantic Parsing Graphs and Consistency Rewards
CogitoErgoSumm
Abstractive Summarization of Biomedical Papers via Semantic Parsing Graphs and Consistency Rewards
Giacomo Frisoni, Paolo Italiani, Stefano Salvatori, Gianluca Moro
Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI-23), Main Track
Description
The automatic synthesis of biomedical publications catalyzes a profound research interest elicited by literature congestion. Current sequence-to-sequence models mainly rely on the lexical surface and seldom consider the deep semantic interconnections between the entities mentioned in the source document. Such superficiality translates into fabricated, poorly informative, redundant, and near-extractive summaries that severely restrict their real-world application in biomedicine, where the specialized jargon and the convoluted facts further emphasize task complexity. To fill this gap, we argue that the summarizer should acquire semantic interpretation over input, exploiting structured and unambiguous representations to capture and conserve the most relevant parts of the text content. This paper presents CogitoErgoSumm, the first framework for biomedical abstractive summarization equipping large pre-trained language models with rich semantic graphs. Precisely, we infuse graphs from two complementary semantic parsing techniques with different goals and granularities—Event Extraction and Abstract Meaning Representation, also designing a reward signal to maximize information content preservation through reinforcement learning. Extensive quantitative and qualitative evaluations on the CDSR dataset show that our solution achieves competitive performance according to multiple metrics, despite using 2.5x fewer parameters. Results and ablation studies indicate that our joint text-graph model generates more enlightening, readable, and consistent summaries.
Keywords: abstractive summarization, semantic parsing, event extraction, abstract meaning representation, biomedical text mining, knowledge injection.
If you use CogitoErgoSumm
in your research, please cite CogitoErgoSumm: Abstractive Summarization of Biomedical Papers via Semantic Parsing Graphs and Consistency Rewards.
@inproceedings{frisoni-etal-2023-cogitoergosumm,
title = {Cogito Ergo Summ: Abstractive Summarization of Biomedical Papers via Semantic Parsing Graphs and Consistency Rewards},
author = "Frisoni, Giacomo and
Italiani, Paolo and
Salvatori, Stefano and
Moro, Gianluca",
booktitle = {Thirty-Seventh {AAAI} Conference on Artificial Intelligence, {AAAI}
2023, Washington, DC, USA, February 7-14,
2023},
month = feb,
year = "2023",
address = "Washington, DC, USA",
publisher = {AAAI} Press,
abstract = "The automatic synthesis of biomedical publications catalyzes a profound research interest elicited by literature congestion. Current sequence-to-sequence models mainly rely on the lexical surface and seldom consider the deep semantic interconnections between the entities mentioned in the source document. Such superficiality translates into fabricated, poorly informative, redundant, and near-extractive summaries that severely restrict their real-world application in biomedicine, where the specialized jargon and the convoluted facts further emphasize task complexity. To fill this gap, we argue that the summarizer should acquire semantic interpretation over input, exploiting structured and unambiguous representations to capture and conserve the most relevant parts of the text content. This paper presents CogitoErgoSumm, the first framework for biomedical abstractive summarization equipping large pre-trained language models with rich semantic graphs. Precisely, we infuse graphs from two complementary semantic parsing techniques with different goals and granularities—Event Extraction and Abstract Meaning Representation, also designing a reward signal to maximize information content preservation through reinforcement learning. Extensive quantitative and qualitative evaluations on the CDSR dataset show that our solution achieves competitive performance according to multiple metrics, despite using 2.5x fewer parameters. Results and ablation studies indicate that our joint text-graph model generates more enlightening, readable, and consistent summaries."
}