Cross-lingual Science Journalism: Select, Simplify and Rewrite Summaries for Non-expert Readers

Abstract:

Automating Cross-lingual Science Journalism (CSJ) aims to generate popular science summaries from English scientific texts for non-expert readers in their local language. We introduce CSJ as a downstream task of text simplification and cross-lingual scientific summarization to facilitate science journalists’ work. We analyze the performance of possible existing solutions as baselines for the CSJ task. Based on these findings, we propose to combine the three components - SELECT, SIMPLIFY and REWRITE (SSR) to produce cross-lingual simplified science summaries for non-expert readers. Our empirical evaluation on the WIKIPEDIA dataset shows that SSR significantly outperforms the baselines for the CSJ task and can serve as a strong baseline for future work. We also perform an ablation study investigating the impact of individual components of SSR. Further, we analyze the performance of SSR on a high-quality, real-world CSJ dataset with human evaluation and in-depth analysis, demonstrating the superior performance of SSR for CSJ.

SEEK ID: https://publications.h-its.org/publications/1678

Research Groups: Natural Language Processing

Publication type: InProceedings

Citation: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Ontario, Canada, July 2023, pp. 1843-1861

Date Published: 8th Jul 2023

URL: https://aclanthology.org/2023.acl-long.103.pdf

Registered Mode: manually

Authors: Mehwish Fatima, Michael Strube

help Submitter
Activity

Views: 1302

Created: 31st May 2023 at 10:45

Last updated: 5th Mar 2024 at 21:25

help Tags

This item has not yet been tagged.

help Attributions

None

Powered by
(v.1.14.2)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH