Automating Cross-lingual Science Journalism (CSJ) aims to generate popular science summaries from English scientific texts for non-expert readers in their local language. We introduce CSJ as a downstream task of text simplification and cross-lingual scientific summarization to facilitate science journalists’ work. We analyze the performance of possible existing solutions as baselines for the CSJ task. Based on these findings, we propose to combine the three components - SELECT, SIMPLIFY and REWRITE (SSR) to produce cross-lingual simplified science summaries for non-expert readers. Our empirical evaluation on the WIKIPEDIA dataset shows that SSR significantly outperforms the baselines for the CSJ task and can serve as a strong baseline for future work. We also perform an ablation study investigating the impact of individual components of SSR. Further, we analyze the performance of SSR on a high-quality, real-world CSJ dataset with human evaluation and in-depth analysis, demonstrating the superior performance of SSR for CSJ.
SEEK ID: https://publications.h-its.org/publications/1678
Research Groups: Natural Language Processing
Publication type: InProceedings
Citation: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Ontario, Canada, July 2023, pp. 1843-1861
Date Published: 8th Jul 2023
URL: https://aclanthology.org/2023.acl-long.103.pdf
Registered Mode: manually
Views: 2431
Created: 31st May 2023 at 10:45
Last updated: 5th Mar 2024 at 21:25
This item has not yet been tagged.
None