Publications

What is a Publication?

1702 Publications visible to you, out of a total of 1702

Jet formation in stellar interactions

Physics of Stellar Objects

Abstract

Not specified

Author: Friedrich Röpke

Date Published: 1st Jul 2024

Publication Type: InProceedings

Citation: In EAS2024, European Astronomical Society Annual Meeting, p. 2451

Created: 18th Feb 2025 at 14:16

Computational Approaches for Integrating out Subjectivity in Cognate Synonym Selection

Computational Molecular Evolution

Abstract (Expand)

Working with cognate data involves handling synonyms, that is, multiple words that describe the same concept in a language. In the early days of language phylogenetics it was recommended to select one …

Authors: Luise Häuser, Gerhard Jäger, Alexandros Stamatakis

Date Published: 28th Jun 2024

Publication Type: Proceedings

Citation: Luise Häuser, Gerhard Jäger, and Alexandros Stamatakis. 2024. Computational Approaches for Integrating out Subjectivity in Cognate Synonym Selection. In Proceedings of the Society for Computation in Linguistics 2024, pages 162–172, Irvine, CA. Association for Computational Linguistics.

Created: 9th Jan 2025 at 10:35, Last updated: 9th Jan 2025 at 10:35

Structural bioinformatics approach to exploring transient binding pockets in proteins, (Bachelor's Thesis), Biochemistry, Faculty of Biosciences and Faculty of Chemistry and Earth Sciences, Ruprecht-Karls University, Heidelberg

Molecular and Cellular Modeling

Abstract

Not specified

Editor:

Date Published: 21st Jun 2024

Publication Type: Bachelor's Thesis

Citation:

Created: 15th Oct 2024 at 13:28

Brief Announcement: (Near) Zero-Overhead C++ Bindings for MPI

Computational Molecular Evolution

Abstract (Expand)

The Message-Passing Interface (MPI) and C++ form the backbone of high-performance computing and algorithmic research in the field of distributed-memory computing, but MPI only provides C and Fortran …

Authors: Kunal Agrawal, Erez Petrank, Demian Hespe, Lukas Hübner, Florian Kurpicz, Peter Sanders, Matthias Schimek, Daniel Seemaier, Tim Niklas Uhl

Date Published: 17th Jun 2024

Publication Type: Proceedings

DOI: 10.1145/3626183.3660260

Citation: Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures,pp.289-291,ACM

Created: 9th Jan 2025 at 10:19, Last updated: 25th Feb 2025 at 09:20

What causes the failure of explicit to implicit discourse relation recognition?

Natural Language Processing

Abstract

Not specified

Authors: Wei Liu, Stephen Wan, Michael Strube

Date Published: 16th Jun 2024

Publication Type: InProceedings

Citation: Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Mexico City, Mexico, June 16-21, 2024, pp.2738-2753

Created: 17th Apr 2024 at 15:26, Last updated: 17th Feb 2025 at 12:48

Increasing Detection Rate for Imbalanced Malicious Traffic using Generative Adversarial Networks

Data Mining and Uncertainty Quantification

Abstract

Not specified

Authors: Pascal Memmesheimer, Stefan Machmeier, Vincent Heuveline

Date Published: 5th Jun 2024

Publication Type: Journal

DOI: 10.1145/3655693.3655703

Citation: European Interdisciplinary Cybersecurity Conference,pp.74-81,ACM

Created: 30th Jan 2025 at 15:13

Faster Sorting of Aligned DNA-Read Files

Computational Molecular Evolution

Abstract (Expand)

In the analysis of DNA sequencing data for finding disease causing mutations, to understand evolutionary relationships between species, and to find variants, DNA-Reads are compared to a reference genome. … A reference genome is a representative example for a set of genes of a species. Sorting these aligned DNA-Reads by their position within the reference sequence is a crucial step in many of these downstream analyses. SAMtools sort, a widely used tool, performs external memory sorting of aligned DNA-Reads stored in the BAM format (Binary Alignment Map). This format allows for compressed storage of alignment data. SAMtools sort provides the most comprehensive set of features while exhibiting demonstrably faster execution times than its open source alternatives. In this work, we analyze SAMtools sort for sorting BAM files and propose methods to reduce its runtime. We divide the analysis into three parts: management of temporary files, compression, and input/output (IO). For the management of temporary files, we find that the maximum number of temporary files SAMtools sort can open concurrently is lower than the maximum number of open files permitted by the operating system. This results in an unnecessarily high number of merges of temporary files into larger temporary files, introducing overhead as SAMtools sort performs extra write and compression operations. To overcome this, we propose a dynamic limit for the number of temporary files, adapting to the operating system’s soft limit for open files. For compression, we test seven different libraries for compatible compression and a range of compression levels, identifying options that offer faster compression and result in a speedup of up to five times in single-threaded execution of SAMtools sort. For IO, we demonstrate that a minimal level of compression avoids IO overhead, thereby reducing the runtime of SAMtools sort compared to uncompressed output. However, we also show that uncompressed output can be used in the pipelining of SAMtools commands to reduce the runtime of subsequent SAMtools commands. Our proposed modifications to SAMtools sort and user behavior have the potential to achieve speedups of up to 6. This represents an important contribution to the field of bioinformatics, considering the widespread adoption of SAMtools sort evidenced by its over 5,000 citations and over 5.1 million downloads through Bioconda.

Authors: Dominik Siebelt, Lukas Hübner, Alexandros Stamatakis

Date Published: 3rd Jun 2024

Publication Type: Bachelor's Thesis

Citation:

Created: 9th Jan 2025 at 13:05

Publications

Filters ×

Filters