https://socictopen.socict.org/files/original/12445b84d563e2566607d9e86d156580.pdf e8090fabe0ae58038f416bb772fb7928 Dublin Core The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/. Title A name given to the resource Coronavirus Description An account of the resource Dominio científico: Coronavirus Text A resource consisting primarily of words for reading. Examples include books, letters, dissertations, poems, newspapers, articles, archives of mailing lists. Note that facsimiles or images of texts are still of the genre Text. Dublin Core The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/. Title A name given to the resource Ribosome signatures aid bacterial translation initiation site identification Creator An entity primarily responsible for making the resource Adam Giess, Veronique Jonckheere, Elvis Ndah, Katarzyna ChyŻyńska, Petra Van Damme, Eivind Valen Description An account of the resource Abstract Background While methods for annotation of genes are increasingly reliable, the exact identification of translation initiation sites remains a challenging problem. Since the N-termini of proteins often contain regulatory and targeting information, developing a robust method for start site identification is crucial. Ribosome profiling reads show distinct patterns of read length distributions around translation initiation sites. These patterns are typically lost in standard ribosome profiling analysis pipelines, when reads from footprints are adjusted to determine the specific codon being translated. Results Utilising these signatures in combination with nucleotide sequence information, we build a model capable of predicting translation initiation sites and demonstrate its high accuracy using N-terminal proteomics. Applying this to prokaryotic translatomes, we re-annotate translation initiation sites and provide evidence of N-terminal truncations and extensions of previously annotated coding sequences. These re-annotations are supported by the presence of structural and sequence-based features next to N-terminal peptide evidence. Finally, our model identifies 61 novel genes previously undiscovered in the Salmonella enterica genome. Conclusions Signatures within ribosome profiling read length distributions can be used in combination with nucleotide sequence information to provide accurate genome-wide identification of translation initiation sites. Date A point or period of time associated with an event in the lifecycle of the resource 2017 Subject The topic of the resource ribosome profiling, bacterial translation initiation, machine learning, N-terminal proteomics, proteo-genomics Identifier An unambiguous reference to the resource within a given context DOI: 10.1186/s12915-017-0416-0 Source A related resource from which the described resource is derived BMC Biology Publisher An entity responsible for making the resource available BMC Coverage The spatial or temporal topic of the resource, the spatial applicability of the resource, or the jurisdiction under which the resource is relevant Biology (General) Language A language of the resource EN