{"id":415,"date":"2025-02-14T23:32:19","date_gmt":"2025-02-14T23:32:19","guid":{"rendered":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/?page_id=415"},"modified":"2026-04-08T08:55:06","modified_gmt":"2026-04-08T14:55:06","slug":"using-bioinformatics-to-aid-discovery-rodeo","status":"publish","type":"page","link":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/using-bioinformatics-to-aid-discovery-rodeo\/","title":{"rendered":"Using Bioinformatics to Aid Discovery: RODEO"},"content":{"rendered":"<p style=\"text-align: justify\"><b>Natural products, or specialized molecules produced by living things,<\/b> have historically been an important source of antibiotics and other drugs. To accelerate the discovery of new natural products, researchers have turned to bioinformatics methods for predicting whether a specific organism will make an undiscovered molecule. <img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-498\" src=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_1-300x131.jpg\" alt=\"Diagram of a bioinformatics workflow for RiPP discovery using RODEO. Genomic database sequences are input and analyzed through RODEO, which performs open reading frame (ORF) annotation and module scoring using approaches such as genomic data, heuristic scoring, and machine learning. The outputs inform downstream biochemical validation and bioinformatic characterization, including spectral analysis and gene cluster organization.\" width=\"592\" height=\"259\" srcset=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_1-300x131.jpg 300w, https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_1.jpg 624w\" sizes=\"auto, (max-width: 592px) 100vw, 592px\" \/> This has led to an increase in \u201cgenome mining\u201d for natural product biosynthetic pathways. While genome mining can access diverse natural products, the process can be time consuming and the number of organisms with sequenced genomes continues to explode. As a result, automation of genome mining and big data analysis pipelines is crucial for the prioritization and discovery of new natural products.<\/p>\n<p style=\"text-align: justify\"><b>To lead these efforts, the Mitchell Lab has developed an algorithm called <a href=\"https:\/\/webtool.ripp.rodeo\/\" target=\"_blank\" rel=\"noopener\">RODEO<\/a>,<\/b> which aids genome miners by automating the annotation and visualization of genomic data. While RODEO is a multi-purpose algorithm, it specifically includes modules designed to analyze certain types of RiPPs (ribosomally synthesized and post-translationally modified peptides) using motif analysis, heuristic scoring, and supervised machine learning. RiPPs offer an exciting opportunity for natural product discovery, as peptides and other biologics are increasingly sought-after therapeutic modalities. By studying RiPP biosynthesis, we hope to access new bioactive molecules as pharmaceutical leads and new enzymes as tools for biotechnological or pharmaceutical development. The Mitchell lab maintains RODEO as a <a href=\"https:\/\/webtool.ripp.rodeo\/login.php\">public webtool<\/a>, and RODEO is under active development to further aid researchers studying RiPPs.<\/p>\n<p style=\"text-align: center\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-505\" src=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_2-300x44.png\" alt=\"Workflow diagram of the RODEO pipeline for RiPP discovery. Starting with a GenBank ID input, annotated genes are retrieved and organized into gene clusters. These clusters undergo HMM-based functional prediction, followed by optional six-frame translation to identify hypothetical precursor peptides. Supervised machine learning is then used to score candidate RiPP precursors, resulting in an output of prioritized sequences for accelerated RiPP discovery.\" width=\"2000\" height=\"293\" srcset=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_2-300x44.png 300w, https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_2-768x112.png 768w, https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_2.png 936w\" sizes=\"auto, (max-width: 2000px) 100vw, 2000px\" \/><\/p>\n<p style=\"text-align: justify\"><b>Beyond bioinformatic methods, we also validate and characterize<\/b> new natural products predicted by RODEO. Following dataset generation and analysis, we select cases with the potential for discovering new natural products and\/or enzymatic transformations. Beginning with our original report of RODEO in 2017, we have expanded molecular diversity in many RiPP classes including <a href=\"https:\/\/doi.org\/10.1038\/nchembio.2319\" target=\"_blank\" rel=\"noopener\">lasso peptides<\/a>, <a href=\"https:\/\/doi.org\/10.1021\/acschembio.1c00672\" target=\"_blank\" rel=\"noopener\">graspetides<\/a>, <a href=\"https:\/\/doi.org\/10.1021\/jacs.8b03896\" target=\"_blank\" rel=\"noopener\">thiopeptides<\/a>, <a href=\"https:\/\/doi.org\/10.1186\/s12864-020-06785-7\" target=\"_blank\" rel=\"noopener\">lanthipeptides<\/a>, <a href=\"https:\/\/doi.org\/10.1021\/acschembio.0c00620\" target=\"_blank\" rel=\"noopener\">linaridins<\/a>, and <a href=\"https:\/\/doi.org\/10.1021\/acschembio.4c00066\" target=\"_blank\" rel=\"noopener\">borosins<\/a>, and discovered new RiPP classes defined by new biosynthetic reactions such as the <a href=\"https:\/\/doi.org\/10.1021\/jacs.9b01519\" target=\"_blank\" rel=\"noopener\">ranthipeptides<\/a>, <a href=\"https:\/\/doi.org\/10.1038\/s41467-023-37287-1\" target=\"_blank\" rel=\"noopener\">daptides<\/a>, and <a href=\"https:\/\/doi.org\/10.1021\/acscentsci.4c00088\" target=\"_blank\" rel=\"noopener\">aminopyruvatide<\/a>. RODEO also allows us to explore the larger evolutionary contexts of RiPP biosynthesis, from both computational and biochemical perspectives. At the moment, our efforts with RODEO are focused on pushing the boundaries of hypothesis generation using bioinformatics. Collectively, these approaches are enabling multiple lines of investigation for the discovery of new RiPPs, study of their biological functions, and the engineering of RiPP biosynthetic systems for biotechnological applications.<\/p>\n<p style=\"text-align: center\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-514\" src=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_3-300x208.png\" alt=\"Detailed schematic of a RiPP peptide structure and its post-translational modifications. On the left, the chemical structure highlights multiple modified residues with color-coded functional groups (e.g., heterocycles, sulfur-containing linkages, and hydroxyl groups). On the right, the corresponding peptide sequence is shown as a circular or lollipop-style diagram with labeled amino acids (e.g., W, G, E, T, A), indicating positions of modifications. The bottom portion shows the linear precursor peptide sequence with specific residues highlighted to indicate sites of enzymatic modification and crosslink formation.\" width=\"598\" height=\"414\" srcset=\"https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_3-300x208.png 300w, https:\/\/cdn.vanderbilt.edu\/t2-main\/lab-prd\/wp-content\/uploads\/sites\/224\/2025\/02\/RODEO_3.png 624w\" sizes=\"auto, (max-width: 598px) 100vw, 598px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p style=\"text-align: justify\"><b>Key Publications:<\/b><\/p>\n<p style=\"text-align: justify\">1. <a href=\"https:\/\/www.nature.com\/articles\/nchembio.2319\" target=\"_blank\" rel=\"noopener\">A new genome mining tool redefines the lasso peptide biosynthetic landscape. <i>Nat. Chem. Biol.<\/i>, <b>13<\/b>: 470-478 (2017).<\/a><\/p>\n<p style=\"text-align: justify\">2. <a href=\"https:\/\/pubs.acs.org\/doi\/10.1021\/jacs.9b01519\" target=\"_blank\" rel=\"noopener\">Bioinformatic mapping of radical S-adenosylmethionine-dependent ribosomally synthesized and post-translationally modified peptides identifies new C\u03b1, C\u03b2, and C\u03b3-linked thioether-containing peptides. <i>J. Am. Chem. Soc.<\/i>, <b>141<\/b>: 8228-8238 (2019).<\/a><\/p>\n<p style=\"text-align: justify\">3. <a href=\"https:\/\/doi.org\/10.1038\/s41467-022-33890-w\" target=\"_blank\" rel=\"noopener\">A scalable platform to discover antimicrobials of ribosomal origin. <i>Nat. Commun.<\/i>, <b>13<\/b>: 6135 (2022).<\/a><\/p>\n<p style=\"text-align: justify\">4. <a href=\"https:\/\/doi.org\/10.1128\/msystems.00267-20\" target=\"_blank\" rel=\"noopener\">Genome mining unveils a class of ribosomal peptides with two amino termini. <i>Nat. Commun.<\/i>, <b>14<\/b>: 1624 (2023).<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Natural products, or specialized molecules produced by living things, have historically been an important source of antibiotics and other drugs. To accelerate the discovery of new natural products, researchers have turned to bioinformatics methods for predicting whether a specific organism will make an undiscovered molecule. This has led to an increase in \u201cgenome mining\u201d for&#8230;<\/p>\n","protected":false},"author":577,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page_onecolumn.php","meta":{"_acf_changed":false,"footnotes":"","_links_to":"","_links_to_target":""},"tags":[],"class_list":["post-415","page","type-page","status-publish","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/pages\/415","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/users\/577"}],"replies":[{"embeddable":true,"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/comments?post=415"}],"version-history":[{"count":56,"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/pages\/415\/revisions"}],"predecessor-version":[{"id":922,"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/pages\/415\/revisions\/922"}],"wp:attachment":[{"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/media?parent=415"}],"wp:term":[{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lab.vanderbilt.edu\/mitchell-lab\/wp-json\/wp\/v2\/tags?post=415"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}