• Tidak ada hasil yang ditemukan

Biosynthetic Gene Cluster Profiling of Virgibacillus salarius from North Java Sea

N/A
N/A
utube

Academic year: 2024

Membagikan " Biosynthetic Gene Cluster Profiling of Virgibacillus salarius from North Java Sea"

Copied!
15
0
0

Teks penuh

(1)

Biosynthetic gene cluster profiling from North Java Sea Virgibacillus salarius reveals hidden potential metabolites

Ocky Karna Radjasa

1*

, Ray Steven

2

, Zalfa Humaira

2

, Fenny Martha Dwivany

2*

, Husna Nugrahapraja

2

, Joko Pebrianto Trinugroho

2

, Tati Kristianti

3

, Agus Chahyadi

4

, Yosua Natanael

2

, Neil Priharto

2

, Kamarisima

2

, Farisa Amalia P. B. Sembiring

2

, Ari Dwijayanti

5

, Lia Kusmita

6

, Maelita R. Moeis

7

& V. Sri Harjati Suhardi

2

Virgibacillus salarius 19.PP.SC1.6 is a coral symbiont isolated from Indonesia’s North Java Sea; it has the ability to produce secondary metabolites that provide survival advantages and biological functions, such as ectoine, which is synthesized by an ectoine gene cluster. Apart from being an osmoprotectant for bacteria, ectoine is also known as a chemical chaperone with numerous biological activities such as maintaining protein stability, which makes ectoine in high demand in the market industry and makes it beneficial to investigate V. salarius ectoine. However, there has been no research on genome-based secondary metabolite and ectoine gene cluster characterization from Indonesian marine V. salarius. In this study, we performed a genomic analysis and ectoine identification of V. salarius. A high-quality draft genome with total size of 4.45 Mb and 4426 coding sequence (CDS) was characterized and then mapped into the Cluster of Orthologous Groups (COG) category. The genus Virgibacillus has an "open" pangenome type with total of 18 genomic islands inside the V. salarius 19.PP.SC1.6 genome. There were seven clusters of secondary metabolite- producing genes found, with a total of 80 genes classified as NRPS, PKS (type III), terpenes, and ectoine biosynthetic related genes. The ectoine gene cluster forms one operon consists of ectABC gene with 2190 bp gene cluster length, and is successfully characterized. The presence of ectoine in V.

salarius was confirmed using UPLC-MS/MS operated in Multiple Reaction Monitoring (MRM) mode, which indicates that V. salarius has an intact ectoine gene clusters and is capable of producing ectoine as compatible solutes.

The sea constitutes a dynamic and intricate ecosystem. Physical factors such as atmospheric pressure and tem- perature, as well as chemical factors like pH and salt concentration, exhibit variations at various ocean depths.

These abiotic fluctuations significantly impact marine microorganisms, compelling them to adapt and evolve to thrive in these challenging surroundings. Among the adaptation mechanisms observed in marine microorgan- isms is the synthesis of a wide range of secondary metabolites1, 2.

Secondary metabolites exhibit a wide range of biochemical activities, encompassing antibiotics, antioxidants, and antitumor properties3, 4. These compounds are typically generated by Secondary Metabolite Biosynthetic Gene Clusters (smBGCs), which are sets of genes that govern specific metabolic pathways responsible for sec- ondary metabolite production5. In recent times, secondary metabolites possessing antimicrobial properties have found application in antibiotic production within the pharmaceutical industry, garnering approval from the United States Food and Drug Administration (FDA) agencies6. Secondary metabolites sourced from marine

OPEN

1Research Center for Deep Sea, The Earth Sciences and Maritime Research Organization, National Research and Innovation Agency, Jakarta 14430, Indonesia. 2Institut Teknologi Bandung, School of Life Sciences and Technology, Bandung, West Java 40132, Indonesia. 3Institut Pendidikan Indonesia, Garut, West Java 44151, Indonesia. 4University Center of Excellence for Nutraceuticals, Bioscience and Biotechnology Research Center, Bandung Institute of Technology, Bandung, West Java, Indonesia. 5CNRS@CREATE Ltd., 1 Create Way, #08-01 Create Tower, Singapore 138602, Singapore. 6STIFAR Yayasan Pharmasi Semarang, Semarang, Central Java 50124, Indonesia. 7Department of Biotechnology, Faculty of Science and Technology, Universitas Muhammadiyah Bandung, Bandung, West Java 40262, Indonesia.*email: [email protected]; [email protected]

(2)

environments boast distinctive and diverse chemical structures, piquing the pharmaceutical industry’s interest, particularly for the potential discovery of novel antibiotic compounds7. Marine-derived secondary metabolites have found utility across various sectors, including agriculture as biopesticides8, the food industry as functional additives9, and the cosmetics and skincare sector as ingredients for beauty products10.

Genome mining has emerged as an indispensable tool for tapping into the vast potential of Indonesia’s deep sea and its valuable biological compounds. Indonesia, renowned as the world’s largest archipelagic nation and one of the most biodiverse, boasts an expansive oceanic territory that covers a remarkable 77% of its total land area, with sea depths exceeding 200 meters11. This phenomenon is particularly prominent in Eastern Indonesia, encompassing regions like the Flores Sea, Banda Sea, and Makassar Strait, as well as the open waters of the Indian and Pacific Oceans12. As a result, Indonesia offers an extraordinary opportunity for the discovery of valuable biological compounds.

Conventional biochemical methods used for the screening and isolation of secondary metabolites from bac- terial species have inherent limitations. These methods demand extensive labor and entail the manipulation of various cultivation parameters, including light, pH, aeration, temperature, and nutrient composition13. Further- more, a significant challenge arises from the fact that numerous biosynthetic gene clusters (BGCs) responsible for secondary metabolite production remain silent or cryptic under standard laboratory conditions. These gene clusters are predominantly activated in response to environmental stressors, driving bacterial adaptations14.

To surmount these limitations, genome mining emerges as a promising alternative. This method harnesses the power of genome sequencing, computational analysis, and compound characterization, enabling the discovery of novel natural bioproducts15, 16. Genome mining has demonstrated its efficacy in uncovering the potential of marine microbial compounds that have eluded detection using traditional characterization methods, including cultivation and isolation within laboratory settings5, 17, 18.

One of the secondary metabolites produced by certain marine bacteria to serve as an osmoprotectant is ectoine. Ectoine facilitates microbial adaptation to fluctuations in salinity within the sea, employing a "salt-out"

strategy19. Ectoine, chemically known as 1,4,5,6-tetrahydro-2-methyl-4-pyrimidine carboxylic acid, is an amino acid derivative originating from aspartate, possessing a pronounced affinity for water20. The synthesis of ectoine involves three principal enzymes encoded by three genes: ectA, ectB, and ectC, which collectively form a gene cluster. Previous research has demonstrated the conservation of ectoine biosynthetic gene clusters across numer- ous bacterial species21. Traditionally, the production of ectoine has relied on the use of Halomonas elongata, a halophilic bacterium. The process involves stimulating these bacteria within a high-salinity medium, followed by inducing an osmotic shock to release ectoine into the surrounding medium22.

Ectoine, often referred to as a chemical chaperone, plays a crucial role in stabilizing proteins, shielding against UV radiation, and safeguarding molecules or cells from high temperatures. Its versatile properties make it a potential natural protectant with applications spanning agriculture, biotechnology, healthcare, food processing, and cosmetics22. In the realm of skincare, ectoine finds extensive use as an ingredient, contributing significantly to sun protection and anti-aging products. It also serves as an enhancer of lipase catalytic efficiency in biodiesel production and is utilized in the healthcare industry to develop ectoine-based medical devices for allergy treat- ment, including nasal sprays and eye drops22–24. The biosynthesis of ectoine commences with the presence of a precursor called L-aspartate-semialdehyde, which undergoes conversion to L-2,4-diaminobutyrate through the action of L-2,4-diaminobutyrate-2-oxoglutarate transaminase (ectB). Subsequently, diaminobutyric acid acetyltransferase (ectA) transforms it into N-γ-acetyl-L-2,4-diaminobutyrate, ultimately leading to the produc- tion of ectoine, facilitated by ectoine synthase (ectC)21. In the presence of the ectD gene, which encodes ectoine hydroxylase, certain bacteria, such as V. halodenitrificans and V. salarius, can produce ectoine derivatives in the form of 5-hydroxyectoine23 (Fig. 1).

Considering the relatively uncharted territory of Indonesia’s marine bioresources, there is a pressing need for further exploration through a genomics-driven strategy. This approach holds the promise of unearthing

Figure 1. Ectoine and 5-hydroxyectoine biosynthesis pathway. This pathway consists of ectA, ectB, and ectC as main genes. Not every ectoine-producing bacteria has the ectD gene, whose main activity is to produce ectoine derivative 5-hydroxyectoine.

(3)

previously unknown and potentially valuable bioproducts, including bacteria such as V. salarius. Industries are actively seeking cost-effective methods for a variety of bioproducts, and the discovery of a bacterial strain with unique capabilities would be of great interest. While V. salarius 19.PP.SC1.6, isolated from coral in the North Java Sea, has shown potential in the synthesis of various industrially valuable compounds25–28, its complete genome sequence and specific capabilities remain uncharted territory. Conducting comprehensive research to obtain genomic data on Indonesia’s marine microbes, including V. salarius 19.PP.SC1.6, has the potential to unlock the vast capabilities of these natural resources and their application across various industries. This knowledge will significantly contribute to the development of sustainable and economically viable processes for the production of valuable compounds. Further investigation is essential to delineate the specific bioproduct capabilities of V.

salarius 19.PP.SC1.6 and assess its potential as a high-yield ectoine producer.

Material and methods

Sample collection and cultivation

The bacterial samples utilized in this study were sourced from our prior studies28. Specifically, soft coral Sinularia sp. specimens were collected at a depth of 2 m in the waters of Pulau Panjang, Jepara, Central Java (coordinates:

6°34′37.35ʺLS 110°37′52.01ʺE and 6°34′41.38ʺLS 110°37′54.12ʺE). These coral specimens measured 3 to 5 cm in size and were placed in a plastic container filled with sterile seawater before being temporarily stored in a coolbox. The symbiotic microorganisms were isolated following the method outlined by Kusmita et al.28.

Isolation and sequencing of genomic DNA

Bacterial DNA isolation was carried out using the PowerWater® DNA Isolation Kit from MOBIO, while Whole Genome Amplification (WGA) was performed using the REPLI-G Mini Kit from QIAGEN (Venlo, Netherlands).

Subsequently, the DNA sample underwent whole genome sequencing, which was conducted by Genetika Science Indonesia (Jakarta, Indonesia), utilizing the GridION sequencing platform from Oxford Nanopore Technologies (Oxford, U.K).

Identification and taxonomic analysis

The Mash algorithm (MinHash) was employed on the web PATRIC 3.6.1229 platform with a p-value of 0 to search for related species. GenBank files generated from whole genome sequencing were parsed using a Python script (available at https:// github. com/ rayst even/ gbk_ parser), which was executed using Biopython 1.79. ANI (Aver- age Nucleotide Identity) values were computed using FastANI version 0.1.330, while dDDH (digital DNA-DNA hybridization) values were calculated using GGDC (Genome-to-Genome Distance Calculator) version 3.031. To construct a phylogenomic tree from the dataset, OrthoFinder version 2.5.432 was utilized, and the resulting tree was visualized with iTOL version 6.5.233. The genome assembly of V. salarius 19.PP.SC1.6 has been deposited in the NCBI GenBank database under the accession number GCF_027941815.1.

Estimation of genomic characteristics

The total genome length (in base pairs, bp), coding region length (in bp), total G+C content (%), and the number of genes (including tRNA, rRNA, tmRNA, and CDS) were computed using the Python script mentioned (https://

github. com/ rayst even/ gbk_ parser). Subsequently, the obtained results were visualized as a circular genome map using the Proksee web server’s CGView-based draw engine34.

Pan genome, genomic islands, and secondary metabolites analysis

For pan genome analysis, GET HOMOLOGUES35 was utilized to acquire occupancy statistics, estimate the size of the pangenome and core genome, and perform related analyses. To predict the presence of genomic islands, the IslandViewer web server version 4 was employed36. The antiSMASH web server version 6.0.1 was used to predict the existence of biosynthetic gene clusters responsible for secondary metabolite production37. Visuali- zation of reaction pathways, based on KO (KEGG Orthology) annotations, was conducted using the KEGG Reconstructor38.

In silico nonribosomal peptides structure similarity search and its antibacterial activity prediction

The putative non-ribosomal peptides (NRP) chemical structures were extracted from the antiSMASH results in SMILES (simplified molecular-input line-entry system) format. These NRP SMILES were then employed as search queries in the NRP database, using NORINE (https:// bioin fo. lifl. fr/ norine/), to identify potential structural similarities with known NRP compounds existing in the database39. Additionally, the NRP SMILES were utilized to predict the in silico antibacterial effects through QSAR (quantitative-structure activity relationship) methods using the antiBac-Pred web application (http:// www. way2d rug. com/ antib ac/) from Way2Drug40.

Ectoine biosynthetic gene cluster characterization and comparison

Comparison of V. salarius 19.PP.SC1.6 ectoine gene sequences and clusters with ectoine gene cluster sequences obtained from the NCBI database, which have accession numbers AY935521 (Virgibacillus salexigens DSM 11483), KU510274 (Virgibacillus halodenitrificans PDB-F2), AY585263 (Virgibacillus pantothenticus DSM 26), DQ471210 (Bacillus alcalophilus DTY1), MH020162 (Bacillus clausii NIOT-DSB04), U66614 (Marinococcus halo- philus DSM 20408), and D88359 (Halomonas elongata OUT30018), was carried out through multiple sequence alignment (MSA). A Neighbor Joining (NJ) phylogenetic tree was constructed using MEGA-X software41 for fur- ther analysis of identity and sequence similarity using SIAS42. The results were visualized using IBS43. Structural

(4)

features of the ectoine gene cluster were predicted using various tools, including the Promoter Prediction tool (http:// www. fruit fly. org/ seq_ tools/ promo ter. html)44, BPROM45, and FindTerm45. These tools were employed to identify promoter regions and termination sites within the ectoine gene cluster of V. salarius 19.PP.SC1.6.)

Ectoine production and identification

V. salarius 19.PP.SC1.6 was cultured in 100 mL of Zobell Marine Medium (Himedia M384-500G) containing a 10% NaCl concentration, with agitation at 150 rpm for 30, 45, and 48 h. To collect cell pellets, the culture was subjected to centrifugation at 6000 rpm for 10 min. Following this, 100 mL of 1.5% NaCl was added for the bacterial milking process. Bacterial milking was performed at 200 rpm and 25 °C for a duration of 30 min. The resulting cell suspension was then centrifuged at 8000 rpm for 15 min. The supernatants obtained from the bacterial milking process were subsequently freeze-dried for further analysis.

Ectoine analysis using liquid chromatography-mass spectrometry (LC–MS)

The authentic standard of ectoine obtained from Sigma Aldrich (Missouri, USA) was freshly prepared in dis- tilled water and sonicated for 2 min. A stock solution of 1 mg/ml concentration was prepared and then filtered through a 0.22-µm PTFE filter for analysis. The analysis was conducted using an Acquity UPLC H-Class system equipped with a binary solvent manager, sample manager, and column heater, all from Waters (Michigan, USA).

The Masslynx analysis software was employed to operate the instrument and carry out data analyses. A separation column, ACQUITY UPLC BEH Shield RP18 100 × 2.1 mm with 1.7-μm particle size, was utilized. The mobile phase consisted of two solvents: solvent A, which was composed of water with 0.1% formic acid, and solvent B, which contained acetonitrile (AcN) with 0.1% formic acid. The elution was performed in isocratic mode using 5% B for 10 min at a flow rate of 0.3 mL/min. The column was maintained at a temperature of 40 °C, and the injection volume for both the sample and standard was 3 µL.

Detection was carried out using multiple reaction monitoring (MRM), a highly specific and sensitive mass spectrometry (MS) technique that can selectively quantify compounds within complex mixtures by monitoring multiple daughter ions of the compound of interest. Electrospray ionization (ESI) was operated in positive mode (ESI +) with the following source parameters: a spray voltage of 26 V using nitrogen as the source gas, a source gas flow rate of 900 L/h for desolvation, and a source temperature of 450 °C for desolvation. The collision energies were set to 15 V for the quantifier and 25 V for the qualifier. Capillary voltage and cone voltage were maintained at 2.5 kV and 26 V, respectively. Under these conditions, product ions (m/z) for ectoine were observed as follows:

143.02 for the parent ion, 97.21 for the quantifier, and 68.21 for the qualifier.

Ectoine genes isolation

V. salarius 19.PP.SC1.6 underwent a cultivation process, initially on solid Zobell Marine Broth Medium, followed by subculturing in liquid Zobell Marine Medium, with overnight incubation at 37 °C in a shaker set at 200 rpm.

DNA isolation was performed using the ZymoBIOMICS DNA Miniprep Kit from Zymo Research (Irvine, Cali- fornia), in accordance with the provided protocol. The ectABC biosynthetic gene was amplified through PCR, with denaturation temperatures at 95 °C, annealing temperatures at 53 °C, and elongation temperatures at 72 °C.

Specific primers were utilized for this purpose: ectA primer (5′-atgcctacaaaagatgatggggcag-3′ F and 3′-ttattcat- tatttcccttttgaaaagggccaatctt-5′ R), ectB primer (5′-atgaaaacctttgaagaattggaatcatcggt-3′ F and 3′-ttacttcaatacttgtt- taattgcttcttcaagaa-5′ R), and ectC (5′-atgatcgtaaaatcacttgaagatattattggaac-3′ F and 3′-ttattctgttaatagaggataatatc- cttcttttg-5′ R). Subsequently, the amplified gene was purified using the Wizard® SV Gel and PCR Clean-Up System kit.

For gene cloning, the ectA, ectB, and ectC genes were ligated into the pGEM®-T Easy Vector plasmid obtained from Promega (Wisconsin, USA) following established protocols. Transformation of E. coli DH5α with plasmids containing these genes was accomplished using the heat shock method46, and the transformed bacteria were cultured on ampicillin LB medium for selection. The cloning vector containing the ectA, ectB, and ectC genes was isolated from the recombinant E. coli DH5α and verified through PCR using gene-specific primers. Finally, the recombinant plasmid was confirmed through plasmid sequencing to ensure the successful integration of the ectA, ectB, and ectC genes.

Results and discussion Environmental conditions

V. salarius 19.PP.SC1.6 was isolated from Panjang Island in the North Java Sea, where it was found to inhabit an environment with a salinity level of approximately 2.6%, as indicated in Table 1. This bacterium belongs to the category of moderate halophilic bacteria, which means it can thrive in a wide range of environmental conditions. It is capable of living in salt concentrations ranging from 0.5% to 25%, accommodating pH levels between 5.5 and 10, and tolerating temperatures within the range of 10 °C to 50 °C46. The successful isolation of this bacterium from its natural habitat aligns with its ecological profile, which falls well within its documented growth parameters46.

Identification and taxonomic analysis

The Overall Genome Relatedness Index (OGRI), which encompasses ANI (%) and digital DNA-DNA Hybridi- zation (dDDH) (%), serves as a tool for identifying bacterial samples collected, as depicted in Fig. 2a. The cal- culated OGRI values for V. salarius 19.PP.SC1.6 indicate ANI (99.42%) and dDDH (96.8%) when compared to V. salarius 720a. Notably, the ANI threshold value is set at 95%47, and the dDDH48 threshold value is set at 70%.

These set threshold values are used to determine whether two genomes belong to the same species. These results strongly suggest that V. salarius 19.PP.SC1.6 and V. salarius 720a belong to the same species. To further bolster

(5)

this identification, a phylogenomic tree was constructed (Fig. 2b), which solidifies V. salarius 19.PP.SC1.6 as the closest relative to V. salarius 720a within this species tree. These findings corroborate the conclusions drawn from the previous 16S rRNA analysis, reaffirming the bacterial identity as V. salarius28.

Genomic characteristics

Table 2 presents the general characteristics of the V. salarius 19.PP.SC1.6 genome. The genome size of V. salarius 19.PP.SC1.6 is approximately 4.45 Mbp. It’s worth noting that bacteria with genome sizes larger than 3 Mbp often exhibit a higher G+C content. This phenomenon is attributed to multiple horizontal gene transfer (HGT) events involving various genomic elements, including genomic islands, which occur in response to environmental pres- sures, ultimately enhancing their survival strategies49. The differences observed among the numbers of COG, KO, COG-KO-annotated, and unannotated CDS indicate that there are many CDSs that remain insufficiently characterized, underscoring the need for further research.

The functional mapping of COG provides more insight into the analysis of CDSs (Fig. 3). V. salarius 19.PP.

SC1.6 is found to possess 21 out of the 25 COG categories, with the absence of four specific categories: Y (related to nuclear structure), Z (related to the cytoskeleton), A (associated with RNA processing and modification), and R (associated with the prediction of general function only). The annotated V. salarius 19.PP.SC1.6 genome includes 266 pathways and 75 KEGG modules, as determined by the KEGG Reconstructor results.

Pan genome analysis

The 14 Virgibacillus genomes (Supplementary Table S2) collectively exhibit an estimated pan genome size of 11,345 gene clusters. Within this genomic repertoire, 1136 genes constitute the core genome, 1596 genes make up the soft-core genome, 2295 genes form the shell genome, and a substantial 7463 genes constitute the cloud genome. Remarkably, the Virgibacillus core genome represents only 10.01% (1136 out of 11,345) of the entire pangenome. This relatively low proportion of the core genome suggests that Virgibacillus exhibits characteristics of an "open pangenome"50. Figure 4 illustrates the estimation curve for core genome size, which closely aligns with the size of the Virgibacillus genus pangenome, consistent with the concept of an open pan genome previ- ously proposed by researchers51, 52.

Moreover, the determination of the type of pangenome is guided by calculating the value on the pangenome size curve derived from computational results with GET HOMOLOGUES (Fig. 4a). In the case of an open pangenome, a significant portion of its genes is unique and exclusive to specific species. A genus characterized by an open pan genome implies that the species within it possess adaptability to diverse ecological niches53, 54. This evidence lends support to the hypothesis that V. salarius 19.PP.SC1.6 may potentially harbor novel genes or produce unique secondary metabolites as an adaptation to its niche in the North Java region. Consequently, there is a compelling incentive to delve further into the exploration of the V. salarius 19.PP.SC1.6 genome.

Prediction of genomic islands

The prediction of genomic islands using IslandViewer version 4.0 revealed the presence of a total of 18 genomic islands within the genome of V. salarius 19.PP.SC1.6 (Fig. 5). These genomic islands exhibit varying lengths, ranging from 4.385 to 36.931 kb, and each island contains between 4 and 41 genes. Notably, some of the genomic islands in V. salarius 19.PP.SC1.6 have relatively smaller sizes, ranging from 4 to 8 kb, in contrast to the average genomic island size, which typically falls between 10 and 200 kb and is commonly observed in other bacteria55.

Within the genomic island of V. salarius 19.PP.SC1.6, a total of 355 genes were identified. Significantly, 44.51%

of these genes (158 out of 355) were not found in the COG database, indicating the presence of novel genes that have not been previously characterized. Additionally, 15.21% of the genes (54 out of 355) were classified as having an unknown function (category S). Furthermore, a substantial portion, approximately 59.71% of the total genomic island genes (212 out of 355), remain functionally uncharacterized. Among the remaining genes detected by the COG database, the dominant category observed was category L, which pertains to replication, recombination, and repair processes55.

The functions of other genes within the genomic island remain unknown, providing an opportunity for fur- ther research to uncover the specific types of genomic islands present in V. salarius 19.PP.SC1.6. These islands could potentially encompass various categories, including pathogenicity islands, resistance islands, metabolic islands, and symbiosis islands, among others, warranting further investigation.

Table 1. Descriptive information on the environmental characteristics of Panjang island (sampling location)28.

Parameter Panjang island Salinity (‰) 26

pH 7.08

Temperature (℃) 28.8 Visibility (m) 1

DO (mg/L) 3.45

NH3 (mg/L) 2.5 S2− (mg/L) 0.000 NO3 (mg/L) 2.6

(6)

Prediction of secondary metabolite BGCs

We conducted a secondary metabolite prediction analysis using antiSMASH 6.0 to identify potential secondary metabolite gene clusters in V. salarius 19.PP.SC1.6. Table 3 provides information on the secondary metabolite Figure 2. (a) Calculation results of ANI and dDDH of 14 Virgibacillus genomes for sample identification. (b).

Phylogenomic analysis of the V. salarius 19.PP.SC1.6 genome compared to 43 other Virgibacillus genomes in total. Phylogenomic tree was constructed using OrthoFinder and Visualized with iTOL software.

(7)

gene clusters identified in the genome, with a total of seven clusters containing 3 to 30 genes. In aggregate, these seven gene clusters encompass a total of 80 genes.

The first secondary metabolite gene cluster falls under the polyketide synthase (PKS) type III category and comprises two main genes: naringenin-chalcone synthase (NCS) and hydroxymethylglutaryl-CoA synthase (HMGCS). These enzymes play pivotal roles in the biosynthesis of naringenin chalcone and (S)-3-hydroxy- 3-methylglutaryl-CoA (HMG-CoA)56, 57. Within V. salarius 19.PP.SC1.6, the terpene gene cluster includes the enzymes farnesyl-diphosphate farnesyltransferase (FDFT1) and squalene-hopene cyclase (SHC). These enzymes are responsible for generating sesquiterpenoid and triterpenoid derivatives58, 59.

Non-ribosomal peptides (NRPs) represent secondary metabolites known for their antibacterial activity. NRP 1 in the V. salarius 19.PP.SC1.6 genome exhibits a predicted similarity of 0.542 to bacillibactin, with an antibac- terial confidence of 0.8122 against Klebsiella oxytoca. Meanwhile, NRP 2 is anticipated to have the capacity to produce new compounds akin to lolicillomycin and 51W pyoverdine. Furthermore, the ectoine gene cluster in V. salarius 19.PP.SC1.6 comprises a complete set of genes, consisting of ectABC, which will be further examined for its ability to produce intact ectoine products.

Table 2. General characteristics of the V. salarius 19.PP.SC1.6 genome from whole genome sequencing (WGS) result. *Total is from genome size (bp) or from total genes. COG Clusters of Orthologous Groups, CDS Coding Sequence, KO KEGG Orthology.

Attribute Score % from total*

Genome size (bp) 4,497,283 100.00 DNA coding region (bp) 3,575,955 79.51 DNA G+C content (bp) 1,650,503 36.70

Total genes 4979 100.00

rRNA 17 0.34

tRNA 64 1.29

tmRNA 1 0.02

CDS 4897 98.35

CDS annotated with COG 4426 88.89 CDS annotated with KO 4297 87.75 CDS annotated with COG & KO 4227 84.90

Figure 3. Circular analysis and visualization of V. salarius 19.PP.SC1.6 genome. From the outside to the inside, the first and second circle represent genes with COG annotation. Circles 3 (green and purple) and 4 (black) show GC skew and GC content as the deviation from the average for the complete genome.

(8)

Characterization and comparison of ectoine biosynthetic gene cluster

According to antiSMASH results, three genes in the ectoine gene cluster are located on the V. salarius 19.PP.SC1.6 genome at bases 3,905,306–3,907,562 with a total length of 2190 bp. These three genes code l-ectoine synthase (ectC), diaminobutyrate-2-oxoglutarate transaminase (ectB), and L-2,4-diaminobutyric acid acetyltransferase (ectA).

Figure 4. Pan- and core-genome estimation curves. (a) The size of the pan-genome increases for every included genome indicating an open pan-genome. (b) Core-genome size decreases with more genomes included in analysis.

Figure 5. Visualization of Genomic Island detected in the V. salarius 19.PP.SC1.6 Genome using IslandViewer 4.0. The genome of V. salarius 19.PP.SC1.6 contains a total of 18 genomic islands.

(9)

The complete sequence of ectoine-producing bacteria gene clusters, consisting of non-coding and coding regions, was predicted for the number of operons, number of genes, and gene length; the structure of the gene cluster was also visualized (Fig. 6A). Overall, ectoine gene clusters in all species have the same gene order, namely ectA, ectB, and ectC, with more or less the same gene size (ectA 479–557 bp; ectB 1265–1286 bp; ectC 332–398 bp).

The ectoine biosynthetic gene cluster in V. salexigens, V. panthotenthicus, B. alcalophilus, B. clausii, M. halophilus, and H. elongata consists of one operon. However, there is a gap between ectB and ectC in the annotation results of the V. salarius 19.PP.SC1.6 gene cluster, resulting in the formation of two distinct operons. H. elongata had the largest gene cluster size, with a total of 2431 bp. Meanwhile, V. pantothenticus has the smallest ectoine gene cluster size, with a total of 2186 bp.

A total of 2190 bp of V. salarius 19.PP.SC1.6 ectoine gene cluster sequences were taken from the non-coding ectA upstream region to the non-coding ectC downstream region. Predictions using BPROM and FindTerm showed that the V. salarius 19.PP.SC1.6 ectoine gene cluster contains two promoters and one terminator; a visu- alization of the gene structure is shown in Fig. 6B. The first promoter is on the first operon, specifically the 239th base, with -10 boxes on the 224th base and -35 boxes on the 200th base (Fig. 6C) with predicted transcription factors marR, gcvA, and lrp. The other promoter was predicted at base 145 in the second operon, specifically in the non-coding downstream ectB region with − 10 boxes at positions 130 and − 35 boxes at positions 111 with crp as predicted transcription factor.

Bacterial transcription terminators control gene expression in two ways: intrinsic terminators that separate transcription complexes without the help of other factors (Rho-independent) and terminators that rely on RNA helicases called Rho (Rho-dependent). It is predicted that one putative Rho-independent terminator will be formed from the two operons formed in the V. salarius 19.PP.SC1.6 ectoine gene cluster, namely in the down- stream non-coding region of ectC, starting from base 22 and until base 69.

The amino acid sequences of ectABC V. salarius 19.PP.SC1.6 and other ectoine-producing bacteria were aligned using MSA and compared for sequence identity and similarity. According to the findings, the ectA protein of V. salarius 19.PP.SC1.6 has 93.08% identity and 96.22% similarity to V. salexigens, with the highest predicted global similarity (BLOSUM62) of 0.93. The ectB gene from V. salarius 19.PP.SC1.6, like its ectA gene, has the highest identity and similarity to the ectB gene from V. salexigens, with similarity rate of 89.22% and 92.81%, respectively. Furthermore, BLOSUM 0.88. ectC V. salarius 19.PP.SC1.6 had the highest identity, similarity, and BLOSUM when aligned with ectC V. salexigens, with values of 97.65%, 99.21%, and 0.98, respectively.

After determining the sequences’ similarity and identity, a Neighbor-Joining (NJ) phylogenetic tree was constructed (Fig. 7). ectA, ectB, and ectC of V. salarius 19.PP.SC1.6 were found to be the most closely related to ectABC of V. salexigens based on phylogenetic analysis. This is supported by the results of the previous sequences’

similarity and identity, with ectABC V. salarius-V. salexigens having the highest percentage of similarity and identity. It was also known that the Virgibacillus clade and the Marinococcus-Alkalihalobacillus clade had formed, indicating that ectABC from M. halophilus had the highest similarity to the ectA genus Alkalihalobacillus, as well as the presence of outgroup H. elongata.

Ectoine gene cluster isolation

The ectABC gene amplification results indicated that the ectA gene had a size of 480 bp, the ectB gene was 1245 bp, and the ectC gene was 387 bp (Supplementary Fig. S1). These findings align with the in silico annotation results, confirming the successful isolation of the ectABC gene cluster of V. salarius 19.PP.SC1.6. To obtain the full-length sequence of these three genes, we performed pairwise alignment between the whole genome sequencing (WGS) sequence of V. salarius 19.PP.SC1.6, which had been annotated in silico, and the plasmid sequencing containing the ectA, ectB, and ectC sequences from V. salarius 19.PP.SC1.6 using the EMBOSS-Water tool.

The pairwise sequence alignment (PSA) between the plasmid sequencing and WGS ectA sequences revealed 100% similarity and sequence identity, spanning 480 bp of the ectA sequences. PSA conducted between the ectB WGS sequences and the ectB plasmid sequence exhibited 99.8% similarity and identity with a 0.1% sequence gap. Notably, two nucleotide base differences and one base substitution were observed in the ectB WGS sequence, leading to a gap in the sequence alignment results. These discrepancies between the WGS results and the plasmid sequence caused a frameshift in the ectB sequence, resulting in a stop codon occurring in the middle of the ectB gene. PSA performed on the ectC plasmid sequence against the V. salarius 19.PP.SC1.6 ectC WGS result revealed Table 3. V. salarius 19.PP.SC1.6 secondary metabolite gene clusters from antiSMASH analysis results showed the presence of seven gene clusters with cluster lengths ranging from 2190 to 84,764 bp with a total of 80 genes.

*KCBLAST (KnownCluster BLAST) is a part of the antiSMASH pipeline that aligns the query with clusters from the MiBIG database.

Cluster Start Stop Length (bp) KCBLAST* result Similarity (%) Number of genes

Terpene 671,094 691,138 20,044 6

Terpene 2,533,865 2,555,214 21,349 7

NRPS 2,789,711 2,833,924 44,213 Bacillibactin 46 4

PKS (type III) 3,235,646 3,275,709 40,063 15

Ectoine 3,905,372 3,907,562 2,190 Ectoine 75 3

PKS (type III) 3,915,988 3,956,238 40,250 12

NRPS 4,367,900 4,452,664 84,764 Locillomycin 64 30

(10)

99.7% identity and similarity with no sequence gaps. There was one nucleotide-based difference between the ectC plasmid sequence and the ectC WGS sequences, leading to a distinct amino acid sequence. In the ectC WGS sequence, Proline (P) was found, while Leucine (L) was present in the ectC plasmid sequence. These variations in sequence between the WGS and plasmid results may be attributed to differences in sample specificity during sequencing. The WGS approach employed samples from the entire V. salarius 19.PP.SC1.6 genome, resulting in a much larger number of genes and nucleotide bases being read. In contrast, plasmid sequencing utilized specific primers (T7 and SP6 promoters), resulting in a smaller and more specific set of bases being read, as only the ectABC gene was amplified.

To further confirm the ectB and ectC sequences, we conducted pairwise sequence alignments between V.

salariusectB and ectC sequences and those of closely related species, specifically V. salexigens ectB and ectC (Supplementary Figs. S2–S3). These results were then employed for visualizing the ectoine gene cluster between the plasmid and WGS sequences. Initially, in silico analysis suggested that the V. salarius 19.PP.SC1.6 ectoine gene cluster consisted of two operons, each with two promoters and one terminator (Fig. 8A). However, after isolating and sequencing ectA, ectB, and ectC, it was revealed that the V. salarius 19.PP.SC1.6 ectoine gene cluster comprised only one operon, with one promoter and one terminator (Fig. 8B).

Figure 6. Ectoine gene cluster. (A) Comparative schematic structure of ectoine biosynthetic gene organization in Virgibacillus salarius and other ectoine producing bacteria. l-2,4-diaminobutyric acid N-γ-acetyltransferase genes (ectA) are red, diaminobutyric acid transaminase genes (ectB) are yellow, and ectoine synthase (ectC) are blue. (B) Ectoine gene cluster in Virgibacillus salarius from WGS result showed two operons comprised of ectABC genes with two promoter (red box) and one terminator (green box). (C) Putative structural features of V.

bacillus ectoine biosynthetic gene cluster, consist of − 35 and − 10 boxes, with transcription start site (+ 1).

(11)

Figure 7. Phylogenetic analysis of Virgibacillus salarius’ (A) ectA, (B) ectB, and (C) ectC and other ectoine- producing bacteria deposited in the NCBI Database. Sequences were aligned using MEGA 10 and phylogenetic trees were built using the neighbor-joining (NJ) method within the MEGA 10 software. V. salarius’ ectABC amino acid sequences were closely related with V. salexigens.

(12)

Ectoine production

The results of the whole genome sequencing (WGS) and the characterization of the V. salarius 19.PP.SC1.6 ectoine biosynthetic gene cluster were further investigated to determine if the gene cluster functions as a whole to synthesize ectoine. The presence of ectoine in V. salarius 19.PP.SC1.6 incubated in Marine Zobell medium with 15% NaCl was confirmed using UPLC-MS/MS operated in MRM mode, as shown in the chromatograms (Fig. 9). This indicates that V. salarius 19.PP.SC1.6 possesses intact ectoine gene clusters and is capable of produc- ing ectoine as compatible solutes, which act as osmotic protective agents for bacterial cells. Further quantification of ectoine using our validated analytical method (Supplementary Table S1) showed that cells incubated for 30, 45, and 48 h in 10% NaCl produced 0.23, 0.46, and 0.68 mg of ectoine per liter of medium culture, respectively.

This preliminary study revealed that the ectoine yield from V. salarius 19.PP.SC1.6 is lower compared to several previously reported or commercial ectoine-producing bacteria (1170–14,860 mg/L)60–62. However, previously reported yields are the result of engineering and optimization process and cannot be compared directly. Efforts such as growth optimization and other molecular approaches, including metabolic pathway optimization, might be employed to increase the yield of ectoine from V. salarius 19.PP.SC1.6.

Conclusion

Genome characteristics such as gene annotation, number of genomic islands, pan genome types, and number of secondary metabolite gene clusters from V. salarius 19.PP.SC1.6 have been obtained. Terpene groups, NRPS, PKS (type III), and ectoine are four of the seven clusters of secondary metabolite-producing genes found. The V.

salarius 19.PP.SC1.6 ectoine biosynthetic gene cluster, which includes the ectABC gene, has been characterized, Figure 8. Comparative schematic representation of the V. salarius ectoine gene cluster structure based on (A) WGS and (B) plasmid sequencing results. Plasmid sequencing revealed that ectoine gene cluster forms one operon consisting of ectABC gene with the promoter shown in the green box and terminator shown in the red triangle.

Figure 9. Ectoine biosynthesis analysis of Virgibacillus salarius. UPLC-MRM-MS/MS map (a) The UPLC map shows spectra of intracellular ectoine extracted from V. salarius. (b) The UPLC map of the authentic standard of ectoine. The number shown in the map indicates that intracellular ectoine from V. salarius has high similarity in structure as compared to pure ectoine used as control.

(13)

and V. salarius ’ ability to produce ectoine has been identified. Overall, our study has uncovered several potential metabolites from V. salarius 19.PP.SC1.6, and this opens the opportunity for further utilization.

Data availability

The genome sequence of the V. salarius 19.PP.SC1.6 has been deposited in GenBank/EMBL/DDBJ under the accession number GCF_027941815.1.

Received: 18 February 2023; Accepted: 10 October 2023

References

1. Srinivasan, R., Kannappan, A., Shi, C. & Lin, X. Marine bacterial secondary metabolites: A treasure house for structurally unique and effective antimicrobial compounds. Mar. Drugs 19, 530 (2021).

2. Wang, Y.-N., Meng, L.-H. & Wang, B.-G. Progress in Research on bioactive secondary metabolites from deep-sea derived micro- organisms. Mar. Drugs 18, 614 (2020).

3. Yang, L. J. et al. Antimicrobial and Antioxidant Polyketides from a Deep-Sea-Derived Fungus Aspergillus versicolor SH0105. Mar Drugs 18, (2020).

4. Ruiz, B. et al. Production of microbial secondary metabolites: Regulation by the carbon source. Crit Rev Microbiol 36, 146–167 (2010).

5. Steven, R. et al. Marine microbial-derived resource exploration: Uncovering the hidden potential of marine carotenoids. Mar.

Drugs 20, 636 (2022).

6. Song, C. et al. Marine natural products: The important resource of biological insecticide. Chem. Biodivers. 18, 15 (2021).

7. Zhao, Y. & Chen, P. Natural products with health benefits from marine biological resources. Curr. Org. Chem. 18, 777–792 (2014).

8. Martins, A., Vieira, H., Gaspar, H. & Santos, S. Marketed marine natural products in the pharmaceutical and cosmeceutical industries: Tips for success. Mar. Drugs 12, 1066–1101 (2014).

9. Puttaswamygowda, G. H., Olakkaran, S., Antony, A. & Kizhakke Purayil, A. Present status and future perspectives of marine actinobacterial metabolites. in Recent Developments in Applied Microbiology and Biochemistry, 307–319 (Elsevier, 2019). https://

doi. org/ 10. 1016/ B978-0- 12- 816328- 3. 00022-2.

10. Lee, N. et al. Mini review: Genome mining approaches for the identification of secondary metabolite biosynthetic gene clusters in Streptomyces. Comput. Struct. Biotechnol. J. 18, 1548–1556 (2020).

11. Handayani, I. et al. Mining indonesian microbial biodiversity for novel natural compounds by a combined genome mining and molecular networking approach. Mar. Drugs 19, 1–10 (2021).

12. Suyadi, S. W. N., Syahailatua, A. & Arifin, Z. Banda deep-sea research: History, mission and strategic plan. IOP Conf. Ser. Earth Environ. Sci. 184, 012001 (2018).

13. Webster, G., Jones, C., Mullins, A. J. & Mahenthiralingam, E. A rapid screening method for the detection of specialised metabolites from bacteria: Induction and suppression of metabolites from Burkholderia species. J. Microbiol. Methods 178, 106057 (2020).

14. Liu, Z., Zhao, Y., Huang, C. & Luo, Y. Recent advances in silent gene cluster activation in streptomyces. Front. Bioeng. Biotechnol.

9, 632230 (2021).

15. Albarano, L., Esposito, R., Ruocco, N. & Costantini, M. Genome mining as new challenge in natural products discovery. Mar.

Drugs 18, 199 (2020).

16. Belknap, K. C., Park, C. J., Barth, B. M. & Andam, C. P. Genome mining of biosynthetic and chemotherapeutic gene clusters in Streptomyces bacteria. Sci. Rep. 10, 58904 (2020).

17. Ren, H., Shi, C. & Zhao, H. Computational tools for discovering and engineering natural product biosynthetic pathways. Science 23, 100795 (2020).

18. Paoli, L. et al. Biosynthetic potential of the global ocean microbiome. Nature 607, 111–118 (2022).

19. Yang, Z., He, J., Wei, X., Ju, J. & Ma, J. Exploration and genome mining of natural products from marine Streptomyces. Appl.

Microbiol. Biotechnol. 104, 67–76 (2020).

20. Graf, R., Anzali, S., Buenger, J., Pfluecker, F. & Driller, H. The multifunctional role of ectoine as a natural cell protectant. Clin.

Dermatol. 26, 326–333 (2008).

21. Czech, L. et al. Role of the extremolytes ectoine and hydroxyectoine as stress protectants and nutrients: Genetics, phylogenomics, biochemistry, and structural analysis. Genes 9, 177 (2018).

22. Kunte, H., Lentzen, G. & Galinski, E. Industrial production of the cell protectant ectoine: Protection mechanisms, processes, and products. Curr. Biotechnol. 3, 10–25 (2014).

23. García-Estepa, R. et al. The ectD gene, which is involved in the synthesis of the compatible solute hydroxyectoine, is essential for thermoprotection of the halophilic bacterium Chromohalobacter salexigens. J. Bacteriol. 188, 3774–3784 (2006).

24. Werkhäuser, N., Bilstein, A. & Sonnemann, U. Treatment of allergic rhinitis with ectoine containing nasal spray and eye drops in comparison with azelastine containing nasal spray and eye drops or with cromoglycic acid containing nasal spray. J. Allergy 2014, 1–13 (2014).

25. Ayuningrum, D., Muchlissin, S. I., Trianto, A., Radjasa, O. K. & Sabdono, A. Crude extract from a hardcoral-associated bacterium Virgibacillus salarius PHC-44–04 inhibiting growth of multidrug-resistant Enterobacter aerogenes human pathogen. Biofarmasi J. Nat. Product Biochem. 18, 205 (2020).

26. Insafitri, H. N. & Nugraha, W. A. Bacteria associated with healthy and white syndrome diseased Acropora collected from Gili Labak Island Sumenep and their antibacterial activity. J. Allergy https:// doi. org/ 10. 1063/5. 00027 41 (2020).

27. Kristiana, R. et al. Bioactivity of compounds secreted by symbiont bacteria of Nudibranchs from Indonesia. PeerJ 2020, 1–10 (2020).

28. Kusmita, L. et al. Bioactivity of carotenoid produced by soft coral symbiotic microorganisms from panjang and karimunjawa island, Centra Java, Indonesia. Biodiversitas 22, 732–740 (2021).

29. Ondov, B. D. et al. Mash: Fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 997 (2016).

30. Jain, C., Rodriguez-R, L. M., Phillippy, A. M., Konstantinidis, K. T. & Aluru, S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 9, 5114 (2018).

31. Meier-Kolthoff, J. P., Carbasse, J. S., Peinado-Olarte, R. L. & Göker, M. TYGS and LPSN: A database tandem for fast and reliable genome-based classification and nomenclature of prokaryotes. Nucleic Acids Res. 50, D801–D807 (2022).

32. Emms, D. M. & Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 1832 (2019).

33. Letunic, I. & Bork, P. Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).

34. Stothard, P. & Wishart, D. S. Circular genome visualization and exploration using CGView. Bioinformatics 21, 537–539 (2005).

35. Contreras-Moreira, B. & Vinuesa, P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pange- nome analysis. Appl. Environ. Microbiol. 79, 7696–7701 (2013).

(14)

36. Bertelli, C. et al. IslandViewer 4: Expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. 45, W30–W35 (2017).

37. Blin, K. et al. AntiSMASH 6.0: Improving cluster detection and comparison capabilities. Nucleic Acids Res. 49, W29–W35 (2021).

38. Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–D361 (2017).

39. Flissi, A. et al. Norine: Update of the nonribosomal peptide resource. Nucleic Acids Res. 48, D465–D469 (2020).

40. Pogodin, P. V. et al. AntiBac-Pred: A web application for predicting antibacterial activity of chemical compounds. J. Chem. Inf.

Model 59, 4513–4518 (2019).

41. Tamura, K., Stecher, G. & Kumar, S. MEGA11: Molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 38, 3022–3027 (2021).

42. Reche, P. Sequence Identities and Similarities. http:// imed. med. ucm. es/ Tools/ sias. html (2013).

43. Liu, W. et al. IBS: An illustrator for the presentation and visualization of biological sequences. Bioinformatics 31, 3359–3361 (2015).

44. Reese, M. G. Application of a time-delay neural network to promoter annotation in the Drosophila melanogaster genome. Comput.

Chem. 26, 51–56 (2001).

45. Solovyev, V. Automatic annotation of microbial genomes and metagenomic sequences. in Metagenomics and its Applications in Agriculture, Biomedicine and Environmental Studies, 61–78 (Nova Science Publishers, 2011).

46. Hua, N.-P., Hamza-Chaffai, A., Vreeland, R. H., Isoda, H. & Naganuma, T. Virgibacillus salarius sp. nov., a halophilic bacterium isolated from a Saharan salt lake. Int. J. Syst. Evol. Microbiol. 58, 2409–2414 (2008).

47. Kim, M., Oh, H.-S., Park, S.-C. & Chun, J. Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes. Int. J. Syst. Evol. Microbiol. 64, 346–351 (2014).

48. Meier-Kolthoff, J. P., Auch, A. F., Klenk, H.-P. & Göker, M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinform. 14, 60 (2013).

49. Mann, S. & Chen, Y.-P.P. Bacterial genomic G+C composition-eliciting environmental adaptation. Genomics 95, 7–15 (2010).

50. Costa, S. S., Guimarães, L. C., Silva, A., Soares, S. C. & Baraúna, R. A. First steps in the analysis of prokaryotic pan-genomes.

Bioinform. Biol. Insights 14, 117793222093806 (2020).

51. Tettelin, H. et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan- genome”. Proc. Natl. Acad. Sci. 102, 13950–13955 (2005).

52. Willenbrock, H., Hallin, P. F., Wassenaar, T. M. & Ussery, D. W. Characterization of probiotic Escherichia coli isolates with a novel pan-genome microarray. Genome Biol. 8, R267 (2007).

53. Brockhurst, M. A. et al. The ecology and evolution of pangenomes. Curr. Biol. 29, R1094–R1103 (2019).

54. Vernikos, G., Medini, D., Riley, D. R. & Tettelin, H. T. Ten years of pan-genome analyses. Curr. Opin. Microbiol. 23, 148–154 (2015).

55. Langille, M. G. I., Hsiao, W. W. L. & Brinkman, F. S. L. Detecting genomic islands using bioinformatics approaches. Nat. Rev.

Microbiol. 8, 373–382 (2010).

56. Austin, M. B. & Noel, J. P. The chalcone synthase superfamily of type III polyketide synthases. Nat. Prod. Rep. 20, 79–110 (2003).

57. Strauss, E. Coenzyme A biosynthesis and enzymology. in Comprehensive Natural Products II, 351–410 (Elsevier, 2010). https://

doi. org/ 10. 1016/ B978- 00804 5382-8. 00141-6.

58. Reddy, L. H. & Couvreur, P. Squalene: A natural triterpene for use in disease management and therapy. Adv. Drug Deliv. Rev. 61, 1412–1426 (2009).

59. Pearson, A. & Rusch, D. B. Distribution of microbial terpenoid lipid cyclases in the global ocean metagenome. ISME J. 3, 352–363 (2009).

60. Putu Parwata, I., Wahyuningrum, D., Suhandono, S. & Hertadi, R. Production of ectoine by Halomonas elongata BK-AG25 using osmotic shock technique. IOP Conf. Ser. Earth Environ. Sci. 209, 012017 (2018).

61. Lang, Y., Bai, L., Ren, Y., Zhang, L. & Nagata, S. Production of ectoine through a combined process that uses both growing and resting cells of Halomonas salina DSM 5928T. Extremophiles 15, 303–310 (2011).

62. Onraedt, A. E., Walcarius, B. A., Soetaert, W. K. & Vandamme, E. J. Optimization of ectoine synthesis through fed-batch fermenta- tion of Brevibacterium epidermis. Biotechnol. Prog. 21, 1206–1212 (2005).

Acknowledgements

We thank the 2022 Institut Teknologi Bandung Priority Research (Contract No. 223/IT1.B07.1/TA.00/2022) for Fenny M. Dwivany et al. for supporting this research.

Author contributions

O.K.R. and F.M.D. concepted and designed the research. H.N., R.S., and Z.H., performed identification and taxonomic analysis, genome analysis, and gene cluster characterization. L.K. isolated the bacteria. T.K. and Z.H.

isolated ectoine gene cluster. N.P.K., and F.A.P.B.S. extracted ectoine from V. salarius. A.C. performed ectoine identification. O.K.R., F.M.D., J.P.T., A.D., M.R.M., and V.S.H.S. supervised, reviewed, and revised the manuscript.

Z.H., R.S., and Y.N. drew the figures in the manuscript. All authors have read and agreed to the final version of the manuscript.

Competing interests

The authors declare no competing interests.

Additional information

Supplementary Information The online version contains supplementary material available at https:// doi. org/

10. 1038/ s41598- 023- 44603-8.

Correspondence and requests for materials should be addressed to O.K.R. or F.M.D.

Reprints and permissions information is available at www.nature.com/reprints.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

(15)

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.

© The Author(s) 2023

Referensi

Dokumen terkait