2023, Vol. 13, No. 1, 81 – 92
http://dx.doi.org/10.11594/jtls.13.01.09
How to cite:
Wiprayoga IPP, Meitha K, Dwivany FM (2023) In Silico Characterization of Lycopene Beta Cyclase (LCYB) and Lycopene Research Article
In Silico Characterization of Lycopene Beta Cyclase (LCYB) and Lycopene Epsilon Cyclase (LCYE) Genes from DH-Pahang (Musa acuminata, A Genome) and DH- PKW (Musa balbisiana, B Genome)
I Putu Prakasa Wiprayoga, Karlia Meitha, Fenny M. Dwivany *
School of Life Sciences and Technology, Institut Teknologi Bandung, Bandung 40132, Indonesia
Article history:
Submission February 2022 Revised February 2022 Accepted September 2022
ABSTRACT
Alpha (α) and beta (β)-carotene are among the nutritious contents of bananas, with the unique feature of a high ratio of α-/β-carotene. Research on the gene and protein of the key enzymes determining the ratio of α-/β-carotene in bananas, namely lycopene beta cyclase (LCYB) and lycopene epsilon cyclase (LCYE), is currently not well de- fined. Hence, this study aimed to compare the characters of the LCYB and LCYE genes and their putative proteins from Musa acuminata 'DH-Pahang' and Musa balbisi- ana 'DH-PKW'. The corresponding nucleotide sequences from both species were aligned to detect similarities in the gene structure. Their protein products were char- acterized at the primary and tertiary levels. The phylogenetic tree was constructed based on nucleotide and protein sequences. The result showed that the gene structure between these two species is similar in LCYB in chromosome 9 but different in LCYB in chromosome 7 and LCYE. The presence of cis-acting regulatory elements in response to light dominated the 2000 nucleotide region of the 5'UTR of LCYB and LCYE genes in both species. Based on protein alignment and domain analysis, the NADB_Rossmann superfamily domain was detected in both LCYB and LCYE.
Alignment of the three-dimensional protein structure showed a significant difference between MaLCYB.c07 and MbLCYB.c07 only. The phylogenetic tree based on pro- tein sequences indicated the distant relationship of MaLCYB.c07 and MbLCYB.c07 with other LCYB ingroup OTUs. The results of this study could provide a molecular basis related to the exploration of bananas as a promising functional food to meet the needs of provitamin A.
Keywords: LCYB, LCYE, Musa acuminata, Musa balbisiana, provitamin A
*Corresponding author:
E-mail: [email protected]
Introduction
Vitamin A deficiency (VAD) has been causing serious health problems, especially in poor and de- veloping countries. Data on the prevalence of VAD in the period of 1995-2005 showed a mod- erate-severe problem status (>10%) in preschool children in 122 countries [1]. Moreover, VAD in Asia and Africa also increases mortality rates in children and pregnant women [2, 3].
The discovery of banana cultivars with high provitamin A carotenoids (pVACs) piqued the in- terest to promote bananas as a functional food in treating VAD cases [4]. Various commonly eaten banana cultivars were derived from Musa acu-
minata (A genome) and Musa balbisiana (B ge- nome). Research in 2009 showed that consump- tion of 7 banana cultivars imported from the Pa- cific Islands, with a pure composition of A ge- nome (AA and AAA) and a combination of A and B genomes (AAB and ABB), successfully im- proved vitamin A intake of vulnerable population in Africa [3]. Hence, the distinctive content of pVACs in bananas with pure A or B genomes and their combination becomes an interesting basis for studying the genes involved in the biosynthesis of carotenoids in each genome.
Alpha (α) and beta (β)-carotene are important compounds in the pVACs group. In the biosyn- thetic pathway (figure 1), these two compounds are synthesized from the cyclization reaction, an essential pathway that generates carotenoid diver- sity with different cyclic end groups. LCYE and LCYB compete on lycopene substrates to form α- carotene or b-carotene. At the pathway catalyzed only by LCYB, molecules with two beta rings are formed, namely b-carotene. In another branching pathway, a reaction catalyzed simultaneously by LCYB and LCYE resulted in the formation of beta and epsilon ring molecules, namely α -carotene [5]. Thus, the relative activity of these two en- zymes will determine the ratio of α/β-carotene [6].
It is also known that the conversion efficiency of β-carotene into vitamin A is higher than α-caro- tene [7].
D'Hont et al. [8] and Wang et al. [9] have car- ried out whole genome sequencing of M. acu- minata 'DH-Pahang' and M. balbisiana 'DH- PKW', respectively. However, the characteriza- tion of LCYB and LCYE genes and proteins in these two genomes have not been well defined.
Therefore, this study compares the characters of LCYB and LCYE genes and their putative proteins from M. acuminata 'DH-Pahang' and M. balbisi-
ana 'DH-PKW'. The results suggested key regula- tory areas and domains of the enzymes for future improvement of pVACs content in banana.
Material and Methods
Sequence retrieving and gene identification Nucleotide sequences annotated by KEGG (www.genome.jp/kegg/) [10] database were used as the reference for determining LCYB and LCYE genes. Each coding sequence (CDS) was retrieved from KEGG and served as the query for the Basic Local Alignment Search Tool (BLAST) feature on the Banana Genome Hub database (https://banana- genome-hub.southgreen.fr/) [11]. BLASTN was conducted on the whole genome of M. acuminata 'DH-Pahang' (v4.3) for DH-Pahang, and whole ge- nome annotation of M. balbisiana 'DH-PKW' (v1.1) for DH-PKW. The sequences with identity percentage higher than 90% was then identified as LCYB or LCYE gene sequences.
Gene structure prediction
Gene structure prediction was performed with FGENESH+(www.softberry.com/berry.phtml?to- pic=fgenesh&group=help&subgroup=gfind) thro- ugh ab-initio principle [12]. Furthermore, Scipio was used to confirm the exon-intron junctions [13]. Gene structure prediction in the form of the length of nucleotide sequence of genes from the transcription start site (TSS) to the poly-A tail, the number of exons and introns, and the position of each exon and intron, were then visualized with Il- lustrator for Biological Sequences (IBS) software (ibs.biocuckoo.org/) [14]. To find the percentage of identity and similarity between two sequences, a global pairwise sequence alignment was per- formed using the Emboss Needle in the region of open reading frame (ORF) [15].
Analysis of Cis-Acting Regulatory Element (CAREs)
The presence of cis-acting regulatory elements (CAREs) in the upstream of 2000 nucleotides (5') relatives to the start codon (ATG) of the first exon of each gene was detected using the PlantCARE tool (https://bio.tools/plantcare) [16]. The identi- fied CAREs motifs were then grouped according to their function related to hormonal regulations and abiotic stresses. The frequency of occurrence of CAREs in DH-Pahang and DH-PKW was cal- culated for each gene.
Figure 1. Carotene biosynthesis pathway. GGDP:
geranylgeranyl diphosphate; PSY: phy- toene synthase; PDS: phytoene desaturase;
ZDS: z-carotene desaturase; CrtISO: ca- rotenoid isomerase; LCYE: lycopene - cyclase; LCYB: lycopene -cyclase) (Adapted from Badejo [5].
Analysis of protein motif and domain
The predictive protein sequences obtained from FGENESH+ were analyzed to find motifs and domains [12]. Domains were discovered using the NCBI Conserved Domain Search tool (www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) [17]. The identified domains were then visualized with Illustrator for Biological Sequences (IBS) software.
The search for motifs was carried out based on common expression principle, with reference to the literature. Initially, protein sequences from four other plants were obtained by performing BLASTP using queries from DH-PKW in the GenBank database (blast.ncbi.nlm.nih.gov/
Blast.cgi) [18]. Next, multiple sequence alignment (MSA) was performed with the muscle algorithm.
Alignment results with the marked motifs were visualized using GeneDoc software [19].
Prediction of Three-Dimensional Protein Struc- ture
The protein sequences were submitted on the i-TASSER webpage (https://zhanglab.dcmb.med.
umich.edu/I-TASSER/) to find the predictive three-dimensional (tertiary) structure of the pro- tein [20]. In this study, we used multiple threading alignment and iterative structural assembly ap- proach. Table 1 summarizes templates for struc- tural modelling. The selected three dimensional structure is the model with the highest C-score. To make comparisons between three-dimensional structures, the root means square deviation (RMSD) value displayed by PyMol (pymol.org/2/) was used, with an RMSD value
<3Å indicating that the two aligned proteins are similar [21].
Table 1. Template for each protein.
Organism Protein Template (PDB)
DH- Pahang
MaLCYB.c09 3ATQ MaLCYB.c07
MaLCYE
7CTQ 3ATQ
DH-PKW
MbLCYB.c09 3ATQ MbLCYB.c07
MbLCYE
3ATQ 3ATQ
Construction of phylogenetic tree
Phylogenetic trees were constructed based on CDS and protein sequences to examine the rela- tionship between LCYB and LCYE in DH-Pahang
and DH-PKW, relative to related genes in selected plants. CDS from other plants were obtained by performing BLASTP using queries from DH-
PKW in the GenBank database
(blast.ncbi.nlm.nih.gov/Blast.cgi) [18]. Ten se- quences with the highest max score were selected as operational taxonomic units (OTUs). The CDS of these ten species were then used to retrieve pro- tein sequences to build a second type of phyloge- netic tree.
The phylogenetic tree construction was carried out in MEGA X software [22]. Initially, multiple sequence alignment (MSA) was performed with the muscle algorithm, and sequence trimming was performed as necessary. The phylogenetic tree was constructed using the maximum likelihood (ML) algorithm, with a bootstrap of 1000.
Results and Discussion
Determination of LCYB and LCYE genes in DH- Pahang and DH-PKW
A total of six LCYB and LCYE loci were found in DH-Pahang and DH-PKW. In DH-Pahang, two LCYB loci were identified in chromosome 9 (MaL- CYB.c09) and chromosome 7 (MaLCYB.c07), and one LCYE locus in chromosome 1 (MaLCYE). In DH-PKW, two LCYB loci were found, which were also in chromosome 9 (MbLCYB.c09) and 7 (MbLCYB.c07), as well as one LCYE locus in chromosome 1 (MbLCYE).
Structure Prediction of LCYB and LCYE genes In DH-Pahang, the sequence length of MaL- CYB.c09 and MaLCYB.c07 gene were 2514 and 2262 base pairs, respectively. In DH-PKW, the MbLCYB.c09 and MbLCYB.c07 gene had se- quence length of 2639 and 3141 base pairs, respec- tively. Comparison of the LCYB gene structure in DH-Pahang and DH-PKW is displayed in figure 2 (a).
The LCYB genes in chromosome 9, MaL- CYB.c09 and MbLCYB.c09, each consists of one exon. After conducting pairwise sequence align- ment, the identity percentage value was 99% be- tween MaLCYB.c09 and MbLCYB.c09. This very high identity percentage indicates conserved nu- cleotide sequences.
In DH-Pahang, the MaLCYB.c07 gene was de- tected to have two exons and one intron, whereas in DH-PKW, the MbLCYB.c07 gene had three ex- ons and two introns. Confirmation by Scipio re- vealed that there was no confirmed splicing site between exon-intron junctions in the MaL- CYB.c07 gene. Fu et al. [6] found the presence of a premature stop codon at the end of the first exon
of the MaLCYB.c07 gene, which caused this tran- script produces a nonfunctional protein. Hence, the CDS and protein sequence of the MaLCYB.c07 gene were only derived from the first exon (marked with the green box in figure 3). Pairwise sequence alignment between MaLCYB.c07 and MbLCYB.c07 showed the identity percentage value of 94.8%, indicating highly conserved nu- cleotide sequences.
(a) (b)
Figure 2. Comparison of exons (green box) and intron (gray line) of LCYB (a) and LCYE (b). Blue triangle is TSS, red rhombus is poly-A tail. Yellow box in MaLCYB.c07 is an unconfirmed exon. There is a scale difference in gene length between figure (a) dan (b).
Figure 3. Further analysis of the MaLCYB.c07 ORF. In the inset, the green box indicates the first exon, the yellow box indicates the second exon.
For the LCYE gene, one locus was found in DH-Pahang and DH-PKW. The sequence length of MaLCYE and MbLCYE were 9730 and 9381 base pairs, respectively. Comparison of the struc- ture of the MaLCYE and MbLCYE genes is dis- played in Figure 2 (b). Prediction of the LCYE gene between DH-Pahang and DH-PKW showed difference in the number of exons-introns. In the MaLCYE gene, there were eleven exons and ten introns. In the MbLCYE gene, there were twelve exons and eleven introns. The comparison of sim- ilarities between the MaLCYE and MbLCYE genes showed an identity percentage of 80.2%.
Cis-Acting Regulatory Elements (CAREs) of LCYB dan LCYE
Differences in the frequency of occurrence of CAREs were observed on LCYB DH-Pahang and DH-PKW in chromosome 9 and chromosome 7, as shown in Figure 4 (a). Furthermore, specific CAREs motifs are shown in Table 2. The predom- inance of the CAREs motifs that responsible for the light response was observed in all LCYB of both species. The highest abundance of specific CAREs motifs for light response in LCYB in chro- mosome 9 of both species was Box-4, whereas in LCYB in chromosome 7 was G-box.
(a)
(b)
Figure 4. Frequency of CAREs in LCYB (a) and LCYE (b) related to hormonal and stress responses. Chromo- some 9 DH-Pahang and DH-PKW refers to MaLCYB.c09 and MbLCYB.c09, respectively; chromo- some 7 DH-Pahang and DH-PKW refers to MaLCYB.c07 and MbLCYB.c07, respectively. DH-Pahang and DH-PKW refers to MaLCYE and MbLCYE, respectively.
For the LCYE gene, difference in the fre- quency of occurrence of the CAREs were also ob- served between DH-Pahang and DH-PKW, as can be seen in Figure 4 (b), and the specific CAREs motifs in Table 2. The appearance of the CAREs motifs responsible for the light response also pre- dominates in the LCYE of both species. The high- est abundance of specific CAREs motif on MaL- CYE for light response was the Box-4, whereas in MbLCYE was the G-box.
The presence of CAREs that respond to abiotic stress indicates the role of environmental factors in the regulation of LCYB and LCYE gene expres- sion. The predominance of light-responding CAREs in LCYB and LCYE of both species indi- cates an important role of light for increasing gene expression in carotenoid metabolism related to the function of carotenoids in photo-inhibition and photo-oxidation prevention [23]. Plants that are subjected to anaerobic stress will produce com- pounds protecting their tissues from damage caused by abnormalities in oxygen-dependent re- actions. Drought, like other types of stress, can
lead to accumulation of reactive oxygen species (ROS), and controlling of LCYB and LCYE gene expression in drought stress can regulate certain carotenoid levels for protection against these ROS [24].
Auxin, gibberellin, abscisic acid, and MeJa are hormones that play important roles in the expres- sion of various genes in carotenoid metabolism.
Auxins have been shown to interact in signaling pathways involving light in the regulation of ca- rotenoid levels [25]. In the study of Moreno et al.
[26], it was known that LCYB gene expression in Daucus carota induces positive feedback on gib- berellin-associated genes. Abscisic acid is a caro- tene-derived compound that acts as an important hormone in plant stress responses involving carot- enoids [27]. MeJa administration is known to moderately increase LCYB gene expression in Co- riandrum sativum [28].
Domain and motif of LCYB and lcye putative proteins
Table 2. Abundance of specific CAREs motifs in LCYB and LCYE Respond to Motif
Abundance MaLCYB
.c07
MaLCYB .c09
MbLCYB .c07
MbLCYB
.c09 MaLCYE MbLCYE
Auxin TGA-box 1 0 0 0 1 1
TGA-element 1 1 1 2 0 0
Gibberellin P-box 0 0 1 0 1 1
TATC-box 0 1 0 0 0 0
Absicic Acid ABRE 12 1 11 1 3 1
Methyl Jasmonate
CGTCA-motif 2 1 1 0 2 0
TGACG-motif 2 1 1 0 2 0
Light
AE-box 0 0 0 0 1 0
3-AF1 1 0 0 0 0 0
ATCT-motif 0 1 0 0 0 0
Box 4 0 3 0 2 5 0
chs-CMA2a 0 1 0 1 0 1
GA-motif 1 0 1 0 0 0
GATA-motif 2 0 2 0 0 1
GATT-motif 0 1 0 1 0 0
G-box 11 0 10 0 4 3
GT1-motif 0 1 0 0 2 2
I-box 1 0 1 0 0 0
LAMP-element 0 1 0 1 0 0
MRE 1 0 1 0 0 0
Sp1 3 0 3 0 0 0
GTGGC-motif 0 0 0 0 0 1
TCCC-motif 1 1 1 1 0 0
TCT-motif 1 0 1 0 0 2
Anaerobic ARE 2 1 3 2 2 4
GC-motif 1 0 1 0 1 3
Drought MBS 0 1 0 1 0 1
Pairwise sequence alignment between the pu- tative protein sequences of MaLCYB.c09 and MbLCYB.c09 showed the percentages of identity and similarity of 99.4% and 99.8%, respectively.
Furthermore, the percentages of identity and simi- larity between MaLCYB.c07 and MbLCYB.c07
were 19.2% and 21.0%, respectively, indicating the sequences are not conserved. The indication of Figure 5. Domain protein structure of LCYB (a) and LCYE (b). Domain is shown with blue box.
Figure 6. MSA for finding motifs in LCYB. Black-highlighted amino acids indicate highly conserved sequence.
The lighter the highlight, the more variable the amino acids are.
Figure 7. MSA for finding motifs in LCYE. Black-highlighted amino acids indicate highly conserved sequence.
The lighter the highlight, the more variable the amino acids are.
high percentage of gaps (75.2%) was due to the truncated protein of MaLCYB.c07 missing the C- terminal region. The structure of the LCYB puta-
tive protein domain can be seen in Figure 5 (a).
The presence of a domain from the NADB_Rossmann superfamily was indicated in all LCYB protein sequences. The protein members of the NADB_Rossmann superfamily have a Rossmann-fold NADPH/NADP (+) binding (NADB) domain in their three-dimensional struc- ture [29]. A specific hit with the accession code PLN02463 was detected on MaLCYB.c09 and MbLCYB.c09 proteins, whereas no specific hit to MaLCYB.c07 and MbLCYB.c07. The accession code PLN02463 refers to lycopene beta cyclase
protein. The absence of specific hits for the MaL- CYB.c07 and MbLCYB.c07 necessitated further confirmation at the motif level.
Pairwise sequence alignment between the pu- tative protein sequences of MaLCYE and MbLCYE showed the percentage of identity and similarity of 93.7% and 94.4%, respectively, indi- cating a conserved protein’s primary structure.
The structure of the LCYE protein domains can be seen in Figure 5 (b).
As with LCYB proteins, there was the domain from the NADB_Rossmann superfamily in both LCYE proteins. However, different hit, with the accession code PLN02697, was detected in these two LCYE protein sequences, which belong to ly- copene epsylon cyclase protein.
(a) (b) (c)
Figure 8. The alignment of three-dimensional structure of LCYB from chromosome 9 (a), LCYB from chro- mosome 7 (b), and LCYE (c).
Figure 9. Phylogenetic tree based on CDS (a) and protein sequences (b) of LCYB. Red dashed box indicates clade of DH-Pahang and DH-PKW CDS or protein.
(a) (b)
Motifs analysis was carried out with reference to Cunningham et al. [30]. For LCYB proteins, 5 motifs were tried to be identified, namely plant conserved region, di-nucleotide binding site, cyclase motif I (CMI), cyclase motif II (CMII),
and charged region. Plant conserved region is a unique motif that is only found in plant LCYB pro- teins, but not in cyanobacteria. The presence of this motif at the N-terminal of LCYB indicates the importance of this motif in LCYB localization in plastids [30]. The di-nucleotide binding site motif with identity of GXGXXG, is important in binding to the NADP, NAD, ADP, and FAD cofactors.
These cofactors are important in stabilizing the carbonium intermediates formed during the lyco- pene cyclization process [30]. Lastly, the CMI mo- tif, CMII, and the charged region are important in catalytic function and substrate binding. Align- ment along with four LCYB protein sequences from other species to find these motifs is shown in Figure 6.
MaLCYB.c09 and MbLCYB.c09 own all mo- tifs. This predicts that these two LCYB proteins from chromosome 9 are fully functional. On an- other hand, the absence of the CMI, CMII, and charged region motifs in the MaLCYB.c07 puta-
tive protein will greatly affect the catalytic func- tion of this protein. Furthermore, the absence of the plant conserved region and di-nucleotide bind- ing site motif in the MbLCYB.c07 putative protein may indicate differences in the structure or locali-
zation of this protein compared to functional LCYB proteins.
Moreover, we investigated the presence of five motifs in LCYE proteins, namely dinucleotide binding site, cyclase motif I (CMI), cyclase motif II (CMII), charged region, and ring switch [30, 31]. As shown in Figure 7, both LCYE proteins were indicated to have these five motifs. Similar to LCYB proteins, the dinucleotide-binding site motif is important in stabilizing the structure of LCYE, whereas the CMI, CMII, and charged re- gion motifs are important for catalytic function and substrate binding. The ring switch motif is a region that plays a role in determining the number of rings added in the cyclization of lycopene into certain types of carotenoids [31]. Ring switch mo- tif with identity of XLXXXX indicates the cy- clization carried out by MaLCYE and MbLCYE will form α-carotene, which has one β-ring and one ε-ring.
Figure 10. Phylogenetic tree based on CDS (a) and protein sequences (b) of LCYE. Red dashed box indicates clade of DH-Pahang dan DH-PKW CDS or protein.
(a) (b)
Three-Dimensional structure of LCYB and LCYE putative proteins
To date, the crystal structure of any LCYB protein of any species has not been reported [32].
Through i-TASSER, three-dimensional structure prediction can be done by multiple threading alignment and iterative structural assembly based on their similarity to other better known proteins in database [20]. A comparative analysis by per- forming a three-dimensional structural alignment between the paralogous LCYB proteins is shown in Figure 8 (a) and 8 (b).
The RMSD value for MaLCYB.c09 and MbLCYB.c09 protein comparison was 0.596, in- dicating these two paralog proteins are structurally similar. The main dissimilarity was observed in the loop structure. Conversely, the RMSD value for MaLCYB.c07 and MbLCYB.c07 protein com- parison was 17.203, indicating these two proteins are not structurally similar.
As with LCYB, the crystal structure of the LCYE protein of any species has not been re- ported. The predictive three-dimensional struc- tural alignment for LCYE proteins in DH-Pahang and DH-PKW can be seen in Figure 8 (c).
Comparative analysis of the three-dimensional structure between MaLCYE and MbLCYE pro- teins was shown with an RMSD value of 1.025, indicating these two proteins are similar. The main difference lies in the loop structure.
Phylogenetic trees of LCYB and LCYE based on CDS and protein sequences
The presence of all LCYB CDS of DH-Pahang and DH-PKW in an ingroup along with the CDS of the other species, as shown in Figure 9 (a), confirmed that these CDS were correctly belong to a group of LCYB genes. CDS of MaLCYB.c09 is placed in the same clade with MbLCYB.c09; as well as for CDS of MaLCYB.c07 and MbLCYB.c07. These results indicate that the LCYB gene duplication event most likely oc- curred in the common ancestor of DH-Pahang and DH-PKW, before the two species started to sepa- rate. The long branch length that separated the MaLCYB.c07 and MbLCYB.c07 clade from other clades indicates a distant divergence of this clade.
The analysis was continued by constructing a phylogenetic tree based on LCYB protein se- quences, as shown in figure 9 (b). LCYB proteins were also grouped according to their chromosomal location. MaLCYB.c09 and MbLCYB.c09 protein are in a monophyletic group along with other
LCYB from Musa troglodytarum and Ensete ven- tricosum. The distantly located clade of MaL- CYB.c07 and MbLCYB.c07 is caused by the ab- sence of an important motif in the primary struc- ture of MaLCYB.c07 and MbLCYB.c07 proteins.
The presence of MaLCYE and MbLCYE clade in an ingroup along with CDS of other species, as shown in figure 10 (a), confirms that these CDS were correctly belong to a group of LCYE genes.
This tree also concluded that the LCYE from Musa have common ancestor close to the clade from Elaeis guineensis, Phoenix dactylifera, and Ananas comosus.
The presence of MaLCYE and MbLCYE pro- teins in an ingroup along with LCYE proteins of other species, as shown in Figure 10 (b), confirms that structurally, these proteins are similar to LCYE proteins of other species. This tree also concluded that the LCYE proteins from Musa have recent common ancestor with Dendrobium catenatum, Apostasia shenzhenica, and Dioscorea cayenensis.
Conclusion
The gene structure between DH-Pahang (M.
acuminata, A Genome) and DH-PKW (M. balbi- siana, B Genome) was observed to be similar in LCYB in chromosome 9, but different in LCYB in chromosome 7 and LCYE. CAREs that respond to light dominate in all LCYB and LCYE in both spe- cies. LCYB and LCYE proteins of the two species have similarities in the appearance of the NADB_Rossmann superfamily, but different in motifs. Three-dimensional structure showed sig- nificant difference only detected between MaL- CYB.c07 and MbLCYB.c07. For future research, determining the effect of light treatment to the ex- pression of LCYB and LCYE and content of provit- amin A carotenoids in both A and B bananas ge- nome could be an interest of future research.
Moreover, the research to enhance the production of β-carotene in these both genomes might be started from overexpressing the functional LCYB proteins or downregulating the expression of LCYE proteins.
Acknowledgment
This study was funded by Institut Teknologi Bandung Priority Research KK A 2021 for Fenny M. Dwivany.
References
1. World Health Organization (WHO) (2009) Global Preva- lence of Vitamin A Deficiency in Populations at Risk 1995-2005: WHO Global Database on Vitamin A Defi- ciency. Geneva, WHO Press.
2. von Lintig J (2012) Provitamin a metabolism and func- tions in mammalian biology. The American Journal of Clinical Nutrition 96 (5): 1234–1244. doi:
10.3945/ajcn.112.034629.
3. Fongar A, Nabuuma D, Ekesa B (2020) Promoting (Pro) Vitamin A-Rich Bananas: A Chronology. Kampala (Uganda), The Alliance of Bioversity and CIAT.
4. Englberger L, Schierle J, Aalbersberg W et al. (2006) Ca- rotenoid and vitamin content of Karat and other Microne- sian banana cultivars. International Journal of Food Sci- ences and Nutrition 57 (5–6): 399–418. doi:
10.1080/09637480600872010.
5. Badejo AA (2018) Elevated carotenoids in staple crops:
The biosynthesis, challenges and measures for target de- livery. Journal of Genetic Engineering and Biotechnology 16 (2): 553–562. doi: 10.1016/j.jgeb.2018.02.010.
6. Fu X, Cheng S, Feng C et al. (2019) Lycopene cyclases determine high α-/β-carotene ratio and increased carote- noids in bananas ripening at high temperatures. Food Chemistry 283 131–140. doi: 10.1016/j.food- chem.2018.12.121.
7. Nagarajan J, Ramanan RN, Raghunandan ME et al.
(2017) Carotenoids. In: Galanakis CM, ed. Nutraceutical and functional food components. Academic Press. 259–
296.
8. D’Hont A, Denoeud F, Aury J-M et al. (2012) The banana (Musa acuminata) genome and the evolution of monocot- yledonous plants. Nature 488 (7410): 213–217. doi:
10.1038/nature11241.
9. Wang Z, Miao H, Liu J et al. (2019) Musa balbisiana ge- nome reveals subgenome evolution and functional diver- gence. Nature Plants 5 (8): 810–821. doi:
10.1038/s41477-019-0452-6.
10. Kanehisa M, Goto S (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research 28 (1):
27–30.
11. Droc G, Larivière D, Guignon V et al. (2013) The banana genome hub. Database 2013 bat035. doi: 10.1093/data- base/bat035.
12. Solovyev V, Kosarev P, Seledsov I, Vorobyev D (2006) Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biology 7 (1): S10. doi:
10.1186/gb-2006-7-s1-s10.
13. Keller O, Odronitz F, Stanke M et al. (2008) Scipio: Us- ing protein sequences to determine the precise exon/in- tron structures of genes and their orthologs in closely re- lated species. BMC bioinformatics 9: 278. doi:
10.1186/1471-2105-9-278.
14. Liu W, Xie Y, Ma J et al. (2015) IBS: an illustrator for the presentation and visualization of biological se- quences. Bioinformatics 31 (20): 3359–3361. doi:
10.1093/bioinformatics/btv362.
15. Madeira F, Park Y mi, Lee J et al. (2019) The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Research 47 (W1): W636–W641. doi:
10.1093/nar/gkz268.
16. Lescot M, Déhais P, Thijs G et al. (2002) PlantCARE, a database of plant cis-acting regulatory elements and a
portal to tools for in silico analysis of promoter se- quences. Nucleic Acids Research 30 (1): 325–327. doi:
10.1093/nar/30.1.325.
17. Lu S, Wang J, Chitsaz F et al. (2020) CDD/SPARCLE:
The conserved domain database in 2020. Nucleic Acids Research 48 (D1): D265–D268. doi: 10.1093/nar/gkz991.
18. Altschul SF, Gish W, Miller W et al. (1990) Basic local alignment search tool. Journal of Molecular Biology 215 (3): 403–410. doi: 10.1016/S0022-2836(05)80360-2.
19. Nicholas KB (1997) GeneDoc: Analysis and visualization of genetic variation. Embnew news 4 1–14.
20. Roy A, Kucukural A, Zhang Y (2010) I-TASSER: A uni- fied platform for automated protein structure and function prediction. Nature Protocols 5 (4): 725–738. doi:
10.1038/nprot.2010.5.
21. Reva BA, Finkelstein AV, Skolnick J (1998) What is the probability of a chance prediction of a protein structure with an rmsd of 6 å? Folding and Design 3 (2): 141–147.
doi: 10.1016/S1359-0278(98)00019-4.
22. Kumar S, Stecher G, Li M et al. (2018) MEGA X: Mo- lecular evolutionary genetics analysis across computing platforms. Molecular Biology and Evolution 35 (6):
1547–1549. doi: 10.1093/molbev/msy096.
23. Demmig-Adams B, Adams WW (1993) The xanthophyll cycle, protein turnover, and the high light tolerance of sun-acclimated leaves. Plant Physiology 103 (4): 1413–
1420.
24. Kim SH, Kim Y-H, Ahn YO et al. (2013) Downregulation of the lycopene ε-cyclase gene increases carotenoid syn- thesis via the β-branch-specific pathway and enhances salt-stress tolerance in sweetpotato transgenic calli. Phys- iologia Plantarum 147 (4): 432–442. doi: 10.1111/j.1399- 3054.2012.01688.x.
25. Zhang J, Hu Z, Yao Q et al. (2018) A tomato MADS-box protein, SlCMB1, regulates ethylene biosynthesis and ca- rotenoid accumulation during fruit ripening. Scientific Reports 8 (1): 3413. doi: 10.1038/s41598-018-21672-8.
26. Moreno JC, Cerda A, Simpson K et al. (2016) Increased Nicotiana tabacum fitness through positive regulation of carotenoid, gibberellin and chlorophyll pathways pro- moted by Daucus carota lycopene β-cyclase (Dclcyb1) expression. Journal of Experimental Botany 67 (8): 2325–
2338. doi: 10.1093/jxb/erw037.
27. Choi H, Hong J, Ha J et al. (2000) ABFs, a family of ABA-responsive element binding factors. The Journal of Biological Chemistry 275 (3): 1723–1730. doi:
10.1074/jbc.275.3.1723.
28. Divya P, Puthusseri B, Savanur MA et al. (2018) Effects of methyl jasmonate and carotenogenic inhibitors on gene expression and carotenoid accumulation in coriander (Co- riandrum sativum L.) foliage. Food Research Interna- tional 111: 11–19. doi: 10.1016/j.foodres.2018.04.040.
29. Bhattacharyya M, Upadhyay R, Vishveshwara S (2012) Interaction signatures stabilizing the NAD(P)-binding Rossmann fold: A structure network approach. PloS One 7 (12): e51676. doi: 10.1371/journal.pone.0051676.
30. Cunningham FX, Pogson B, Sun Z et al. (1996) Func- tional analysis of the beta and epsilon lycopene cyclase enzymes of Arabidopsis reveals a mechanism for control of cyclic carotenoid formation. The Plant Cell 8 (9):
1613–1626. doi: 10.1105/tpc.8.9.1613.
31. Cunningham FX, Gantt E (2001) One ring or two? Deter- mination of ring number in carotenoids by lycopene ɛ-
cyclases. Proceedings of the National Academy of Sci- ences of the United States of America 98 (5): 2905–2910.
doi: 10.1073/pnas.051618398.
32. Zhao Z, Liu Z, Mao X (2020) Biotechnological advances
in lycopene β-cyclases. Journal of Agricultural and Food Chemistry 68 (43): 11895–11907. doi:
10.1021/acs.jafc.0c04814.