Skip to main navigation Skip to main content
  • KSBS
  • E-Submission

Plant Breed. Biotech. : Plant Breeding and Biotechnology

OPEN ACCESS
ABOUT
BROWSE ARTICLES
EDITORIAL POLICIES
FOR CONTRIBUTORS

Articles

Research Article

Genome-Wide Characterization and Expression Profiling of GeBP Gene Family in Brassica rapa

Plant Breeding and Biotechnology 2025;13:53-70.
Published online: March 19, 2025

1Department of Genetics and Plant Breeding, Bangladesh Agricultural University, Mymensingh-2202, Bangladesh

*Corresponding to Arif Hasan Khan Robin, E-mail. gpb21bau@bau.edu.bd
• Received: September 26, 2024   • Revised: January 7, 2025   • Accepted: February 10, 2025

Copyright © 2025 by the Korean Society of Breeding Science

This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

  • 12 Views
  • 1 Download
prev next
  • The GeBP gene family, known for its GLABROUS1 enhancer-binding proteins, represents a standard transcription factor characterized by an unconventional leucine-zipper motif. The constituents of this family possess a core DNA-affinity domain. This protein is crucial for controlling plant growth and development. GeBP genes have been analyzed and defined only in some Gramineae crops, Arabidopsis, rice and soybean. However, a comprehensive examination of the role of the GeBP genes in Brassica rapa has yet to be conducted. This study used genome-wide characterization to identify twenty BrGeBP genes and accessible RNA sequencing data to evaluate each gene's expression. Based on phylogenetic research, we grouped 114 GeBP proteins from five different plant species into six groups. Generally, groupings of BrGeBP genes with comparable motifs and exon-intron distribution were conserved. The twenty BrGeBP genes were found on six of the ten B. rapa chromosomes, and one paralog showed segmental duplication. The varied protein motifs and gene structures showed that the GeBP genes had various roles in plants. The functions of BrGeBP genes are numerous and include expressions responsive to specific tissues, stress, and hormones according to cis-elements analysis. Furthermore, expression analysis using RNA sequencing revealed that the genes BrGeBP3, BrGeBP9, and BrGeBP20 showed substantially increased expression in different organs. These findings will aid in elucidating the possible roles and traits of BrGeBP genes in the B. rapa biological cycle and related species.
Transcription factors (TFs) are key in controlling the stress response and developmental processes. Under stress, TFs in plants frequently connect the cis-acting component to alter gene expression, eventually preventing or minimizing plant injury. A focused study on TFs has identified their function in stress (abiotic and biotic stress), metabolism (primary and secondary metabolism), and plant development (from seed germination to fruit development). For instance, NF-Y, AP2, TCP, MYB, WRKY, GRF, and SPL TFs play significant roles in the ability of plants to withstand stress. In contrast, NAC, GRF, and SPL TFs are critical for root development, flower development, seed maturation, and plant transformation (Samad et al. 2017). One of the newest plant-specific transcription factor families, the glabrous-enhancer-binding protein (GeBP) was discovered and extracted from Arabidopsis for the first time in 2003 (Curaba et al. 2003).
Plant-specific gene regulators with a central DNA-affinity domain are the GeBP gene family (Curaba et al. 2003; Ma et al. 2021). The GeBP can control the presence of epidermal projections by altering the expression profile of the early gene GLABROUS1 (GL1), which was reported to govern the commencement of epidermal hair formation in Arabidopsis (Perazza et al. 1998). According to a study of conserved motifs, the Arabidopsis GeBP sequencing showed a leucine zipper region and a basic amino acid region. This evidence indicated that GeBP might be a part of the bZIP gene regulatory family. However, it was discovered that there are more than nine residues between these two conserved domains, which does not fit the typical criteria of a bZIP protein (Huang et al. 2021; Jakoby et al. 2002). A typical bZIP conserved domain's basic structure is N-x7-R/ K-x9-L-x6-L-x6-L, which separates the Leu zipper domain from the alkaline domain by nine residues. Researchers have concentrated on the parallels between the GeBP/GeBP-LIKE development of a distinct clade in Arabidopsis and an unexplained extra-endian invariant area (Curaba et al. 2003). Plants can adjust to complicated environmental changes because their root systems are extremely pliable while developing (Khare et al. 2017). Abiotic stresses such as heavy metal pollution at sub-lethal levels cause plants to demonstrate avoidance behavior in the form of altered root architecture (Clemens 2006). Arabidopsis thaliana responds to cadmium (Cd) by rapidly inducing the GeBP-LIKE 4 (GPL4) transcription factor, which has been recognized as a root development inhibitor. When plants are subjected to hazardous metals, GPL4 reduces the concentrations of reactive oxygen species (ROS), slows down root growth. This condition settled the roots in the uncontaminated areas (Khare et al. 2017). Moreover, GPL4 root avoidance is necessary for Arabidopsis to respond to excessive concentrations of zinc (Zn) and copper (Cu), and to respond to Cd stress.
A putative membrane protein is encoded by the Arabidopsis gene known as CONSTITUTIVE EXPRESSOR OF PATHOGENESIS-RELATED GENES5 (CPR5), which also has pleiotropic roles, notably in pathogen reactions, cell expansion, cell proliferation, and cellular death (Bowling et al. 1994; Brininstool et al. 2008). In 2011, Gilles Vachon et al. discovered a connection between CPR5 and the GeBP gene family. In the CPR5 pathway, GeBP/GPLs controlled several genes (Perazza et al. 2011). The transcriptome profile of the gebp gpl 1, 2, 3 triple mutant and the transgenic overexpression line of GPL2 has allowed researchers to determine the role of the GeBP/GPLs genes in cell wall metabolism and defense responses. CPR5 partially controls a portion of the GeBP members. GeBP/GPLs are PR gene repressors because the mutant has elevated gene expression levels for the pathogen-responsive signaling genes PR1 and PR5. According to a recent study, the Arabidopsis GeBP-like transcription factor VFP4 interacted with the Agrobacterium F-box effector VirF and its plant operative equivalent VBF (García-Cano et al. 2018).
Brassica crops serve as sources of human nutrition and offer chances to study genome evolution. Brassica vegetables primarily include Brassica rapa and Brassica oleracea. B. rapa consists of several subspecies, such as Chinese cabbage (B. rapa ssp. pekinensis), non-heading Chinese cabbage (B. rapa ssp. chinensis), and turnip (B. rapa ssp. rapifera). It is a diploid species (AA, 2n = 20) and serves as one of the progenitors of key allotetraploid oilseed crops, including B. juncea (AABB, 2n = 36) and B. napus (AACC, 2n = 38) (Nagaharu et al. 1935). B. rapa varieties are valuable for enhancing the genetic diversity of these domesticated species. With a short life cycle, minimal growth requirements, and sensitivity to environmental factors, B. rapa (like Wisconsin Fast Plants) and other rapid-cycling Brassicas are widely used in education and research. One of the rare subspecies of Brassica rapa is Chinese cabbage (Brassica rapa ssp. pekinensis) (Song et al. 2014). Its variation Chiifu-401-42 genome has undergone sequencing (Wang XiaoWu et al. 2011). While more than 40,000 proteins have been described, the sequences were organized based upon their significant commercial importance and tight ties to Arabidopsis (Wang XiaoWu et al. 2011). The haploid genome of Brassica rapa is approximately 529 Mb, making it much larger than the 125 Mb haploid genome of Arabidopsis (Johnston et al. 2005). Arabidopsis and Brassica share a common ancestor, having diverged around 14.5-20.4 million years ago (Bowers et al. 2003). Comparative genetic mapping has shown collinear chromosome segments within the Brassicaceae family and conserved linkage patterns between Arabidopsis and Brassica (Acarkan et al. 2000a; Acarkan et al. 2000b; Bohuon et al. 1998; Lukens et al. 2003). This makes B. rapa a valuable model for studying the genomics and evolutionary history of Brassica species.
Consequently, a genome-wide investigation was carried out to identify the B. rapa GeBP gene family. The 20 GeBP genes that were discovered underwent additional examination for their protein sequences, gene characterization, structural organization, conserved motif analysis, chromosomal localization, phylogeny, promoter cis-elements, and the investigation of gene expression using transcriptomic data. These bioinformatic investigations will offer valuable insights into the role of BrGeBP genes in response to abiotic stresses and provide essential genetic resources to support biological breeding efforts.
GeBP genes in Brassica rapa: genome-wide identification
To do a comprehensive genomic analysis of the GeBP gene family, we extracted 20 GeBP genes of B. rapa. An HMM test was performed to verify their identities using the HMMER web server (Potter et al. 2018). All sequences, including coding DNA (CDS), protein, and genomic sequences of B. rapa, were downloaded from the BRAD database (Cheng et al. 2011). iTAK database (Zheng et al. 2016), and Ensembl Plants database (Kersey et al. 2016) were used to confirm the sequences found. To find out about molecular characteristics including length, molecular weight (Mw), iso-electric point (pI), and grand average of hydropathicity (GRAVY) of GeBP proteins, ProtParam tool (Gasteiger et al. 2005) was employed. Ensembl Plants database (Kersey et al. 2016) was utilized to discover the Open Reading Frame (ORF) of GeBP genes. The subcellular localization of the identified proteins was anticipated using Cell-PLoc 2.0 (Chou et al. 2010). The SMART conserved domain search tool (Letunic et al. 2018) was used to anticipate the site of the GeBP domain in twenty BrGeBP genes. The protein sequences of Arabidopsis (20), B. oleracea (18), B. napus (36), and Radish (20) (Supplementary Data File S1) were collected from iTAK databases (Zheng et al. 2016) (Table 1).
Establishment of evolutionary relationship through phylogenetic tree and synteny analysis
To investigate the evolutionary connections of GeBP family proteins among the five-plant species, phylogenetic analysis of GeBP proteins of Arabidopsis thaliana, B. rapa, B. oleracea, Raphanus sativus, and B. napus was performed. The phylogenetic tree was assembled employing MEGA7.0 by the Neighbor-Joining (NJ) method (Saitou et al. 1987) considering the parameter of pair-wise deletion and 1000 bootstrap replications. A synteny map was generated using the Circoletto database (http://tools.bat.infspire.org/circoletto) to display the similarities among the protein sequences of Brassica rapa, Brassica oleracea, Brassica napus, and Arabidopsis thaliana.
Study of motif analysis and exon-intron distribution
The BrGeBP protein motifs were analyzed using MEME suite 5.1.0 (Bailey et al. 2009). The pre-defined analysis settings were a minimum width of 6 and a maximum width of 50, with a maximum of 10 motifs each. The exon-intron distribution of the CDSs of GeBP genes was analyzed using the gene structure display server (GSDS) web tool (Hu et al. 2015).
Chromosomal mapping and gene duplication assessment
The chromosomal positions, including start-end coordinates and chromosome numbers, of the 20 BrGeBP genes were compiled from the Ensembl Plants database (Kersey et al. 2016) and the MapGene2Chrom web v2 tool (Chao et al. 2015) was employed to plot their locations on all 10 chromosomes of B. rapa. To determine the gene duplication of BrGeBP genes, an NCBI BLAST search (Zhang et al. 1997) was carried out based on the proportion of query cover percentage among 20 BrGeBP genes.
Identification of cis-acting elements
The PlantCARE web-based tool (Lescot et al. 2002) was used to come up with a list of possible cis-regulatory elements of 20 BrGeBP genes. Each cis-element was categorized based on its association with stress response, hormonal regulation, growth and development. The frequency of each element was recorded for each BrGeBP gene, and the data were systematically organized in Microsoft Excel. To improve visualization, conditional formatting was applied to highlight values, providing a clear and intuitive representation of the variations in cis-element distribution across genes. Cis-regulatory elements were found in the area between each gene's start codon and the 2000 bp upstream sequence (Wu et al. 2012). Such upstream regions that modulate target genes contain cis-elements coupled to transcription factors (Fang et al. 2008).
Expression profiling of BrGeBP genes in diverse tissues
BrGeBP RNA-sequencing data was retrieved from the Expression Atlas database (https://www.ebi.ac.uk/gxa/ home) (Petryszak et al. 2016). Tong et al. (2013) had previously gathered and processed that data. Six types of tissues such as leaf, stem, flower, silique, root, and callus were examined (Tong et al. 2013). The transcript quantity is measured in fragments per kilobase of exon model per million mapped reads (FPKM) values. Microsoft Office Excel created a heat map for the BrGeBP genes using these values.
GeBP Genes in Brassica rapa: genome-wide identification
A total of 20 GeBP genes were identified and according to their gene id, genes were named as BrGeBP1 to BrGeBP20 where 'B' for Brassica and 'r' for rapa (Table 2). To establish the veracity of the conserved domains, the SMART conserved domain search tool (Letunic et al. 2018) was used to verify their presence. DUF573 domains were found in 17 out of 20 genes (Table 2). Three candidate genes (BrGeBP16, BrGeBP19, and BrGeBP20) were eliminated due to their incomplete domain structure. The protein length varied from 299aa, found in both BrGeBP11 and BrGeBP12 (shortest protein) to 640aa found in BrGeBP10 (longest protein) (Table 2). These proteins have molecular weights that range from 33.63 kda (BrGeBP11) to 72.81 kda (BrGeBP10). The theoretical pI values ranged from 4.56 (BrGeBP4) to 8.95 (BrGeBP2). Five proteins (BrGeBP2, BrGeBP3, BrGeBP6, BrGeBP7, and BrGeBP10) have pI values above 7, demonstrating that they were basic, while the proteins generated by other GeBP genes were acidic (pI values <7) (Table 2). The CDS of GeBP genes had a wide range of lengths, ranging from 735 bp (BrGeBP14) to 1923 bp (BrGeBP10) (Table 1). In addition, the GRAVY values differed from -0.263 to -1.087 (Table 2). The GRAVY scores for all of these proteins were negative, illustrating that they were all hydrophilic in nature. In the subcellular location, 19 proteins were expected to be nuclear and one protein (BrGeBP2) was predicted to be endoplasmic reticulum (Table 2). It can be deduced that GeBP proteins have a wide range of physiochemical characteristics.
Establishment of evolutionary relationship through phylogenetic tree and synteny analysis
A phylogenetic tree was built to gain insight into the evolutionary associations among GeBP family proteins from 114 amino acid sequences of B. rapa (20), B. oleracea (18), B. napus (36), Arabidopsis thaliana (20), and Raphanus sativus (20) (Fig. 1). While majority of the nodes had more believable bootstrap values, certain nodes' bootstrap values were low due to low species identity. The GeBP proteins from different species were clustered into six groups, marked as A, B, C, D, E, and F (Fig. 1). Among these, group A comprised 31 members which is 27% of the total GeBP genes, group B comprised 23 members which is 20% of the total GeBP genes, group C contained 12 members which is 10% of the total GeBP genes, succeeded by group D (16 members), group E (15 members), and group F (17 members) which are 13% of the total GeBP genes (Fig. 1). Moreover, the GeBP genes from the dicot were closely grouped in the phylogenetic tree.
The maximum number of BrGeBP was found in group A, which had six members, and the lowest count was found in group F with two members (Fig. 1). In the tree, orthologous pairs represented the majority of BrGeBP proteins (i.e. BrGeBP20/BnGeBP21, BrGeBP19/ BnGeBP4, BrGeBP16/RsGeBP1, BrGeBP1/BnGeBP33, BrGeBP5/ BnGeBP11, BrGeBP3/ BnGeBP35, BrGeBP4/ BoGeBP8, BrGeBP10/ BnGeBP29, BrGeBP12/ BnGeBP28, BrGeBP6/ BnGeBP23, BrGeBP17/ BnGeBP6, BrGeBP8/ BnGeBP9, BrGeBP7/ BnGeBP13, and BrGeBP9/ BoGeBP18) (Fig. 1). On the other hand, only one paralogous pair of BrGeBP proteins was detected in the tree (i.e., BrGeBP14/ BrGeBP15) (Fig. 1). Furthermore, the synteny analysis explored the close physical proximity of genes on a chromosome and the syntenic relationships between BrGeBP proteins and those of related species. The green ribbon connecting BrGeBP11/BoGeBP6 and BrGeBP16/BnGeBP26 indicates a sequence similarity of ≤75% (Fig. 2). In contrast, the remaining orthologous pairs, linked by orange ribbons, represent higher sequence similarities of up to ≤99.99%. Some examples include BrGeBP1/BnGeBP34, BrGeBP2/BnGeBP14, BrGeBP3/BnGeBP35, and BrGeBP4/BoGeBP8 (Fig. 2). This demonstrates that Brassica rapa shares a close syntenic connection with other Brassica species.
Study of motif analysis and exon-intron distribution
To more thoroughly explore the variation of the BrGeBP genes, we conducted the motif analysis of the BrGeBP proteins. There was diversity in the motif distribution among the 20 GeBP genes. Ten conserved motifs were discovered through the MEME online server-based motif analysis. From Fig. 3, we noticed that motif 1 and motif 3 were present in 19 proteins which were the most common motifs among 10. On the other hand, motifs 5 and 6 were observed on 7 proteins whereas motifs 2 and 4 were absent on 3 proteins (Fig. 3). Besides that, BrGeBP2 and BrGeBP18 contained only 3 motifs among 10 motifs (Fig. 3). Members of the evolutionary tree that shared the same group have similar motif compositions (Figs. 1, 3). Group A contained 8 motifs including motifs 1, 3, 4, 5, 6, 7, 8, and 10 whereas motif 4 was absent in group B and motif 10 was absent in group F (Figs. 1, 3). The motif distribution verified that the BrGeBP proteins had not changed over time.
It is well-accepted that the evolution of multigene families is driven by structural variation within genes. One of the crucial structural elements that play a role in the evolution of a given gene is the distribution of exons and introns. Comparable exon-intron distributions suggest functional commonality. Additionally, exonization, insertion/deletion, and addition or removal of exon/intron are three major types of processes that contribute in diverse ways to structural differences in the exon-intron architecture. The exon-intron distribution of each BrGeBP gene was inferred to obtain a clearer perspective of the structural variation of the GeBP genes in B. rapa (Fig. 4). There were only seven BrGeBP genes that comprised introns in their coding regions which ranged from one to seven. BrGeBP4 contained the highest seven introns whereas BrGeBP11 contained only one intron (Fig. 4). Most individuals of a group that were linked to one another had nearly identical exon-intron arrangements, either in terms of intron counts or exon dimensions. Group D, E, and F contained only exons which was the equivalent distribution of exons with varying lengths (Fig. 4). However, compared to other members of group A, BrGeBP16 and BrGeBP19 have a varied number of exons and introns (Fig. 4).
Chromosomal mapping and gene duplication assessment
The physical location of each gene was identified after the 20 BrGeBP genes were mapped onto the 10 chromosomes of B. rapa (Fig. 5). On six of the ten chromosomes, BrGeBP genes were disproportionally distributed. The most BrGeBP genes were found on chromosome A04, which had six of them followed by chromosomes A01 (four genes) and A06 (three genes). Chromosome A09 contained two genes, while chromosomes A03 and A05 contained only one gene (Fig. 5). On the other hand, there were no genes found on the rest of chromosomes A02, A07, A08, and A10 (Fig. 5). Besides that, the chromosomal locations of BrGeBP15, BrGeBP17, and BrGeBP20 were scaffolded which was not identified specifically (unknown).
The sequence identity among the 20 GeBP proteins of B. rapa ranged from 22.14% (between BrGeBP7 and BrGeBP19) to 93.37% (between BrGeBP14 and BrGeBP15) (Supplementary Table S1). If two genes were found in an area of 100 kb of a chromosome and spaced by only four or fewer other genes, they were considered as tandemly duplicated genes (Wang et al. 2015). Genes were considered to be segmentally duplicated if their percentage of query cover and identity was ≥ 80 (Kong et al. 2013). Depending on the degree of identity and query coverage (more than 80%) of the BrGeBP genes, a single pair of paralogous segmentally duplicated BrGeBP genes was discovered (Supplementary Table S1). One gene was found on chromosome A04 and another gene was scaffolded (Fig. 5). According to the evolutionary relationships depicted in Fig. 1, the duplicated gene pairs are members of the same group. None of the genes had any tandem duplication.
Identification of cis-acting elements
During all stages of a plant's life cycle, cis-elements (CEs) are crucial for the transcriptional control of genes. CEs were discovered in the promoter areas of the 20 GeBP genes in B. rapa to better comprehend the transcriptional control mechanisms. Plant growth and development, phytohormone responsiveness, and stress responsiveness were the three kinds of responses among 20 CEs (Supplementary Table S2). Here, we found four CEs in response to growth and development (Fig. 6). CAT-box (for meristem expression) was found on two genes, MSA-like (for cell cycle regulation) was found on two genes, O2-site (for controlling zein metabolism) was present in nine genes, and GCN4-motif (for expressing endosperm) was present in four genes (Fig. 6). Five cis-elements responded to stress: MBS (MYB binding site involved in drought-inducibility) was present in four different BrGeBP genes, LTR (low-temperature responsiveness) was present in seven genes, TC-rich repeats were present in four genes, GC-motif was present in two BrGeBP genes, and ARE (anaerobic induction) was present in 15 BrGeBP genes that responded to both defense and stress (Fig. 6). Additionally, we discovered different hormone-sensitive cis-elements: salicylic acid (TCA-element) in eight genes, auxin (TGA-element) in eight genes, abscisic acid (ABRE) in 12 genes, and gibberellin (GARE-motif) in eight genes (Fig. 6). Furthermore, the majority of the BrGeBP genes in the same phylogenetic group have comparable cis-acting components in their promoter zones, indicating a potential role for this gene family in growth and resilience to stress (Figs. 1, 6).
Expression profiling of BrGeBP genes in diverse tissues
A gene family's tissue-specific expression profile can reveal information about its putative developmental functions. To figure out which GeBP genes are involved in B. rapa's tissue or organ development regulation, the expression levels of the BrGeBP in six tissues (leaf, stem, flower, silique, root, and callus) were investigated based on FPKM values derived from RNA-sequencing data in B. rapa that Tong et al. (2013) had previously examined. According to the findings, six genes (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, and BrGeBP20) were expressed at varying levels in all the tissues (Fig. 7). Mostly BrGeBP3, BrGeBP9, and BrGeBP20 showed notably higher expression levels among the twenty genes. Some genes have distinct expression patterns in different tissues to some extent, for example- BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, BrGeBP18, and BrGeBP20 prominently expressed during the reproductive stage, particularly in flower tissue (Fig. 7). The transcript levels of seven genes, namely- BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, and BrGeBP20 were prominently expressed in leaves, eight genes (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, BrGeBP19, and BrGeBP20) in stems, seven genes (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, BrGeBP18, and BrGeBP20) in siliqua, Six genes in roots (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, and BrGeBP20), and six genes (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, and BrGeBP20) in callus compared to other genes (Fig. 7). In contrast, two genes (BrGeBP2 and BrGeBP11) were not expressed in any organs out of six (Fig. 7). Besides that, BrGeBP10 was very weakly expressed in the different organs examined (Fig. 7). In B. rapa tissues, we discovered that the majority of the individuals belonging to the same phylogenetic grouping showed a comparable expression profile (Figs. 1, 7). In particular, the genes from group A exhibited greater expression, and group C showed lower expression compared to other groups (Figs. 1, 7).
The opportunity to customize the mechanisms and functions of genes associated with plant growth, development, and stress tolerance is expanding as genome sequencing initiatives advance more rapidly. A useful method for examining gene function, structure, and evolution is now the study of the gene family. The ensuing findings imply that GeBP will support the hormone cascade as a TF. First, the GeBP protein connects to the cis-acting region of the GLABROUS1 gene, a MYB gene involved in determining epidermal cells, which is controlled by the cytokinin hormone GA (Gan et al. 2007; Oppenheimer et al. 1991; Perazza et al. 1998). Second, the KNOTTED1 homeodomain (KNOX) family gene BREVIPEDICELLUS (BP) (Curaba et al. 2004), which positively controls the apical meristem of the shoot contains the cytokinin pathway, positively influences the transcript levels of GeBP (SAM) (Jasinski et al. 2005; Yanai et al. 2005). Arabidopsis, rice, apple, moso bamboo and soybean have all undergone a preliminary investigation of the GeBP gene family. Therefore, it is feasible and essential to characterize the GeBP genes in B. rapa at the genome level.
The number of GeBP genes in B. rapa was slightly lesser than in rapeseed (36) but equivalent to that of Arabidopsis (20) and Radish (20), which was higher than that of cabbage (18). In addition, the genome size of B. rapa (283.8 Mb) (Wang XiaoWu et al. 2011), and Arabidopsis (125 Mb) (Hu et al. 2011), but the genome size of cabbage is about 530 Mb (Liu et al. 2014) which is higher than B. rapa. So, the size of the crop genome is unrelated to the quantity of GeBP genes.
From the phylogenetic analysis, it was revealed that the evolutionary ties between the BrGeBP and BnGeBP proteins were found to be closer in contrast to other crop GeBP proteins (Fig. 1). This result implied that B. rapa and B. napus might have shared a functional progenitor and evolved from them. In the tree, several paralogous and orthologous relationships were found. The term "paralog" refers to identical genes that have evolved within a single species and were due to gene duplication, which causes the creation of new genes with novel functions (Gabaldón et al. 2013). Instead, following a speciation event, orthologs diverged yet still perform similarly to the ancestor gene's function (Gabaldón et al. 2013). As one paralog gene pair of BrGeBP proteins was identified, indicating highly functional similarity (Fig. 1). The evolutionary tree of BrGeBP proteins and the similarity in gene structure and motifs among each category provided more evidence for the links between the proteins.
Besides the function in gene family evolution, we examined the conserved motifs. Surprisingly, out of 20 GeBP proteins, 19 of them had motif 1 and motif 3 demonstrating the potential importance of these motifs in regulating target gene expression by transcripts during evolution (Fig. 3). Since the bulk of the BrGeBP proteins sharing the same group contained comparable patterns, gene expansion was likely the cause of their creation. The diverse patterns of motif distribution among the various groups revealed the factors of differentiating functions in GeBP genes throughout their evolutionary background. In addition, the gene structure divergence offers additional proof for analyzing phylogenetic connections. As a result, each member of the B. rapa GeBP family's exon/intron structure and evolutionary connections were examined (Figs. 1, 4). The consistent exon-intron distribution across various groups revealed that these genes had undergone substantial evolutionary conservation. Moreover, the functional divergence of BrGeBP may be explained by variations in intron number.
Genetic material is multiplied throughout selection and alterations, mostly by instances of gene duplication and divergence (Pickett et al. 1995). Tandem and segmental duplication, together with transpositional events, are the primary reasons for the gene family growth in plants (Cannon et al. 2004; Li et al. 2014). These include, segmental duplication is more prevalent among them because the majority of plants are polyploids with duplicate chromosomes and are diploidized (Cannon et al. 2004). In this investigation, no pair of genes' tandem duplications have been discovered, demonstrating that the growth of the GeBP gene family in B. rapa has been mostly driven by segmental duplication as opposed to tandem duplication (Supplementary Table S1). The phylogenetic tree revealed that overlapping genes belonged to the same groupings, demonstrating the occurrences of group-specific gene duplications during evolution.
It may be possible to learn more about the roles of BrGeBP genes at various plant developmental stages by studying the promoters in the upstream area of genes owing to their significant functions in regulating gene expression throughout development or in the environment (Zhu et al. 2014). They are crucial switches at the molecular level that contribute to the regulation of the transcription of genes and operate a vast network of genes engaged in a range of biological functions, such as the effects of stress on development (Zhang et al. 2005). There are numerous cis-acting factors in plants that are connected to stress (Saeediazar et al. 2014). In this study, every BrGeBP gene contains a single or many phytohormones cis-elements listed below, which increases the likelihood that each of the BrGeBP genes has functions related to stress tolerance (Fig. 6). These elements include gibberellin, auxin, salicylic acid, abscisic acid, and methyl jasmonate (Fig. 6). To respond to stressors, phytohormones assemble many regulatory cascades as intracellular transmitters (Wolters et al. 2009). The gene promoter regions contained cis-elements that respond to stress hence describing their purpose in environmental adaptation and abiotic stress.
It has been proposed that transcription factors are important regulators in controlling how genes express themselves in response to different environmental stressors (Chen et al. 2021). Twenty BrGeBP gene expression profiles from various tissues were examined to determine the potential roles of these genes (Fig. 7). The RNA-seq analysis revealed that three different expression patterns were found among the twenty BrGeBP genes, comprising tissue-specific expression patterns, constitutive expression, and no expression (Fig. 7). Two BrGeBP genes (BrGeBP2 and BrGeBP11) were not expressed in any of the tissues analyzed, posing the possibility that they are pseudogenes or that they are only produced during particular developmental stages or under particular circumstances that were not examined in our research. BrGeBP3 is predominantly expressed in stem, siliqua, and callus tissues, indicating involvement in the growth and development of stem and siliqua, and also engaged in the formation of roots and/or the intake of nutrients and water (Fig. 7). The BrGeBP3 gene, identified as being associated with drought tolerance (Wang et al. 2023), can be targeted to enhance nutrient uptake and improve resilience to drought conditions. Similar results obtained from BrGeBP9 and BrGeBP20 show that these genes can be harnessed to develop cabbage and related crops with enhanced adaptability, increased yield, and greater stress tolerance using advanced breeding techniques such as gene editing. Additionally, BrGeBP genes can beleveraged in biotechnological applications, where their targeted modification could enhance oilseed yield, nutritional quality, and stress resilience, benefiting industries reliant on Brassica crops. As the six genes (BrGeBP1, BrGeBP3, BrGeBP5, BrGeBP9, BrGeBP13, and BrGeBP20) showed preferential expression in all the tissues (leaf, stem, flower, silique, root, and callus), they could play a variety of regulatory roles in B. rapa's growth and development (Fig. 7). In addition, GeBP genes can control how plant epidermal cells grow (Jing et al. 2002). In rice, OsGeBP genes can react to various stimuli or hormones to control the downstream target genes (Huang et al. 2021). In soybeans, GmGeBP4 may be involved in the control of CPR5-dependent mechanisms that result in the development of trichomes (Liu et al. 2022). Overexpression of MdGeBP3 demonstrated increased sensitivity to cytokinin in apples, while its ectopic expression in Arabidopsis thaliana reduced drought resistance (Liu et al. 2023). In moso bamboo, 12 of 16 PeGeBP's showed higher expression in tissues with trichomes compared to tissues lacking them, suggesting these genes help control trichome formation (Shan et al. 2020). Silencing CsGeBP4 using a new virus-induced gene silencing method in tea plants stopped trichome formation, suggesting that CsGeBP4 is necessary for this process (Zhou et al. 2023). Future applications of these findings include the implementation of marker-assisted selection, CRISPR/Cas9-based genome editing, and transgenic approaches to develop stress-resilient and high-yielding crop varieties.
The identification, characterization, and expression profiling of the GeBP gene family in B. rapa across the entire genome are described in this research. In this investigation, we discovered a GeBP gene family in B. rapa with twenty BrGeBP genes spread over six of the ten chromosomes. We performed a thorough genome-wide study of the GeBP gene family to investigate their possible involvement in anatomical formation and stress adaptation. The BrGeBP genes were also evaluated for their expected gene arrangements, preserved regions, locations on chromosomes, instances of duplication, evolutionary separation, and syntenic connections. We then categorized them using phylogenetic analysis. Additionally, we found various types of development-related and hormone-sensitive cis-acting elements in the putative BrGeBP gene promoters, demonstrating the gene family's involvement in various biological processes. Finally, based on the expression pattern of the BrGeBP genes, we hypothesized their roles. These findings lay the foundation for additional research into the biological roles of each GeBP gene in plants.
Fig. 1
Phylogenetic analysis of GeBP proteins from B. rapa, B. oleracea, B. napus, Arabidopsis thaliana, and Raphanus sativus. With 1000 bootstrap replicates, the neighbor joining method was used in MEGA-7 to create the phylogenetic tree.
pbb-13-53-f1.jpg
Fig. 2
The syntenic relationship among Brassica rapa, Brassica oleracea, Brassica napus, and Arabidopsis thaliana is depicted using green and orange colors to represent the highest sequence similarity between the genes of these crops.
pbb-13-53-f2.jpg
Fig. 3
10 conserved motifs were found with the aid of the MEME-suite tool. The different colored boxes indicate different types of motifs.
pbb-13-53-f3.jpg
Fig. 4
Exon-intron arrangement of GeBP genes was forecasted from GSDS 2.0. This structural arrangement was generated using gene and CDS sequences.
pbb-13-53-f4.jpg
Fig. 5
Localization of the GeBP gene family on the chromosome of B. rapa. The relative length of a chromosome serves as a measure of its size. Each chromosome has a number at the top that corresponds to its chromosome. Megabases (Mb) is the scale on the left side of the image that was used to estimate the sizes of the chromosomes and the locations of the genes.
pbb-13-53-f5.jpg
Fig. 6
Cis-regulatory elements were detected in the promoter region of BrGeBP genes. Numerical values indicate the number of cis-regulatory elements in each gene. These elements are non-coding DNA regions that control the neighboring genes' transcription.
pbb-13-53-f6.jpg
Fig. 7
Expression of the BrGeBP genes in specific organs. Leaf, stem, flower, siliqua, root, and callus were present. The heat map was generated using the conditional formatting option in Microsoft Excel. The colors red, white, and green indicate high, moderate, and low levels of gene expression, respectively.
pbb-13-53-f7.jpg
Table 1
A comprehensive summary table consolidating data sources
Table 1
Data Source
CDS, protein, and genomic sequences of B. rapa BRAD database (http://brassicadb.cn/#/)
Molecular characteristics: length, Mw, GRAVY, pI ProtParam tool (https://web.expasy.org/protparam/)
Open Reading Frame (ORF) Ensembl Plants database (https://plants.ensembl.org/index.html)
Subcellular localization Cell-PLoc 2.0 (http://www.csbio.sjtu.edu.cn/bioinf/Cell-PLoc-2/)
Domain SMART conserved domain search tool (http://smart.embl-heidelberg.de/)
Protein sequences of Arabidopsis (20), B. oleracea (18), B. napus (36), and Radish (20) iTAk database (http://itak.feilab.net/cgi-bin/itak/index.cgi)
Cis-acting elements PlantCARE web-based tool
(https://bioinformatics.psb.ugent.be/webtools/plantcare/html/)
RNA-sequencing data Expression Atlas database (https://www.ebi.ac.uk/gxa/home)
Table 2
Comprehensive details on the Brassica rapa BrGeBP genes and associated proteins
Table 2
Gene name Gene ID Chromosome location Protein
length
Mol. wt.
(kDa)
pI GRAVY ORF
(bp)
Subcellular
localization
Domain
DUF573

Number Start End Length
BrGeBP1 Bra009986 A06 19087069 19088313 26271742 414 44.96 4.87 -0.587 1245 Nuclear 123-219
BrGeBP2 Bra013842 A01 8042900 8046434 28605136 609 69.62 8.95 -0.263 1830 ER 494-581
BrGeBP3 Bra013873 A01 8185027 8186103 28605136 358 39.31 7.53 -0.785 1077 Nuclear 138-235
BrGeBP4 Bra017244 A04 15879572 15881487 18965343 377 43.16 4.56 -1.020 1134 Nuclear 56-150
BrGeBP5 Bra019183 A03 25857483 25858583 31715688 366 40.52 5.72 -0.924 1101 Nuclear 146-243
BrGeBP6 Bra019847 A06 4034905 4035975 26271742 356 39.16 7.66 -0.945 1071 Nuclear 108-198
BrGeBP7 Bra021317 A01 21515533 92151645 28605136 308 34.58 8.89 -1.001 927 Nuclear 106-193
BrGeBP8 Bra025445 A04 9053857 9054780 18965343 307 34.64 5.61 -0.892 924 Nuclear 100-195
BrGeBP9 Bra027078 A09 8683856 8684998 18965343 380 42.10 5.15 -1.017 1143 Nuclear 156-243
BrGeBP10 Bra028105 A04 6123535 6126692 18965343 640 72.81 8.98 -1.087 1923 Nuclear 89-177,
419-508
BrGeBP11 Bra028106 A04 6127732 6128706 18965343 299 33.63 6.17 -0.957 900 Nuclear 85-162
BrGeBP12 Bra028108 A04 6131824 6132723 18965343 299 33.91 5.91 -0.945 900 Nuclear 87-176
BrGeBP13 Bra031402 A01 17604367 17605365 28605136 332 36.94 5.09 -1.055 999 Nuclear 127-214
BrGeBP14 Bra032717 A04 5586257 5587255 18965343 332 36.61 5.31 -0.683 735 Nuclear 97-187
BrGeBP15 Bra034488 Scaffold 356960 357928 0 322 35.63 5.35 -0.668 969 Nuclear 97-187
BrGeBP16 Bra036116 A09 2403139 2404306 18965343 358 38.40 4.69 -0.570 1077 Nuclear -
BrGeBP17 Bra038583 Scaffold 250721 251980 0 419 46.35 5.08 -1.013 1260 Nuclear 229-324
BrGeBP18 Bra039500 A05 8899379 8900517 18965343 300 34.16 5.71 -0.870 903 Nuclear 36-130
BrGeBP19 Bra040116 A01 25918538 25919720 28605136 371 40.83 5.17 -0.700 1116 Nuclear -
BrGeBP20 Bra040416 Scaffold 97200 98477 0 425 46.79 4.84 -0.638 1278 Nuclear -

Mol. wt. - Molecular weight, pI- Iso-electric point, GRAVY- Grand average of hydropathy, ORF- Open reading frame, bp- base pair, kDa= Kilo Dalton, ER= Endoplasmic Reticulum

  • Acarkan A, Roßberg M, Koch M, Schmidt R. 2000. Comparative genome analysis reveals extensive conservation of genome organisation for Arabidopsis thaliana and Capsella rubella. Plant. J. 23: 55-62.
  • Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. 2009. MEME suite: tools for motif discovery and searching. Nucleic Acids Res. 37(Suppl_2): W202-W208.
  • Bohuon E, Ramsay L, Craft J, Arthur A, Marshall D, Lydiate D, et al. 1998. The association of flowering time quantitative trait loci with duplicated regions and candidate loci in Brassica oleracea. Genetics. 150: 393-401.
  • Bowers JE, Chapman BA, Rong J, Paterson AHJN. 2003. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 422: 433-438.
  • Bowling SA, Guo A, Cao H, Gordon AS, Klessig DF, Dong X. 1994. A mutation in Arabidopsis that leads to constitutive expression of systemic acquired resistance. The Plant Cell. 6: 1845-1857.
  • Brininstool G, Kasili R, Simmons LA, Kirik V, Hülskamp M, Larkin JC. 2008. Constitutive Expressor Of Pathogenesis-related Genes5 affects cell wall biogenesis and trichome development. BMC Plant Biol. 8: 1-11.
  • Cannon SB, Mitra A, Baumgarten A, Young ND, May G. 2004. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 4: 1-21.
  • Chao JT, Kong YZ, Wang Q, Sun YH, Gong DP, Lv J, et al. 2015. MapGene2Chrom, a tool to draw gene physical map based on Perl and SVG languages. Yi chuan= Hereditas. 37: 91-97.
  • Chen Y, Shen J, Zhang L, Qi H, Yang L, Wang H, et al. 2021. Nuclear translocation of OsMFT1 that is impeded by OsFTIP1 promotes drought tolerance in rice. Mol Plant. 14: 1297-1311.
  • Cheng F, Liu S, Wu J, Fang L, Sun S, Liu B, et al. 2011. BRAD, the genetics and genomics database for Brassica plants. BMC Plant Biol. 11: 1-6.
  • Chou KC, Shen HB. 2010. Cell-PLoc 2.0: an improved package of web-servers for predicting subcellular localization of proteins in various organisms. Nat. Sci. 2: 1090
  • Clemens S. 2006. Toxic metal accumulation, responses to exposure and mechanisms of tolerance in plants. Biochimie. 88: 1707-1719.
  • Curaba J, Herzog M, Vachon G. 2003. GeBP, the first member of a new gene family in Arabidopsis, encodes a nuclear protein with DNA-binding activity and is regulated by KNAT1. Plant J. 33: 305-317.
  • Curaba J, Moritz T, Blervaque R, Parcy F, Raz V, Herzog M, et al. 2004. AtGA3ox2, a key gene responsible for bioactive gibberellin biosynthesis, is regulated during embryogenesis by LEAFY COTYLEDON2 and FUSCA3 in Arabidopsis. Plant Physiol. 136: 3660-3669.
  • Fang Y, You J, Xie K, Xie W, Xiong L. 2008. Systematic sequence analysis and identification of tissue-specific or stress-responsive genes of NAC transcription factor family in rice. Mol. Genet. Genomics. 280: 547-563.
  • Gabaldón T, Koonin EV. 2013. Functional and evolutionary implications of gene orthology. Nat. Rev. Genet. 14: 360-366.
  • Gan Y, Liu C, Yu H, Broun P. 2007. Integration of cytokinin and gibberellin signalling by Arabidopsis transcription factors GIS, ZFP8 and GIS2 in the regulation of epidermal cell fate. Development. 134: 2073-2081.
  • García-Cano E, Hak H, Magori S, Lazarowitz SG, Citovsky V. 2018. The Agrobacterium F-box protein effector VirF destabilizes the Arabidopsis GLABROUS1 enhancer/binding protein-like transcription factor VFP4, a transcriptional activator of defense response genes. MPMI. 31: 576-586.
  • Gasteiger E, Hoogland C, Gattiker A, Duvaud Se, Wilkins MR, Appel RD, et al. 2005. Protein identification and analysis tools on the ExPASy server. The proteomics protocols handbook. 571-607..
  • Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. 2015. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics. 31: 1296-1297.
  • Hu TT, Pattyn P, Bakker EG, Cao J, Cheng J-F, Clark RM, et al. 2011. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat. Genet. 43: 476-481.
  • Huang J, Zhang Q, He Y, Liu W, Xu Y, Liu K, et al. 2021. Genome-wide identification, expansion mechanism and expression profiling analysis of GLABROUS1 enhancer-binding protein (GeBP) gene family in Gramineae crops. Int. J. Mol. Sci. 22: 8758
  • Jakoby M, Weisshaar B, Dröge-Laser W, Vicente-Carbajosa J, Tiedemann J, Kroj T, et al. 2002. bZIP transcription factors in Arabidopsis. Trends Plant Sci. 7: 106-111.
  • Jasinski S, Piazza P, Craft J, Hay A, Woolley L, Rieu I, et al. 2005. KNOX action in Arabidopsis is mediated by coordinate regulation of cytokinin and gibberellin activities. Curr. Bioly. 15: 1560-1565.
  • Jing HC, Sturre MJ, Hille J, Dijkwel PP. 2002. Arabidopsis onset of leaf death mutants identify a regulatory pathway controlling leaf senescence. Plant J. 32: 51-63.
  • Johnston JS, Pepper AE, Hall AE, Chen ZJ, Hodnett G, Drabek J, et al. 2005. Evolution of genome size in Brassicaceae. Ann. Bot. 95: 229-235.
  • Kersey PJ, Allen JE, Armean I, Boddu S, Bolt BJ, Carvalho-Silva D, et al. 2016. Ensembl Genomes 2016: More genomes, more complexity. Nucleic Acids Res. 44(D1): D574-D580.
  • Khare D, Mitsuda N, Lee S, Song WY, Hwang D, Ohme-Takagi M, et al. 2017. Root avoidance of toxic metals requires the GeBP-LIKE 4 transcription factor in Arabidopsis thaliana. New Phytol. 213: 1257-1273.
  • Kong X, Lv W, Jiang S, Zhang D, Cai G, Pan J, et al. 2013. Genome-wide identification and expression analysis of calcium-dependent protein kinase in maize. BMC Genomics. 14: 1-15.
  • Lescot M, Déhais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, et al. 2002. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 30: 325-327.
  • Letunic I, Bork P. 2018. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 46(D1): D493-D496.
  • Li Z, Jiang H, Zhou L, Deng L, Lin Y, Peng X, et al. 2014. Molecular evolution of the HD-ZIP I gene family in legume genomes. Gene. 533: 218-228.
  • Liu RX, Li HL, Qiao ZW, Liu HF, Zhao LL, Wang XF, et al. 2023. Genome-wide analysis of MdGeBP family and functional identification of MdGeBP3 in Malus domestica. Environ. Exp. Bot. 208: 105262
  • Liu S, Liu Y, Liu C, Zhang F, Wei J, Li B. 2022. Genome-wide characterization and expression analysis of GeBP family genes in soybean. Plants. 11: 1848
  • Liu S, Liu Y, Yang X, Tong C, Edwards D, Parkin IA, et al. 2014. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5: 3930.
  • Lukens L, Zou F, Lydiate D, Parkin I, Osborn TJG. 2003. Comparison of a Brassica oleracea genetic map with the genome of Arabidopsis thaliana. Genetics. 164: 359-372.
  • Ma C, Chen Q, Wang S, Lers A. 2021. Downregulation of GeBP-like α factor by MiR827 suggests their involvement in senescence and phosphate homeostasis. BMC Biol. 19: 90
  • Nagaharu U, Nagaharu NJJJB. 1935. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn. J. Bot. 7: 389-452.
  • Oppenheimer DG, Herman PL, Sivakumaran S, Esch J, Marks MD. 1991. A myb gene required for leaf trichome differentiation in Arabidopsis is expressed in stipules. Cell. 67: 483-493.
  • Perazza D, Laporte F, Balagué C, Chevalier F, Remo S, Bourge M, et al. 2011. GeBP/GPL transcription factors regulate a subset of CPR5-dependent processes. Plant Physiol. 157: 1232-1242.
  • Perazza D, Vachon G, Herzog M. 1998. Gibberellins promote trichome formation by up-regulating GLABROUS1 in Arabidopsis. Plant Physiol. 117: 375-383.
  • Petryszak R, Keays M, Tang YA, Fonseca NA, Barrera E, Burdett T, et al. 2016. Expression Atlas update-an integrated database of gene and protein expression in humans, animals and plants. Nucleic Acids Res. 44(D1): D746-D752.
  • Pickett FB, Meeks-Wagner DR. 1995. Seeing double: appreciating genetic redundancy. Plant. Cell. 7: 1347
  • Potter SC, Luciani A, Eddy SR, Park Y, Lopez R, Finn RD. 2018. HMMER web server: 2018 update. Nucleic Acids Res. 46(W1): W200-W204.
  • Saeediazar S, Zarrini HN, Ranjbar G, Heidari P. 2014. Identification and study of cis regulatory elements and phylogenetic relationship of TaSRG and other salt response genes. JBES. 5: 1-5.
  • Saitou N, Nei M. 1987. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4: 406-425.
  • Samad AF, Sajad M, Nazaruddin N, Fauzi IA, Murad AM, Zainal Z, et al. 2017. MicroRNA and transcription factor: key players in plant regulatory network. Front. Plant Sci. 8: 565
  • Shan X, Yang K, Shi J, Zhu C, Gao Z. 2020. Genome-wide identification and expression analysis of GeBP transcription factor gene family in moso bamboo. J. Nanjing For. Univ. 44: 41.
  • Song XM, Huang ZN, Duan WK, Ren J, Liu TK, Li Y, et al. 2014. Genome-wide analysis of the bHLH transcription factor family in Chinese cabbage (Brassica rapa ssp. pekinensis). Mol. Genet. Genomic. 289: 77-91.
  • Tong C, Wang X, Yu J, Wu J, Li W, Huang J, et al. 2013. Comprehensive analysis of RNA-seq data reveals the complexity of the transcriptome in Brassica rapa. BMC Genomics. 14: 1-10.
  • Wang R, Wu X, Wang Z, Zhang X, Chen L, Duan Q, et al. 2023. Genome-wide identification and expression analysis of BrGeBP genes reveal their potential roles in cold and drought stress tolerance in Brassica rapa. Int. J. Mol. Sci. 24: 13597
  • Wang XiaoWu WX, Wang HanZhong WH, Wang Jun WJ, Sun RiFei SR, Wu Jian WJ, Liu ShengYi LS, et al. 2011. The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet. 43: 1035-1039.
  • Wang Y, Feng L, Zhu Y, Li Y, Yan H, Xiang Y. 2015. Comparative genomic analysis of the WRKY III gene family in populus, grape, Arabidopsis and rice. Biol. Direct. 10: 1-27.
  • Wolters H, Jürgens G. 2009. Survival of the flexible: hormonal growth control and adaptation in plant development. Nat. Rev. Genet. 10: 305-317.
  • Wu J, Peng Z, Liu S, He Y, Cheng L, Kong F, et al. 2012. Genome-wide analysis of Aux/IAA gene family in Solanaceae species using tomato as a model. Mol. Genet. Genomics. 287: 295-311.
  • Yanai O, Shani E, Dolezal K, Tarkowski P, Sablowski R, Sandberg G, et al. 2005. Arabidopsis KNOXI proteins activate cytokinin biosynthesis. Curr. Biol. 15: 1566-1571.
  • Zhang W, Ruan J, Ho T-hD, You Y, Yu T, Quatrano RS. 2005. Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid-and abiotic stress-responsive genes in Arabidopsis thaliana. Bioinformatics. 21: 3074-3081.
  • Zhang Z, Miller W, Lipman DJ. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25: 3389-402.
  • Zheng Y, Jiao C, Sun H, Rosli HG, Pombo MA, Zhang P, et al. 2016. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol. Plant. 9: 1667-1670.
  • Zhou H, Zhou W, Yao X, Zhao Q, Lu L. 2023. Genome--wide investigation and functional analysis reveal that CsGeBP4 is required for tea plant trichome formation. Int. J. Mol. Sci. 24: 5207
  • Zhu Y, Wu N, Song W, Yin G, Qin Y, Yan Y, et al. 2014. Soybean (Glycine max) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies. BMC Plant Biol. 14: 1-19.

Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:

Include:

Genome-Wide Characterization and Expression Profiling of GeBP Gene Family in Brassica rapa
Plant Breed. Biotech.. 2025;13:53-70.   Published online March 19, 2025
Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:
Include:
Genome-Wide Characterization and Expression Profiling of GeBP Gene Family in Brassica rapa
Plant Breed. Biotech.. 2025;13:53-70.   Published online March 19, 2025
Close

Figure

  • 0
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
Genome-Wide Characterization and Expression Profiling of GeBP Gene Family in Brassica rapa
Image Image Image Image Image Image Image
Fig. 1 Phylogenetic analysis of GeBP proteins from B. rapa, B. oleracea, B. napus, Arabidopsis thaliana, and Raphanus sativus. With 1000 bootstrap replicates, the neighbor joining method was used in MEGA-7 to create the phylogenetic tree.
Fig. 2 The syntenic relationship among Brassica rapa, Brassica oleracea, Brassica napus, and Arabidopsis thaliana is depicted using green and orange colors to represent the highest sequence similarity between the genes of these crops.
Fig. 3 10 conserved motifs were found with the aid of the MEME-suite tool. The different colored boxes indicate different types of motifs.
Fig. 4 Exon-intron arrangement of GeBP genes was forecasted from GSDS 2.0. This structural arrangement was generated using gene and CDS sequences.
Fig. 5 Localization of the GeBP gene family on the chromosome of B. rapa. The relative length of a chromosome serves as a measure of its size. Each chromosome has a number at the top that corresponds to its chromosome. Megabases (Mb) is the scale on the left side of the image that was used to estimate the sizes of the chromosomes and the locations of the genes.
Fig. 6 Cis-regulatory elements were detected in the promoter region of BrGeBP genes. Numerical values indicate the number of cis-regulatory elements in each gene. These elements are non-coding DNA regions that control the neighboring genes' transcription.
Fig. 7 Expression of the BrGeBP genes in specific organs. Leaf, stem, flower, siliqua, root, and callus were present. The heat map was generated using the conditional formatting option in Microsoft Excel. The colors red, white, and green indicate high, moderate, and low levels of gene expression, respectively.
Genome-Wide Characterization and Expression Profiling of GeBP Gene Family in Brassica rapa

A comprehensive summary table consolidating data sources

Data Source
CDS, protein, and genomic sequences of B. rapa BRAD database (http://brassicadb.cn/#/)
Molecular characteristics: length, Mw, GRAVY, pI ProtParam tool (https://web.expasy.org/protparam/)
Open Reading Frame (ORF) Ensembl Plants database (https://plants.ensembl.org/index.html)
Subcellular localization Cell-PLoc 2.0 (http://www.csbio.sjtu.edu.cn/bioinf/Cell-PLoc-2/)
Domain SMART conserved domain search tool (http://smart.embl-heidelberg.de/)
Protein sequences of Arabidopsis (20), B. oleracea (18), B. napus (36), and Radish (20) iTAk database (http://itak.feilab.net/cgi-bin/itak/index.cgi)
Cis-acting elements PlantCARE web-based tool
(https://bioinformatics.psb.ugent.be/webtools/plantcare/html/)
RNA-sequencing data Expression Atlas database (https://www.ebi.ac.uk/gxa/home)

Comprehensive details on the Brassica rapa BrGeBP genes and associated proteins

Gene name Gene ID Chromosome location Protein
length
Mol. wt.
(kDa)
pI GRAVY ORF
(bp)
Subcellular
localization
Domain
DUF573

Number Start End Length
BrGeBP1 Bra009986 A06 19087069 19088313 26271742 414 44.96 4.87 -0.587 1245 Nuclear 123-219
BrGeBP2 Bra013842 A01 8042900 8046434 28605136 609 69.62 8.95 -0.263 1830 ER 494-581
BrGeBP3 Bra013873 A01 8185027 8186103 28605136 358 39.31 7.53 -0.785 1077 Nuclear 138-235
BrGeBP4 Bra017244 A04 15879572 15881487 18965343 377 43.16 4.56 -1.020 1134 Nuclear 56-150
BrGeBP5 Bra019183 A03 25857483 25858583 31715688 366 40.52 5.72 -0.924 1101 Nuclear 146-243
BrGeBP6 Bra019847 A06 4034905 4035975 26271742 356 39.16 7.66 -0.945 1071 Nuclear 108-198
BrGeBP7 Bra021317 A01 21515533 92151645 28605136 308 34.58 8.89 -1.001 927 Nuclear 106-193
BrGeBP8 Bra025445 A04 9053857 9054780 18965343 307 34.64 5.61 -0.892 924 Nuclear 100-195
BrGeBP9 Bra027078 A09 8683856 8684998 18965343 380 42.10 5.15 -1.017 1143 Nuclear 156-243
BrGeBP10 Bra028105 A04 6123535 6126692 18965343 640 72.81 8.98 -1.087 1923 Nuclear 89-177,
419-508
BrGeBP11 Bra028106 A04 6127732 6128706 18965343 299 33.63 6.17 -0.957 900 Nuclear 85-162
BrGeBP12 Bra028108 A04 6131824 6132723 18965343 299 33.91 5.91 -0.945 900 Nuclear 87-176
BrGeBP13 Bra031402 A01 17604367 17605365 28605136 332 36.94 5.09 -1.055 999 Nuclear 127-214
BrGeBP14 Bra032717 A04 5586257 5587255 18965343 332 36.61 5.31 -0.683 735 Nuclear 97-187
BrGeBP15 Bra034488 Scaffold 356960 357928 0 322 35.63 5.35 -0.668 969 Nuclear 97-187
BrGeBP16 Bra036116 A09 2403139 2404306 18965343 358 38.40 4.69 -0.570 1077 Nuclear -
BrGeBP17 Bra038583 Scaffold 250721 251980 0 419 46.35 5.08 -1.013 1260 Nuclear 229-324
BrGeBP18 Bra039500 A05 8899379 8900517 18965343 300 34.16 5.71 -0.870 903 Nuclear 36-130
BrGeBP19 Bra040116 A01 25918538 25919720 28605136 371 40.83 5.17 -0.700 1116 Nuclear -
BrGeBP20 Bra040416 Scaffold 97200 98477 0 425 46.79 4.84 -0.638 1278 Nuclear -
Table 1 A comprehensive summary table consolidating data sources
Table 2 Comprehensive details on the Brassica rapa BrGeBP genes and associated proteins

Mol. wt. - Molecular weight, pI- Iso-electric point, GRAVY- Grand average of hydropathy, ORF- Open reading frame, bp- base pair, kDa= Kilo Dalton, ER= Endoplasmic Reticulum