
Rice (
Nitrogen metabolism in plants involves several key processes, including uptake, transport, reduction (conversion to usable forms), assimilation, translocation, and remobilization (Lee 2021). These processes ensure the efficient acquisition and utilization of nitrogen from the soil to sustain plant growth and reproduction. Nitrogen is initially absorbed by the rice roots in the form of nitrate (NO₃⁻) or ammonium (NH₄⁺). The absorbed nitrogen is then transported through the plant to key tissues such as the leaves, where it is reduced and assimilated into amino acids, proteins, and other vital macromolecules. The nitrate reduction pathway involves the enzymatic conversion of nitrate to nitrite by nitrate reductase (NR) in the cytoplasm, followed by the reduction of nitrite to ammonium by nitrite reductase (NiR) in the chloroplasts (Gao et al. 2019). The resulting ammonium is incorporated into amino acids via the glutamine synthetase-glutamate synthase (GS-GOGAT) cycle, which is essential for maintaining cellular nitrogen homeostasis and supporting metabolic functions critical for plant growth (Tamura et al. 2011). However, despite the importance of these processes, nitrogen uptake in rice is often suboptimal, with current NUE levels ranging from 30-40% (Hakeem et al. 2011). As a result, the development of rice varieties with higher NUE and improved NDT has become a critical focus for agricultural research.
The genome-wide association study (GWAS) has emerged as a powerful tool to explore the genetic basis of complex traits, including NUE and NDT, in crops such as rice (Huang et al. 2012; Zhao et al. 2011). GWAS enables the identification of single nucleotide polymorphisms (SNPs) associated with phenotypic variation across genetically diverse populations. This method has several advantages over traditional quantitative trait locus (QTL) mapping, particularly for studying complex traits with high genetic diversity. GWAS can provide high mapping resolution due to the large number of genetic recombination events and the use of genetically diverse populations, making it possible to identify genetic loci with a strong correlation to the trait of interest (Famoso et al. 2011; Li et al. 2013). Recent studies have applied GWAS to identify genes associated with NUE in rice. For instance,
In this study, we used the North Korean (NK) rice population, which has been relatively underexplored in genetic research, despite its significant potential for the discovery of new genetic traits. NK rice is genetically distinct due to its isolated breeding history and unique environmental conditions. Understanding the genetic makeup of NK rice offers an opportunity to uncover novel genes and pathways that enhance NDT and NUE. Given its genetic diversity, this population serves as an excellent resource for GWAS to identify genes that can be leveraged in breeding programs to develop rice cultivars suited for nitrogen-limited environments. This study aims to investigate the genetic basis of NDT in a diverse population of 190 NK rice accessions. By applying GWAS, we seek to identify candidate genes and genetic markers associated with NDT, which could potentially lead to the development of rice varieties with improved NUE and tolerance to nitrogen-deficient soils. Additionally, this study will provide valuable insights into the genetic diversity of NK rice, contributing to a broader understanding of rice genetics and supporting future rice breeding efforts.
A population of 190 NK rice genotypes was provided by the International Rice Research Institute (IRRI). The rice seeds were surface-sterilized by shaking them in 1,000 ppm prochloraz for 30 minutes, then incubated at 30°C with daily water changes for two days to promote germination. The plants were then hydroponically grown in a greenhouse at Hankyong National University in Anseong-si, Gyeonggi-do, South Korea, beginning on March 31, 2023. Each variety was grown hydroponically in six biological replicates using Yoshida solution as the nutritional medium (Yoshida et al. 1971). The Yoshida solution contained the following nutrients: 40 ppm NH4NO3 for nitrogen, 10 ppm NaH2PO4⋅H2O for phosphorus, 40 ppm K2SO4 for potassium, 40 ppm CaCl2 for calcium, 40 ppm MgSO4⋅7H2O for magnesium, 0.5 ppm MnCl2⋅4H2O for manganese, 0.05 ppm (NH4)6Mo7O24⋅4H2O for molybdenum, 0.2 ppm H3BO3 for boron, 0.01 ppm ZnSO4⋅7H2O for zinc, 0.01 ppm CuSO4⋅5H2O for copper, 2 ppm FeCl3⋅6H2O for iron, 0.02 ppm C6H8O7⋅H2O, and 0.006 ppm H2SO4.
The hydroponic system maintained a pH of 5.0, and the Yoshida solution was refreshed weekly. The control group received the standard Yoshida solution under normal-nitrogen (NN) conditions, whereas the treatment group received a low-nitrogen (LN) solution with nitrogen levels reduced to 10% of those in NN (1 ppm NH4NO3). After 31 days of cultivation, the plants were evaluated for six agronomic traits: soil plant analysis development (SPAD), shoot length (SL), root length (RL), shoot fresh weight (SFW), root fresh weight (RFW), and tiller number (TN). Extreme observations were identified as outliers and removed due to high variability. Outliers from the six biological replicates were removed due to high variability, and the mean values were calculated to generate a phenotypic dataset for GWAS analysis. The relative low-N (ReLN) value was calculated as (LN/NN)×100.
The genomic sequencing data for the 190 NK rice genotypes used in this study were previously generated and reported by Jadamba et al. (2022). In that study, genomic DNA was extracted, libraries were prepared, and sequencing was performed using the MGI sequencing platform with 150-bp paired-end reads. Detailed protocols for DNA isolation, library preparation, and sequencing can be found in their study.
GWAS was conducted using the fixed and random model circulating probability unification (FarmCPU), implemented in the rMVP R package. FarmCPU is particularly well-suited for large-scale GWAS investigations as it iteratively updates fixed effects and random effects models, effectively reducing false positive errors typical in mixed linear models (MLM) (Liu et al. 2016). The rMVP package, recognized for its memory efficiency, enhanced visualization, and parallel acceleration capabilities (Lipka et al. 2012), facilitated the analysis. This comprehensive approach ensured robust results, accounting for agricultural and environmental conditions to yield consistent and biologically meaningful insights. FarmCPU accounts for confounding factors such as population structure and environmental impacts, enabling more accurate genetic signal extraction. Consequently, FarmCPU is widely adopted in agricultural and life sciences research and was selected as the GWAS model for this study. The analysis included a total of 2,394,361 SNPs. We filtered the data using PLINK with the following criteria: minor allele frequency (MAF)≥0.05, missing data rate≤25%, minimum genotype quality≥30, and minimum depth≥4. After filtering, the Bonferroni multiple test adjustment was applied to control for false positives, resulting in a threshold value of -log10 (P)=5.9 (0.05 divided by the number of independent SNPs at a 5% significance level). Lead SNPs were identified based on significance, and linkage disequilibrium (LD) plots were generated to pinpoint genomic regions of importance for each lead SNP. Genes located within 1 Mb of the lead SNPs were considered potential candidates. Haplotype analysis was then performed on genes physically adjacent to the lead SNPs. To evaluate haplotype-specific differences in phenotype, Duncan's test was applied to determine statistical significance.
Haplotyping of candidate genes identified through GWAS was performed to explicitly evaluate SNPs that cause significant phenotypic differences, as well as the distribution of genotypes harboring these SNPs across the 190 NK rice population. To capture all SNP variants, including insertion and deletion (InDel) calls, haplotype analysis was performed on raw, unfiltered genotyping data. After removing heterozygotes and missing data, SNPs and InDels in exon regions were used to conduct haplotype variation studies. We used annotation version 6.1 to scan the genomic areas of the important loci with japonica cv. Nipponbare genomic pseudomolecules from the Michigan State University (MSU) rice database. The candidate genes identified from the GWAS results were then evaluated using Duncan's test.
Nitrogen deficiency significantly hindered the growth of rice seedlings, as demonstrated by the phenotypic analysis of six agronomic traits after 31 days of hydroponic cultivation under NN and LN conditions (Fig. 1). The phenotypic values of five traits declined considerably under LN conditions (Figs. 1c, 1d, 1f, 1h). Notably, the decrease in SPAD value was evident in the color of the rice seedling shoots (Figs. 1a, 1b), indicating that nitrogen is essential for chlorophyll accumulation. Interestingly, RL significantly increased under LN conditions, likely as an adaptive response to enhance nitrogen uptake (Fig. 1e). However, although RL increased under nitrogen deficiency conditions, RFW significantly declined. This was likely due to physiological changes that elongated the roots but did not allocate sufficient energy or resources to increase root width or thickness.
The frequencies of the twelve phenotypes mostly followed a normal distribution, except for TN and ReTN, which exhibited significant skewness. This skewness is likely due to the strong suppression of tiller development caused by nitrogen deficiency (Fig. 2). The normal distribution of most traits suggests that the 190 NK rice population is genetically diverse, with some genotypes demonstrating superior plant growth under nitrogen-deficient conditions. These genotypes could serve as valuable resources for breeding cultivars with high NDT.
The correlation analysis for the six agronomic traits was performed between LN and ReLN values (Table 1). We found that SFW was positively correlated with SL and RFW in both LN and ReLN. Among them, ReSFW showed the highest correlation with ReRFW (0.9241) in ReLN. Notably, RL was negatively correlated with SPAD in both LN and ReLN.
Table 1 . Correlations among six traits used in GWAS.
SPAD | SL | RL | SFW | RFW | |
---|---|---|---|---|---|
SPAD | 1 | ||||
SL | 0.1987 | 1 | |||
RL | -0.1648* | 0.1351* | 1 | ||
SFW | 0.2303 | 0.5892*** | 0.0806 | 1 | |
RFW | 0.1934 | 0.4034 | 0.1348* | 0.7607*** | 1 |
ReSPAD | ReSL | ReRL | ReSFW | ReRFW | |
ReSPAD | 1 | ||||
ReSL | 0.4951*** | 1 | |||
ReRL | -0.0230 | 0.0622 | 1 | ||
ReSFW | 0.4195 | 0.7887*** | 0.1134 | 1 | |
ReRFW | 0.3873 | 0.7359 | 0.1425 | 0.9241*** | 1 |
To identify candidate genes associated with low-N tolerance, we performed a GWAS with 190 NK rice accessions. GWAS was performed using FarmCPU implemented in the rMVP R package, with 2,394,361 SNPs filtered using PLINK. A total of 107 significant lead SNPs were found across the 12 chromosomes. The quantile-quantile (Q-Q) plot revealed that while some features exhibited skewed distributions, there was an overall alignment between the expected and observed -log10 (P) values. LD analysis was used to identify candidate genomic regions associated with each significant peak. Candidate genes underlying the lead SNPs were determined by examining genomic regions 100 kb to 1 Mb adjacent to the lead SNPs. This analysis utilized pairwise R² values for SNPs, and gene annotations were sourced from the Rice Genome Annotation Project (RGAP; http://rice.uga.edu).
GWAS and LD analyses identified twelve known genes (
The twelve known genes identified through GWAS were categorized according to nitrogen metabolic processes. Nine of the twelve genes (
Table 2 . List of twelve known candidate genes linked to lead SNPs identified by GWAS.
Lead SNP | -Log10 (P) | Trait | Candidate gene | Gene ID | Description | Ref. |
---|---|---|---|---|---|---|
Chr10:23174935 | 29.38 | ReSPAD | Os10g0579600 | Nitrate transporter | (Huang et al. 2018) | |
Chr03:19112695 | 10.78 | ReRFW | Os03g0790600 | Transcription factor | (Zhu et al. 2022) | |
Chr01:1012388 | 10.66 6.52 | ReRFW ReSFW | Os01g0103100 | Nitrate transporter | (Tang et al, 2019) | |
Chr03:5664428 | 8.66 | ReSPAD | Os03g0223400 | Glutamine synthetase | (Tabuchi et al. 2005) | |
Chr01:37123160 | 7.94 | SPAD | Os01g0872500 | Nitrate transporter | (Wang et al. 2022) | |
Chr12:3917349 | 7.93 | SPAD | Os12g0194900 | Amino acid permease | (Fang et al. 2021) | |
Chr02:24491851 | 7.47 6.76 | ReSFW ReRFW | Os02g0620600 | Ammonium transporter | (Sonoda et al. 2003) | |
Chr02:28607552 | 7.44 | SPAD | Os02g0699000 | Nitrate transporter | (Hu et al. 2016) | |
Chr10:21719487 | 7.33 6.24 | SFW SL | Os10g0554200 | Nitrate transporter | (Hu et al. 2015) | |
Chr11:7816560 | 7.84 7.17 | ReSL ReRL | Os11g0235200 | Nitrate transporter | (Li et al. 2009) | |
Chr02:33539436 | 6.55 | ReSL | Os02g0797500 | Aminotransferase | (Zhou et al. 2009) | |
Chr01:7938122 | 6.19 | SFW | Os01g0236700 | Transcription factor | (Hu et al. 2019) |
Among the six unknown genes identified by GWAS,
Table 3 . List of six unknown candidate genes linked to lead SNPs identified by GWAS.
Lead SNP | -Log10 (P) | Trait | Candidate gene | Gene ID | Description | Ref. |
---|---|---|---|---|---|---|
Chr10:3108237 | 13.95 10.60 | ReSFW ReRFW | Os10g0151500 | Kinase | (Zhang et al. 2005) | |
Chr05:3016061 | 12.45 | SL | Os05g0161500 | (p)ppGpp synthetase | (Tozawa et al. 2007) | |
Chr07:2925373 | 12.28 | SL | Os07g0150700 | Kinase | (Yang et al. 2008) | |
Chr04:3959459 | 12.13 | SL | Os04g0165200 | Transcription factor | (Che et al. 2018) | |
Chr10:12552737 | 11.71 | SL | Os10g0390500 | Aminotransferase | (Yang et al. 2015) | |
Chr06:18103083 | 10.68 | ReRFW | Os06g0504900 | Transcription factor | (Zhang et al. 2008) |
Haplotype analysis was conducted on all twelve known genes identified by GWAS. The haplotypes of the candidate genes were classified based on differences in SNPs and Indels located in the genomic DNA sequence of each gene. The haplotypes and their positions on the LD plot are shown for five known genes associated with the most significant lead SNPs (Fig. 5).
The second known gene,
The third known gene,
The fourth known gene,
The fifth known gene,
Haplotype analysis of the six unknown genes identified by GWAS was performed and statistical significance was observed for all of them. The haplotypes and positions on the LD plot are shown for three unknown genes associated with the most significant lead SNPs (Fig. 6).
The first candidate gene
The second candidate gene,
The third candidate gene
This study highlights the identification of candidate genes associated with NDT in the 190 NK rice genotypes through GWAS. By analyzing six agronomic traits under NN and LN conditions, we identified 18 significant genes, including 12 known genes related to NUE and six novel candidates. These findings underscore the potential of 190 NK rice genotypes as valuable genetic resources for improving NUE and NDT. The results not only provide new insights into the genetic basis of NDT but also offer promising targets for future breeding programs aimed at enhancing rice productivity in nitrogen-limited environments.
Shoot-related traits, such as SL and SFW, are critical phenotypes in NUE studies (Sharma et al. 2021), as they are directly linked to biomass production and overall plant growth—both of which are primary indicators of NUE. Under LN conditions, two known genes were significantly associated with shoot-related traits:
Nitrogen deficiency significantly affects rice growth and development, resulting in changes to both morphological and physiological traits. A notable reduction in SPAD value, an indicator of chlorophyll content, was observed under LN conditions, reflecting decreased photosynthetic capacity. This decline in SPAD value was accompanied by an increase in RL, a common adaptive response to nitrogen deficiency (Liu et al. 2023). Correlation analysis results further supported this trend, revealing a negative correlation between RL and SPAD (Table 1). Likewise, ReSPAD also exhibited a negative correlation with RL (Table 1), suggesting that under severe nitrogen deficiency, rice plants tend to prioritize root elongation over chlorophyll biosynthesis to enhance nitrogen uptake. Nitrogen deficiency is known to promote primary root growth by promoting cell elongation and division, thereby increasing root surface area for nitrogen absorption (Zhang et al. 2012). These findings align with previous reports describing chlorotic leaf symptoms and elongated, thinner roots in nitrogen-deficient rice plants (Hsieh et al. 2018). This adaptive strategy underscores a trade-off between enhanced root development and reduced photosynthetic capacity, enabling rice plants to cope with nitrogen stress.
The GWAS analysis for NDT identified a total of 18 candidate genes, including 12 previously known genes (Table 2) associated with NUE and six novel candidates (Table 3). Notably, more genes were identified in ReLN traits (seven known genes) than in LN traits (five known genes), emphasizing the advantage of using relative traits in identifying key genetic components of NUE. The ReLN value, calculated as (LN/NN) x 100, enabled for the assessment of how well the 190 NK rice genotypes perform under LN conditions compared to NN conditions. Furthermore, the repeated identification of genes across different ReLN traits (Fig. 4), such as
In this study, we identified six candidate genes associated with NDT through GWAS analysis (Table 3), highlighting their potential roles in NDT. Notably, three genes—
This work was supported by the Cooperative Research Program for Agriculture Science and Technology Development Project (RS-2022-RD010405) of the Rural Development Administration and the National Research Foundation (NRF) of Republic of Korea (No. RS-2023-00253851).
Download Form
![]() |
![]() |