Skip to main content

Bacterial and archaeal communities within the alkaline soda Langaco Lake in the Qinghai-Tibet Plateau



Langaco Lake (LGL) is a soda lake located at an altitude of 4548 m in the Qinghai-Tibet Plateau in China. LGL exhibits unique hydrochemical characteristics among soda lakes, but little is known about the microbial diversity of LGL and the microbial interactions with environmental factors.


The water samples were filtered using chemical-grade cellulose acetate membrane (pore size of 0.45 μm), and the hydrochemical characteristics were analyzed. Community DNA was extracted, and then high-throughput sequencing of 16S rRNA genes was conducted to evaluate the composition of the microbial community.


The high-throughput sequencing of 16S rRNA genes revealed that the bacterial diversity in LGL consisted of 327 genera in 24 phyla (4871 operational taxonomic units (OTUs); Shannon index values of 5.20–6.07), with a significantly higher diversity than that of the Archaea (eight phyla and 29 genera comprising 1008 OTUs; Shannon index values of 2.98–3.30). The bacterial communities were dominated by Proteobacteria (relative abundances of 42.79–53.70%), followed by Bacteroidetes (11.13–15.18%), Planctomycetes (4.20–12.82%), Acidobacteria (5.91–9.50%), Actinobacteria (2.60–5.80%), and Verrucomicrobia (2.11–4.08%). Furthermore, the archaeal communities were dominated by Crenarchaeota (35.97–58.29%), Euryarchaeota (33.02–39.89%), and Woesearchaeota (6.50–21.57%). The dominant bacterial genus was Thiobacillus (8.92–16.78%), and its abundances were most strongly correlated with the total phosphorus (TP) content, pH value, CO32− concentration, and temperature. The most abundant archaeal genus was Methanoregula (21.40–28.29%), and its abundances were the most highly correlated with the total organic carbon (TOC) content, total salinity (TS), and K+ and Na+ concentrations.


The results of this study provide valuable insights for developing a more comprehensive understanding of microbial diversity in these unique carbonate alkaline environments, as well as a better understanding of the microbial resources on the Qinghai-Tibet Plateau.


Soda lakes are exceptional among aquatic ecosystems because they simultaneously exhibit high productivity rates (i.e., carbon and the large amount of dissolved organic matter produced by photosynthesis) and high pH values (9.5–11.0) (Banda et al. 2019; Paul et al. 2015). The lakes are naturally occurring alkaline environments that contain high concentrations of sodium carbonate owing to evaporation. In addition, high concentrations of other salts can also accumulate, especially sodium chloride, leading to the formation of alkaline saline lakes (Namsaraev et al. 2015). Soda lakes have likely massively contributed to global primary productivity in Earth’s geological past and these soda lake environments represent examples of contemporary extreme environments. Generally, they are inland lakes and have a propensity to become meromictic due to regional and local hydrologic events. In addition, they are highly productive due to elevated temperatures, high sunlight incidence, and large supplies of CO2. Furthermore, diverse microbial populations are abundant in soda lakes (Lanzen et al. 2013). Many regional examples of soda lakes have been reported, including in the East African Rift Zone, the rain-shadowed regions in California and Nevada, the Kulunda steppe in Russia, and on the Cariboo Plateau in Canada. Similarly, many microorganisms have been isolated from such lakes, including Cyanobacteria, chemolithoautotrophic sulfide oxidizing bacteria, sulfate-reducing/nitrifying/denitrifying bacteria, aerobic heterotrophic bacteria, fermentative bacteria, methanotrophs, and methanogens (Zorz et al. 2019; Tiodjio et al. 2014).

The underlying basaltic rocks in some areas of these plateaus originate from Miocene and Pliocene volcanic activity and have led to ideal conditions for the formation of soda lakes owing to the low solubility of calcium and magnesium in the basaltic formations. Thus, soda lakes are important components of terrestrial ecosystems, contain abundant microbial resources, and play critical roles in geochemical cycles by promoting material exchange, such as those in the Qinghai-Tibet Plateau in China (Xing et al. 2019). Soda lakes are widely distributed in terrestrial plateau ecosystems with extreme environmental conditions, such as the persistence of extreme droughts, intense solar ultraviolet radiation, extreme daily temperature changes, and low partial pressures of dissolved and atmospheric oxygen (Namsaraev et al. 2015; Lanzen et al. 2013). Consequently, changes in these environmental conditions in association with elevation can lead to changes in bacterial or archaeal lake community diversity in high-altitude areas (Liu et al. 2010).

Langaco Lake (LGL) is one of the highest typical soda lakes in the Qinghai-Tibet Plateau. However, the microbial structure and diversity of LGL have not been previously investigated. LGL is also minimally directly affected by human activities and thus remains ecologically intact, thereby providing a natural laboratory for scientific studies. These rarely explored lakes may harbor new microbial species, and thus, an understanding of the bacterial diversity in these high-altitude lakes is critical for species protection and ecosystem conservation (Mesbah et al. 2007). High-throughput sequencing has been extensively used to investigate microbial communities in recent years via 16S rRNA gene compositional analysis of samples from natural environments. These methodologies have become increasingly used to determine differences in microbial community diversity and structure among environments, thereby helping to reveal the interactions among microorganisms in such environments, as well as their adaptions to specific environments (Paul et al. 2015). In this study, Illumina high-throughput sequencing analysis of community 16S rRNA genes was used to comprehensively investigate the bacterial and archaeal communities in LGL, and to explore the dominant genera in LGL and their associations with environmental factors. Therefore, the results of this study provide a theoretical framework for understanding the relationships among microorganisms and environments under alkaline conditions on plateaus.

Materials and methods

Sample sites and sample collection

Langaco Lake is located in the northwestern margin of the Tibetan Plateau (30°40′30.6″N, 81°18′32.3″E) in Pulan County in the Ngari Region at an altitude of 4548 m. LGL has an area of 256.2 km2 and experiences a frigid semi-arid plateau climate (Mianping 1997). The lake is irregularly, slightly spoon-shaped. There are several islands exposed within the lake, and the terrain around the islands is very steep. The northern portion of the lake is a smaller open lake area that is connected to the open southern part by a narrow channel, while the center is flat (Wang et al. 2013). The lake has a sodium concentration of 106.24 mg/L, a total salinity of 1.00 mg/L, and a pH of 8.62 (Mianping 1997). The total area of the lake has decreased since the 1970s, especially in the northwestern region, while the temperatures have generally increased and precipitation has significantly decreased in this region (Dai 2020). Consequently, the lake water is primarily replenished by meltwater from the glaciers to the north (Wang et al. 2013).

Four samples were collected from LGL in mid-July 2018 from a sediment depth of 30–40 cm. These samples were mixtures of water and sediment (about 4 L total). The distance between any two samples was greater than 4 km (Fig. 1), and they were all collected about 5 m from the coastline. In addition, approximately 2 L of water was immediately filtered through a 0.22-μm filter (Millipore, USA) on site for subsequent DNA extraction. A portable pH meter (LEICI/PHBJ-261L, Shanghai) was used to measure the pH in situ. The filters were taken back to the laboratory on ice, while the water samples collected for physicochemical analysis were stored at 4°C.

Fig. 1
figure 1

Geographic location of Langaco Lake and the location of the four sampling sites in the lake. The map is sourced from Esri, Digital Globe, GeoEye, Earthstar Geographics, CNES/Airbus DS, USDA, USGS, AeroGRID, IGN, and the GIS User Community

Hydrochemical analyses

The water samples were filtered through a chemical-grade cellulose acetate membrane (pore size of 0.45 μm, Millipore, USA), and the physical and chemical properties of water samples were determined according to the general rules of analytical methods (JY/T020-1996). The major cation concentrations (Na+, K+, Ca2+, and Mg2+) were measured using atomic absorption spectrometry (CE 3000 series spectrometer, Thermo Scientific, USA). The anion (Cl and SO42−) concentrations were measured using an ion chromatograph (Dionex/ICS-6000, Thermo Scientific, USA). The concentrations of CO32− and HCO3 were detected by titration. The total salinity (TS) was determined by the drying gravimetric method (HJ/T51-1999), while the total organic carbon (TOC) and total nitrogen (TN) were detected by the total organic carbon/total nitrogen analyzer (Multi N/C2100, Jena, Germany). Finally, ammonium molybdate spectrophotometry (GB11893-891) was used to determine the concentration of total phosphate (TP).

Microbial community DNA extraction and PCR amplification

The 0.22-μm filter membranes (Millipore, USA) used to filter the water samples via vacuum filtration were sectioned according to the manufacturer’s instructions. Then, the community DNA was extracted using an E.Z.N.A Mag-Bind Soil DNA Kit (Omega Bio-Tek, USA). The integrity of the extracted DNA was evaluated using 1% agarose gel electrophoresis, and a Qubit® 2.0 Fluorometer Q32866 type (Invitrogen, USA) was used to determine the DNA concentrations followed by stored at −80°C.

To evaluate the microbial community’s composition, the bacterial and archaeal 16S rRNA genes from the water DNA extracts were amplified using polymerase chain reaction (PCR) with domain-specific primers. The PCR amplifications of the V3–V4 hypervariable regions of the bacterial 16S rRNA genes were conducted with primers 341F (5′-ACTCCTACGGGAGCAGCA-3′) and 805R (5′-GGACTACHVGGGTWTCTAAT-3′) (Han et al. 2017). Similarly, the universal primers 349F (5′-ACGGGGYGCAGCAGGCGCGA-3′) and 806R (5′-GACTGGAGTTCCTTGGCACCCGAGAAT-3′) (Deng et al. 2012) were used to amplify the V3–V4 hypervariable regions of the archaeal 16S rRNA genes. The PCR mixtures (30 μL volume) consisted of 15 μL of 2×Taq master mixture (containing 0.1 U/μL Taq DNA polymerase EP0406 (Thermo Scientific, USA), 0.4 mM per dNTP, and 2×Taq buffer), 10–20 ng of community genomic DNA, 1 μL of each primer (10 μM concentrations), and 12 μL of ultrapure H2O. PCRs were repeated in triplicate for each sample on a T100TM thermal cycler PCR system (Bio-RAD, USA). The PCR conditions were 95°C for 3 min, by 32 cycles of denaturation at 95°C for 30 s, annealing at 55°C for 30 s, extension at 72°C for 45 s, and extension at 72°C for 10 min. The amplified PCR fragments were purified using an Agencourt AMPure XP Kit (A63882, Beckman, USA), and then, they were quantified using a Qubit 2.0 DNA quantification Kit (InvitGen, USA). The 16S rRNA gene sequencing was conducted using the Illumina MiSeq 300PE platform (Sangong Biotechnology Co., Ltd., Shanghai, China).

Processing of Raw Sequence Reads and Statistical Analyses

The raw 16S rRNA gene sequences (ranging from 45,000 to 55,000 sequences per sample) were processed and analyzed using the procedure described by Han et al. (2017). Briefly, the Cutadapt software program (v.1.2.1) was used to combine the original paired-end sequences into overlapping contig sequences (Edgar et al. 2011). The Uchime software program (v.4.2.40) was then used to detect and remove chimeric sequences (Martin 2011). In addition, the Usearch (v.5.2.236) program (Edgar 2010) was also used to detect chimeric sequences by comparison against the SILVA (release 132; and Ribosomal Database Project (RDP) (v.11.3; databases. The taxonomic classification of the operational taxonomic units (OTUs) was conducted using the Quantitative Insights into Microbial Ecology (QIIME; V.1.8.0) pipeline, along with the RDP Bayesian classification method (v.2.12) and a 97% classification confidence level threshold (Wang et al. 2007; Caporaso et al. 2010).

The community alpha-diversity was calculated using the Mothur software package (v.1.30.1) (Schloss et al. 2009), including the abundance-based coverage estimation (ACE), terminal richness estimation (Chao1), Simpson index, Shannon-Weiner index, rarefaction analysis, and Good’s coverage estimation. Venn diagrams were used to assess the numbers of shared and unique OTUs among the samples. The beta-diversity was measured based on the Bray-Curtis distances between the samples, and the overall community differences were evaluated through a full linkage cluster analysis of the Bray-Curtis distances. Canonical correspondence analysis (CCA) was performed using the CCA function in the vegetarian R package, the community distance matrix, and the hydrochemical factors (Han et al. 2017). The variables that significantly explained the differences in the community composition were evaluated using permutation tests under a simplified model.

Taxonomic classification analysis

The relative abundances of the bacterial and archaeal communities were summarized at the phylum, class, and genus levels. The R software suite was used to construct boxplots of the relative taxonomic abundances among the samples, while the GraPhlAn software package (Asnicar et al. 2015); and the online Tree Of Life interactive tool (ITOL, v.3.2.1) (Letunic and Bork 2015) were used for phylogenetic tree visualization of the 100 most abundant OTUs.

Sequence accession numbers

The raw 16S rRNA gene sequences were deposited in the National Center for Biotechnology Information (NCBI) database under the BioSample accession Nos. SAMN20703607 to SAMN20703610 for Bacteria, and SAMN20703611 to SAMN20703614 for Archaea.


Hydrochemical characteristics

LGL is a sodium carbonate-type lake. Its major water cations were Na+ (105.90–106.62 mg/L) and Ca2+ (84.24–85.80 mg/L), and its major anions were HCO3 (472.34–476.17 mg/L) and CO32− (75.97–76.43 mg/L). The average water temperature in mid-July was 16.7°C, while the pH of the water was alkaline, ranging from 8.54 to 8.62. The average TOC, TN, and TP values were 6.21, 104.21, and 1.91 mg/L, respectively (Table 1).

Table 1 Chemical characteristics of surface water and sediment samples from Langaco Lake

Bacterial and archaeal community diversity

The bacterial and archaeal community compositions of the four LGL samples were investigated through high-throughput Illumina sequencing of the community 16S rRNA genes (Table 2). A total of 5879 OTUs were recovered, including 4871 bacterial OTUs and 1008 archaeal OTUs. Among the bacterial samples, the richness and diversity of samples L2 and L4 were slightly higher than those of samples L1 and L3. The observed bacterial community OTU richness, Shannon index, and ACE values were 1025–1293, 5.20–6.07, and 2148.23–2376.17, respectively. In contrast, the archaeal diversity (221–269 observed OTUs, Shannon index values of 2.98–3.30, and ACE index values of 417.63–536.42) was significantly lower than that of the bacterial communities.

Table 2 Operational taxonomic unit (OTU) richness and diversity measures from Langaco Lake communities

Taxonomic composition of LGL microbial communities

Venn diagrams were used to visualize the overlap in the OTUs among the lake water samples (Fig. S1). The total numbers of unique bacterial genera observed in the four samples were 219 (L1), 270 (L2), 291 (L3), and 271 (L4), while the numbers of unique archaeal genera were significantly lower at 77 (L1a), 56 (L2a), 72 (L3a), and 74 (L4a). There were 472 and 81 shared bacterial genera among the bacterial and archaeal communities. The most abundant sequences were selected as representative sequences of the OTUs, and the relative abundances of the bacterial and archaeal populations among the communities were compared at the phylum, class, and genus taxonomic levels (Fig. 2 and S2). In addition, community clustering based on community compositional differences revealed that there were three different bacterial community groups, with L3 and L1 each comprising a single community, and L4 and L2 forming a community together. In contrast, the archaeal communities included two groups with L4 and L1 comprising one group and L3 and L2 comprising the second group. Overall, 24 bacterial phyla were detected in the LGL microbial communities, including 50 classes and 327 genera. Additionally, eight archaeal phyla were detected that comprised nine classes and 29 genera (Tables S1, S2, and S3).

Fig. 2
figure 2

Taxonomic classification and community clustering of LGL bacterial (A) and archaeal (B) communities. The top, middle, and bottom panels show taxonomic distributions at the phylum, class, and genus levels, respectively

At the phylum level, the predominant bacteria (>1% relative abundance) were Proteobacteria (relative abundance of 42.79–53.70%), Bacteroidetes (11.13–15.18%), Planctomycetes (4.20–12.82%), and Acidobacteria (5.91–9.50%), followed by Actinobacteria (2.60–5.80%), Verrucomicrobia (2.11–4.08%), Chloroflexi (2.00–2.54%), Parcubacteria (0.83–2.16%), Firmicutes (0.63–1.14%), and Nitrospirae (0.21–1.53%). The dominant archaeal phyla among all of the samples were Crenarchaeota (35.97–58.29%), Euryarchaeota (33.02–39.89%), Woesearchaeota (6.50–21.57%), and Pacearchaeota (0.63–1.13%). The relative abundances of Woesearchaeota were greater than 20% in samples L2a, L3a, and L4a, but it was only 6.50% in L1a.

At the class level, the dominant Bacteria (>1% relative abundance) among the four samples were Betaproteobacteria (21.46–28.52%), followed by Alphaproteobacteria (9.13–12.34%), Gammaproteobacteria (7.91–10.70%), Sphingobacteriia (5.90–7.61%), Actinobacteria (2.60–5.76%), Deltaproteobacteria (1.87–2.95%), and Planctomycetia (3.97–12.00%). The dominant Archaea among the four samples were Thermoprotei (35.97–58.29%), Methanomicrobia (24.08–32.84%), Thermoplasmata (6.94–8.86%), and unclassified Woesearchaeota groups (6.50–21.57%).

At the genus level, the dominant bacterial genera (excluding the unclassified genera) among the four samples were Thiobacillus (8.92–16.78%), Hydrogenophaga (1.76–3.71%), Gemmobacter (1.38–3.56%), Algoriphagus (0.87–3.15%), Pirellula (1.07–2.85%), Parcubacteria-genera-incertae-sedis (0.83–2.16%), Ilumatobacter (1.25–1.70%), Sphingorhabdus (0.69–1.74%), Phaeodactylibacter (1.07–1.40%), and acidobacterial groups GP16, GP6, GP3, and GP7 (0.67–2.42%). In addition to the above bacterial genera, other Bacteria exhibited higher abundances within the individual samples, including Lacibacter (1.15%) in L1, Saccharibacterial genera incertae-sedis (0.59–0.80%) in L2 and L3, Thermomonas (1.03–3.21%) in L1 and L3, and Litorilinea (0.55–0.68%) in L2 and L4. The dominant archaeal genera were Methanoregula (21.40–28.29%), Thermocladium (4.17–12.75%), Methanomassiliicoccus (6.92–8.77%), the Woesearchaeota incertae-sedis-AR16 group (2.46–5.96%), and Methanothrix (1.02–1.91%). Each of the four samples exhibited unique archaeal genera, including Aridibacter (0.01%) in L1a, Aquisphaera (0.01%) in L4a, and Halovenus (0.01%) in L3a and L4a.

Associations between environmental factors and dominant genera

The differences in the abundances among the samples were investigated (Fig. 3) while considering the twelve and nine most abundant bacterial and archaeal phyla. The most abundant (relative abundances of > 1%) bacterial genera among the four samples were the betaproteobacterial genera Thiobacillus and Hydrogenophaga, the alphaproteobacterial genus Gemmobacter, and the gammaproteobacterial genus Thermomonas. In addition, Methanoregula was the most abundant archaeal genus, and other archaeal genera exhibited higher abundances in individual samples, including Thermocladium and Methanomassiliicoccus.

Fig. 3
figure 3

Boxplots showing the distributions of dominant bacterial (A) and archaeal (B) genera abundances among LGL sites. Different phyla are indicated by different colors (as indicated in the top right corner), while genera within the same phylum are indicated with the same colors. Dominant taxa are indicated in red font

CCA was conducted to evaluate the relationships among community structures and environmental parameters, yielding numerous associations between the overall community composition and several environmental parameters. Consequently, the environmental parameters were analyzed in the context of the representative genera (Fig. 4). The abundances of the dominant bacterial genus Thiobacillus were most strongly correlated with the TP content, followed by the pH, CO32− concentration, and temperature. The abundances of the next most dominant bacterial genera (Hydrogenophaga and Gemmobacter) were associated with the TP and HCO3 content. The moderately abundant (1.50–4.00%) bacterial genera Thermomonas, Algoriphagus, and Sphingorhabdus in sample L1 were correlated with the TP content, pH, and HCO3 concentration. The abundances of the Parcubacteria genera incertae sedis and group GP16 were strongly correlated with the pH and Cl concentration, respectively. The less abundant (1.5–2.5%) genera Rheinheimera, Nitrospira, and Gp6 in sample L2 were significantly correlated with the TN content, as well as the Ca2+, SO42−, and Cl concentrations. Furthermore, the abundances of the genera Pirellula, Gimesia, and Spartobacteria genera incertae sedis were related to the K+, Na+, and TOC concentrations and the TS in samples L3 and L4. The abundances of the dominant (>20%) archaeal genera Methanoregula, Methanothrix, Methanomassiliicoccus, Pacearchaeota incertae sedis-AR13, and Woesearchaeota incertae sedis-AR16 were highly correlated with the TOC, K+, and Na+ contents and TS. In addition, the Thermocladium abundances were particularly highly associated with the HCO3 concentration.

Fig. 4
figure 4

Canonical correspondence analysis (CCA) showing the relationships among dominant genera and hydrochemical variables in LGL. Samples are indicated by red circles, while genera are indicated by blue triangles, and environmental variables are indicated by arrows. TS, TOC, TN, and TP correspond to total salinity, total organic carbon, total nitrogen, and total phosphate, respectively


Characteristics of the LGL soda lake

Soda lakes are globally distributed and are predominantly found in arid and semi-arid environments, including Hungary, Egypt, America, and China (Namsaraev et al. 2015; Rojas et al. 2018). Soda lakes have recently been classified into soda and soda-salt types based on differences in their carbonate and bicarbonate contents. Soda types are those that are dominated by sodium, bicarbonate, and carbonate ions, while soda-salt types are those dominated by sodium ions, as well as other ions besides bicarbonate/carbonate (Boros and Kolpakova 2018). Soda lakes generally form in hydrologically closed lake basins, and their water has high Na+ concentrations, in addition to high CO32− (0.023–63.20 g/L) or HCO3 (0.11–20.40 g/L) concentrations (Table 3); however, their water may contain abundant and variable concentrations of SO42−, K+, and/or Cl (Boros and Kolpakova 2018; Schagerl and Renaut 2016). LGL is an example of an extreme soda lake with a lower ionic content, with CO32−, and HCO3 concentrations of 76.30 and 475.56 mg/L, respectively. Soda lake water also usually contains high concentrations of Cl (73,000 mg/L) (Schagerl and Renaut 2016). In contrast, LGL has a Cl concentration of 55 mg/L. Thus, LGL is considered a weakly ionic soda lake. The formation of alkalinity is closely related to the hydrology, climate, and regional geology, which ultimately affect the diversity of microorganisms within these systems. The pH of the water in LGL was 8.62; whereas it is greater than 9.00 in most other soda lakes (Table 3). Due to the extreme conditions within these systems, the characteristics of soda lakes (including high pH values) provide unique environments for microbial communities (Table 3). Thus, a predominance of carbonate and bicarbonate (in addition to the associated alkalinity) are hallmarks of soda lake ecosystems, and diversity has been observed in their hydrochemical characteristics.

Table 3 Hydrochemical characteristics and dominant genera within worldwide soda lakes

Bacterial and archaeal diversity within LGL

Despite the extreme environments within soda lakes, they have high levels of microbial diversity. This is evinced by the 327 and 29 bacterial and archaeal genera, respectively, within LGL, which corresponded to 4871 and 1008 16S rRNA gene OTUs, respectively. Under the condition of using the same sequencing method, LGL exhibited a higher diversity than other soda lakes that have been previously investigated including four alkaline soda lakes in the Cariboo Plateau (bacterial OTUs: 1662), Doroninskoe Lake (bacterial OTUs: 2254), and Lonar Lake (bacterial OTUs: 1,568) (Table 3). Furthermore, the Shannon diversity index values calculated in this study (Bacteria: 5.20–6.07; Archaea: 2.98–3.30) were higher than previously calculated for soda lakes, including Doroninskoe Lake (Bacteria: 1.49–3.46) and five soda lakes in the Badain Jaran Desert (Bacteria: 1.15–3.24) (Table 3). Thus, the microbial diversity of LGL appears to be considerably higher than those of other previously studied soda lakes.

Microbial community structure of LGL

High-throughput sequencing of community 16S rRNA genes was used to conduct a comprehensive investigation of the microbial community diversity in LGL. Twenty-four bacterial phyla and 50 classes were detected in the LGL microbial communities, representing a significantly higher level of taxonomic diversity than in other soda lakes, including 17 bacterial phyla in Mono Lake (CA, USA) and 11 bacterial phyla in Doroninskoe Lake (Transbaikalia, Russia) (Rojas et al. 2018; Matyugina et al. 2018). The most dominant bacterial phylum was Proteobacteria (42.79–53.70%), followed by Bacteroidetes (11.13–15.18%), Planctomycetes (4.20–12.82%), and Acidobacteria (5.91–9.50%). Proteobacteria, Bacteroidetes, and Firmicutes are typically the dominant bacterial taxa in soda lakes (Paul et al. 2015; Mesbah et al. 2007). The unique presence of sodium carbonate may be one of the factors affecting the differences in the microbial diversity in LGL compared to other soda lakes. As was previously mentioned, Proteobacteria and Bacteroidetes have been detected in many soda lakes (Paul et al. 2015; Mesbah et al. 2007), but their relative abundances considerably vary in these systems. The relative abundance of Proteobacteria was reported to be 29.5% in Lonar Lake (India) (Paul et al. 2015), which is significantly lower than that observed in LGL and also in the Soda Lake of Inner Mongolia (Namsaraev et al. 2015). In addition, the abundances of some bacterial groups were lower than those observed in LGL, including Bacteroidetes (8.25%) and Planctomycetes (6.8%) (Banda et al. 2019). Planctomycetes and Acidobacteria have rarely been observed in other soda lakes (Namsaraev et al. 2015; Aguirre-Garrido et al. 2016). Our results indicate that the compositions of the LGL bacterial communities were primarily affected by the environmental factors, which is most likely due to the long residence time of this lake. These effects are likely reflected in the differences in the dominant taxonomic classes within LGL relative to those in other soda lakes. Cytophagia and Flavobacteria (phylum: Bacteroidetes) are the dominant classes in other soda lakes (Szabó et al. 2017), while the most abundant class of Bacteroidetes in LGL was Sphingobacteria, which may be related to the unique hydrochemical characteristics of LGL. Similarly, Alphaproteobacteria are typically the dominant proteobacterial class among soda lakes (Szabó et al. 2017); however, the Gammaproteobacteria class was dominant in LGL. Thus, the LGL bacterial communities exhibit unique compositional differences relative to other soda lakes.

The archaeal diversity in LGL was much lower than that observed for the bacterial communities, but it was unique among soda lakes. A total of eight archaeal phyla were detected in LGL comprising nine classes and 29 genera. The LGL archaeal communities exhibited a higher level of diversity than those of other soda lakes. For example, only three archaeal phyla were detected in Doroninskoe Lake (Matyugina et al. 2018). Interestingly, the dominant archaeal phyla also vary with the sediment salinity in some salt lakes. For example, Crenarchaeota are generally dominant in hyposaline sediments, while Halobacteriales (phylum Euryarchaeota) are dominant in hypersaline sediments (Jiang et al. 2007). Methanogens and other Euryarchaeota are important contributors to global organic carbon cycling (Vavourakis et al. 2016). The relative abundances of the Archaea in LGL also varied compared to those in other soda lakes. Crenarchaeota were dominant (43.96%), followed by Euryarchaeota (36.56%). In contrast, Crenarchaeota have been either undetected (Matyugina et al. 2018) or had very low levels (Rojas et al. 2018) in other soda lakes. Similar to the levels observed in LGL, Euryarchaeota (35.15%) has been reported to be the most abundant phylum in the soda lakes in the Badain Jaran Desert (Banda et al. 2019). Nevertheless, the overall microbial diversity in LGL was higher than those reported in other studies, and the unique hydrochemical characteristics of LGL may be an important factor contributing to this high diversity and unique microbial community structure.

In addition, the unclassified Bacteria and Archaea accounted for a considerable proportion of the LGL communities, contributing 12.67% and 18.31% to the overall communities, respectively. The unclassified Bacteria included groups GP6, GP16, GP7, GP3, and GP4 of the Acidobacteria and the unclassified Parcubacteria. Acidobacteria are one of the most abundant phyla in soils, and their OTU richness, phylogenetic diversity, and community composition are significantly related to the pH of the soil (Wei et al. 2018). Previous studies have reported that human disturbances and activities can reduce the abundances of soil Acidobacteria (Qin et al. 2019). Moreover, sediment bacterial communities, including Acidobacteria, have been shown to be sensitive to fluctuations in environmental factors, especially the external water supply.

Dominant Bacterial Genera Unique to LGL

The most dominant bacterial genus in the LGL communities was Thiobacillus (8.92–16.78%), which featured uniquely higher abundances than in other soda lakes (e.g., those in eastern China: 1.39–2.47%) (Duan et al. 2020). The abundances of Thiobacillus have also been reported to be negatively correlated with the sedimentary sulfate and total sulfur contents (Duan et al. 2020). Thiobacillus can be one of the most dominant groups in freshwater sediments, and it can be used as a biomarker to predict the intensity of subsequent blooms in such environments (Chen et al. 2015). Furthermore, Thiobacillus has also been detected in some typical habitat types (e.g., soil, water, and duck and fish farm) (Yi et al. 2021) and has been observed to be a unique member of coastal bacterial benthic communities (Sherysheva et al. 2020). Intriguingly, Thiobacillus has rarely been observed in other soda lakes (Table 3).

Thiobacillus is an autotrophic bacterium. It is one of the primary iron-reducing bacterial taxa in lake sediments, and its abundances and diversity are closely related to the degree of water eutrophication (Fan et al. 2018). Through these processes, Thiobacillus participates in the redox cycling of heavy metals by producing ferrous iron and accelerating the oxidation of ferric ion in localized areas, such as anaerobic sedimentary environments with high concentrations of heavy metals, in which they contribute to a large proportion of the communities (Ding et al. 2017). Additionally, Thiobacillus can be a key mediator of S2− oxidation coupled to denitrification, thereby playing an important role in NO3 reduction under S2− enrichment conditions when organic carbon is scarce (Pang et al. 2021). Our CCA analysis of the LGL communities also revealed that the Thiobacillus abundances were most highly correlated with the variations in the temperature, pH, and CO32− and TP concentrations. Thus, the association of Thiobacillus with these factors warrants further investigation.

Dominant archaeal genera unique to LGL

Members of the archaeal family Methanoregulaceae, have been isolated in samples from various habitats, including acidic peat bogs, anaerobic organic waste treatment reactors, submerged sinkhole ecosystems (e.g., oil fields, paddy soils, and mud volcanoes), and freshwater lakes (Savvichev et al. 2021). Methanoregula was the most abundant taxa in LGL (21.40–28.29%), even though it has low sodium requirements (Rosenberg et al. 2014). Furthermore, the CCA revealed that the Methanoregula abundances were significantly correlated with the TOC content and TS, in addition to the K+ and Na+ concentrations. Methanoregula is a nitrogen-fixing archaeal taxon that dominates freshwater lakes (Stoeva et al. 2014) and is a dominant methane producer within communities. Other studies have shown that it has a significant genetic potential for nitrogen metabolism (e.g., nitrate transport, denitrification, nitrite assimilation, and nitrogen fixation) in methyl-methanogenesis bacterial genomes (Biderre-Petit et al. 2019). In addition, Methanoregula has been detected at different temperatures in wetland soils near alkali lakes (Deng et al. 2019), but it has not been detected in most soda lakes, because it has an optimal pH growth range of 4.50 to 5.55 (Rosenberg et al. 2014). Finally, Methanoregula has also been reported to be the dominant member of a methanogenic community and is well adapted to hypoxic conditions (Savvichev et al. 2021).


The LGL soda lake is a unique sodium carbonate ecosystem that may serve as an excellent model for understanding microbial diversity and microbial adaptation to carbonate habitats. Here, high-throughput 16S rRNA gene sequencing was conducted to comprehensively characterize the bacterial and archaeal communities of LGL. The bacterial diversity in LGL was significantly higher than the archaeal diversity and was mostly dominated by Proteobacteria, Bacteroidetes, and Planctomycetes; while the dominant archaeal groups were Crenarchaeota and Euryarchaeota. The presence and high abundances of Crenarchaeota was uniquely different compared to other soda lakes. Moreover, the characteristics and metabolism of Thiobacillus provide more possibilities for bacterial diversity, and their abundances were most strongly correlated with the pH, temperature, and CO32− and TP concentrations. The high abundance of Methanoregula in LGL was also a unique observation for the LGL archaeal communities, and their abundances were correlated with the TOC content, TS, and K+ and Na+ concentrations. Finally, the minimal anthropogenic effects on LGL and its extreme environmental conditions provide a unique context for understanding the interactions between microorganisms and extreme soda environments, while also furthering our understanding of microbial resources on the Tibetan Plateau.

Availability of data and materials

We have not reproduced any pre-published information/material for this study.


Download references


This research was supported by the National Natural Science Foundation of China (grant numbers 21967018), the Applied Basic Research Program of Qinghai Province (grant numbers 2022ZJ771), and the Team’s Research Program of Microbial Resources in Salt-lakes in the Qinghai-Tibetan Plateau grant number 2018KYT1. We thank LetPub ( for providing linguistic assistance during the preparation of this manuscript.

Author information

Authors and Affiliations



MW performed most of the experiments and wrote the manuscript. XZ, CL and ZS supervised the execution of the experiments and analyzed the data. DZ,GS, YT, and ZW performed the sample collection. DZ and GS provided the bioinformatics technical assistance and evaluated the data. All of the authors have read and approved the final version of the manuscript.

Corresponding author

Correspondence to Guoping Shen.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Fig. S1.

Venn diagram of OTU sample distribution of LGL bacterial (A) and archaeal (B) communities. Fig. S2. Taxonomic and phylogenetic tree representations of bacterial taxa (A) and archaeal taxa (B) in LGL sample. Table S1. Phylum abundance percentage (%) of Bacteria and Archaea in LGL. Table S2. Class abundance percentage (%) of Bacteria and Archaea in LGL. Table S3. Genus abundance percentage (%) of Bacteria and Archaea in LGL.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, M., Zhang, X., Shu, Z. et al. Bacterial and archaeal communities within the alkaline soda Langaco Lake in the Qinghai-Tibet Plateau. Ann Microbiol 72, 33 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Community diversity
  • Soda lake
  • High-altitude lake
  • Langaco Lake
  • Qinghai-Tibet Plateau