Muhammad Abrar Yousaf1*, Muhammad Asjad khan2
1Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
2Department of Biological Sciences, Virtual University of Pakistan, Lahore, Pakistan
Background.. Superoxide dismutase 1 (SOD1) is crucial for cellular defense against oxidative stress induced by superoxide radicals, particularly in challenging conditions, such as elevated temperature and humidity. This study investigated SOD1 in Bactrian camel (Camelus bactrianus), Wild Bactrian camel (C. ferus), and Arabian camel (C. dromedarius) to understand its role in heat tolerance.
Methodology.The current study employed bioinformatics analysis to assess the genomic features including GC% content. It also investigated the structure and location of the SOD1 gene on the chromosomes. Phylogenetic analysis was conducted to elucidate the evolutionary relationships based on SOD1 protein sequences. Structural analyses encompassed secondary and tertiary structure predictions, emphasizing stability and potential functional implications. Subcellular localization of the SOD1 protein was also explored.
Results.. C. dromedarius displayed the highest GC% in its genome, indicating improved thermostability. With the exception of C. bactrianus whose chromosomal location was unknown, all other species contained SOD1 gene on their first chromosome. Based upon SOD1 protein sequences, phylogenetic investigation emphasized the close evolutionary link within the Camelidae family. Structurally, all three species of camel shared an acidic, globular, and thermally-stable SOD1 protein having high glycine content and lack of cleavage sites. Analysis of secondary structure indicated a frequency of random coils, highlighting the adaptability and evolutionary conservation of protein. Predictions of tertiary structure verified that SOD1 was stable in all species. The protein is predominantly found in cytoplasm although, also present in nucleus, extracellular region, and mitochondria.
Conclusion.IThis inclusive analysis of SOD1 in three different species of camel highlighted their strong adaptation to desert environment by elucidating their genomic and proteomic stability. Further research is necessary to investigate the biochemical mechanisms behind camels’ extraordinary ability to thrive in desert conditions and respond to the challenges posed by climate change
Reactive Oxygen Species (ROS) are naturally formed during regular metabolic processes in organisms [1]. Yet, oxidative stress may result from an excessive amount of ROS if the body’s antioxidant defense systems are incapable to eliminate them. Thus, cells possess both enzymatic and nonenzymatic antioxidant defense systems to avoid oxidative harm to proteins, DNA, and lipids [2]. One essential defense mechanism comprises antioxidant enzymes, such as superoxide dismutase (SOD), catalase, and glutathione peroxidase that are vital to defend cells against injurious effects of ROS [2].
SOD enzymes help in the conversion of superoxide anion radicals into hydrogen peroxide as well as oxygen, thus defending against oxidative harm [3]. Both prokaryotic and eukaryotic species have these enzymatic traits. Even though, variations in amino acid sequencing, protein structure, subcellular localization, and metabolic cofactors cause differences in SOD enzymes in various species. Based upon particular metallic cofactor utilized by SOD enzymes, various categories are determined. This classification of Copper-zinc SOD or SOD1 is extensively distributed in many mammalian cells, accounting for 50–80% of total SOD activity in blood vessels [4].
Although SOD1 is widely studied in several mammalian species, still there is insufficient data available about its properties in distinct camel species [5]. In desert areas, camels are highly valuable due to their services in transportation, agricultural activities, and production [6]. Furthermore, research has shown that camel milk has therapeutic effects against diabetes, ulcers, allergies, cancer, autism, and infections [6, 7]. Along with extraordinary adaptations of camels to desert life, they also exhibit diverse biochemical, physiological, and anatomical features, distinguishing them from other mammalian species [8]. These creatures demonstrate the capability of survival under extremely harsh desert conditions cauterized by intense heat and drought [9]. However, continued exposure to these circumstances may disturb the balance between ROS and antioxidant defenses, leading to oxidative stress and impairment to biomolecules [10]. The unique biochemistry of camels suggests that SOD1 is crucial for safeguarding cells from oxidative damage due to O2⁻ production, especially under conditions of high temperature and humidity [5, 11].
The primary goal of the current study was to analyze the SOD1 protein and its related genes within the Wild Bactrian camel (C. ferus), Bactrian camel (C. bactrianus), and Arabian camel (C. dromedarius) using bioinformatics and computational practices. By exploring SOD1 in these species of camels, the study aimed to understand the biochemical processes underlying their adaptations to harsh desert conditions. Additionally, the study is noteworthy in the context of modern climate changes and rising temperatures, providing significant knowledge about the adaptive stratagems of camels.
The SOD1 protein sequences corresponding to C. bactrianus (A0A9W3ES54), C. ferus (A0A8B8SSA1), and C. dromedarius (H6BDU4) were acquired by assessing the UniProt database [12]. After obtaining the protein sequences in FASTA format, they were saved for consequent bioinformatic analysis.
Data was retrieved from the Eukaryotic Genome Annotation section of the NCBI database to determine the megabase (Mb) size of genome, median protein, and median GC% values for the camel species.
To discover the genomic location of the SOD1 gene, the gene ID linked to the respective unique protein was employed in order to examine the gene’s genomic as well as chromosomal coordinates.
The aim of this phylogenetic study was to investigate the evolutionary conservation and variations in the SOD1 gene across different animal species. Since SOD1 homologs may share a conserved sequence structure, this study sought to identify sequence alterations, mutations, or adaptive modifications, particularly in camels, which are known for their exceptional heat and oxidative stress tolerance. The phylogenetic analysis of the SOD1 protein was conducted using diverse species, including camelid species (C. dromedarius, C. bactrianus, C. ferus, Vicugna pacos), other artiodactyls (Bos taurus, Capra hircus, Sus scrofa), mammals (Homo sapiens, Mus musculus, Canis lupus familiaris, Macaca mulatta), birds (Gallus gallus, Anas platyrhynchos), and outgroup species (Xenopus tropicalis, Danio rerio, Drosophila melanogaster). Protein sequences were retrieved from UniProt, aligned using MUSCLE in MEGA 11, and visually inspected for accuracy. A phylogenetic tree was constructed using the Maximum Likelihood method in MEGA 11, with 1000 bootstrap replicates to assess the statistical support. Drosophila melanogaster was designated as the outgroup to root the tree.
Using ExPASy-ProtParam tool, various physiochemical characteristics of the SOD1 protein in camel species were assessed [13]. This computational tool analyzes various parameters encircling molecular weight, amino acid, molecular formula, aliphatic index, instability index (II), theoretical isoelectric point (pI), and grand average of hydrophobicity (GRAVY). Additionally, it analyzes the percentage composition of each amino acid in the protein. Furthermore, the SignalP-4.1 tool [14] was employed to predict the presence of signal peptides in the protein sequences.
The SOPMA tool [15] was utilized to analyze the secondary structure of the SOD1 proteins. This analysis ascertained the proportional representation of alpha helices, extended strands, beta-turns, and random coil structures within the protein sequence.
The experimental three-dimensional (3D) structure of SOD1 protein of camel species was not available at RCSB PDB [16]. Hence, different methodologies were applied for the prediction of protein 3D models. Firstly, the SWISS-MODEL [17] was used to generate the structure based on a template (PDB ID: 7WX0, Cu/Zn-superoxide dismutase from dog (Canis familiaris)). Then, the structure was created using AlphaFold [18]. Lastly, the structure was predicted using Phyre2 tool [19]. After the structure prediction, the quality of each model was evaluated using the Ramachandran graph via the Ramachandran Plot Server by Zlab (https://swift.cmbi.umcn.nl/servers/html/ramaplot.html).
Accurate prediction of subcellular localization is important to determine the relative expression levels of proteins across different cell types as well as to explore their functions. Therefore, the CELLO2GO online tool [20] was utilized with an E-value of 0.001 to predict the subcellular localization of SOD1.
Figure 1 illustrates the genome characteristics including the overall size, average GC content, and average number of proteins for three different species of camels. The genome of C. bacteria spans 1780.72 megabases (Mb) with an average GC content of 40.45% and a median number of proteins at 28,601. For C. ferus, the genome is larger, measuring 2048.14 Mb with a GC content average of 41.4819% and a protein tally of roughly 37,385. Meanwhile, C. dromedaries have a genome that measures 2084.54 Mb, featuring an average GC content of 41.5% and hosting 51,736 proteins.
C. bactrianus SOD1 exhibited a total of nine introns and five exons. Unfortunately, the precise chromosome localization was not determined. C. ferus also possessed nine introns and five exons. SOD1 gene of C. ferus was located on chromosome 1. Likewise, the C. dromedarius displayed nine introns and five exons in its genomic composition. C. dromedarius SOD1 was also localized on chromosome 1 (Figure 2).
The phylogenetic tree (Figure 3) represents the evolutionary relationships of the (SOD1) protein among different taxonomic groups. The camelid species (C. dromedarius, C. ferus, and C. bactrianus) form a well-supported clade, closely related to Vicugna pacos (alpaca), which is consistent with their known evolutionary history within the family Camelidae. This clustering suggests strong conservation of the SOD1 protein within Camelidae. Given the critical role of SOD1 in oxidative stress defense, this conservation may be linked to adaptations to extreme desert environments. The artiodactyl species (Bos taurus, Capra hircus, and Sus scrofa) form a distinct lineage, separate from the camelids, further supporting their divergent evolutionary paths.
Among mammals, Canis lupus familiaris (dog) clusters with artiodactyls, indicating a closer relationship with hoofed mammals than primates and rodents. Mus musculus (mouse), Homo sapiens (human), and Macaca mulatta (rhesus macaque) form a primate-rodent clade, reflecting their shared ancestry. The avian species (Gallus gallus and Anas platyrhynchos) cluster with high support, confirming their evolutionary relationship. Xenopus tropicalis (frog) diverges earlier, representing a transition between aquatic and terrestrial vertebrates. Danio rerio (zebrafish) forms a separate vertebrate lineage. Drosophila melanogaster (fruit fly) is the most distantly related, serving as an outgroup that roots the tree and defines evolutionary relationships. This analysis provides insights into evolutionary relationships, emphasizing the diversity and interconnections within the animal kingdom based on their SOD1 protein sequences.
ExPASy-ProtParam was used to calculate the physiochemical characteristics for the proteins encoded by the SOD1 gene in camels. The SOD1 protein from C. bactrianus had the following chemical make-up: C673H1079N203O221S6, a molecular weight of 15742.56 Da, 2182 atoms, and 153 amino acids. The theoretical pI was found to be 5, the aliphatic index, GRAVY, and values of instability index (II) to be 17.52, 75.75, and -S0.348, respectively. The SOD1 protein from C. ferus was analyzed, and it was discovered to have 153, 2183 atoms of amino acids, the molecular formula C673H1079N203O221S7, and the molecular weight 15774.62 Da. The aliphatic index was 73.86, the instability index (II) was 18.33, the theoretical pI value was 5.94, and GRAVY was calculated to be -0.363. The SOD1 protein from C. dromedarius had a molecular weight of 15742.56 Da, 153 amino acids, 2182 atoms, and the chemical formula C673H1079N203O221S6. Theoretical calculations yielded a predicted isoelectric point (pl) value of 5.94, an instability index (II) value of 17.52, an aliphatic index of 75.75, and a Grand Average of Hydropathy (GRAVY) SCORE OF -0.348 for the SOD1 protein. The distribution of various amino acids in the SOD1 protein across camel species is illustrated in Figure 4. Notably, glycine (Gly) constituted the highest proportion in the amino acid sequence of the SOD1 protein across all species, while the absence of tryptophan (Trp), tyrosine (Tyr), pyrrolysine (Pyl), and selenocysteine (Sec) was observed in all analyzed proteins. There were differences of Methionine (Met) and valine (Val) amino acids among three SOD1 proteins.
According to the analysis using SignalP 4.1, no cleavage site was detected in SOD1 proteins of all camel species (Figure 5). The value of D, which indicated the probability of a cleavage site, was 0.136 for all SOD1 proteins, and was lower than the D-cutoff value of 0.450 (indicated by the purple line). This suggested that the likelihood of a cleavage site in the SOD1 proteins was very low. This low probability suggested that the SOD1 proteins in camel species were unlikely to undergo cleavage, which may influence their stability and functional integrity. Understanding the absence of cleavage sites may provide insights into the evolutionary conservation and functional adaptations of SOD1 proteins in camels, potentially impacting their role in oxidative stress response and disease mechanisms.
The secondary structure of the SOD1 protein in camel species was analyzed using the SOPMA tool, revealing variations in the proportions of alpha helices, extended strands, beta-turns, and random coils (Figure 6). In C. bactrianus, SOD1 consists of 54.99% random coils, 33.9% extended strands, 5.88% beta-turns, and 5.23% alpha helices. Similarly, C. ferus SOD1 comprises 52.9% random coils, 33.9% extended strands, 6.54% beta-turns, and 6.54% alpha helices. While, C. dromedarius SOD1 contains 54.9% random coils, 33.99% extended strands, 5.88% beta-turns, and 5.23% alpha helices. In all three species, random coils are the most dominant structural features, followed by extended strands. The high proportion of random coils suggests that SOD1 maintains structural flexibility, which may contribute to its functional stability under different environmental conditions. The extended strand regions likely play a role in maintaining the protein stability overall, while the presence of alpha-helices and beta-turns may contribute to specific functional interactions. A visual representation of these structural compositions is provided in Figure 7, highlighting the predominance of random coils. Furthermore, solubility predictions using the SOSUI tool indicate that SOD1 is soluble in all three species, which is essential for its enzymatic activity and proper cellular functioning.
The tertiary structure models of C. bactrianus, C. ferus, and C. dromedarius are displayed in Figure 8, 9, and 10. These structures were predicted using three different modeling approaches: SWISS-MODEL, AlphaFold, and Phyre2. Each model was evaluated using Ramachandran plot analysis to assess the structural quality. The SWISS-MODEL prediction provided a template-based homology model, while the AlphaFold model utilized deep learning-based predictions for structural accuracy. The Phyre2 model incorporated both homology modeling and fold recognition techniques. All models exhibited a high proportion (more than 99%) of residues in favored regions of the Ramachandran plot, suggesting reliable structural predictions. All generated structures were comparable as shown by the superimposed models. However, SWISS-MODEL structure was the best as it also predicted the copper along with the structure for all proteins. These generated structures could be further used to investigate the interactions of camel SOD1 protein with other molecules and for the functional annotation of protein, specifically the structure/function relationship. This would also help to understand the SOD1 structural interactions with copper and zinc.
The subcellular localization of SOD1 protein was determined using CELLO2GO (Figure 11), which predicted the protein to be predominantly cytoplasmic for all three species. Specifically, C. bactrianus SOD1 protein was predicted to be cytoplasmic (77.6%), with smaller percentages present in the nucleus (7.0%), extracellular space (4.6%), and mitochondria (4.0%). Similarly, C. ferus SOD1 was mainly present in the cytoplasm (75.8%), with smaller percentages found in the nucleus (8.3%), extracellular space (4.8%), and mitochondria (4.2%). Finally, C. dromedarius SOD1 was also predominantly located in the cytoplasm (77.6%), with smaller percentages found in the nucleus (7.0%), extracellular space (4.6%), and mitochondria (4.0%). The predominant cytoplasmic localization of SOD1 suggests that it plays a key role in neutralizing reactive oxygen species produced during heat stress. This localization ensures efficient detoxification in cytoplasm, which is crucial to maintain cellular homeostasis under high-temperature conditions. The minor presence of SOD1 in the nucleus, mitochondria, and extracellular space further supports a coordinated defense against heat-induced oxidative damage.
Camels are a valuable resource for human beings, providing substantial cultural, economic, and medicinal advantages. Camels are of significant importance in dry regions. Nonetheless, there is a lack of detailed data on the mechanism or biological processes that enable them to survive such harsh conditions. Therefore, this study investigated how SOD1 contributes for the survival of three camel species, namely C. bactrianus, C. ferus, and C. dromedarius in extreme temperatures, dehydration, and arid habitats.
The current investigation employed genome analysis to examine the attributes of distinct camel species, uncovering dissimilarities in genome size and GC composition. While the genome size and GC% of C. ferus and C. dromedarius exhibited minimal variation, it is noteworthy that GC content is widely acknowledged as a reliable predictor of DNA thermostability, gene regulation, and adaptation to extreme temperatures [21]. C. dromedaries demonstrates the highest genomic GC content, whereas C. bactrianus shows the lowest proportion of GC nucleotides. The greater number of hydrogen bonds in GC pairs enhances DNA thermal stability [22], suggesting this species-specific GC content may account for the observed difference in thermostability between the two species. For instance, C. dromedarius, which predominantly inhabits hot regions of Africa and the Middle East [23], has genome which is more thermostable as compared to the C. bactrianus, which is typically found in colder regions of China and Central Asia [24], with lower GC content and decreased genome thermostability. This analysis indicates how molecular insights into GC composition could drive evolutionary adaptation of camels in different temperature settings with genetic foundation for heat tolerance.
The examination of the primary structure indicated that the SOD1 in every camel species is comprised of a total of 153 amino acids. The highest molecular weight and atomic count of 15774.62 Da and 2183 was exhibited by C. ferus SOD1. On the other hand, the SOD1 of the two other species exhibited the molecular weight of 15742.56 Da and consisted of a total of 2182 atoms. Due to the presence of additional sulfur atom in the molecular formula of C. ferus, its SOD1 molecular weight and atomic count are relatively higher than other species. The pH value at which a protein surface has net charge of zero in spite of having both positive and negative charges is termed as isoelectric point (pI), which plays a significant role in protein purification. This is because protein exhibits slight solubility and no mobility at the specific pH value of the pI in an electro-focusing system [25]. C. bactrianus SOD1 exhibited pI of 5, while pI of 5.94 was shown by both C. dromedarius and C. ferus. The current study revealed the acidic property of SOD1 in camel species. The instability index of protein measures the stability of protein in in vivo and in vitro [26]. For proteins having half-life less than five hours, the instability index exceeds 40, while those having half-life longer than 16 hours, instability index is lower than 40. A protein structure exhibiting an instability index below 40 is suggestive of stability. The stability of SOD1 proteins among all camel species is notably high. Specifically, C. bactrianus and C. dromedarius exhibit an instability index (II) of 17.52, while C. ferus has a value of 18.0. The aliphatic index serves as a quantitative measure, indicating the fraction of protein composition attributed to aliphatic amino acid, namely valine (V), alanine(A), leucine (L), and isoleucine (I). A higher alphabetic index value is indicative of enhanced protein thermostability [27]. Remarkably, elevated aliphatic index values observed in the SOD1 protein of C. bactrianus, C. ferus, and C. dromedaries suggest a broad spectrum of protein stability across varying temperatures. The thermostability of the genome GC content in C. dromedarius was observed to be higher than that of C. bactrianus. However, the thermostability of SOD1 was found to be equivalent in both species. Thus, it is imperative to explore further proteins associated with thermostability. The computation of the GRAVY index involves the division of the summation of the hydrophobicity values of all amino acids by the overall number of residues present in the protein sequence [28]. A score lower than zero signifies a higher probability of the protein being globular, which is a hydrophilic protein. On the other hand, a score greater than zero suggests that protein is more likely to be membrane bound, a hydrophobic protein. All species of SOD1 showed negative GRAVY values, supporting their solubility and potential cytoplasmic functionality.
The secondary structure analysis showed that the SOD1 protein in all species of camel is primarily composed of random coils, subsequently extended strands, beta turns, and alpha helices. The substantial prevalence of random coils indicates that the protein is highly conserved throughout its evolutionary history. Random coils, characterized by their structural flexibility and dynamic nature, are crucial for the protein's conformational adaptability and its ability to interact functionally within various contexts [29]. This inherent flexibility is likely a key to SOD1’s stability and effectiveness, particularly under varied environmental stresses. Such structural characteristics enhance the protein capacity to maintain functionality and stability, thereby bolstering its role in combating oxidative stress in camels.
The tertiary 3D structure of SOD1 in camel species was modelled using three methodologies, namely SWISS-MODEL, AlphaFold, and Phyre2, working on different techniques. For instance, the SWISS-MODEL [17] works on the homology-based prediction using known protein templates. The AlphaFold [18] provides an advanced approach based on deep learning, while the Phyre2 [19] works on homology modeling with fold recognition. However, the structure predicted by all tools was comparable as depicted by the superimposition. Structural validation via Ramachandran plot analysis confirmed that over 99% of the amino acid residues in all models fell within favored conformations, reflecting high stereochemical quality and structural integrity. This consensus shows the accuracy of all structures by different tools. Furthermore, structures from different species are also comparable, showing evolutionary conservation and preserving its function despite varied environmental pressures. While computational models provide highly accurate structural predictions, experimental validation via techniques, such as X-ray crystallography or NMR spectroscopy remains essential to verify these structures and to explore the structural dynamics of SOD1 under conditions of thermal stress.
Subcellular localization studies reveal that SOD1 is primarily located in the cytoplasm of camels, however, also appears significantly in the nucleus, mitochondria, and extracellular regions. Traditionally recognized as a cytosolic enzyme, the discovery of SOD1 in various cellular compartments implies a more extensive functional role that extends beyond mere cytoplasmic oxidative stress defense, where the enzyme superoxide radicals into oxygen and hydrogen peroxide [30]. Research, including recent studies conducted by Martinelli et al. [31], highlights that SOD1 is also present in the nucleus, mitochondria, and peroxisomes, contributing to critical processes, such as redox homeostasis, protection against oxidative damage, and involvement in cellular signaling pathways. This widespread distribution indicates SOD1’s role in defending against environmental stresses at multiple cellular levels. Moreover, it also suggests evolutionary adaptations that enhance camels’ resilience under extreme conditions.
This research provided valuable insights into the structural and functional aspects of SOD1 in camels, although further investigation is needed. Enhancing computational predictions with experimental techniques, such as X-ray crystallography or enzymatic assays would improve the study's reliability. Additionally, studying post-translational modifications of SOD1 and comparing these across different mammals could broaden the evolutionary understanding and reveal species-specific adaptations. Integrating transcriptomic and proteomic data under real environmental conditions would further clarify SOD1's role in heat stress adaptation. While the study lays a strong foundation to understand camel resilience to extreme conditions, expanding these areas of research could deepen our knowledge of adaptive mechanisms in camels and similar species, potentially informing broader applications in environmental biology and conservation science.
The current study adeptly characterized the SOD1 enzyme in three distinct camel species, emphasizing their noteworthy stability and capacity to endure elevated temperatures. These findings provided valuable insights into the antioxidant defense system of camels, elucidating their adaptation to harsh desert conditions. Furthermore, the study underscored the importance of further research to enhance our understanding of the biochemical mechanisms involved in camels’ resilience to the desert environment and their ability to cope with climate change-induced temperature changes.
The authors of the manuscript have no financial or non-financial conflict of interest in the subject matter or materials discussed in this manuscript.
Data will be provided by corresponding author upon reasonable request
No funding has been received for this research.