1. Data Preparation¶
Computational studies of molecular mimicry require careful selection of both host and microbial proteins to ensure biological relevance. This chapter provides an overview of how raw biological data were selected and prepared for downstream computational analysis.
a. Selection of Data source¶
The data preparation workflow focused on selecting relevant protein sources followed by generation of peptide sequences. Two protein sources were considered in this study:
A human protein representing a self peptide
A microbial proteome representing non-self (foreign) peptides
Human Protein Source (Self Peptides)
Annexins are a family of calcium-dependent phospholipid-binding proteins involved in membrane organization, intracellular trafficking, inflammation, and immune regulation. Several members of the annexin family have been associated with autoimmune and inflammatory disorders through altered expression or immune recognition Li et al. (2016).
Based on this immunological relevance, Annexin (ANX) was selected as the representative human self protein in this study.
FASTA file used:
P07355_ANXA2_HUMAN_Annexin_A2.fastaMicrobial Protein Source (Non-Self Peptides)
Klebsiella pneumoniae has been widely implicated as a potential microbial trigger in ankylosing spondylitis, particularly in individuals carrying the HLA-B27 allele. Several studies have proposed that immune responses against K. pneumoniae antigens may cross-react with host peptides through molecular mimicry, contributing to chronic inflammation and autoimmunity Puccetti et al. (2017).
Due to this reported association with HLA-B27–linked autoimmune disease, the complete proteome of K. pneumoniae was selected as the microbial peptide source.
FASTA file used:
uniprotkb_proteome_UP000000265_K_pneumoniae_strain_ATCC700721_MGH_78578.fastaBasis for Data Selection
The combined selection of a human self protein (ANX) with immunological relevance and a microbial proteome (K. pneumoniae) strongly associated with HLA-B27–linked autoimmunity enables systematic investigation of peptide-level molecular mimicry. This approach allows identification of microbial peptides that may bind HLA-B27 in a manner comparable to human peptides, potentially contributing to autoimmune cross-reactivity.
b. Peptide Generation and Extraction¶
Peptide identification was performed in two sequential steps:
i. Peptide Slicing
ii. Peptide Filtering and Similarity Screening
This two-step approach allows generation of a broad peptide pool.
The detailed explainantion of workflow used during extraction of peptide sequences from source protein and proteome is discussed under next section:
- Li, D.-H., He, C.-R., Liu, F.-P., Li, J., Gao, J.-W., Li, Y., & Xu, W.-D. (2016). Annexin A2, up-regulated by IL-6, promotes the ossification of ligament fibroblasts from ankylosing spondylitis patients. Biomedicine & Pharmacotherapy, 84, 674–679. https://doi.org/10.1016/j.biopha.2016.09.091
- Puccetti, A., Dolcino, M., Tinazzi, E., Moretta, F., D’Angelo, S., Olivieri, I., & Lunardi, C. (2017). Antibodies Directed against a Peptide Epitope of a Klebsiella pneumoniae-Derived Protein Are Present in Ankylosing Spondylitis. PLOS ONE, 12(1), 1–12. 10.1371/journal.pone.0171073