This page was produced as an assignment for Genetics 677, an undergraduate course at UW-Madison.
Homologous Proteins
Homo sapiens - human
cyclin-dependent kinase inhibitor 2A 156 amino acids NP_000068.1 Pan troglodytes - chimpanzee cyclin-dependent kinase inhibitor 2A 156 amino acids NP_001139762.1 Bos taurus - cow cyclin-dependent kinase inhibitor 2A 109 amino acids XP_873468.5 |
Mus musculus - mouse
cyclin-dependent kinase inhibitor 2A 168 amino acids NP_001035744.1 Rattus norvegicus - rat cyclin-dependent kinase inhibitor 2A 159 amino acids NM_031550.1 Danio rerio - zebrafish cyclin-dependent kinase inhibitor 2A-like 124 amino acids XP_002660514.1 |
Drosophila melanogaster - fruit fly
tankyrase 1181 amino acids NP_651410.1 Caenorhabditis elegans- worm D2021.8 209 amino acids NP_509448.1 Arabidopsis thaliana - thale cress 26S proteasome non-ATPase regulatory subunit 10 240 amino acids NP_178442.2 |
Alignment
ClustalW2 Alignment
The image below shows the ClustalW2 alignment of protein sequences in all of the previously mentioned organisms with the exception of Drosophila. The amino acid sequence of Drosophila is quite long, and can be seen aligned with the other organisms in the file below the colored alignment.
The image below shows the ClustalW2 alignment of protein sequences in all of the previously mentioned organisms with the exception of Drosophila. The amino acid sequence of Drosophila is quite long, and can be seen aligned with the other organisms in the file below the colored alignment.
clustalw2_protein_alignment.docx | |
File Size: | 113 kb |
File Type: | docx |
MUSCLE Alignment
The file below shows the MUSCLE multiple sequence alignment of the homologous proteins in the nine organisms previously mentioned.
The file below shows the MUSCLE multiple sequence alignment of the homologous proteins in the nine organisms previously mentioned.
muscle_protein_alignment.docx | |
File Size: | 108 kb |
File Type: | docx |
Analysis
The table to the right shows the pairwise scores of the protein homologs as computed in ClustalW2. The CDKN2A protein in humans is most similar to its homolog in chimpanzees with a score of 99%. The score between the human and cow protein was also quite high at 83%. When compared to the human protein, the rat, mouse, and zebrafish scores fall in the middle of the range at 61%, 59%, and 45%, respectively.
The human homolog is most dissimilar to the homologs in both C. elegans and Drosophila, sharing a score of 23% in each organism. The human protein is more dissimilar to the worm and fly protein than it is to the Arabidopsis protein, which may come as a surprise to some people. The Drosophila protein was by far the greatest sequence length having 1181 amino acids. The next largest protein has just over 200 amino acids, so the Drosophila protein is really the outlier in that respect. The shortest protein sequence was the cow homolog at 109 amino acids. |