FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2758, 654 aa
1>>>pF1KE2758 654 - 654 aa - 654 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.9379+/-0.00123; mu= 5.7077+/- 0.075
mean_var=551.2697+/-111.752, 0's: 0 Z-trim(115.1): 180 B-trim: 0 in 0/56
Lambda= 0.054625
statistics sampled from 15652 (15824) to 15652 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.736), E-opt: 0.2 (0.473), width: 16
Scan time: 2.050
The best scores are: opt bits E(33420)
CCDS43258.1 COL25A1 gene_id:84570|Hs109|chr4 ( 654) 4805 393.8 4.2e-109
CCDS43259.1 COL25A1 gene_id:84570|Hs109|chr4 ( 642) 4497 369.5 8.4e-102
CCDS58922.1 COL25A1 gene_id:84570|Hs109|chr4 ( 645) 3242 270.6 5e-72
CCDS44424.2 COL13A1 gene_id:1305|Hs109|chr10 ( 695) 1558 137.9 4.6e-32
CCDS44423.2 COL13A1 gene_id:1305|Hs109|chr10 ( 668) 1552 137.4 6.3e-32
CCDS75932.1 COL5A1 gene_id:1289|Hs109|chr9 (1838) 1511 134.8 1e-30
CCDS6982.1 COL5A1 gene_id:1289|Hs109|chr9 (1838) 1511 134.8 1e-30
CCDS44425.2 COL13A1 gene_id:1305|Hs109|chr10 ( 686) 1487 132.3 2.2e-30
CCDS44428.2 COL13A1 gene_id:1305|Hs109|chr10 ( 610) 1478 131.5 3.4e-30
CCDS8759.1 COL2A1 gene_id:1280|Hs109|chr12 (1418) 1473 131.7 7.1e-30
CCDS41778.1 COL2A1 gene_id:1280|Hs109|chr12 (1487) 1473 131.7 7.3e-30
CCDS2773.1 COL7A1 gene_id:1294|Hs109|chr3 (2944) 1460 131.1 2.1e-29
CCDS12222.1 COL5A3 gene_id:50509|Hs109|chr19 (1745) 1436 128.9 6e-29
CCDS44427.2 COL13A1 gene_id:1305|Hs109|chr10 ( 645) 1411 126.3 1.4e-28
CCDS9511.1 COL4A1 gene_id:1282|Hs109|chr13 (1669) 1395 125.6 5.5e-28
CCDS42828.1 COL4A4 gene_id:1286|Hs109|chr2 (1690) 1389 125.2 7.7e-28
CCDS44419.1 COL13A1 gene_id:1305|Hs109|chr10 ( 717) 1376 123.6 9.8e-28
CCDS41353.1 COL24A1 gene_id:255631|Hs109|chr1 (1714) 1337 121.1 1.3e-26
CCDS4436.1 COL23A1 gene_id:91522|Hs109|chr5 ( 540) 1313 118.4 2.6e-26
CCDS6802.1 COL27A1 gene_id:85301|Hs109|chr9 (1860) 1299 118.2 1.1e-25
CCDS6376.1 COL22A1 gene_id:169044|Hs109|chr8 (1626) 1297 117.9 1.1e-25
CCDS13505.1 COL9A3 gene_id:1299|Hs109|chr20 ( 684) 1287 116.5 1.2e-25
CCDS35366.1 COL4A5 gene_id:1287|Hs109|chrX (1691) 1282 116.7 2.6e-25
CCDS41297.1 COL16A1 gene_id:1307|Hs109|chr1 (1604) 1269 115.7 5.2e-25
CCDS14542.1 COL4A6 gene_id:1288|Hs109|chrX (1690) 1261 115.1 8.3e-25
CCDS14541.1 COL4A6 gene_id:1288|Hs109|chrX (1691) 1261 115.1 8.3e-25
CCDS76010.1 COL4A6 gene_id:1288|Hs109|chrX (1707) 1261 115.1 8.4e-25
CCDS14543.1 COL4A5 gene_id:1287|Hs109|chrX (1685) 1251 114.3 1.4e-24
CCDS450.1 COL9A2 gene_id:1298|Hs109|chr1 ( 689) 1231 112.1 2.6e-24
CCDS76008.1 COL4A6 gene_id:1288|Hs109|chrX (1633) 1229 112.5 4.7e-24
CCDS76009.1 COL4A6 gene_id:1288|Hs109|chrX (1666) 1229 112.6 4.8e-24
CCDS11561.1 COL1A1 gene_id:1277|Hs109|chr17 (1464) 1213 111.2 1.1e-23
CCDS2297.1 COL3A1 gene_id:1281|Hs109|chr2 (1466) 1206 110.7 1.6e-23
CCDS41907.1 COL4A2 gene_id:1284|Hs109|chr13 (1712) 1171 108.0 1.1e-22
CCDS42971.1 COL18A1 gene_id:80781|Hs109|chr21 (1339) 1126 104.3 1.2e-21
CCDS42972.1 COL18A1 gene_id:80781|Hs109|chr21 (1519) 1126 104.4 1.3e-21
CCDS77643.1 COL18A1 gene_id:80781|Hs109|chr21 (1754) 1126 104.5 1.4e-21
CCDS7554.1 COL17A1 gene_id:1308|Hs109|chr10 (1497) 1087 101.3 1e-20
CCDS42829.1 COL4A3 gene_id:1285|Hs109|chr2 (1670) 1067 99.8 3.3e-20
CCDS4970.1 COL19A1 gene_id:1310|Hs109|chr6 (1142) 1008 94.9 6.8e-19
CCDS43553.1 COL28A1 gene_id:340267|Hs109|chr7 (1125) 952 90.5 1.4e-17
CCDS780.2 COL11A1 gene_id:1301|Hs109|chr1 (1690) 878 84.9 1e-15
CCDS53348.1 COL11A1 gene_id:1301|Hs109|chr1 (1767) 878 84.9 1e-15
CCDS778.1 COL11A1 gene_id:1301|Hs109|chr1 (1806) 878 85.0 1.1e-15
CCDS34682.1 COL1A2 gene_id:1278|Hs109|chr7 (1366) 855 83.0 3.2e-15
CCDS43452.1 COL11A2 gene_id:1302|Hs109|chr6 (1650) 839 81.8 8.4e-15
CCDS35081.1 COL15A1 gene_id:1306|Hs109|chr9 (1388) 837 81.5 8.6e-15
CCDS76649.1 COL4A1 gene_id:1282|Hs109|chr13 ( 519) 808 78.6 2.5e-14
CCDS33350.1 COL5A2 gene_id:1290|Hs109|chr2 (1499) 810 79.5 3.9e-14
CCDS55025.1 COL21A1 gene_id:81578|Hs109|chr6 ( 957) 777 76.6 1.9e-13
>>CCDS43258.1 COL25A1 gene_id:84570|Hs109|chr4 (654 aa)
initn: 4805 init1: 4805 opt: 4805 Z-score: 2072.5 bits: 393.8 E(33420): 4.2e-109
Smith-Waterman score: 4805; 100.0% identity (100.0% similar) in 654 aa overlap (1-654:1-654)
10 20 30 40 50 60
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMVFPKINHGFLSADQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMVFPKINHGFLSADQQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 LIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 PPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 EKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 EKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 KGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQK
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 GDQGATEIIDYNGNLHEALQRITTLTVTGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GDQGATEIIDYNGNLHEALQRITTLTVTGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQGL
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 KGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 GPMGPHGLPGPKGTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GPMGPHGLPGPKGTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFP
550 560 570 580 590 600
610 620 630 640 650
pF1KE2 GPRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GPRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
610 620 630 640 650
>>CCDS43259.1 COL25A1 gene_id:84570|Hs109|chr4 (642 aa)
initn: 4497 init1: 4497 opt: 4497 Z-score: 1941.4 bits: 369.5 E(33420): 8.4e-102
Smith-Waterman score: 4497; 100.0% identity (100.0% similar) in 615 aa overlap (1-615:1-615)
10 20 30 40 50 60
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMVFPKINHGFLSADQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMVFPKINHGFLSADQQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 LIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 PPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 EKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 EKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 KGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQK
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 GDQGATEIIDYNGNLHEALQRITTLTVTGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GDQGATEIIDYNGNLHEALQRITTLTVTGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQGL
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 KGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 GPMGPHGLPGPKGTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GPMGPHGLPGPKGTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFP
550 560 570 580 590 600
610 620 630 640 650
pF1KE2 GPRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
:::::::::::::::
CCDS43 GPRGEKGDLGEKGEKVTSPSQHVPCLILLLLSALLFSLCDSI
610 620 630 640
>>CCDS58922.1 COL25A1 gene_id:84570|Hs109|chr4 (645 aa)
initn: 3269 init1: 1584 opt: 3242 Z-score: 1406.8 bits: 270.6 E(33420): 5e-72
Smith-Waterman score: 3941; 86.1% identity (86.9% similar) in 656 aa overlap (1-615:1-618)
10 20 30 40 50 60
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPPCAVLAALLSVVAVVSCLYLGVKTNDLQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 ARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNC
70 80 90 100 110 120
130 140 150 160 170
pF1KE2 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMV--FPKINHGFLSAD
:::::::::::::::::::::::::::::::::::::::::::: . :: . : ..
CCDS58 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRGLPGFPTVAA--LHSN
130 140 150 160 170
180 190 200 210 220 230
pF1KE2 QQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGP
: : .:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 QILT----VKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGP
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE2 LGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 LGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGD
240 250 260 270 280 290
300 310 320 330 340 350
pF1KE2 TGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 TGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLP
300 310 320 330 340 350
360 370 380 390 400 410
pF1KE2 GTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPG
::::::::::::::::::::::::::: ::::::::::::::::::
CCDS58 GTKGERGEAGPPGRGERGEPGAPGPKG---------------DRGEKGDSGAQGPRGPPG
360 370 380 390
420 430 440 450 460 470
pF1KE2 QKGDQGATEIIDYNGNLHEALQRITTLTVTGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQ
:::::::::::::::::::::: ::::::::::::::::::::::::::::::
CCDS58 QKGDQGATEIIDYNGNLHEALQ--------GPPGPPGPQGLQGPKGEQGSPGIPGMDGEQ
400 410 420 430 440 450
480 490 500 510 520 530
pF1KE2 GLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 GLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHG
460 470 480 490 500 510
540 550
pF1KE2 PPGPMGPHGLPGPKG---------------------------------------TDGPMG
::::::::::::::: ::::::
CCDS58 PPGPMGPHGLPGPKGEPGLNGVKGLKGEPGQKGDRGPLGLPGASGLDGKPGSRGTDGPMG
520 530 540 550 560 570
560 570 580 590 600 610
pF1KE2 PHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRG
:::::::::::::::::::::::::::::: :::::::::::::::::
CCDS58 PHGPAGPKGERGEKGAMGEPGPRGPYGLPG---------FPGPRGEKGDLGEKGEKVTSP
580 590 600 610 620
620 630 640 650
pF1KE2 VKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
CCDS58 SQHVPCLILLLLSALLFSLCDSI
630 640
>>CCDS44424.2 COL13A1 gene_id:1305|Hs109|chr10 (695 aa)
initn: 4103 init1: 944 opt: 1558 Z-score: 689.3 bits: 137.9 E(33420): 4.6e-32
Smith-Waterman score: 1651; 43.0% identity (57.0% similar) in 691 aa overlap (28-644:37-685)
10 20 30 40 50
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPP--CAVLAALLSVVAVVSCLYLGVK
:: : :..:. : .:. : .
CCDS44 HKAAATGARGPGELGAPGTVALVAARAERGARLPSPGSCGLLTLALCSLAL--SLLAHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE2 TNDLQARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAP
: .::::. ::. .: ....: . .:..:: .: : . : : ..
CCDS44 TAELQARVLRLEAERGE----------QQMETAILGRVNQLLDEKWKLHSRRRREAPKTS
70 80 90 100 110
120 130 140 150 160 170
pF1KE2 SECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKG-----DKGEQGDQGPRMVFPKI
:::: :::: :. : :..: :.:: : :: : .: :. : : :: .
CCDS44 PGCNCPPGPPGPTGRPGLPGDKGAIGMPGRVGSPGDAGLSIIGPRGPPGQPGTRG-FPGF
120 130 140 150 160 170
180 190 200 210 220
pF1KE2 NHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPR----GMPG----
: .. : . . :::.: .:::: ::: : .: :. :. : .::.
CCDS44 P-GPIGLDGK-PGHPGPKGDMGLTGPPGQPGPQGQKGEKGQCGEYPHRECLSSMPAALRS
180 190 200 210 220 230
230 240 250 260 270
pF1KE2 -----VPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGP
. :: .. . :: :: :::: .: .: ::.:: : ::::: :::
CCDS44 SQIIALKGEQSQASIQGPPGPPGPPGPSGPLGHPGLPG---PMGPPGLPGP------PGP
240 250 260 270 280
280 290 300 310 320 330
pF1KE2 KGEPGEQGEKGDAGENG-P--KGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLP
::.:: :: .: :: : : : : :: :: ..::.::::: : :.:: : ::::
CCDS44 KGDPGIQGYHGRKGERGMPGMPGKHGAKGAPGIAVAGMKGEPGIPGTKGEKGAEGSPGLP
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE2 GLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRG
:: : ::: .:. : . : ::: :::: :: :::::. : .: :
CCDS44 GLLGQKGE------KGDAG----NSIGGGRGEPGPPGL-----PGPPGPKGEAGVDGQVG
350 360 370 380
400 410 420 430 440 450
pF1KE2 PKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVTGPPG--
: :. ::.::.: .: ::: :: :.::. : :..:::::..::::.: ::.. ::::
CCDS44 PPGQPGDKGERGAAGEQGPDGPKGSKGEPGKGEMVDYNGNINEALQEIRTLALMGPPGLP
390 400 410 420 430 440
460 470 480 490
pF1KE2 ----PPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKG-----GI----
::: :. : ::: : :: :: :::.: .:. :::: :: : : :.
CCDS44 GQIGPPGAPGIPGQKGEIGLPGPPGHDGEKGPRGKPGDMGPPGPQGPPGKDGPPGVKGEN
450 460 470 480 490 500
500 510 520 530 540 550
pF1KE2 GLPGLPGANGMKGEKGDSGMPGPQGPSI-IGPPG---P--PGPHGPPGPMGPHGLPGPKG
: :: :: .: ::: :..: :: .: . : :: : :::.::::: : .:.:::::
CCDS44 GHPGSPGEKGEKGETGQAGSPGEKGEAGEKGNPGAEVPGLPGPEGPPGPPGLQGVPGPKG
510 520 530 540 550 560
560 570 580 590
pF1KE2 T---DGPMGPHGPAGPKGERGE------------------KGAMGEPGPRGPYGLPGKDG
:: : .: : ::.:: : .: ::: :: : :. :
CCDS44 EAGLDGAKGEKGFQGEKGDRGPLGLPGASGLDGRPGPPGTPGPIGVPGPAGPKGERGSKG
570 580 590 600 610 620
600 610 620 630 640
pF1KE2 EPGLDG------FPG---PRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGP
.::. : .:: : :.::. ::.:.:: :: ::.::. : :: ::::: ::
CCDS44 DPGMTGPTGAAGLPGLHGPPGDKGNRGERGKKGSRGPKGDKGDQGAPG---LDAPCPLGE
630 640 650 660 670 680
650
pF1KE2 DGLPMPGCWQK
:
CCDS44 DGLPVQGCWNK
690
>>CCDS44423.2 COL13A1 gene_id:1305|Hs109|chr10 (668 aa)
initn: 4618 init1: 1048 opt: 1552 Z-score: 686.9 bits: 137.4 E(33420): 6.3e-32
Smith-Waterman score: 1761; 44.5% identity (58.5% similar) in 674 aa overlap (28-654:37-668)
10 20 30 40 50
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPP--CAVLAALLSVVAVVSCLYLGVK
:: : :..:. : .:. : .
CCDS44 HKAAATGARGPGELGAPGTVALVAARAERGARLPSPGSCGLLTLALCSLAL--SLLAHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE2 TNDLQARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAP
: .::::. ::. .: ....: . .:..:: .: : . : : ..
CCDS44 TAELQARVLRLEAERGE----------QQMETAILGRVNQLLDEKWKLHSRRRREAPKTS
70 80 90 100 110
120 130 140 150 160 170
pF1KE2 SECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKG-----DKGEQGDQGPRMVFPKI
:::: :::: :. : :..: :.:: : :: : .: :. : : :: .
CCDS44 PGCNCPPGPPGPTGRPGLPGDKGAIGMPGRVGSPGDAGLSIIGPRGPPGQPGTRG-FPGF
120 130 140 150 160 170
180 190 200 210 220
pF1KE2 NHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPR----GMPG----
: .. : . . :::.: .:::: ::: : .: :. :. : .::.
CCDS44 P-GPIGLDGK-PGHPGPKGDMGLTGPPGQPGPQGQKGEKGQCGEYPHRECLSSMPAALRS
180 190 200 210 220 230
230 240 250 260 270
pF1KE2 -----VPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGP
. :: .. . :: :: :::: .: .: ::.:: : ::::: :::
CCDS44 SQIIALKGEQSQASIQGPPGPPGPPGPSGPLGHPGLPG---PMGPPGLPGP------PGP
240 250 260 270 280
280 290 300 310 320 330
pF1KE2 KGEPGEQGEKGDAGENG-P--KGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLP
::.:: :: .: :: : : : : :: :: ..::.::::: : :.:: : ::::
CCDS44 KGDPGIQGYHGRKGERGMPGMPGKHGAKGAPGIAVAGMKGEPGIPGTKGEKGAEGSPGLP
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE2 GLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRG
:: : ::: : : . : ::: :::: :: :::::. : .: :
CCDS44 GLLGQKGEKGDAGNS----------IGGGRGEPGPPGL-----PGPPGPKGEAGVDGQVG
350 360 370 380
400 410 420 430 440 450
pF1KE2 PKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVTGPPG--
: :. ::.::.: .: ::: :: :.::. : :..:::::..::::.: ::.. ::::
CCDS44 PPGQPGDKGERGAAGEQGPDGPKGSKGEPGKGEMVDYNGNINEALQEIRTLALMGPPGLP
390 400 410 420 430 440
460 470 480 490
pF1KE2 ----PPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKG-----GI----
::: :. : ::: : :: :: :::.: .:. :::: :: : : :.
CCDS44 GQIGPPGAPGIPGQKGEIGLPGPPGHDGEKGPRGKPGDMGPPGPQGPPGKDGPPGVKGEN
450 460 470 480 490 500
500 510 520 530 540 550
pF1KE2 GLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPPGPMGPHGLPGPKGTDGPMG
: :: :: .: ::: :..: : : :. ::::::: .: ::: : :: : :: : .:
CCDS44 GHPGSPGEKGEKGETGQAGSPVPGLPGPEGPPGPPGLQGVPGPKGEAGLDGAKGEKGFQG
510 520 530 540 550 560
560 570 580 590 600 610
pF1KE2 PHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDG------FPG---PRGEKGDLG
.: :: : : : .: ::: :: : :. :.::. : .:: : :.::. :
CCDS44 EKGDRGPLGLPGTPGPIGVPGPAGPKGERGSKGDPGMTGPTGAAGLPGLHGPPGDKGNRG
570 580 590 600 610 620
620 630 640 650
pF1KE2 EKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
:.:.:: :: ::.::. : :: ::::: :: ::::. :::.:
CCDS44 ERGKKGSRGPKGDKGDQGAPG---LDAPCPLGEDGLPVQGCWNK
630 640 650 660
>>CCDS75932.1 COL5A1 gene_id:1289|Hs109|chr9 (1838 aa)
initn: 2556 init1: 562 opt: 1511 Z-score: 665.1 bits: 134.8 E(33420): 1e-30
Smith-Waterman score: 1539; 43.7% identity (56.7% similar) in 600 aa overlap (113-649:474-1067)
90 100 110 120 130 140
pF1KE2 DHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQ
:.:. : : : :. : :: ::::.
CCDS75 GIGGPRGEKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGR
450 460 470 480 490 500
150 160 170 180
pF1KE2 PG-P-----QGPPGPK-------GDKGEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKG
:: : :::: : :. :..:: :: . ... .: . : :
CCDS75 PGLPGADGLPGPPGTMLMLPFRFGGGGDAGSKGP-MVSAQESQAQAILQQARLALRGPAG
510 520 530 540 550 560
190 200 210 220 230
pF1KE2 DQGQAGPPGPPGPPGP---RGPPGDTGKDGPRGMPGVPGEPGKPGEQG---------LMG
.: .: ::: :::: .: :::.: .::::. : :: ::::..: . :
CCDS75 PMGLTGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPG
570 580 590 600 610 620
240 250 260 270 280 290
pF1KE2 PLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKG
:: :..: : :.:: .:..:.:: : : : : .:. :: : .: :: ::.:
CCDS75 QTGPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRG
630 640 650 660 670 680
300 310 320 330 340
pF1KE2 DTGEKGDPGS----SAAGIKGEPG---------ESGRPGQKGEPGLPGLPGLPGIKGEPG
: :: :: ...:. :.:: : : :::.:.:: :::: : : ::
CCDS75 LLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPG
690 700 710 720 730 740
350 360 370 380 390 400
pF1KE2 FIGPQGEPGLPGLPGTKG---ERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGD
:: :.:::::.::. : . :. :::: :.: : :::.: : : :: ::. :
CCDS75 EKGPLGKPGLPGMPGADGPPGHPGKEGPPG--EKGGQGPPGPQGPIGYPGPRGVKGADGI
750 760 770 780 790 800
410 420 430 440 450
pF1KE2 RGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVT-----GPPGPPGP
:: :: .: .: : :: :::.: : :.. : :: : :::
CCDS75 RGLKGTKGEKGEDGFPGFKGDMG---IKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP
810 820 830 840 850
460 470 480 490 500 510
pF1KE2 QGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKG----
: : ::. : ::.::. :.:: ::: : : :: .::::: : :: :: :..:
CCDS75 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP
860 870 880 890 900 910
520 530 540 550 560
pF1KE2 --EKGDSGM---PGPQGPSII-GPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPHGPAGP
:.: :. :::.: : :: :::: .:: ::.:: :.::::: :: : : :
CCDS75 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH
920 930 940 950 960 970
570 580 590 600 610 620
pF1KE2 KGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRGVKGE---
:.::: : .:. :: :: :. : .: : : : ::. : : ::.:. :. :.
CCDS75 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT
980 990 1000 1010 1020 1030
630 640 650
pF1KE2 KGEPGQPGLDGLDAPCQL----GPDGLPMPGCWQK
::.:: :: : :.: : : ::: :
CCDS75 KGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGA
1040 1050 1060 1070 1080 1090
>--
initn: 2506 init1: 561 opt: 889 Z-score: 400.2 bits: 85.8 E(33420): 5.8e-16
Smith-Waterman score: 1402; 46.9% identity (57.5% similar) in 475 aa overlap (123-594:1162-1575)
100 110 120 130 140 150
pF1KE2 VERLLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPK
: ::..:..: .::.:::: ::::: :
CCDS75 GRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQP
1140 1150 1160 1170 1180 1190
160 170 180 190 200 210
pF1KE2 GDKGEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDT
: .: .:. ::: ..: :. :..:. :: : :::::: :
CCDS75 GPSGADGEPGPR------------GQQGLF------GQKGDEGPRGFPGPPGPVGL----
1200 1210 1220
220 230 240 250 260 270
pF1KE2 GKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQN
.:.:: ::: :. :. : ::: :::: .: :::: : .: : : :::::..
CCDS75 -----QGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEK
1230 1240 1250 1260 1270 1280
280 290 300 310 320 330
pF1KE2 GIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPG
: :: :::: :: .: ::::. ::::. : : : : :: .: ::. : : ::
CCDS75 GEPGEAGEPGLPGE---GGPPGPKGERGEKGESGPS--GAAGPPGPKGPPGDDGPKGSPG
1290 1300 1310 1320 1330
340 350 360 370 380 390
pF1KE2 LPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGT
:.:: : :: :: :. : :: : :: :..: : : :::: :: ::.: :
CCDS75 PVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSP--GPTGEPGPSGPPGKRGPPGP
1340 1350 1360 1370 1380 1390
400 410 420 430 440 450
pF1KE2 RGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVTGPPG
::.: .:..: ::..: .:: : : : ::: : ::
CCDS75 AGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAP----------------------GKPG
1400 1410 1420 1430
460 470 480 490 500
pF1KE2 PPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGD---MGDPGMTGEKGGIGLPGLPGANG
: : .:. :: :::: :: :: :: : : : :: : :::: :: :: : :
CCDS75 PDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPG
1440 1450 1460 1470 1480 1490
510 520 530 540 550 560
pF1KE2 MKGEKGDSGMPGPQGPSIIGPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPHGPAGPKGE
.::::: :.::::: : :: : : :: ::.:: ::: : :: ::.: : .:
CCDS75 EQGEKGDRGLPGPQGSS--GPKGEQGITGPSGPIGP---PGPPGLPGPPGPKGAKGSSGP
1500 1510 1520 1530 1540 1550
570 580 590 600 610 620
pF1KE2 RGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRGVKGEKGEPGQ
: :: :.::: :: : ::. .:
CCDS75 TGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEE
1560 1570 1580 1590 1600 1610
>>CCDS6982.1 COL5A1 gene_id:1289|Hs109|chr9 (1838 aa)
initn: 2556 init1: 562 opt: 1511 Z-score: 665.1 bits: 134.8 E(33420): 1e-30
Smith-Waterman score: 1539; 43.7% identity (56.7% similar) in 600 aa overlap (113-649:474-1067)
90 100 110 120 130 140
pF1KE2 DHLKTMVQEKVERLLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQ
:.:. : : : :. : :: ::::.
CCDS69 GIGGPRGEKGQKGEPAIIEPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGR
450 460 470 480 490 500
150 160 170 180
pF1KE2 PG-P-----QGPPGPK-------GDKGEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKG
:: : :::: : :. :..:: :: . ... .: . : :
CCDS69 PGLPGADGLPGPPGTMLMLPFRFGGGGDAGSKGP-MVSAQESQAQAILQQARLALRGPAG
510 520 530 540 550 560
190 200 210 220 230
pF1KE2 DQGQAGPPGPPGPPGP---RGPPGDTGKDGPRGMPGVPGEPGKPGEQG---------LMG
.: .: ::: :::: .: :::.: .::::. : :: ::::..: . :
CCDS69 PMGLTGRPGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPG
570 580 590 600 610 620
240 250 260 270 280 290
pF1KE2 PLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKG
:: :..: : :.:: .:..:.:: : : : : .:. :: : .: :: ::.:
CCDS69 QTGPKGDRGFDGLAGLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRG
630 640 650 660 670 680
300 310 320 330 340
pF1KE2 DTGEKGDPGS----SAAGIKGEPG---------ESGRPGQKGEPGLPGLPGLPGIKGEPG
: :: :: ...:. :.:: : : :::.:.:: :::: : : ::
CCDS69 LLGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPG
690 700 710 720 730 740
350 360 370 380 390 400
pF1KE2 FIGPQGEPGLPGLPGTKG---ERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGD
:: :.:::::.::. : . :. :::: :.: : :::.: : : :: ::. :
CCDS69 EKGPLGKPGLPGMPGADGPPGHPGKEGPPG--EKGGQGPPGPQGPIGYPGPRGVKGADGI
750 760 770 780 790 800
410 420 430 440 450
pF1KE2 RGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVT-----GPPGPPGP
:: :: .: .: : :: :::.: : :.. : :: : :::
CCDS69 RGLKGTKGEKGEDGFPGFKGDMG---IKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP
810 820 830 840 850
460 470 480 490 500 510
pF1KE2 QGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKG----
: : ::. : ::.::. :.:: ::: : : :: .::::: : :: :: :..:
CCDS69 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP
860 870 880 890 900 910
520 530 540 550 560
pF1KE2 --EKGDSGM---PGPQGPSII-GPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPHGPAGP
:.: :. :::.: : :: :::: .:: ::.:: :.::::: :: : : :
CCDS69 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH
920 930 940 950 960 970
570 580 590 600 610 620
pF1KE2 KGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRGVKGE---
:.::: : .:. :: :: :. : .: : : : ::. : : ::.:. :. :.
CCDS69 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT
980 990 1000 1010 1020 1030
630 640 650
pF1KE2 KGEPGQPGLDGLDAPCQL----GPDGLPMPGCWQK
::.:: :: : :.: : : ::: :
CCDS69 KGDPGPAGLPGKDGPPGLRGFPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGA
1040 1050 1060 1070 1080 1090
>--
initn: 2506 init1: 561 opt: 889 Z-score: 400.2 bits: 85.8 E(33420): 5.8e-16
Smith-Waterman score: 1402; 46.9% identity (57.5% similar) in 475 aa overlap (123-594:1162-1575)
100 110 120 130 140 150
pF1KE2 VERLLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPK
: ::..:..: .::.:::: ::::: :
CCDS69 GRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQP
1140 1150 1160 1170 1180 1190
160 170 180 190 200 210
pF1KE2 GDKGEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDT
: .: .:. ::: ..: :. :..:. :: : :::::: :
CCDS69 GPSGADGEPGPR------------GQQGLF------GQKGDEGPRGFPGPPGPVGL----
1200 1210 1220
220 230 240 250 260 270
pF1KE2 GKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQN
.:.:: ::: :. :. : ::: :::: .: :::: : .: : : :::::..
CCDS69 -----QGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEK
1230 1240 1250 1260 1270 1280
280 290 300 310 320 330
pF1KE2 GIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPG
: :: :::: :: .: ::::. ::::. : : : : :: .: ::. : : ::
CCDS69 GEPGEAGEPGLPGE---GGPPGPKGERGEKGESGPS--GAAGPPGPKGPPGDDGPKGSPG
1290 1300 1310 1320 1330
340 350 360 370 380 390
pF1KE2 LPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGT
:.:: : :: :: :. : :: : :: :..: : : :::: :: ::.: :
CCDS69 PVGFPGDPGPPGEPGPAGQDGPPGDKGDDGEPGQTGSP--GPTGEPGPSGPPGKRGPPGP
1340 1350 1360 1370 1380 1390
400 410 420 430 440 450
pF1KE2 RGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVTGPPG
::.: .:..: ::..: .:: : : : ::: : ::
CCDS69 AGPEGRQGEKGAKGEAGLEGPPGKTGPIGPQGAP----------------------GKPG
1400 1410 1420 1430
460 470 480 490 500
pF1KE2 PPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGD---MGDPGMTGEKGGIGLPGLPGANG
: : .:. :: :::: :: :: :: : : : :: : :::: :: :: : :
CCDS69 PDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPG
1440 1450 1460 1470 1480 1490
510 520 530 540 550 560
pF1KE2 MKGEKGDSGMPGPQGPSIIGPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPHGPAGPKGE
.::::: :.::::: : :: : : :: ::.:: ::: : :: ::.: : .:
CCDS69 EQGEKGDRGLPGPQGSS--GPKGEQGITGPSGPIGP---PGPPGLPGPPGPKGAKGSSGP
1500 1510 1520 1530 1540 1550
570 580 590 600 610 620
pF1KE2 RGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRGVKGEKGEPGQ
: :: :.::: :: : ::. .:
CCDS69 TGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDASQLLDDGNGENYVDYADGMEE
1560 1570 1580 1590 1600 1610
>>CCDS44425.2 COL13A1 gene_id:1305|Hs109|chr10 (686 aa)
initn: 2495 init1: 939 opt: 1487 Z-score: 659.1 bits: 132.3 E(33420): 2.2e-30
Smith-Waterman score: 1704; 43.3% identity (57.7% similar) in 691 aa overlap (28-650:37-682)
10 20 30 40 50
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPP--CAVLAALLSVVAVVSCLYLGVK
:: : :..:. : .:. : .
CCDS44 HKAAATGARGPGELGAPGTVALVAARAERGARLPSPGSCGLLTLALCSLAL--SLLAHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE2 TNDLQARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAP
: .::::. ::. .: ....: . .:..:: .: : . : : ..
CCDS44 TAELQARVLRLEAERGE----------QQMETAILGRVNQLLDEKWKLHSRRRREAPKTS
70 80 90 100 110
120 130 140 150 160
pF1KE2 SECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPG--------PKGDKGEQGDQGPRMVF
:::: :::: :. : :..: :.:: : :: :.: :. : .: :
CCDS44 PGCNCPPGPPGPTGRPGLPGDKGAIGMPGRVGSPGDAGLSIIGPRGPPGQPGTRG----F
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE2 PKINHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGV----
: . : .. : . . :::.: .:::: ::: : .: :. :. : .: .
CCDS44 PGFP-GPIGLDGK-PGHPGPKGDMGLTGPPGQPGPQGQKGEKGQCGEYPHRLLPLLNSVR
180 190 200 210 220
230 240 250 260 270
pF1KE2 --P----------GEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQ
: :: .. . :: :: :::: .: .: ::.:: : :::::
CCDS44 LAPPPVIKRRTFQGEQSQASIQGPPGPPGPPGPSGPLGHPGLPG---PMGPPGLPGP---
230 240 250 260 270 280
280 290 300 310 320
pF1KE2 NGIPGPKGEPGEQGEKGDAGENG-P--KGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEP
:::::.:: :: .: :: : : : : :: :: ..::.::::: : :.::
CCDS44 ---PGPKGDPGIQGYHGRKGERGMPGMPGKHGAKGAPGIAVAGMKGEPGIPGTKGEKGAE
290 300 310 320 330
330 340 350 360 370 380
pF1KE2 GLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQG
: :::::: : ::: : : . : ::: :::: :: :::::. :
CCDS44 GSPGLPGLLGQKGEKGDAG----------NSIGGGRGEPGPPGL-----PGPPGPKGEAG
340 350 360 370 380
390 400 410 420 430 440
pF1KE2 ESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVT
.: :: :. ::.::.: .: ::: :: :.::. : :..:::::..::::.: ::..
CCDS44 VDGQVGPPGQPGDKGERGAAGEQGPDGPKGSKGEPGKGEMVDYNGNINEALQEIRTLALM
390 400 410 420 430 440
450 460 470 480 490
pF1KE2 GPPG------PPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKG-----
:::: ::: :. : ::: : :: :: :::.: .:. :::: :: : :
CCDS44 GPPGLPGQIGPPGAPGIPGQKGEIGLPGPPGHDGEKGPRGKPGDMGPPGPQGPPGKDGPP
450 460 470 480 490 500
500 510 520 530 540
pF1KE2 GI----GLPGLPGANGMKGEKGDSGMPGPQGPSI-IGPPG---P--PGPHGPPGPMGPHG
:. : :: :: .: ::: :..: :: .: . : :: : :::.::::: : .:
CCDS44 GVKGENGHPGSPGEKGEKGETGQAGSPGEKGEAGEKGNPGAEVPGLPGPEGPPGPPGLQG
510 520 530 540 550 560
550 560 570 580 590
pF1KE2 LPGPKGTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPG---------KDGEPGLDG
.::::: : : .: : .::.:..: .: :: :: :.:: . :.::. :
CCDS44 VPGPKGEAGLDGAKGEKGFQGEKGDRGPLGLPGTPGPIGVPGPAGPKGERGSKGDPGMTG
570 580 590 600 610 620
600 610 620 630 640
pF1KE2 ------FPG---PRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMP
.:: : :.::. ::.:.:: :: ::.::. : ::: :::: :: ::::.
CCDS44 PTGAAGLPGLHGPPGDKGNRGERGKKGSRGPKGDKGDQGAPGL---DAPCPLGEDGLPVQ
630 640 650 660 670 680
650
pF1KE2 GCWQK
:
CCDS44 GCWNK
>>CCDS44428.2 COL13A1 gene_id:1305|Hs109|chr10 (610 aa)
initn: 2997 init1: 1034 opt: 1478 Z-score: 655.7 bits: 131.5 E(33420): 3.4e-30
Smith-Waterman score: 1640; 43.8% identity (55.8% similar) in 651 aa overlap (28-654:37-610)
10 20 30 40 50
pF1KE2 MLLKKHAGKGGGREPRSEDPTPAEQHCARTMPP--CAVLAALLSVVAVVSCLYLGVK
:: : :..:. : .:. : .
CCDS44 HKAAATGARGPGELGAPGTVALVAARAERGARLPSPGSCGLLTLALCSLAL--SLLAHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE2 TNDLQARIAALESAKGAPSIHLLPDTLDHLKTMVQEKVERLLAQKSYEHMAKIRIAREAP
: .::::. ::. .: ....: . .:..:: .: : . : : ..
CCDS44 TAELQARVLRLEAERGE----------QQMETAILGRVNQLLDEKWKLHSRRRREAPKTS
70 80 90 100 110
120 130 140 150 160 170
pF1KE2 SECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRMVFPKINHGFL
:::: :::: : : : : :: : : ::: : :. : ::
CCDS44 PGCNCPPGPPGPTG---RPGLPGQPGTRGFPGFPGPIGLDGKPGHPGP------------
120 130 140 150
180 190 200 210 220
pF1KE2 SADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKDGPR----GMPG---------
:::.: .:::: ::: : .: :. :. : .::.
CCDS44 ------------KGDMGLTGPPGQPGPQGQKGEKGQCGEYPHRECLSSMPAALRSSQIIA
160 170 180 190 200
230 240 250 260 270 280
pF1KE2 VPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPG
. :: .. . :: :: :::: .: .: ::.: : : ::::: :::::.::
CCDS44 LKGEQSQASIQGPPGPPGPPGPSGPLGHPGLP---GPMGPPGLPGP------PGPKGDPG
210 220 230 240 250
290 300 310 320 330
pF1KE2 EQGEKGDAGENG-P--KGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGI
:: .: :: : : : : :: :: ..::.::::: : :.:: : :::::: :
CCDS44 IQGYHGRKGERGMPGMPGKHGAKGAPGIAVAGMKGEPGIPGTKGEKGAEGSPGLPGLLGQ
260 270 280 290 300 310
340 350 360 370 380 390
pF1KE2 KGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSK
::: : : . : ::: :::: :: :::::. : .: :: :.
CCDS44 KGEKGDAGNS----------IGGGRGEPGPPGL-----PGPPGPKGEAGVDGQVGPPGQP
320 330 340 350 360
400 410 420 430 440 450
pF1KE2 GDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTVTGPPG------P
::.::.: .: ::: :: :.::. : :..:::::..::::.: ::.. :::: :
CCDS44 GDKGERGAAGEQGPDGPKGSKGEPGKGEMVDYNGNINEALQEIRTLALMGPPGLPGQIGP
370 380 390 400 410 420
460 470 480 490 500 510
pF1KE2 PGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGE
:: :. : ::: : :: :: :::.: .:. :::: :: : : : ::. : :: :
CCDS44 PGAPGIPGQKGEIGLPGPPGHDGEKGPRGKPGDMGPPGPQGPPGKDGPPGVKGENGHPGS
430 440 450 460 470 480
520 530 540 550 560 570
pF1KE2 KGDSGMPGPQGPSIIGPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPHGPAGPKGERGEK
:..: : : . :: :::.::::: : .:.::::: :: : .:::
CCDS44 PGEKGEKGETGQAGSPVPGLPGPEGPPGPPGLQGVPGPKGE---------AGLDGAKGEK
490 500 510 520 530
580 590 600 610 620 630
pF1KE2 GAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLD
: .:: : ::: :::: :: : ::: : ::. : ::. :. : : : :: :
CCDS44 GFQGEKGDRGPLGLPGT---PGPIGVPGPAGPKGERGSKGDPGMTGPTGAAGLPGLHGPP
540 550 560 570 580 590
640 650
pF1KE2 GLDAPCQLGPDGLPMPGCWQK
: : . : ::::. :::.:
CCDS44 G-DKGNR-GEDGLPVQGCWNK
600 610
>>CCDS8759.1 COL2A1 gene_id:1280|Hs109|chr12 (1418 aa)
initn: 3019 init1: 544 opt: 1473 Z-score: 650.0 bits: 131.7 E(33420): 7.1e-30
Smith-Waterman score: 1535; 44.4% identity (54.9% similar) in 572 aa overlap (121-650:145-684)
100 110 120 130 140 150
pF1KE2 EKVERLLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPG
::: :: .: .: :: : :: ::.:: :
CCDS87 AGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMGPRG
120 130 140 150 160 170
160 170 180 190 200
pF1KE2 PKGDKGEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPG---PRG
: : :. ::.: : .. : : :: : :: :: :: ::
CCDS87 PPGPPGKPGDDGEAGKPGKAGE------------RGPPGPQGARGFPGTPGLPGVKGHRG
180 190 200 210 220
210 220 230 240 250 260
pF1KE2 PPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLGP---PGQKGSIGAPGIPGMNGQKGEPG
:: : : : ::: :: :.:::.: ::.:: ::..: : : : :. :.::
CCDS87 YPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPG
230 240 250 260 270 280
270 280 290 300 310 320
pF1KE2 L---PGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGR
:: :: : :: : :: .:: : .: ::.: : .:.::. : : : ::
CCDS87 PAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGT--PGSPGPAGASGN
290 300 310 320 330 340
330 340 350 360 370
pF1KE2 PGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPG----RGERGE
:: : :: : : ::: : ::: ::.: :: : : : .:..: :: .::.:
CCDS87 PGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGP
350 360 370 380 390 400
380 390 400 410 420 430
pF1KE2 PGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHE
: ::: : :: :: : .: :: .:. :. :: ::::..: : . .:
CCDS87 KGEPGPAGPQGAP---GPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDG----
410 420 430 440 450
440 450 460 470 480 490
pF1KE2 ALQRITTLTVTGPPGPPG---PQGLQGPKG---EQGSPGIPGMDGEQGLKGSKGDMGDPG
..:: : :: :.:: :::: . : :: ::. : .:: : :: : :
CCDS87 ---------LAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQG
460 470 480 490 500
500 510 520 530 540
pF1KE2 MTGEKGGIGL---PGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGPHGPPGPMGPHGL
.: .:. : :: :: .: .:. : :.:::.: . : :: : .: :: : .::
CCDS87 KVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGAN--GEPGKAGEKGLPGAPGLRGL
510 520 530 540 550 560
550 560 570 580 590
pF1KE2 PGPKGTDGPMGPHGPAGPKGERGEKGA------MGEPGPRGPYGL---PGKDGEPGLDGF
:: : : :: ::::: :::::.:: .: ::: :: : :: .: :: :
CCDS87 PGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGA
570 580 590 600 610 620
600 610 620 630 640
pF1KE2 PG---PRGEKGDLGEKGEKGFRGVKGEKGEPGQPGLDG-------LDAPCQLGPDGLP-M
:: ::::.: ::.: : .:..: .: :: :: :: : :: :: :
CCDS87 PGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGM
630 640 650 660 670 680
650
pF1KE2 PGCWQK
::
CCDS87 PGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGP
690 700 710 720 730 740
>--
initn: 2466 init1: 521 opt: 1246 Z-score: 553.4 bits: 113.8 E(33420): 1.7e-24
Smith-Waterman score: 1378; 44.6% identity (57.4% similar) in 500 aa overlap (126-607:687-1163)
100 110 120 130 140 150
pF1KE2 LLAQKSYEHMAKIRIAREAPSECNCPAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDK
: : : .:. : :. ::.: :: : .
CCDS87 GTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGR
660 670 680 690 700 710
160 170 180 190 200 210
pF1KE2 GEQGDQGPRMVFPKINHGFLSADQQLIKRRLIKGDQGQAGPPGPPGPPGPRGPPGDTGKD
: : :: : : .:. :..:..::::: : : :: ::. :.
CCDS87 GLTGPIGP----P----GPAGAN----------GEKGEVGPPGPAGSAGARGAPGERGET
720 730 740 750
220 230 240 250 260 270
pF1KE2 GPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIP
:: : : : :: :. : : : ::::. :::: : .: : : :..: .:
CCDS87 GPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGAR
760 770 780 790 800 810
280 290 300 310 320 330
pF1KE2 GPKGEPGEQGEKGDAGENGPKGDTGEKGDPG----SSAAGIKGEPGESGRPGQKGEPGLP
: .: :: : : ::. :: :..:. : :: :. : :: :.:: ::. :::::
CCDS87 GAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQ
820 830 840 850 860 870
340 350 360 370 380 390
pF1KE2 GLPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPG-RGERGEPGAPGPKGKQGES
: : :: ::::: ::.: : :: : :.:: .: :: ::::: :: :::.:. :..
CCDS87 GPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQ
880 890 900 910 920 930
400 410 420 430 440
pF1KE2 GTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGATEIIDYNGNLHEALQRITTLTV---
:. : .:..: : : : :: : ::..:. :: .: .: : .:
CCDS87 GAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAP
940 950 960 970 980 990
450 460 470 480 490 500
pF1KE2 --TGPPGPPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLP
:::: ::: : : .:..: : : : .: :..: .: : :.:: : ::
CCDS87 GAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGER
1000 1010 1020 1030 1040 1050
510 520 530 540 550 560
pF1KE2 GANGMKGEKGDSGMPGPQGPS----IIGPPGPPGPHGPPGPMGPHGLPGPKGTDGPMGPH
: .: .: : .:.::: ::: :: :: ::.:::::.:: : : .: ::.::
CCDS87 GLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPP
1060 1070 1080 1090 1100 1110
570 580 590 600 610
pF1KE2 GPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLD--GFPG--PRGEKGDLGEKGEKGF
:: : .:: : : :.::: :: : :: ::.: .: : :: :::
CCDS87 GPRGRSGETGPAGPPGNPGPPGPPGPPG----PGIDMSAFAGLGPR-EKGPDPLQYMRAD
1120 1130 1140 1150 1160 1170
620 630 640 650
pF1KE2 RGVKGEKGEPGQPGLDGLDAPCQLGPDGLPMPGCWQK
CCDS87 QAAGGLRQHDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDP
1180 1190 1200 1210 1220 1230
654 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Aug 1 17:01:44 2019 done: Thu Aug 1 17:01:44 2019
Total Scan time: 2.050 Total Display time: 0.150
Function used was FASTA [36.3.4 Apr, 2011]