FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1339, 520 aa
1>>>pF1KE1339 520 - 520 aa - 520 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.5809+/-0.00119; mu= -1.3916+/- 0.072
mean_var=450.7090+/-92.148, 0's: 0 Z-trim(113.8): 180 B-trim: 0 in 0/53
Lambda= 0.060412
statistics sampled from 14262 (14438) to 14262 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.444), width: 16
Scan time: 3.590
The best scores are: opt bits E(32554)
CCDS2124.1 MARCO gene_id:8685|Hs108|chr2 ( 520) 3627 330.5 2.8e-90
CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466) 944 97.2 1.3e-19
CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745) 907 94.1 1.4e-18
CCDS72756.1 COL8A2 gene_id:1296|Hs108|chr1 ( 638) 889 92.0 2.2e-18
CCDS403.1 COL8A2 gene_id:1296|Hs108|chr1 ( 703) 889 92.0 2.3e-18
CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499) 886 92.2 4.5e-18
CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 ( 689) 875 90.8 5.4e-18
CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 ( 678) 868 90.2 8.1e-18
CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 877 91.5 8.9e-18
CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 877 91.5 8.9e-18
CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690) 876 91.4 9e-18
CCDS53348.1 COL11A1 gene_id:1301|Hs108|chr1 (1767) 876 91.4 9.2e-18
CCDS778.1 COL11A1 gene_id:1301|Hs108|chr1 (1806) 876 91.4 9.3e-18
CCDS4971.1 COL9A1 gene_id:1297|Hs108|chr6 ( 921) 868 90.4 9.9e-18
CCDS6802.1 COL27A1 gene_id:85301|Hs108|chr9 (1860) 870 90.9 1.4e-17
CCDS11561.1 COL1A1 gene_id:1277|Hs108|chr17 (1464) 865 90.3 1.6e-17
CCDS8759.1 COL2A1 gene_id:1280|Hs108|chr12 (1418) 861 90.0 2e-17
CCDS41778.1 COL2A1 gene_id:1280|Hs108|chr12 (1487) 861 90.0 2e-17
CCDS43452.1 COL11A2 gene_id:1302|Hs108|chr6 (1650) 855 89.5 3.1e-17
CCDS42829.1 COL4A3 gene_id:1285|Hs108|chr2 (1670) 854 89.4 3.4e-17
CCDS83099.1 COL21A1 gene_id:81578|Hs108|chr6 ( 954) 847 88.5 3.6e-17
CCDS55025.1 COL21A1 gene_id:81578|Hs108|chr6 ( 957) 847 88.5 3.6e-17
CCDS41297.1 COL16A1 gene_id:1307|Hs108|chr1 (1604) 841 88.3 7.2e-17
CCDS41353.1 COL24A1 gene_id:255631|Hs108|chr1 (1714) 840 88.2 8e-17
CCDS9511.1 COL4A1 gene_id:1282|Hs108|chr13 (1669) 839 88.1 8.3e-17
CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626) 837 87.9 9.2e-17
CCDS13505.1 COL9A3 gene_id:1299|Hs108|chr20 ( 684) 816 85.7 1.9e-16
CCDS34682.1 COL1A2 gene_id:1278|Hs108|chr7 (1366) 816 86.0 2.9e-16
CCDS13730.1 COL6A2 gene_id:1292|Hs108|chr21 ( 828) 792 83.7 9.1e-16
CCDS13729.1 COL6A2 gene_id:1292|Hs108|chr21 ( 918) 792 83.7 9.7e-16
CCDS13728.1 COL6A2 gene_id:1292|Hs108|chr21 (1019) 792 83.8 1e-15
CCDS76649.1 COL4A1 gene_id:1282|Hs108|chr13 ( 519) 777 82.1 1.7e-15
CCDS44428.2 COL13A1 gene_id:1305|Hs108|chr10 ( 610) 775 82.0 2.1e-15
CCDS2934.1 COL8A1 gene_id:1295|Hs108|chr3 ( 744) 775 82.1 2.4e-15
CCDS14543.1 COL4A5 gene_id:1287|Hs108|chrX (1685) 781 83.1 2.8e-15
CCDS35366.1 COL4A5 gene_id:1287|Hs108|chrX (1691) 781 83.1 2.8e-15
CCDS2773.1 COL7A1 gene_id:1294|Hs108|chr3 (2944) 785 83.7 3.1e-15
CCDS42828.1 COL4A4 gene_id:1286|Hs108|chr2 (1690) 775 82.6 4e-15
CCDS13727.1 COL6A1 gene_id:1291|Hs108|chr21 (1028) 760 81.0 7.2e-15
CCDS41907.1 COL4A2 gene_id:1284|Hs108|chr13 (1712) 757 81.0 1.2e-14
CCDS7554.1 COL17A1 gene_id:1308|Hs108|chr10 (1497) 748 80.1 1.9e-14
CCDS4436.1 COL23A1 gene_id:91522|Hs108|chr5 ( 540) 737 78.7 1.9e-14
CCDS43553.1 COL28A1 gene_id:340267|Hs108|chr7 (1125) 722 77.7 7.6e-14
CCDS5105.1 COL10A1 gene_id:1300|Hs108|chr6 ( 680) 709 76.3 1.2e-13
CCDS76008.1 COL4A6 gene_id:1288|Hs108|chrX (1633) 717 77.5 1.3e-13
CCDS76009.1 COL4A6 gene_id:1288|Hs108|chrX (1666) 717 77.5 1.3e-13
CCDS14542.1 COL4A6 gene_id:1288|Hs108|chrX (1690) 717 77.5 1.3e-13
CCDS14541.1 COL4A6 gene_id:1288|Hs108|chrX (1691) 717 77.5 1.3e-13
CCDS76010.1 COL4A6 gene_id:1288|Hs108|chrX (1707) 717 77.5 1.3e-13
CCDS46948.1 OTOL1 gene_id:131149|Hs108|chr3 ( 477) 686 74.1 3.9e-13
>>CCDS2124.1 MARCO gene_id:8685|Hs108|chr2 (520 aa)
initn: 3627 init1: 3627 opt: 3627 Z-score: 1734.3 bits: 330.5 E(32554): 2.8e-90
Smith-Waterman score: 3627; 100.0% identity (100.0% similar) in 520 aa overlap (1-520:1-520)
10 20 30 40 50 60
pF1KE1 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVVVIYLILLTAGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVVVIYLILLTAGA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 GLLVVQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GLLVVQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE1 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS21 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ
430 440 450 460 470 480
490 500 510 520
pF1KE1 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV
::::::::::::::::::::::::::::::::::::::::
CCDS21 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV
490 500 510 520
>>CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466 aa)
initn: 2388 init1: 859 opt: 944 Z-score: 465.3 bits: 97.2 E(32554): 1.3e-19
Smith-Waterman score: 959; 50.3% identity (64.7% similar) in 286 aa overlap (148-418:717-993)
120 130 140 150 160 170
pF1KE1 AQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQG---HKGAMGMPGAPGPPGPP
: :.::::: ..:..: :: : : :
CCDS22 GLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEP
690 700 710 720 730 740
180 190 200 210 220
pF1KE1 AEKGAKGAMGRDGATGPSGPQGPPGV------KGEAG------LQGPQGAPGKQGATGTP
. :: :. :.:: ::.:: :::: :::.: . ::.:.::..: :: :
CCDS22 GGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPP
750 760 770 780 790 800
230 240 250 260 270 280
pF1KE1 GPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDF
:: : :. :..: : ::: :. ::::. : :: : : .: :: ::: :: ::.
CCDS22 GPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAG---PPGPQGVKGER
810 820 830 840 850 860
290 300 310 320 330 340
pF1KE1 GRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSP
: :: :: ::::::.: : :: .: ::::: : :: : :: ::. : .::::
CCDS22 GSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTG------APGSP
870 880 890 900 910
350 360 370 380 390 400
pF1KE1 GATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVK
:..: ::. :. : .:. : .: :.::: :. : :. ::::: : :: :. : ::::
CCDS22 GVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVK
920 930 940 950 960 970
410 420 430 440 450 460
pF1KE1 GSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRM
: ::. :..: .::::
CCDS22 GESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGEN
980 990 1000 1010 1020 1030
>--
initn: 812 init1: 812 opt: 865 Z-score: 428.1 bits: 90.3 E(32554): 1.6e-17
Smith-Waterman score: 894; 46.1% identity (62.2% similar) in 304 aa overlap (136-418:150-453)
110 120 130 140 150 160
pF1KE1 LAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMP
:.. . . .:. .. :: :. : : :
CCDS22 NGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPP
120 130 140 150 160 170
170 180 190 200 210
pF1KE1 GAPGPPG----P--PAEKGAKGAMGRDGATGPSGPQGPPGV---KGEAGLQGPQGAPGKQ
: ::::: : :. : .: :. : .::::: ::::. .: :: .: .: ::.
CCDS22 GPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRP
180 190 200 210 220 230
220 230 240 250 260 270
pF1KE1 GATGTPGPQGEKGSKGDGGLIGPKGETG---TKGEKGDLGLPGSKGDRGMKGDAGVMGPP
: : ::: : :: : :. : ::. : .::::. : :: ::. :. :. :. ::
CCDS22 GERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPM
240 250 260 270 280 290
280 290 300 310 320 330
pF1KE1 GAQGSKGDFGRPGPPGLAGF---PGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSP
: .:. :. :::: :: :: ::.:..:::: : :: : : :::::: : ::::
CCDS22 GPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSP
300 310 320 330 340 350
340 350 360 370 380
pF1KE1 GRAGLPGS---PGSPGATGLKGSKGDTGLQGQQGRKGE---SGVPGPAGVKGEQGSPGLA
: : ::. :: : .: .: : :..:. : ::: .:.:: :. : .: :: :
CCDS22 GSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPA
360 370 380 390 400 410
390 400 410 420 430 440
pF1KE1 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWG
: .:::: : :. : .:..:: : .::.:: :
CCDS22 GANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLP
420 430 440 450 460 470
450 460 470 480 490 500
pF1KE1 TICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWG
CCDS22 GAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMP
480 490 500 510 520 530
>--
initn: 1441 init1: 740 opt: 758 Z-score: 377.7 bits: 81.0 E(32554): 1e-14
Smith-Waterman score: 758; 46.7% identity (61.1% similar) in 244 aa overlap (142-382:477-714)
120 130 140 150 160 170
pF1KE1 RLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPP
:. ::.::::..: : :.:: ::
CCDS22 GERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPA
450 460 470 480 490 500
180 190 200 210 220 230
pF1KE1 GPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSK
: :.:: : : ::.: : .: :: : :. : :.::..: : :: :::.:
CCDS22 G---ERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRP
510 520 530 540 550 560
240 250 260 270 280
pF1KE1 GDGGLIGPKGETGT---KGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPP
: : ::.:. :. : ::. : ::..:.:: : : .::: :..:. : :::
CCDS22 GPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPP---GKNGETGPQGPP
570 580 590 600 610 620
290 300 310 320 330 340
pF1KE1 GLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLK
: .: : ::: : :: ::. : ::. : :: .:.:: : : :: ::.::. : .:
CCDS22 GPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAP
630 640 650 660 670 680
350 360 370 380 390 400
pF1KE1 GSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQ
: .: :: : : .: .: ::: : :: : ::
CCDS22 GERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPK
690 700 710 720 730 740
410 420 430 440 450 460
pF1KE1 GVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKG
CCDS22 GDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGER
750 760 770 780 790 800
>>CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745 aa)
initn: 4669 init1: 868 opt: 907 Z-score: 447.0 bits: 94.1 E(32554): 1.4e-18
Smith-Waterman score: 915; 46.8% identity (61.4% similar) in 293 aa overlap (141-418:496-788)
120 130 140 150 160 170
pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP
:: .:::.:: : :: .: .: : ::
CCDS12 AQAVLQQTQLSMKGPPGPVGLTGRPGPVGLPGHPGLKGEEGAEGPQGPRGLQGPHGPPGR
470 480 490 500 510 520
180 190 200 210
pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQG----P--PGVKGE------AGLQGPQGAPGKQGA
: .. :: :: : : :::.: .: : :: ::. .: :: : :..::
CCDS12 VGKMGRPGADGARGLPGDTGPKGDRGFDGLPGLPGEKGQRGDFGHVGQPGPPGEDGERGA
530 540 550 560 570 580
220 230 240 250 260 270
pF1KE1 TGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS
: ::: :. : : ::.::.: : :. : :. :. : .: : : :::: ::.
CCDS12 EGPPGPTGQAGEPGPRGLLGPRGSPGPTGRPGVTGIDGAPGAKGNVGPPGEPGPPGQQGN
590 600 610 620 630 640
280 290 300 310 320 330
pF1KE1 KGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGS
.:. : ::: :: : :: :: :.::. :.:: : .:::: .: : :. : : :
CCDS12 HGSQGLPGPQGLIGTPGEKGPPGNPGIPGLPGSDGPLGHPGHEGPTGEKGAQGPPGSAGP
650 660 670 680 690 700
340 350 360 370 380 390
pF1KE1 PGSPGATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ
:: :: :.::..:. ::::..:.:::.: :: .:.::.::.:: ::.: : :
CCDS12 PGYPGPRGVKGTSGNRGLQGEKGEKGEDGFPGFKGDVGLKGDQGKPGAPGPRGEDGPEGP
710 720 730 740 750 760
400 410 420 430 440 450
pF1KE1 KGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSD
::. : : : : ::::. :
CCDS12 KGQAGQAGEEGPPGSAGEKGKLGVPGLPGYPGRPGPKGSIGFPGPLGPIGEKGKSGKTGQ
770 780 790 800 810 820
>--
initn: 1602 init1: 858 opt: 859 Z-score: 424.4 bits: 89.9 E(32554): 2.6e-17
Smith-Waterman score: 862; 45.3% identity (56.0% similar) in 318 aa overlap (141-418:1126-1442)
120 130 140 150 160 170
pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP
::.: ::..:. :. : : :. : :::
CCDS12 AGPPGQPGIRGPAGHPGPPGADGAQGRRGPPGLFGQKGDDGVRGFVGVIGPPGLQGLPGP
1100 1110 1120 1130 1140 1150
180 190 200 210
pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGP---------------PGVKGEAGLQGPQGAPGK
:: .: : :.:: :: :: ::::: ::. :: : .: : ::
CCDS12 PGEKGEVGDVGSMGPHGAPGPRGPQGPTGSEGTPGLPGGVGQPGAVGEKGERGDAGDPGP
1160 1170 1180 1190 1200 1210
220 230 240 250 260
pF1KE1 QGATGTPGPQGEKGSKGDGG---------LIGPKGETGTKGEKGDLGLPGSKGDRGMKGD
:: : :::.:. : :::.: :: :: :.:: : ::::. : : :
CCDS12 PGAPGIPGPKGDIGEKGDSGPSGAAGPPGKKGPPGEDGAKGSVGPTGLPGDLGPPGDPGV
1220 1230 1240 1250 1260 1270
270 280 290 300 310 320
pF1KE1 AGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGS
.:. : :: .:. :: : ::::: .: ::: : :. : .: : : :. :::::::
CCDS12 SGIDGSPGEKGDPGDVGGPGPPGASGEPGAPGPPGKRGPSGHMGREGREGEKGAKGEPGP
1280 1290 1300 1310 1320 1330
330 340 350 360 370
pF1KE1 AGSPGRAGLP----GSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGP-----------
: :::.: : : :: : ::.: : .: : : :. : :::
CCDS12 DGPPGRTG-PMGARGPPGRVGPEGLRGIPGPVGEPGLLGAPGQMGPPGPLGPSGLPGLKG
1340 1350 1360 1370 1380 1390
380 390 400 410 420 430
pF1KE1 -AGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSS
.: :::.: :: : : ::.::.:::::. : .: : ::. : :
CCDS12 DTGPKGEKGHIGLIGLIGPPGEAGEKGDQGLPGVQGPPGPKGDPGPPGPIGSLGHPGPPG
1400 1410 1420 1430 1440 1450
440 450 460 470 480 490
pF1KE1 NRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRG
CCDS12 VAGPLGQKGSKGSPGSMGPRGDTGPAGPPGPPGAPAELHGLRRRRRFVPVPLPVVEGGLE
1460 1470 1480 1490 1500 1510
>--
initn: 1424 init1: 757 opt: 835 Z-score: 413.1 bits: 87.8 E(32554): 1.1e-16
Smith-Waterman score: 835; 44.3% identity (59.1% similar) in 296 aa overlap (138-418:823-1112)
110 120 130 140 150 160
pF1KE1 QGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGA
: .::. .::.: :: .:..: : :
CCDS12 PGYPGRPGPKGSIGFPGPLGPIGEKGKSGKTGQPGL---EGERGPPGSRGERGQPGATGQ
800 810 820 830 840
170 180 190 200 210 220
pF1KE1 PGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGE
::: : .. :: : :. : : .:: : :: :: : :: .: ::. : : : ::.
CCDS12 PGPKGDVGQDGAPGIPGEKGLPGLQGPPGFPGPKGPPGHQGKDGRPGHPGQRGELGFQGQ
850 860 870 880 890 900
230 240 250 260 270 280
pF1KE1 KGSKGDGGLIGPKGETGTKG---EKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGR
: : .:..::.:.:: : :.: : :: :..:. : : : : : : .:.
CCDS12 TGPPGPAGVLGPQGKTGEVGPLGERGPPGPPGPPGEQGLPGLEGREGAKGELGPPGPLGK
910 920 930 940 950 960
290 300 310 320 330 340
pF1KE1 PGPPGLAGFPGAKGDQGQPG---LQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGS
:: :: :::: :: :.:: :.: :::: :: :. :: : : : ::::. ::
CCDS12 EGPAGLRGFPGPKGGPGDPGPTGLKGDKGPPGPVGANGSPGERGPLGPAGGIGLPGQSGS
970 980 990 1000 1010 1020
350 360 370 380 390
pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKG---------APGQ
: .: :.::. : .: : :..:.::: : : :: :::.: :::.
CCDS12 EGPVGPAGKKGSRGERGPPGPTGKDGIPGPLG---PLGPPGAAGPSGEEGDKGDVGAPGH
1030 1040 1050 1060 1070 1080
400 410 420 430 440 450
pF1KE1 AGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQ
:.:::.: : :. :..: :. :
CCDS12 KGSKGDKGDAGPPGQPGIRGPAGHPGPPGADGAQGRRGPPGLFGQKGDDGVRGFVGVIGP
1090 1100 1110 1120 1130 1140
>>CCDS72756.1 COL8A2 gene_id:1296|Hs108|chr1 (638 aa)
initn: 1622 init1: 562 opt: 889 Z-score: 443.6 bits: 92.0 E(32554): 2.2e-18
Smith-Waterman score: 889; 47.6% identity (59.7% similar) in 290 aa overlap (141-418:168-454)
120 130 140 150 160 170
pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP
:. . : .: :: : :: : ::.:::
CCDS72 GLKGDNGVGQPGLPGAPGQGGAPGPPGLPGPAGLGKPGLDGLPGAPGDKGESGPPGVPGP
140 150 160 170 180 190
180 190 200 210 220
pF1KE1 PGPPAEKGAKGAMGRDG-----ATGPSGPQGPPGVKGEAGLQGPQG--APGKQGATGTPG
: :. : :: : :: :.: ::::: :.::: : .:: : .: : : ::
CCDS72 RGEPGAVGPKGPPGVDGVGVPGAAGLPGPQGPSGAKGEPGTRGPPGLIGPTGYGMPGLPG
200 210 220 230 240 250
230 240 250 260 270 280
pF1KE1 PQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFG
:.:..: : ::.: .:: : :: :. : : : :. :.::. : : : ::. :
CCDS72 PKGDRGPAGVPGLLGDRGEPGEDGEPGEQGPQGLGGPPGLPGSAGLPGRRGPPGPKGEAG
260 270 280 290 300 310
290 300 310 320 330 340
pF1KE1 RPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPG
::::. :: .:::: :: : :: :: : :::.: :: .: :. :. : ::.::
CCDS72 PGGPPGV---PGIRGDQGPSGLAGKPGVPGERGLPGAHGPPGPTGPKGEPGFTGRPGGPG
320 330 340 350 360 370
350 360 370 380 390
pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ--KGD
..: :.::: :: :: : .: ::.:: ::: : :: ::: : : :: :. :.
CCDS72 VAGALGQKGDLGLPGQPGLRGPSGIPGLQGPAGPIGPQGLPGLKGEPGLPGPPGEGRAGE
380 390 400 410 420 430
400 410 420 430 440 450
pF1KE1 QGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIV
:. : .: :: : : :
CCDS72 PGTAGPTGPPGVPGSPGITGPPGPPGPPGPPGAPGAFDETGIAGLHLPNGGVEGAVLGKG
440 450 460 470 480 490
>>CCDS403.1 COL8A2 gene_id:1296|Hs108|chr1 (703 aa)
initn: 1622 init1: 562 opt: 889 Z-score: 443.1 bits: 92.0 E(32554): 2.3e-18
Smith-Waterman score: 889; 47.6% identity (59.7% similar) in 290 aa overlap (141-418:233-519)
120 130 140 150 160 170
pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP
:. . : .: :: : :: : ::.:::
CCDS40 GLKGDNGVGQPGLPGAPGQGGAPGPPGLPGPAGLGKPGLDGLPGAPGDKGESGPPGVPGP
210 220 230 240 250 260
180 190 200 210 220
pF1KE1 PGPPAEKGAKGAMGRDG-----ATGPSGPQGPPGVKGEAGLQGPQG--APGKQGATGTPG
: :. : :: : :: :.: ::::: :.::: : .:: : .: : : ::
CCDS40 RGEPGAVGPKGPPGVDGVGVPGAAGLPGPQGPSGAKGEPGTRGPPGLIGPTGYGMPGLPG
270 280 290 300 310 320
230 240 250 260 270 280
pF1KE1 PQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFG
:.:..: : ::.: .:: : :: :. : : : :. :.::. : : : ::. :
CCDS40 PKGDRGPAGVPGLLGDRGEPGEDGEPGEQGPQGLGGPPGLPGSAGLPGRRGPPGPKGEAG
330 340 350 360 370 380
290 300 310 320 330 340
pF1KE1 RPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPG
::::. :: .:::: :: : :: :: : :::.: :: .: :. :. : ::.::
CCDS40 PGGPPGV---PGIRGDQGPSGLAGKPGVPGERGLPGAHGPPGPTGPKGEPGFTGRPGGPG
390 400 410 420 430
350 360 370 380 390
pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ--KGD
..: :.::: :: :: : .: ::.:: ::: : :: ::: : : :: :. :.
CCDS40 VAGALGQKGDLGLPGQPGLRGPSGIPGLQGPAGPIGPQGLPGLKGEPGLPGPPGEGRAGE
440 450 460 470 480 490
400 410 420 430 440 450
pF1KE1 QGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIV
:. : .: :: : : :
CCDS40 PGTAGPTGPPGVPGSPGITGPPGPPGPPGPPGAPGAFDETGIAGLHLPNGGVEGAVLGKG
500 510 520 530 540 550
>>CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499 aa)
initn: 3133 init1: 835 opt: 886 Z-score: 437.9 bits: 92.2 E(32554): 4.5e-18
Smith-Waterman score: 896; 45.3% identity (59.0% similar) in 300 aa overlap (140-418:415-714)
110 120 130 140 150 160
pF1KE1 ASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPG
.::. : .:.:: .: :. : : ::
CCDS33 MKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGAKGPTGSPGTSGPPG
390 400 410 420 430 440
170 180 190 200 210 220
pF1KE1 PPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKG
:::. : .:. : .: : : : :: ::::: .: : : :: : :: .:..:
CCDS33 SAGPPGSPGPQGSTGPQGIRGQPGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRG
450 460 470 480 490 500
230 240 250 260 270 280
pF1KE1 SKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPG
.:: : .:: : .: .: :. :.::: : : :: : :: :..: ::. : :: ::
CCDS33 PRGDPGTVGPPGPVGERGAPGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGSQGDPGRPG
510 520 530 540 550 560
290 300 310 320 330 340
pF1KE1 LAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGS------PGRAGLPGSPGSPG
:.:::.: :.::.:: : : .: :: :.:: :: :: :::: :: :
CCDS33 EPGLPGARGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSG
570 580 590 600 610 620
350 360 370 380
pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPGP---------AGVKGEQGSPG------LAGPKG
: : :..:. ::.: :..: :: :: .:::: :: : :: :
CCDS33 DPGKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQGPPGPTGFQGLPGPPG
630 640 650 660 670 680
390 400 410 420 430 440
pF1KE1 APGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICD
::..:. ::::: :. : : : .::::
CCDS33 PPGEGGKPGDQGVPGDPGAVGPLGPRGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKG
690 700 710 720 730 740
>--
initn: 1568 init1: 804 opt: 839 Z-score: 415.7 bits: 88.1 E(32554): 7.8e-17
Smith-Waterman score: 878; 44.8% identity (57.5% similar) in 306 aa overlap (135-419:740-1045)
110 120 130 140 150 160
pF1KE1 HLAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGM
:. .:: :. : ::::: : :.
CCDS33 RGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKGSPGPSGTPGDTGPPGLQGMPGERGI
710 720 730 740 750 760
170 180 190 200 210
pF1KE1 PGAPGPPGPPA---EKGAKGAMGRDGATG---PSGPQGPPGVKGEAGLQGPQGAPGKQGA
:.::: : . ::::.:. : ::: : : :: :: : :: : ::.: : :.
CCDS33 AGTPGPKGDRGGIGEKGAEGTAGNDGARGLPGPLGPPGPAGPTGEKGEPGPRGLVGPPGS
770 780 790 800 810 820
220 230 240 250 260 270
pF1KE1 TGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGD------RGMKGDAGVMGP
:.:: .::.: : :. ::.: : : ::. : ::.::: .:. :. : ::
CCDS33 RGNPGSRGENGPTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGPHGP
830 840 850 860 870 880
280 290 300 310 320 330
pF1KE1 PGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGR
:. : :: : :::: .::::. : : :: :.::: : .:.:: .: :: :.::
CCDS33 NGVPGLKGGRGTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGEPGKEGPPGLRGDPGS
890 900 910 920 930 940
340 350 360 370 380
pF1KE1 ---------AGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGL
:: ::.::. : : :. : : : : :. :. : : .::.: :::
CCDS33 HGRVGDRGPAGPPGGPGDKGDPGEDGQPGPDGPPGPAGTTGQRGIVGMPGQRGERGMPGL
950 960 970 980 990 1000
390 400 410 420 430 440
pF1KE1 AGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTW
:: :.::..: : : :: : : : .: ::
CCDS33 PGPAGTPGKVGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGERGD
1010 1020 1030 1040 1050 1060
450 460 470 480 490 500
pF1KE1 GTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSW
CCDS33 RGDPGPAGLPGSQGAPGTPGPVGAPGDAGQRGDPGSRGPIGPPGRAGKRGLPGPQGPRGD
1070 1080 1090 1100 1110 1120
>--
initn: 1877 init1: 641 opt: 704 Z-score: 352.2 bits: 76.3 E(32554): 2.7e-13
Smith-Waterman score: 750; 40.7% identity (56.9% similar) in 297 aa overlap (147-419:112-403)
120 130 140 150 160 170
pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE
::..: ::: . :. : ::: :::
CCDS33 CADPVTPPGECCPVCSQTPGGGNTNFGRGRKGQKGEPGLV--PVVTGIRGRPGPAGPP--
90 100 110 120 130
180 190 200 210 220
pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQG----------
:..: :. : : ::.:: :. :: :. : :::: : . :::.:
CCDS33 -GSQGPRGERGPKGRPGPRGPQGIDGEPGVPGQPGAPGPPGHPSHPGPDGLSRPFSAQMA
140 150 160 170 180 190
230 240 250 260 270
pF1KE1 ---EKGSKGDG-----GLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS
::.. :. : .:: : : .: .:. : : : : :: : ::: :..:
CCDS33 GLDEKSGLGSQVGLMPGSVGPVGPRGPQGLQGQQGGAGPTGPPGEPGDPGPMGPIGSRGP
200 210 220 230 240 250
280 290 300 310 320 330
pF1KE1 KGDFGRPGP---PGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGL
.: :.:: :: : :: : :.:: .: :: :: : : .:. : : :..:
CCDS33 EGPPGKPGEDGEPGRNGNPGEVGFAGSPGARGFPGAPGLPGLKGHRGHKGLEGPKGEVGA
260 270 280 290 300 310
340 350 360 370 380 390
pF1KE1 PGSPGSPGATGLKGSKGDTG---LQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQ
::: : : :: :. : : . :..:: : .:.:: :..: :.:: :: : ::.
CCDS33 PGSKGEAGPTGPMGAMGPLGPRGMPGERGRLGPQGAPGQRGAHGMPGKPGPMGPLGIPGS
320 330 340 350 360 370
400 410 420 430 440 450
pF1KE1 AGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQ
.: :. :.:: .: :..: .: .:.
CCDS33 SGFPGNPGMKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGAKGPTGS
380 390 400 410 420 430
>>CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 (689 aa)
initn: 783 init1: 783 opt: 875 Z-score: 436.6 bits: 90.8 E(32554): 5.4e-18
Smith-Waterman score: 894; 45.0% identity (60.5% similar) in 311 aa overlap (141-444:180-481)
120 130 140 150 160 170
pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP
::: : ::. : :..: .: :: :
CCDS45 PPGPPGKPGRPGTIQGLEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGK
150 160 170 180 190 200
180 190 200 210 220 230
pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGS
::: .. ::.: .: : ::.: .: ::. : : ::.: : :: :. :: ::.:
CCDS45 PGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEG-
210 220 230 240 250 260
240 250 260 270 280 290
pF1KE1 KGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGL
:.: : ::::: : :: .: .:. : :. :::: .:. : : :: :
CCDS45 --------PRGPPGRAGEKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGS
270 280 290 300 310 320
300 310 320 330 340 350
pF1KE1 AGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGS
:: : :. :. :: :::: ::. : :: .:::: : :: .: ::. : :: : :
CCDS45 AGQAGQPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGP
330 340 350 360 370 380
360 370 380 390 400
pF1KE1 KGDTGLQGQQGRKGESGVPGPAGV---KGEQGSPGLAGPKGAPGQAGQKGDQGV---KGS
.: : .:.::..: : ::: : ::::: ::. ::.: :: :.::. : .:.
CCDS45 QGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGK
390 400 410 420 430 440
410 420 430 440 450 460
pF1KE1 SGEQGVKGEKGERGENSVSVRIVGSSNRG-RAEVYYSGTWGTICDDEWQNSDAIVFCRML
:. :: : ::.::.. : . ....: :.: : : :
CCDS45 VGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGL
450 460 470 480 490 500
470 480 490 500 510 520
pF1KE1 GYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV
CCDS45 AGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGP
510 520 530 540 550 560
>>CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 (678 aa)
initn: 827 init1: 827 opt: 868 Z-score: 433.4 bits: 90.2 E(32554): 8.1e-18
Smith-Waterman score: 874; 46.8% identity (63.7% similar) in 278 aa overlap (148-419:178-446)
120 130 140 150 160 170
pF1KE1 AQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEK
: : ::..::::: : : :: : .:.
CCDS47 GPPGPPGPRGTIGFHDGDPLCPNACPPGRSGYPGLPGMRGHKGAKGEIGEPGRQGHKGEE
150 160 170 180 190 200
180 190 200 210 220 230
pF1KE1 GAKGAMGRDGATGPSGPQGPPGV------KGEAGLQGPQGAPGKQGATGTPGPQGEKGSK
: .: .:. :: :: : :: :. ::: : .: .: :: :: :.:: ::..:
CCDS47 GDQGELGEVGAQGPPGAQGLRGITGIVGDKGEKGARGLDGEPGPQGLPGAPGDQGQRGPP
210 220 230 240 250 260
240 250 260 270 280 290
pF1KE1 GDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLA
:..: :::. :..: .: :::: ::: :. : : : :: :.::. :.::::: :
CCDS47 GEAG---PKGDRGAEGARGIPGLPGPKGDTGLPGVDGRDGIPGMPGTKGEPGKPGPPGDA
270 280 290 300 310 320
300 310 320 330 340 350
pF1KE1 GFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSK
:. :: ::. :.:: :..:. :. : ::. :. : .: ::. : :: .: .: .
CCDS47 GL------QGLPGVPGIPGAKGVAGEKGSTGAPGKPGQMGNSGKPGQQGPPGEVGPRGPQ
330 340 350 360 370
360 370 380 390 400 410
pF1KE1 GDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVK
: : .:. : : :.:: : : : ::: :: : ::. :..: : : .::::..
CCDS47 GLPGSRGELGPVGSPGLPGKLGSLGSPGLPGLPGPPGLPGMKGDRGVVGEPGPKGEQGAS
380 390 400 410 420 430
420 430 440 450 460 470
pF1KE1 GEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRAL
::.:: ::
CCDS47 GEEGEAGERGELGDIGLPGPKGSAGNPGEPGLRGPEGSRGLPGVEGPRGPPGPRGVQGEQ
440 450 460 470 480 490
>>CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa)
initn: 3985 init1: 847 opt: 877 Z-score: 432.6 bits: 91.5 E(32554): 8.9e-18
Smith-Waterman score: 899; 45.0% identity (56.9% similar) in 327 aa overlap (147-446:1203-1527)
120 130 140 150 160 170
pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP------
.:.:: : .: .: :.:: :::
CCDS75 KGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGL
1180 1190 1200 1210 1220 1230
180 190 200 210 220
pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGA---PGKQGATGTPGPQGE
::::.::: : .:. : :: ::.:: :. : : ::: :. :: : : :: ::
CCDS75 PGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE
1240 1250 1260 1270 1280 1290
230 240 250 260 270 280
pF1KE1 KGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGP
: :.:: ::::: : :::.: : : : .: :: : : :: : :: : ::
CCDS75 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE
1300 1310 1320 1330 1340 1350
290 300 310 320 330 340
pF1KE1 PGLAGF---PGAKGDQGQPGLQGVPGP---PGAVGHPGAKGEPGSAGSPGRAGLPGSPGS
:: :: :: :::.:.:: : ::: :: : :: .: :: :: :: : :. :
CCDS75 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE
1360 1370 1380 1390 1400 1410
350 360 370 380 390
pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESG---VPGPAGVKGEQGSPGLAGPKGA---PGQAGQ
: : :. : : :: :. : .: .:::.: .: :::: :: : :: :
CCDS75 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL
1420 1430 1440 1450 1460 1470
400 410 420 430 440
pF1KE1 KGDQGVKGSSGEQGV------KGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDD
:::.: :: .:. :. ::.::.:. .. .:. .:. . .: : :
CCDS75 KGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI--TGPSGPIGPP
1480 1490 1500 1510 1520 1530
450 460 470 480 490 500
pF1KE1 EWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCS
CCDS75 GPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDAS
1540 1550 1560 1570 1580 1590
>--
initn: 4632 init1: 822 opt: 839 Z-score: 414.7 bits: 88.2 E(32554): 8.8e-17
Smith-Waterman score: 883; 48.1% identity (58.8% similar) in 291 aa overlap (147-422:714-998)
120 130 140 150 160 170
pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE
.:: : :: ::. ::.:.:: : :::.:
CCDS75 LGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGE
690 700 710 720 730 740
180 190 200 210 220 230
pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL
:: : : : : .:: : :: .: : .: :: :: :: : :::.: ::. : ::
CCDS75 KGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGL
750 760 770 780 790 800
240 250 260 270 280 290
pF1KE1 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS---KGDFGRPGP---PGL
:::::::. :.:: ::: :.::: : .:::: .: .: :: :: ::
CCDS75 ------KGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP
810 820 830 840 850
300 310 320 330 340
pF1KE1 AGFPGAKGDQGQPGLQGVPG---PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGL
: :: :: : ::: : :: : :..: :: : : :. : : :: :. : ::
CCDS75 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP
860 870 880 890 900 910
350 360 370 380 390 400
pF1KE1 KGSKGDTGLQGQQGRKGESGVPGPAGVKGE------QGSPGLAGPKGAPGQAGQKGDQGV
.: .: :. :. : ::.:: :::: :: :: :. :::: :: :. : :
CCDS75 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH
920 930 940 950 960 970
410 420 430 440 450 460
pF1KE1 KGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCR
:. :: : .:. : : .:
CCDS75 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT
980 990 1000 1010 1020 1030
>>CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa)
initn: 3985 init1: 847 opt: 877 Z-score: 432.6 bits: 91.5 E(32554): 8.9e-18
Smith-Waterman score: 899; 45.0% identity (56.9% similar) in 327 aa overlap (147-446:1203-1527)
120 130 140 150 160 170
pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP------
.:.:: : .: .: :.:: :::
CCDS69 KGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGL
1180 1190 1200 1210 1220 1230
180 190 200 210 220
pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGA---PGKQGATGTPGPQGE
::::.::: : .:. : :: ::.:: :. : : ::: :. :: : : :: ::
CCDS69 PGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE
1240 1250 1260 1270 1280 1290
230 240 250 260 270 280
pF1KE1 KGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGP
: :.:: ::::: : :::.: : : : .: :: : : :: : :: : ::
CCDS69 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE
1300 1310 1320 1330 1340 1350
290 300 310 320 330 340
pF1KE1 PGLAGF---PGAKGDQGQPGLQGVPGP---PGAVGHPGAKGEPGSAGSPGRAGLPGSPGS
:: :: :: :::.:.:: : ::: :: : :: .: :: :: :: : :. :
CCDS69 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE
1360 1370 1380 1390 1400 1410
350 360 370 380 390
pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESG---VPGPAGVKGEQGSPGLAGPKGA---PGQAGQ
: : :. : : :: :. : .: .:::.: .: :::: :: : :: :
CCDS69 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL
1420 1430 1440 1450 1460 1470
400 410 420 430 440
pF1KE1 KGDQGVKGSSGEQGV------KGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDD
:::.: :: .:. :. ::.::.:. .. .:. .:. . .: : :
CCDS69 KGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI--TGPSGPIGPP
1480 1490 1500 1510 1520 1530
450 460 470 480 490 500
pF1KE1 EWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCS
CCDS69 GPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDAS
1540 1550 1560 1570 1580 1590
>--
initn: 4632 init1: 822 opt: 839 Z-score: 414.7 bits: 88.2 E(32554): 8.8e-17
Smith-Waterman score: 883; 48.1% identity (58.8% similar) in 291 aa overlap (147-422:714-998)
120 130 140 150 160 170
pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE
.:: : :: ::. ::.:.:: : :::.:
CCDS69 LGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGE
690 700 710 720 730 740
180 190 200 210 220 230
pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL
:: : : : : .:: : :: .: : .: :: :: :: : :::.: ::. : ::
CCDS69 KGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGL
750 760 770 780 790 800
240 250 260 270 280 290
pF1KE1 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS---KGDFGRPGP---PGL
:::::::. :.:: ::: :.::: : .:::: .: .: :: :: ::
CCDS69 ------KGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP
810 820 830 840 850
300 310 320 330 340
pF1KE1 AGFPGAKGDQGQPGLQGVPG---PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGL
: :: :: : ::: : :: : :..: :: : : :. : : :: :. : ::
CCDS69 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP
860 870 880 890 900 910
350 360 370 380 390 400
pF1KE1 KGSKGDTGLQGQQGRKGESGVPGPAGVKGE------QGSPGLAGPKGAPGQAGQKGDQGV
.: .: :. :. : ::.:: :::: :: :: :. :::: :: :. : :
CCDS69 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH
920 930 940 950 960 970
410 420 430 440 450 460
pF1KE1 KGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCR
:. :: : .:. : : .:
CCDS69 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT
980 990 1000 1010 1020 1030
520 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 22:50:03 2016 done: Sun Nov 6 22:50:04 2016
Total Scan time: 3.590 Total Display time: 0.130
Function used was FASTA [36.3.4 Apr, 2011]