FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1339, 520 aa 1>>>pF1KE1339 520 - 520 aa - 520 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.5809+/-0.00119; mu= -1.3916+/- 0.072 mean_var=450.7090+/-92.148, 0's: 0 Z-trim(113.8): 180 B-trim: 0 in 0/53 Lambda= 0.060412 statistics sampled from 14262 (14438) to 14262 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.444), width: 16 Scan time: 3.590 The best scores are: opt bits E(32554) CCDS2124.1 MARCO gene_id:8685|Hs108|chr2 ( 520) 3627 330.5 2.8e-90 CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466) 944 97.2 1.3e-19 CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745) 907 94.1 1.4e-18 CCDS72756.1 COL8A2 gene_id:1296|Hs108|chr1 ( 638) 889 92.0 2.2e-18 CCDS403.1 COL8A2 gene_id:1296|Hs108|chr1 ( 703) 889 92.0 2.3e-18 CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499) 886 92.2 4.5e-18 CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 ( 689) 875 90.8 5.4e-18 CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 ( 678) 868 90.2 8.1e-18 CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 877 91.5 8.9e-18 CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 877 91.5 8.9e-18 CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690) 876 91.4 9e-18 CCDS53348.1 COL11A1 gene_id:1301|Hs108|chr1 (1767) 876 91.4 9.2e-18 CCDS778.1 COL11A1 gene_id:1301|Hs108|chr1 (1806) 876 91.4 9.3e-18 CCDS4971.1 COL9A1 gene_id:1297|Hs108|chr6 ( 921) 868 90.4 9.9e-18 CCDS6802.1 COL27A1 gene_id:85301|Hs108|chr9 (1860) 870 90.9 1.4e-17 CCDS11561.1 COL1A1 gene_id:1277|Hs108|chr17 (1464) 865 90.3 1.6e-17 CCDS8759.1 COL2A1 gene_id:1280|Hs108|chr12 (1418) 861 90.0 2e-17 CCDS41778.1 COL2A1 gene_id:1280|Hs108|chr12 (1487) 861 90.0 2e-17 CCDS43452.1 COL11A2 gene_id:1302|Hs108|chr6 (1650) 855 89.5 3.1e-17 CCDS42829.1 COL4A3 gene_id:1285|Hs108|chr2 (1670) 854 89.4 3.4e-17 CCDS83099.1 COL21A1 gene_id:81578|Hs108|chr6 ( 954) 847 88.5 3.6e-17 CCDS55025.1 COL21A1 gene_id:81578|Hs108|chr6 ( 957) 847 88.5 3.6e-17 CCDS41297.1 COL16A1 gene_id:1307|Hs108|chr1 (1604) 841 88.3 7.2e-17 CCDS41353.1 COL24A1 gene_id:255631|Hs108|chr1 (1714) 840 88.2 8e-17 CCDS9511.1 COL4A1 gene_id:1282|Hs108|chr13 (1669) 839 88.1 8.3e-17 CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626) 837 87.9 9.2e-17 CCDS13505.1 COL9A3 gene_id:1299|Hs108|chr20 ( 684) 816 85.7 1.9e-16 CCDS34682.1 COL1A2 gene_id:1278|Hs108|chr7 (1366) 816 86.0 2.9e-16 CCDS13730.1 COL6A2 gene_id:1292|Hs108|chr21 ( 828) 792 83.7 9.1e-16 CCDS13729.1 COL6A2 gene_id:1292|Hs108|chr21 ( 918) 792 83.7 9.7e-16 CCDS13728.1 COL6A2 gene_id:1292|Hs108|chr21 (1019) 792 83.8 1e-15 CCDS76649.1 COL4A1 gene_id:1282|Hs108|chr13 ( 519) 777 82.1 1.7e-15 CCDS44428.2 COL13A1 gene_id:1305|Hs108|chr10 ( 610) 775 82.0 2.1e-15 CCDS2934.1 COL8A1 gene_id:1295|Hs108|chr3 ( 744) 775 82.1 2.4e-15 CCDS14543.1 COL4A5 gene_id:1287|Hs108|chrX (1685) 781 83.1 2.8e-15 CCDS35366.1 COL4A5 gene_id:1287|Hs108|chrX (1691) 781 83.1 2.8e-15 CCDS2773.1 COL7A1 gene_id:1294|Hs108|chr3 (2944) 785 83.7 3.1e-15 CCDS42828.1 COL4A4 gene_id:1286|Hs108|chr2 (1690) 775 82.6 4e-15 CCDS13727.1 COL6A1 gene_id:1291|Hs108|chr21 (1028) 760 81.0 7.2e-15 CCDS41907.1 COL4A2 gene_id:1284|Hs108|chr13 (1712) 757 81.0 1.2e-14 CCDS7554.1 COL17A1 gene_id:1308|Hs108|chr10 (1497) 748 80.1 1.9e-14 CCDS4436.1 COL23A1 gene_id:91522|Hs108|chr5 ( 540) 737 78.7 1.9e-14 CCDS43553.1 COL28A1 gene_id:340267|Hs108|chr7 (1125) 722 77.7 7.6e-14 CCDS5105.1 COL10A1 gene_id:1300|Hs108|chr6 ( 680) 709 76.3 1.2e-13 CCDS76008.1 COL4A6 gene_id:1288|Hs108|chrX (1633) 717 77.5 1.3e-13 CCDS76009.1 COL4A6 gene_id:1288|Hs108|chrX (1666) 717 77.5 1.3e-13 CCDS14542.1 COL4A6 gene_id:1288|Hs108|chrX (1690) 717 77.5 1.3e-13 CCDS14541.1 COL4A6 gene_id:1288|Hs108|chrX (1691) 717 77.5 1.3e-13 CCDS76010.1 COL4A6 gene_id:1288|Hs108|chrX (1707) 717 77.5 1.3e-13 CCDS46948.1 OTOL1 gene_id:131149|Hs108|chr3 ( 477) 686 74.1 3.9e-13 >>CCDS2124.1 MARCO gene_id:8685|Hs108|chr2 (520 aa) initn: 3627 init1: 3627 opt: 3627 Z-score: 1734.3 bits: 330.5 E(32554): 2.8e-90 Smith-Waterman score: 3627; 100.0% identity (100.0% similar) in 520 aa overlap (1-520:1-520) 10 20 30 40 50 60 pF1KE1 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVVVIYLILLTAGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVVVIYLILLTAGA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GLLVVQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GLLVVQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ 430 440 450 460 470 480 490 500 510 520 pF1KE1 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV :::::::::::::::::::::::::::::::::::::::: CCDS21 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV 490 500 510 520 >>CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466 aa) initn: 2388 init1: 859 opt: 944 Z-score: 465.3 bits: 97.2 E(32554): 1.3e-19 Smith-Waterman score: 959; 50.3% identity (64.7% similar) in 286 aa overlap (148-418:717-993) 120 130 140 150 160 170 pF1KE1 AQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQG---HKGAMGMPGAPGPPGPP : :.::::: ..:..: :: : : : CCDS22 GLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEP 690 700 710 720 730 740 180 190 200 210 220 pF1KE1 AEKGAKGAMGRDGATGPSGPQGPPGV------KGEAG------LQGPQGAPGKQGATGTP . :: :. :.:: ::.:: :::: :::.: . ::.:.::..: :: : CCDS22 GGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPP 750 760 770 780 790 800 230 240 250 260 270 280 pF1KE1 GPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDF :: : :. :..: : ::: :. ::::. : :: : : .: :: ::: :: ::. CCDS22 GPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAG---PPGPQGVKGER 810 820 830 840 850 860 290 300 310 320 330 340 pF1KE1 GRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSP : :: :: ::::::.: : :: .: ::::: : :: : :: ::. : .:::: CCDS22 GSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTG------APGSP 870 880 890 900 910 350 360 370 380 390 400 pF1KE1 GATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVK :..: ::. :. : .:. : .: :.::: :. : :. ::::: : :: :. : :::: CCDS22 GVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVK 920 930 940 950 960 970 410 420 430 440 450 460 pF1KE1 GSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRM : ::. :..: .:::: CCDS22 GESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGEN 980 990 1000 1010 1020 1030 >-- initn: 812 init1: 812 opt: 865 Z-score: 428.1 bits: 90.3 E(32554): 1.6e-17 Smith-Waterman score: 894; 46.1% identity (62.2% similar) in 304 aa overlap (136-418:150-453) 110 120 130 140 150 160 pF1KE1 LAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMP :.. . . .:. .. :: :. : : : CCDS22 NGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPP 120 130 140 150 160 170 170 180 190 200 210 pF1KE1 GAPGPPG----P--PAEKGAKGAMGRDGATGPSGPQGPPGV---KGEAGLQGPQGAPGKQ : ::::: : :. : .: :. : .::::: ::::. .: :: .: .: ::. CCDS22 GPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRP 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE1 GATGTPGPQGEKGSKGDGGLIGPKGETG---TKGEKGDLGLPGSKGDRGMKGDAGVMGPP : : ::: : :: : :. : ::. : .::::. : :: ::. :. :. :. :: CCDS22 GERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPM 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE1 GAQGSKGDFGRPGPPGLAGF---PGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSP : .:. :. :::: :: :: ::.:..:::: : :: : : :::::: : :::: CCDS22 GPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSP 300 310 320 330 340 350 340 350 360 370 380 pF1KE1 GRAGLPGS---PGSPGATGLKGSKGDTGLQGQQGRKGE---SGVPGPAGVKGEQGSPGLA : : ::. :: : .: .: : :..:. : ::: .:.:: :. : .: :: : CCDS22 GSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPA 360 370 380 390 400 410 390 400 410 420 430 440 pF1KE1 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWG : .:::: : :. : .:..:: : .::.:: : CCDS22 GANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPGEPGANGLP 420 430 440 450 460 470 450 460 470 480 490 500 pF1KE1 TICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWG CCDS22 GAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMP 480 490 500 510 520 530 >-- initn: 1441 init1: 740 opt: 758 Z-score: 377.7 bits: 81.0 E(32554): 1e-14 Smith-Waterman score: 758; 46.7% identity (61.1% similar) in 244 aa overlap (142-382:477-714) 120 130 140 150 160 170 pF1KE1 RLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPP :. ::.::::..: : :.:: :: CCDS22 GERGEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPA 450 460 470 480 490 500 180 190 200 210 220 230 pF1KE1 GPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSK : :.:: : : ::.: : .: :: : :. : :.::..: : :: :::.: CCDS22 G---ERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRP 510 520 530 540 550 560 240 250 260 270 280 pF1KE1 GDGGLIGPKGETGT---KGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPP : : ::.:. :. : ::. : ::..:.:: : : .::: :..:. : ::: CCDS22 GPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPP---GKNGETGPQGPP 570 580 590 600 610 620 290 300 310 320 330 340 pF1KE1 GLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLK : .: : ::: : :: ::. : ::. : :: .:.:: : : :: ::.::. : .: CCDS22 GPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAP 630 640 650 660 670 680 350 360 370 380 390 400 pF1KE1 GSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQ : .: :: : : .: .: ::: : :: : :: CCDS22 GERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPK 690 700 710 720 730 740 410 420 430 440 450 460 pF1KE1 GVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKG CCDS22 GDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGER 750 760 770 780 790 800 >>CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745 aa) initn: 4669 init1: 868 opt: 907 Z-score: 447.0 bits: 94.1 E(32554): 1.4e-18 Smith-Waterman score: 915; 46.8% identity (61.4% similar) in 293 aa overlap (141-418:496-788) 120 130 140 150 160 170 pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP :: .:::.:: : :: .: .: : :: CCDS12 AQAVLQQTQLSMKGPPGPVGLTGRPGPVGLPGHPGLKGEEGAEGPQGPRGLQGPHGPPGR 470 480 490 500 510 520 180 190 200 210 pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQG----P--PGVKGE------AGLQGPQGAPGKQGA : .. :: :: : : :::.: .: : :: ::. .: :: : :..:: CCDS12 VGKMGRPGADGARGLPGDTGPKGDRGFDGLPGLPGEKGQRGDFGHVGQPGPPGEDGERGA 530 540 550 560 570 580 220 230 240 250 260 270 pF1KE1 TGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS : ::: :. : : ::.::.: : :. : :. :. : .: : : :::: ::. CCDS12 EGPPGPTGQAGEPGPRGLLGPRGSPGPTGRPGVTGIDGAPGAKGNVGPPGEPGPPGQQGN 590 600 610 620 630 640 280 290 300 310 320 330 pF1KE1 KGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGS .:. : ::: :: : :: :: :.::. :.:: : .:::: .: : :. : : : CCDS12 HGSQGLPGPQGLIGTPGEKGPPGNPGIPGLPGSDGPLGHPGHEGPTGEKGAQGPPGSAGP 650 660 670 680 690 700 340 350 360 370 380 390 pF1KE1 PGSPGATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ :: :: :.::..:. ::::..:.:::.: :: .:.::.::.:: ::.: : : CCDS12 PGYPGPRGVKGTSGNRGLQGEKGEKGEDGFPGFKGDVGLKGDQGKPGAPGPRGEDGPEGP 710 720 730 740 750 760 400 410 420 430 440 450 pF1KE1 KGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSD ::. : : : : ::::. : CCDS12 KGQAGQAGEEGPPGSAGEKGKLGVPGLPGYPGRPGPKGSIGFPGPLGPIGEKGKSGKTGQ 770 780 790 800 810 820 >-- initn: 1602 init1: 858 opt: 859 Z-score: 424.4 bits: 89.9 E(32554): 2.6e-17 Smith-Waterman score: 862; 45.3% identity (56.0% similar) in 318 aa overlap (141-418:1126-1442) 120 130 140 150 160 170 pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP ::.: ::..:. :. : : :. : ::: CCDS12 AGPPGQPGIRGPAGHPGPPGADGAQGRRGPPGLFGQKGDDGVRGFVGVIGPPGLQGLPGP 1100 1110 1120 1130 1140 1150 180 190 200 210 pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGP---------------PGVKGEAGLQGPQGAPGK :: .: : :.:: :: :: ::::: ::. :: : .: : :: CCDS12 PGEKGEVGDVGSMGPHGAPGPRGPQGPTGSEGTPGLPGGVGQPGAVGEKGERGDAGDPGP 1160 1170 1180 1190 1200 1210 220 230 240 250 260 pF1KE1 QGATGTPGPQGEKGSKGDGG---------LIGPKGETGTKGEKGDLGLPGSKGDRGMKGD :: : :::.:. : :::.: :: :: :.:: : ::::. : : : CCDS12 PGAPGIPGPKGDIGEKGDSGPSGAAGPPGKKGPPGEDGAKGSVGPTGLPGDLGPPGDPGV 1220 1230 1240 1250 1260 1270 270 280 290 300 310 320 pF1KE1 AGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGS .:. : :: .:. :: : ::::: .: ::: : :. : .: : : :. ::::::: CCDS12 SGIDGSPGEKGDPGDVGGPGPPGASGEPGAPGPPGKRGPSGHMGREGREGEKGAKGEPGP 1280 1290 1300 1310 1320 1330 330 340 350 360 370 pF1KE1 AGSPGRAGLP----GSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGP----------- : :::.: : : :: : ::.: : .: : : :. : ::: CCDS12 DGPPGRTG-PMGARGPPGRVGPEGLRGIPGPVGEPGLLGAPGQMGPPGPLGPSGLPGLKG 1340 1350 1360 1370 1380 1390 380 390 400 410 420 430 pF1KE1 -AGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSS .: :::.: :: : : ::.::.:::::. : .: : ::. : : CCDS12 DTGPKGEKGHIGLIGLIGPPGEAGEKGDQGLPGVQGPPGPKGDPGPPGPIGSLGHPGPPG 1400 1410 1420 1430 1440 1450 440 450 460 470 480 490 pF1KE1 NRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRG CCDS12 VAGPLGQKGSKGSPGSMGPRGDTGPAGPPGPPGAPAELHGLRRRRRFVPVPLPVVEGGLE 1460 1470 1480 1490 1500 1510 >-- initn: 1424 init1: 757 opt: 835 Z-score: 413.1 bits: 87.8 E(32554): 1.1e-16 Smith-Waterman score: 835; 44.3% identity (59.1% similar) in 296 aa overlap (138-418:823-1112) 110 120 130 140 150 160 pF1KE1 QGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGA : .::. .::.: :: .:..: : : CCDS12 PGYPGRPGPKGSIGFPGPLGPIGEKGKSGKTGQPGL---EGERGPPGSRGERGQPGATGQ 800 810 820 830 840 170 180 190 200 210 220 pF1KE1 PGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGE ::: : .. :: : :. : : .:: : :: :: : :: .: ::. : : : ::. CCDS12 PGPKGDVGQDGAPGIPGEKGLPGLQGPPGFPGPKGPPGHQGKDGRPGHPGQRGELGFQGQ 850 860 870 880 890 900 230 240 250 260 270 280 pF1KE1 KGSKGDGGLIGPKGETGTKG---EKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGR : : .:..::.:.:: : :.: : :: :..:. : : : : : : .:. CCDS12 TGPPGPAGVLGPQGKTGEVGPLGERGPPGPPGPPGEQGLPGLEGREGAKGELGPPGPLGK 910 920 930 940 950 960 290 300 310 320 330 340 pF1KE1 PGPPGLAGFPGAKGDQGQPG---LQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGS :: :: :::: :: :.:: :.: :::: :: :. :: : : : ::::. :: CCDS12 EGPAGLRGFPGPKGGPGDPGPTGLKGDKGPPGPVGANGSPGERGPLGPAGGIGLPGQSGS 970 980 990 1000 1010 1020 350 360 370 380 390 pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKG---------APGQ : .: :.::. : .: : :..:.::: : : :: :::.: :::. CCDS12 EGPVGPAGKKGSRGERGPPGPTGKDGIPGPLG---PLGPPGAAGPSGEEGDKGDVGAPGH 1030 1040 1050 1060 1070 1080 400 410 420 430 440 450 pF1KE1 AGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQ :.:::.: : :. :..: :. : CCDS12 KGSKGDKGDAGPPGQPGIRGPAGHPGPPGADGAQGRRGPPGLFGQKGDDGVRGFVGVIGP 1090 1100 1110 1120 1130 1140 >>CCDS72756.1 COL8A2 gene_id:1296|Hs108|chr1 (638 aa) initn: 1622 init1: 562 opt: 889 Z-score: 443.6 bits: 92.0 E(32554): 2.2e-18 Smith-Waterman score: 889; 47.6% identity (59.7% similar) in 290 aa overlap (141-418:168-454) 120 130 140 150 160 170 pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP :. . : .: :: : :: : ::.::: CCDS72 GLKGDNGVGQPGLPGAPGQGGAPGPPGLPGPAGLGKPGLDGLPGAPGDKGESGPPGVPGP 140 150 160 170 180 190 180 190 200 210 220 pF1KE1 PGPPAEKGAKGAMGRDG-----ATGPSGPQGPPGVKGEAGLQGPQG--APGKQGATGTPG : :. : :: : :: :.: ::::: :.::: : .:: : .: : : :: CCDS72 RGEPGAVGPKGPPGVDGVGVPGAAGLPGPQGPSGAKGEPGTRGPPGLIGPTGYGMPGLPG 200 210 220 230 240 250 230 240 250 260 270 280 pF1KE1 PQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFG :.:..: : ::.: .:: : :: :. : : : :. :.::. : : : ::. : CCDS72 PKGDRGPAGVPGLLGDRGEPGEDGEPGEQGPQGLGGPPGLPGSAGLPGRRGPPGPKGEAG 260 270 280 290 300 310 290 300 310 320 330 340 pF1KE1 RPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPG ::::. :: .:::: :: : :: :: : :::.: :: .: :. :. : ::.:: CCDS72 PGGPPGV---PGIRGDQGPSGLAGKPGVPGERGLPGAHGPPGPTGPKGEPGFTGRPGGPG 320 330 340 350 360 370 350 360 370 380 390 pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ--KGD ..: :.::: :: :: : .: ::.:: ::: : :: ::: : : :: :. :. CCDS72 VAGALGQKGDLGLPGQPGLRGPSGIPGLQGPAGPIGPQGLPGLKGEPGLPGPPGEGRAGE 380 390 400 410 420 430 400 410 420 430 440 450 pF1KE1 QGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIV :. : .: :: : : : CCDS72 PGTAGPTGPPGVPGSPGITGPPGPPGPPGPPGAPGAFDETGIAGLHLPNGGVEGAVLGKG 440 450 460 470 480 490 >>CCDS403.1 COL8A2 gene_id:1296|Hs108|chr1 (703 aa) initn: 1622 init1: 562 opt: 889 Z-score: 443.1 bits: 92.0 E(32554): 2.3e-18 Smith-Waterman score: 889; 47.6% identity (59.7% similar) in 290 aa overlap (141-418:233-519) 120 130 140 150 160 170 pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP :. . : .: :: : :: : ::.::: CCDS40 GLKGDNGVGQPGLPGAPGQGGAPGPPGLPGPAGLGKPGLDGLPGAPGDKGESGPPGVPGP 210 220 230 240 250 260 180 190 200 210 220 pF1KE1 PGPPAEKGAKGAMGRDG-----ATGPSGPQGPPGVKGEAGLQGPQG--APGKQGATGTPG : :. : :: : :: :.: ::::: :.::: : .:: : .: : : :: CCDS40 RGEPGAVGPKGPPGVDGVGVPGAAGLPGPQGPSGAKGEPGTRGPPGLIGPTGYGMPGLPG 270 280 290 300 310 320 230 240 250 260 270 280 pF1KE1 PQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFG :.:..: : ::.: .:: : :: :. : : : :. :.::. : : : ::. : CCDS40 PKGDRGPAGVPGLLGDRGEPGEDGEPGEQGPQGLGGPPGLPGSAGLPGRRGPPGPKGEAG 330 340 350 360 370 380 290 300 310 320 330 340 pF1KE1 RPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPG ::::. :: .:::: :: : :: :: : :::.: :: .: :. :. : ::.:: CCDS40 PGGPPGV---PGIRGDQGPSGLAGKPGVPGERGLPGAHGPPGPTGPKGEPGFTGRPGGPG 390 400 410 420 430 350 360 370 380 390 pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPG---PAGVKGEQGSPGLAGPKGAPGQAGQ--KGD ..: :.::: :: :: : .: ::.:: ::: : :: ::: : : :: :. :. CCDS40 VAGALGQKGDLGLPGQPGLRGPSGIPGLQGPAGPIGPQGLPGLKGEPGLPGPPGEGRAGE 440 450 460 470 480 490 400 410 420 430 440 450 pF1KE1 QGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIV :. : .: :: : : : CCDS40 PGTAGPTGPPGVPGSPGITGPPGPPGPPGPPGAPGAFDETGIAGLHLPNGGVEGAVLGKG 500 510 520 530 540 550 >>CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499 aa) initn: 3133 init1: 835 opt: 886 Z-score: 437.9 bits: 92.2 E(32554): 4.5e-18 Smith-Waterman score: 896; 45.3% identity (59.0% similar) in 300 aa overlap (140-418:415-714) 110 120 130 140 150 160 pF1KE1 ASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPG .::. : .:.:: .: :. : : :: CCDS33 MKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGAKGPTGSPGTSGPPG 390 400 410 420 430 440 170 180 190 200 210 220 pF1KE1 PPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKG :::. : .:. : .: : : : :: ::::: .: : : :: : :: .:..: CCDS33 SAGPPGSPGPQGSTGPQGIRGQPGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRG 450 460 470 480 490 500 230 240 250 260 270 280 pF1KE1 SKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPG .:: : .:: : .: .: :. :.::: : : :: : :: :..: ::. : :: :: CCDS33 PRGDPGTVGPPGPVGERGAPGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGSQGDPGRPG 510 520 530 540 550 560 290 300 310 320 330 340 pF1KE1 LAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGS------PGRAGLPGSPGSPG :.:::.: :.::.:: : : .: :: :.:: :: :: :::: :: : CCDS33 EPGLPGARGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSG 570 580 590 600 610 620 350 360 370 380 pF1KE1 ATGLKGSKGDTGLQGQQGRKGESGVPGP---------AGVKGEQGSPG------LAGPKG : : :..:. ::.: :..: :: :: .:::: :: : :: : CCDS33 DPGKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQGPPGPTGFQGLPGPPG 630 640 650 660 670 680 390 400 410 420 430 440 pF1KE1 APGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICD ::..:. ::::: :. : : : .:::: CCDS33 PPGEGGKPGDQGVPGDPGAVGPLGPRGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKG 690 700 710 720 730 740 >-- initn: 1568 init1: 804 opt: 839 Z-score: 415.7 bits: 88.1 E(32554): 7.8e-17 Smith-Waterman score: 878; 44.8% identity (57.5% similar) in 306 aa overlap (135-419:740-1045) 110 120 130 140 150 160 pF1KE1 HLAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGM :. .:: :. : ::::: : :. CCDS33 RGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKGSPGPSGTPGDTGPPGLQGMPGERGI 710 720 730 740 750 760 170 180 190 200 210 pF1KE1 PGAPGPPGPPA---EKGAKGAMGRDGATG---PSGPQGPPGVKGEAGLQGPQGAPGKQGA :.::: : . ::::.:. : ::: : : :: :: : :: : ::.: : :. CCDS33 AGTPGPKGDRGGIGEKGAEGTAGNDGARGLPGPLGPPGPAGPTGEKGEPGPRGLVGPPGS 770 780 790 800 810 820 220 230 240 250 260 270 pF1KE1 TGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGD------RGMKGDAGVMGP :.:: .::.: : :. ::.: : : ::. : ::.::: .:. :. : :: CCDS33 RGNPGSRGENGPTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGPHGP 830 840 850 860 870 880 280 290 300 310 320 330 pF1KE1 PGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGR :. : :: : :::: .::::. : : :: :.::: : .:.:: .: :: :.:: CCDS33 NGVPGLKGGRGTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGEPGKEGPPGLRGDPGS 890 900 910 920 930 940 340 350 360 370 380 pF1KE1 ---------AGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGL :: ::.::. : : :. : : : : :. :. : : .::.: ::: CCDS33 HGRVGDRGPAGPPGGPGDKGDPGEDGQPGPDGPPGPAGTTGQRGIVGMPGQRGERGMPGL 950 960 970 980 990 1000 390 400 410 420 430 440 pF1KE1 AGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTW :: :.::..: : : :: : : : .: :: CCDS33 PGPAGTPGKVGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGERGD 1010 1020 1030 1040 1050 1060 450 460 470 480 490 500 pF1KE1 GTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSW CCDS33 RGDPGPAGLPGSQGAPGTPGPVGAPGDAGQRGDPGSRGPIGPPGRAGKRGLPGPQGPRGD 1070 1080 1090 1100 1110 1120 >-- initn: 1877 init1: 641 opt: 704 Z-score: 352.2 bits: 76.3 E(32554): 2.7e-13 Smith-Waterman score: 750; 40.7% identity (56.9% similar) in 297 aa overlap (147-419:112-403) 120 130 140 150 160 170 pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE ::..: ::: . :. : ::: ::: CCDS33 CADPVTPPGECCPVCSQTPGGGNTNFGRGRKGQKGEPGLV--PVVTGIRGRPGPAGPP-- 90 100 110 120 130 180 190 200 210 220 pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQG---------- :..: :. : : ::.:: :. :: :. : :::: : . :::.: CCDS33 -GSQGPRGERGPKGRPGPRGPQGIDGEPGVPGQPGAPGPPGHPSHPGPDGLSRPFSAQMA 140 150 160 170 180 190 230 240 250 260 270 pF1KE1 ---EKGSKGDG-----GLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS ::.. :. : .:: : : .: .:. : : : : :: : ::: :..: CCDS33 GLDEKSGLGSQVGLMPGSVGPVGPRGPQGLQGQQGGAGPTGPPGEPGDPGPMGPIGSRGP 200 210 220 230 240 250 280 290 300 310 320 330 pF1KE1 KGDFGRPGP---PGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGL .: :.:: :: : :: : :.:: .: :: :: : : .:. : : :..: CCDS33 EGPPGKPGEDGEPGRNGNPGEVGFAGSPGARGFPGAPGLPGLKGHRGHKGLEGPKGEVGA 260 270 280 290 300 310 340 350 360 370 380 390 pF1KE1 PGSPGSPGATGLKGSKGDTG---LQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQ ::: : : :: :. : : . :..:: : .:.:: :..: :.:: :: : ::. CCDS33 PGSKGEAGPTGPMGAMGPLGPRGMPGERGRLGPQGAPGQRGAHGMPGKPGPMGPLGIPGS 320 330 340 350 360 370 400 410 420 430 440 450 pF1KE1 AGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQ .: :. :.:: .: :..: .: .:. CCDS33 SGFPGNPGMKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAIGTDGTPGAKGPTGS 380 390 400 410 420 430 >>CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 (689 aa) initn: 783 init1: 783 opt: 875 Z-score: 436.6 bits: 90.8 E(32554): 5.4e-18 Smith-Waterman score: 894; 45.0% identity (60.5% similar) in 311 aa overlap (141-444:180-481) 120 130 140 150 160 170 pF1KE1 SRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP ::: : ::. : :..: .: :: : CCDS45 PPGPPGKPGRPGTIQGLEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGK 150 160 170 180 190 200 180 190 200 210 220 230 pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGS ::: .. ::.: .: : ::.: .: ::. : : ::.: : :: :. :: ::.: CCDS45 PGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEG- 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE1 KGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGL :.: : ::::: : :: .: .:. : :. :::: .:. : : :: : CCDS45 --------PRGPPGRAGEKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGS 270 280 290 300 310 320 300 310 320 330 340 350 pF1KE1 AGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGS :: : :. :. :: :::: ::. : :: .:::: : :: .: ::. : :: : : CCDS45 AGQAGQPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGP 330 340 350 360 370 380 360 370 380 390 400 pF1KE1 KGDTGLQGQQGRKGESGVPGPAGV---KGEQGSPGLAGPKGAPGQAGQKGDQGV---KGS .: : .:.::..: : ::: : ::::: ::. ::.: :: :.::. : .:. CCDS45 QGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGK 390 400 410 420 430 440 410 420 430 440 450 460 pF1KE1 SGEQGVKGEKGERGENSVSVRIVGSSNRG-RAEVYYSGTWGTICDDEWQNSDAIVFCRML :. :: : ::.::.. : . ....: :.: : : : CCDS45 VGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGL 450 460 470 480 490 500 470 480 490 500 510 520 pF1KE1 GYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV CCDS45 AGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGP 510 520 530 540 550 560 >>CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 (678 aa) initn: 827 init1: 827 opt: 868 Z-score: 433.4 bits: 90.2 E(32554): 8.1e-18 Smith-Waterman score: 874; 46.8% identity (63.7% similar) in 278 aa overlap (148-419:178-446) 120 130 140 150 160 170 pF1KE1 AQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEK : : ::..::::: : : :: : .:. CCDS47 GPPGPPGPRGTIGFHDGDPLCPNACPPGRSGYPGLPGMRGHKGAKGEIGEPGRQGHKGEE 150 160 170 180 190 200 180 190 200 210 220 230 pF1KE1 GAKGAMGRDGATGPSGPQGPPGV------KGEAGLQGPQGAPGKQGATGTPGPQGEKGSK : .: .:. :: :: : :: :. ::: : .: .: :: :: :.:: ::..: CCDS47 GDQGELGEVGAQGPPGAQGLRGITGIVGDKGEKGARGLDGEPGPQGLPGAPGDQGQRGPP 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE1 GDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLA :..: :::. :..: .: :::: ::: :. : : : :: :.::. :.::::: : CCDS47 GEAG---PKGDRGAEGARGIPGLPGPKGDTGLPGVDGRDGIPGMPGTKGEPGKPGPPGDA 270 280 290 300 310 320 300 310 320 330 340 350 pF1KE1 GFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSK :. :: ::. :.:: :..:. :. : ::. :. : .: ::. : :: .: .: . CCDS47 GL------QGLPGVPGIPGAKGVAGEKGSTGAPGKPGQMGNSGKPGQQGPPGEVGPRGPQ 330 340 350 360 370 360 370 380 390 400 410 pF1KE1 GDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVK : : .:. : : :.:: : : : ::: :: : ::. :..: : : .::::.. CCDS47 GLPGSRGELGPVGSPGLPGKLGSLGSPGLPGLPGPPGLPGMKGDRGVVGEPGPKGEQGAS 380 390 400 410 420 430 420 430 440 450 460 470 pF1KE1 GEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRAL ::.:: :: CCDS47 GEEGEAGERGELGDIGLPGPKGSAGNPGEPGLRGPEGSRGLPGVEGPRGPPGPRGVQGEQ 440 450 460 470 480 490 >>CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 3985 init1: 847 opt: 877 Z-score: 432.6 bits: 91.5 E(32554): 8.9e-18 Smith-Waterman score: 899; 45.0% identity (56.9% similar) in 327 aa overlap (147-446:1203-1527) 120 130 140 150 160 170 pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP------ .:.:: : .: .: :.:: ::: CCDS75 KGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGL 1180 1190 1200 1210 1220 1230 180 190 200 210 220 pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGA---PGKQGATGTPGPQGE ::::.::: : .:. : :: ::.:: :. : : ::: :. :: : : :: :: CCDS75 PGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE 1240 1250 1260 1270 1280 1290 230 240 250 260 270 280 pF1KE1 KGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGP : :.:: ::::: : :::.: : : : .: :: : : :: : :: : :: CCDS75 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE 1300 1310 1320 1330 1340 1350 290 300 310 320 330 340 pF1KE1 PGLAGF---PGAKGDQGQPGLQGVPGP---PGAVGHPGAKGEPGSAGSPGRAGLPGSPGS :: :: :: :::.:.:: : ::: :: : :: .: :: :: :: : :. : CCDS75 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE 1360 1370 1380 1390 1400 1410 350 360 370 380 390 pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESG---VPGPAGVKGEQGSPGLAGPKGA---PGQAGQ : : :. : : :: :. : .: .:::.: .: :::: :: : :: : CCDS75 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL 1420 1430 1440 1450 1460 1470 400 410 420 430 440 pF1KE1 KGDQGVKGSSGEQGV------KGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDD :::.: :: .:. :. ::.::.:. .. .:. .:. . .: : : CCDS75 KGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI--TGPSGPIGPP 1480 1490 1500 1510 1520 1530 450 460 470 480 490 500 pF1KE1 EWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCS CCDS75 GPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDAS 1540 1550 1560 1570 1580 1590 >-- initn: 4632 init1: 822 opt: 839 Z-score: 414.7 bits: 88.2 E(32554): 8.8e-17 Smith-Waterman score: 883; 48.1% identity (58.8% similar) in 291 aa overlap (147-422:714-998) 120 130 140 150 160 170 pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE .:: : :: ::. ::.:.:: : :::.: CCDS75 LGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGE 690 700 710 720 730 740 180 190 200 210 220 230 pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL :: : : : : .:: : :: .: : .: :: :: :: : :::.: ::. : :: CCDS75 KGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGL 750 760 770 780 790 800 240 250 260 270 280 290 pF1KE1 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS---KGDFGRPGP---PGL :::::::. :.:: ::: :.::: : .:::: .: .: :: :: :: CCDS75 ------KGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP 810 820 830 840 850 300 310 320 330 340 pF1KE1 AGFPGAKGDQGQPGLQGVPG---PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGL : :: :: : ::: : :: : :..: :: : : :. : : :: :. : :: CCDS75 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP 860 870 880 890 900 910 350 360 370 380 390 400 pF1KE1 KGSKGDTGLQGQQGRKGESGVPGPAGVKGE------QGSPGLAGPKGAPGQAGQKGDQGV .: .: :. :. : ::.:: :::: :: :: :. :::: :: :. : : CCDS75 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH 920 930 940 950 960 970 410 420 430 440 450 460 pF1KE1 KGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCR :. :: : .:. : : .: CCDS75 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT 980 990 1000 1010 1020 1030 >>CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 3985 init1: 847 opt: 877 Z-score: 432.6 bits: 91.5 E(32554): 8.9e-18 Smith-Waterman score: 899; 45.0% identity (56.9% similar) in 327 aa overlap (147-446:1203-1527) 120 130 140 150 160 170 pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGP------ .:.:: : .: .: :.:: ::: CCDS69 KGEQGPPGPTGPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGL 1180 1190 1200 1210 1220 1230 180 190 200 210 220 pF1KE1 PGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGA---PGKQGATGTPGPQGE ::::.::: : .:. : :: ::.:: :. : : ::: :. :: : : :: :: CCDS69 PGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE 1240 1250 1260 1270 1280 1290 230 240 250 260 270 280 pF1KE1 KGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGP : :.:: ::::: : :::.: : : : .: :: : : :: : :: : :: CCDS69 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE 1300 1310 1320 1330 1340 1350 290 300 310 320 330 340 pF1KE1 PGLAGF---PGAKGDQGQPGLQGVPGP---PGAVGHPGAKGEPGSAGSPGRAGLPGSPGS :: :: :: :::.:.:: : ::: :: : :: .: :: :: :: : :. : CCDS69 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE 1360 1370 1380 1390 1400 1410 350 360 370 380 390 pF1KE1 PGATGLKGSKGDTGLQGQQGRKGESG---VPGPAGVKGEQGSPGLAGPKGA---PGQAGQ : : :. : : :: :. : .: .:::.: .: :::: :: : :: : CCDS69 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL 1420 1430 1440 1450 1460 1470 400 410 420 430 440 pF1KE1 KGDQGVKGSSGEQGV------KGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDD :::.: :: .:. :. ::.::.:. .. .:. .:. . .: : : CCDS69 KGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI--TGPSGPIGPP 1480 1490 1500 1510 1520 1530 450 460 470 480 490 500 pF1KE1 EWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCS CCDS69 GPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPIQASRTRRNIDAS 1540 1550 1560 1570 1580 1590 >-- initn: 4632 init1: 822 opt: 839 Z-score: 414.7 bits: 88.2 E(32554): 8.8e-17 Smith-Waterman score: 883; 48.1% identity (58.8% similar) in 291 aa overlap (147-422:714-998) 120 130 140 150 160 170 pF1KE1 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE .:: : :: ::. ::.:.:: : :::.: CCDS69 LGPKGPPGPPGPPGVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGE 690 700 710 720 730 740 180 190 200 210 220 230 pF1KE1 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL :: : : : : .:: : :: .: : .: :: :: :: : :::.: ::. : :: CCDS69 KGPLGKPGLPGMPGADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGL 750 760 770 780 790 800 240 250 260 270 280 290 pF1KE1 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGS---KGDFGRPGP---PGL :::::::. :.:: ::: :.::: : .:::: .: .: :: :: :: CCDS69 ------KGTKGEKGEDGFPGFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGP 810 820 830 840 850 300 310 320 330 340 pF1KE1 AGFPGAKGDQGQPGLQGVPG---PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGL : :: :: : ::: : :: : :..: :: : : :. : : :: :. : :: CCDS69 LGPPGEKGKLGVPGLPGYPGRQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGP 860 870 880 890 900 910 350 360 370 380 390 400 pF1KE1 KGSKGDTGLQGQQGRKGESGVPGPAGVKGE------QGSPGLAGPKGAPGQAGQKGDQGV .: .: :. :. : ::.:: :::: :: :: :. :::: :: :. : : CCDS69 RGERGPRGITGKPGPKGNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGH 920 930 940 950 960 970 410 420 430 440 450 460 pF1KE1 KGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCR :. :: : .:. : : .: CCDS69 PGQRGETGFQGKTGPPGPPGVVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGT 980 990 1000 1010 1020 1030 520 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:50:03 2016 done: Sun Nov 6 22:50:04 2016 Total Scan time: 3.590 Total Display time: 0.130 Function used was FASTA [36.3.4 Apr, 2011]