FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7280, 1336 aa 1>>>pF1KB7280 1336 - 1336 aa - 1336 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.7938+/-0.0014; mu= 0.5222+/- 0.085 mean_var=681.7630+/-143.329, 0's: 0 Z-trim(115.5): 225 B-trim: 420 in 1/52 Lambda= 0.049120 statistics sampled from 15891 (16111) to 15891 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.495), width: 16 Scan time: 7.060 The best scores are: opt bits E(32554) CCDS42971.1 COL18A1 gene_id:80781|Hs108|chr21 (1339) 9734 706.4 1.3e-202 CCDS42972.1 COL18A1 gene_id:80781|Hs108|chr21 (1519) 9493 689.4 2e-197 CCDS77643.1 COL18A1 gene_id:80781|Hs108|chr21 (1754) 9480 688.5 4.1e-197 CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466) 1946 154.5 1.9e-36 CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 1919 152.7 8.3e-36 CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 1919 152.7 8.3e-36 CCDS35366.1 COL4A5 gene_id:1287|Hs108|chrX (1691) 1895 151.0 2.6e-35 CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690) 1875 149.6 6.9e-35 CCDS53348.1 COL11A1 gene_id:1301|Hs108|chr1 (1767) 1875 149.6 7.1e-35 CCDS778.1 COL11A1 gene_id:1301|Hs108|chr1 (1806) 1875 149.6 7.2e-35 CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499) 1852 147.9 2e-34 CCDS14543.1 COL4A5 gene_id:1287|Hs108|chrX (1685) 1810 145.0 1.7e-33 CCDS42828.1 COL4A4 gene_id:1286|Hs108|chr2 (1690) 1802 144.4 2.5e-33 CCDS9511.1 COL4A1 gene_id:1282|Hs108|chr13 (1669) 1784 143.1 6e-33 CCDS42829.1 COL4A3 gene_id:1285|Hs108|chr2 (1670) 1780 142.8 7.3e-33 CCDS43452.1 COL11A2 gene_id:1302|Hs108|chr6 (1650) 1759 141.3 2e-32 CCDS34682.1 COL1A2 gene_id:1278|Hs108|chr7 (1366) 1722 138.6 1.1e-31 CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745) 1706 137.6 2.8e-31 CCDS6802.1 COL27A1 gene_id:85301|Hs108|chr9 (1860) 1705 137.6 3.1e-31 CCDS2773.1 COL7A1 gene_id:1294|Hs108|chr3 (2944) 1697 137.3 6e-31 CCDS41353.1 COL24A1 gene_id:255631|Hs108|chr1 (1714) 1646 133.4 5.3e-30 CCDS41907.1 COL4A2 gene_id:1284|Hs108|chr13 (1712) 1603 130.3 4.4e-29 CCDS41778.1 COL2A1 gene_id:1280|Hs108|chr12 (1487) 1561 127.2 3.2e-28 CCDS8759.1 COL2A1 gene_id:1280|Hs108|chr12 (1418) 1553 126.7 4.6e-28 CCDS11561.1 COL1A1 gene_id:1277|Hs108|chr17 (1464) 1549 126.4 5.7e-28 CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 ( 689) 1539 125.2 6e-28 CCDS76008.1 COL4A6 gene_id:1288|Hs108|chrX (1633) 1535 125.5 1.2e-27 CCDS76009.1 COL4A6 gene_id:1288|Hs108|chrX (1666) 1535 125.5 1.2e-27 CCDS14542.1 COL4A6 gene_id:1288|Hs108|chrX (1690) 1535 125.5 1.2e-27 CCDS14541.1 COL4A6 gene_id:1288|Hs108|chrX (1691) 1535 125.5 1.2e-27 CCDS35081.1 COL15A1 gene_id:1306|Hs108|chr9 (1388) 1509 123.5 4e-27 CCDS41297.1 COL16A1 gene_id:1307|Hs108|chr1 (1604) 1509 123.6 4.3e-27 CCDS76010.1 COL4A6 gene_id:1288|Hs108|chrX (1707) 1456 119.9 6e-26 CCDS43553.1 COL28A1 gene_id:340267|Hs108|chr7 (1125) 1326 110.4 2.8e-23 CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 ( 678) 1283 107.1 1.7e-22 CCDS4971.1 COL9A1 gene_id:1297|Hs108|chr6 ( 921) 1285 107.4 1.9e-22 CCDS13505.1 COL9A3 gene_id:1299|Hs108|chr20 ( 684) 1270 106.2 3.3e-22 CCDS44419.1 COL13A1 gene_id:1305|Hs108|chr10 ( 717) 1215 102.3 5e-21 CCDS58922.1 COL25A1 gene_id:84570|Hs108|chr4 ( 645) 1187 100.2 1.9e-20 CCDS4970.1 COL19A1 gene_id:1310|Hs108|chr6 (1142) 1185 100.4 2.9e-20 CCDS7554.1 COL17A1 gene_id:1308|Hs108|chr10 (1497) 1170 99.5 7e-20 CCDS83099.1 COL21A1 gene_id:81578|Hs108|chr6 ( 954) 1144 97.4 1.9e-19 CCDS55025.1 COL21A1 gene_id:81578|Hs108|chr6 ( 957) 1144 97.4 2e-19 CCDS43258.1 COL25A1 gene_id:84570|Hs108|chr4 ( 654) 1125 95.9 4e-19 CCDS4436.1 COL23A1 gene_id:91522|Hs108|chr5 ( 540) 1110 94.7 7.4e-19 CCDS76649.1 COL4A1 gene_id:1282|Hs108|chr13 ( 519) 1104 94.2 9.8e-19 CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626) 1110 95.3 1.4e-18 CCDS44427.2 COL13A1 gene_id:1305|Hs108|chr10 ( 645) 1068 91.8 6.5e-18 CCDS43259.1 COL25A1 gene_id:84570|Hs108|chr4 ( 642) 1067 91.7 6.8e-18 CCDS2934.1 COL8A1 gene_id:1295|Hs108|chr3 ( 744) 1006 87.5 1.5e-16 >>CCDS42971.1 COL18A1 gene_id:80781|Hs108|chr21 (1339 aa) initn: 7194 init1: 7194 opt: 9734 Z-score: 3750.8 bits: 706.4 E(32554): 1.3e-202 Smith-Waterman score: 9734; 99.8% identity (99.8% similar) in 1339 aa overlap (1-1336:1-1339) 10 20 30 40 50 60 pF1KB7 MAPRCPWPWPRRRRLLDVLAPLVLLLGVRAASAEPERISEEVGLLQLLGDPPPQQVTQTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MAPRCPWPWPRRRRLLDVLAPLVLLLGVRAASAEPERISEEVGLLQLLGDPPPQQVTQTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 DPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 EEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 GDDSDGASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GDDSDGASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 EQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 CLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 CLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 PQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB7 EALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB7 GPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB7 GPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADG 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB7 VPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDG 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB7 SVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQ 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB7 KGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPG 790 800 810 820 830 840 850 860 870 880 890 900 pF1KB7 PPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFD 850 860 870 880 890 900 910 920 930 940 950 pF1KB7 FLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGP---RGYPGIPGPKG :::::::::::::::::::::::::::::::::::::::::::::: ::::::::::: CCDS42 FLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGPPGPRGYPGIPGPKG 910 920 930 940 950 960 960 970 980 990 1000 1010 pF1KB7 ESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGP 970 980 990 1000 1010 1020 1020 1030 1040 1050 1060 1070 pF1KB7 PGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLP 1030 1040 1050 1060 1070 1080 1080 1090 1100 1110 1120 1130 pF1KB7 RGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHH 1090 1100 1110 1120 1130 1140 1140 1150 1160 1170 1180 1190 pF1KB7 SSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAG 1150 1160 1170 1180 1190 1200 1200 1210 1220 1230 1240 1250 pF1KB7 TFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFD 1210 1220 1230 1240 1250 1260 1260 1270 1280 1290 1300 1310 pF1KB7 GKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAAS 1270 1280 1290 1300 1310 1320 1320 1330 pF1KB7 CHHAYIVLCIENSFMTASK ::::::::::::::::::: CCDS42 CHHAYIVLCIENSFMTASK 1330 >>CCDS42972.1 COL18A1 gene_id:80781|Hs108|chr21 (1519 aa) initn: 6940 init1: 6940 opt: 9493 Z-score: 3657.9 bits: 689.4 E(32554): 2e-197 Smith-Waterman score: 9493; 98.1% identity (98.3% similar) in 1334 aa overlap (6-1336:189-1519) 10 20 30 pF1KB7 MAPRCPWPWPRRRRLLDVLAPLVLLLGVRAASAEP : : : : :::. : . CCDS42 STPQENGTTLWPSRGIPSSPGAHTTEAGTLPAPTPSPPSLGRPWAPLT---GPSVPPPSS 160 170 180 190 200 210 40 50 60 70 80 90 pF1KB7 ERISEEVGLLQLLGDPPPQQVTQTDDPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ERISEEVGLLQLLGDPPPQQVTQTDDPDVGLAYVFGPDANSGQVARYHFPSLFFRDFSLL 220 230 240 250 260 270 100 110 120 130 140 150 pF1KB7 FHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FHIRPATEGPGVLFAITDSAQAMVLLGVKLSGVQDGHQDISLLYTEPGAGQTHTAASFRL 280 290 300 310 320 330 160 170 180 190 200 210 pF1KB7 PAFVGQWTHLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PAFVGQWTHLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDK 340 350 360 370 380 390 220 230 240 250 260 270 pF1KB7 FQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLP 400 410 420 430 440 450 280 290 300 310 320 330 pF1KB7 APPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 APPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVSTWDGSVRTPGGRVKE 460 470 480 490 500 510 340 350 360 370 380 390 pF1KB7 GGLKGQKGEPGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GGLKGQKGEPGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGPPGPP 520 530 540 550 560 570 400 410 420 430 440 450 pF1KB7 GRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGPKGDKGDPGVGERGPPGPQGPPG 580 590 600 610 620 630 460 470 480 490 500 510 pF1KB7 PPGPSFRHDKLTFIDMEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PPGPSFRHDKLTFIDMEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDV 640 650 660 670 680 690 520 530 540 550 560 570 pF1KB7 PGPAGLPGVPGREGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PGPAGLPGVPGREGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGP 700 710 720 730 740 750 580 590 600 610 620 630 pF1KB7 AGARGESGLAGAPGPAGPPGPPGPPGPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 AGARGESGLAGAPGPAGPPGPPGPPGPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPP 760 770 780 790 800 810 640 650 660 670 680 690 pF1KB7 GLPGLKGDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GLPGLKGDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKD 820 830 840 850 860 870 700 710 720 730 740 750 pF1KB7 GVGQPGLPGPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GVGQPGLPGPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGS 880 890 900 910 920 930 760 770 780 790 800 810 pF1KB7 PGPKGEKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PGPKGEKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPG 940 950 960 970 980 990 820 830 840 850 860 870 pF1KB7 MNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGN 1000 1010 1020 1030 1040 1050 880 890 900 910 920 930 pF1KB7 QGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGS 1060 1070 1080 1090 1100 1110 940 950 960 970 980 990 pF1KB7 SLPGPPGPPGP---RGYPGIPGPKGESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGP ::::::::::: :::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SLPGPPGPPGPPGPRGYPGIPGPKGESIRGQPGPPGPQGPPGIGYEGRQGPPGPPGPPGP 1120 1130 1140 1150 1160 1170 1000 1010 1020 1030 1040 1050 pF1KB7 PSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVA 1180 1190 1200 1210 1220 1230 1060 1070 1080 1090 1100 1110 pF1KB7 EQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARP 1240 1250 1260 1270 1280 1290 1120 1130 1140 1150 1160 1170 pF1KB7 WRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 WRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPL 1300 1310 1320 1330 1340 1350 1180 1190 1200 1210 1220 1230 pF1KB7 SGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDELL 1360 1370 1380 1390 1400 1410 1240 1250 1260 1270 1280 1290 pF1KB7 FPSWEALFSGSEGPLKPGARIFSFDGKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FPSWEALFSGSEGPLKPGARIFSFDGKDVLRHPTWPQKSVWHGSDPNGRRLTESYCETWR 1420 1430 1440 1450 1460 1470 1300 1310 1320 1330 pF1KB7 TEAPSATGQASSLLGGRLLGQSAASCHHAYIVLCIENSFMTASK :::::::::::::::::::::::::::::::::::::::::::: CCDS42 TEAPSATGQASSLLGGRLLGQSAASCHHAYIVLCIENSFMTASK 1480 1490 1500 1510 >>CCDS77643.1 COL18A1 gene_id:80781|Hs108|chr21 (1754 aa) initn: 6940 init1: 6940 opt: 9480 Z-score: 3652.3 bits: 688.5 E(32554): 4.1e-197 Smith-Waterman score: 9480; 99.8% identity (99.8% similar) in 1304 aa overlap (36-1336:451-1754) 10 20 30 40 50 60 pF1KB7 PWPWPRRRRLLDVLAPLVLLLGVRAASAEPERISEEVGLLQLLGDPPPQQVTQTDDPDVG :::::::::::::::::::::::::::::: CCDS77 SRLGGGRLPVACASLPTQEDGYCVLIGPAAERISEEVGLLQLLGDPPPQQVTQTDDPDVG 430 440 450 460 470 480 70 80 90 100 110 120 pF1KB7 LAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 LAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKL 490 500 510 520 530 540 130 140 150 160 170 180 pF1KB7 SGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDCEEFQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 SGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDCEEFQR 550 560 570 580 590 600 190 200 210 220 230 240 pF1KB7 MPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSD 610 620 630 640 650 660 250 260 270 280 290 300 pF1KB7 GASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTV 670 680 690 700 710 720 310 320 330 340 350 360 pF1KB7 ASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 ASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGP 730 740 750 760 770 780 370 380 390 400 410 420 pF1KB7 PGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 PGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFP 790 800 810 820 830 840 430 440 450 460 470 480 pF1KB7 GTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GTPGDVGPKGDKGDPGVGERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRG 850 860 870 880 890 900 490 500 510 520 530 540 pF1KB7 PRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 PRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGR 910 920 930 940 950 960 550 560 570 580 590 600 pF1KB7 EGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPPGPPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 EGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPPGPPGP 970 980 990 1000 1010 1020 610 620 630 640 650 660 pF1KB7 GLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFP 1030 1040 1050 1060 1070 1080 670 680 690 700 710 720 pF1KB7 GLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDGSVLSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVVYVSEQDGSVLSV 1090 1100 1110 1120 1130 1140 730 740 750 760 770 780 pF1KB7 PGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 PGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKG 1150 1160 1170 1180 1190 1200 790 800 810 820 830 840 pF1KB7 EPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 EPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPP 1210 1220 1230 1240 1250 1260 850 860 870 880 890 900 pF1KB7 GPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLE 1270 1280 1290 1300 1310 1320 910 920 930 940 950 960 pF1KB7 AEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGP---RGYPGIPGPKGESIRG ::::::::::::::::::::::::::::::::::::::::: :::::::::::::::: CCDS77 AEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGPPGPRGYPGIPGPKGESIRG 1330 1340 1350 1360 1370 1380 970 980 990 1000 1010 1020 pF1KB7 QPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 QPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMG 1390 1400 1410 1420 1430 1440 1030 1040 1050 1060 1070 1080 pF1KB7 ASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 ASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDN 1450 1460 1470 1480 1490 1500 1090 1100 1110 1120 1130 1140 pF1KB7 EVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHHSSYVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 EVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHHSSYVH 1510 1520 1530 1540 1550 1560 1150 1160 1170 1180 1190 1200 pF1KB7 LRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 LRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRAF 1570 1580 1590 1600 1610 1620 1210 1220 1230 1240 1250 1260 pF1KB7 LSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 LSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVL 1630 1640 1650 1660 1670 1680 1270 1280 1290 1300 1310 1320 pF1KB7 RHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 RHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAY 1690 1700 1710 1720 1730 1740 1330 pF1KB7 IVLCIENSFMTASK :::::::::::::: CCDS77 IVLCIENSFMTASK 1750 >>CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466 aa) initn: 785 init1: 475 opt: 1946 Z-score: 767.6 bits: 154.5 E(32554): 1.9e-36 Smith-Waterman score: 2082; 41.7% identity (54.1% similar) in 917 aa overlap (183-1026:343-1198) 160 170 180 190 200 210 pF1KB7 FRLPAFVGQWTHLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGAD : : :.. : ::.. .: : CCDS22 LPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPG 320 330 340 350 360 370 220 230 240 250 260 270 pF1KB7 PDKFQGVIAELKVRRDPQVSPMHCLDEEG-DDSDGASGDSG-SGLGDARELLREETGAAL :. :. . : : : . : :..:. : .:. : :. .: CCDS22 PQGHAGA-------QGPPGPP-------GINGSPGGKGEMGPAGIPGAPGLM-----GAR 380 390 400 410 280 290 300 310 320 330 pF1KB7 KPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVSTWDGSVRTPG : :.: .. : :.. : ... : . : .:: ... ::. .:: CCDS22 GP--PGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAKGEDGKDGSPG 420 430 440 450 460 470 340 350 360 370 380 pF1KB7 GRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGPPG---LPCPVSPLGPAG-PALQTVP . .:: : :: :.:: : ::: : : :: : : :..: : :: :. . :: CCDS22 -EPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVP 480 490 500 510 520 530 390 400 410 420 430 440 pF1KB7 GPQGPPGPPGRDGTPGRDGEPGDPGEDGK------PGDTGPQGFPGTPGDVGPKGDKGDP : : : :: : :: ::.:: :: .:. :: .::.: ::. : ::::. : : CCDS22 GGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAP 540 550 560 570 580 590 450 460 470 480 490 pF1KB7 GV-GERGPPGPQGPPGPPGPSFRHDKLTFIDMEG-SGFGGDLEALRGP------RGFPGP : :::: :: :: :::: .. . : .: ::: .. :: .:.:: CCDS22 GKNGERGGPGGPGPQGPPG---KNGETGPQGPPGPTGPGGD-KGDTGPPGPQGLQGLPGT 600 610 620 630 640 500 510 520 530 540 pF1KB7 PGPPGVPGLPGEPGRFG-VNSSDVPGPAGLPGVPGREGPPGFPGLPG------PPGPPGR :::: : ::::: : ... .:: : :.::..::::. : :: :::: : CCDS22 GGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGG 650 660 670 680 690 700 550 560 570 580 590 pF1KB7 EG---PPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGA---PGPAGPPGPPGP .: ::: : :. : : ::..:. :.::: : .:: : :: :: :: :: :: CCDS22 KGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGP 710 720 730 740 750 760 600 610 620 630 640 pF1KB7 PGPPGP-GLPA-----GFDDMEGSGGPFWSTA-RSADGPQGP---PGLPGLKGDPGVPGL ::::: : :. : . : .:: : . :. :: :: :: :: .:.:: : CCDS22 IGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGEPGGKGE 770 780 790 800 810 820 650 660 670 680 690 700 pF1KB7 PGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGV-GQPGLPGPPG- :: :: : : :: : :: : ::: ::.: .: :: : :: : : ::::::: CCDS22 RGAPGEKGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGS 830 840 850 860 870 880 710 720 730 740 750 760 pF1KB7 -----PPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEK :::: .:: ::: : : : :: .::::. :. ::.:::: .: CCDS22 NGNPGPPGPS-GSPGKDGP----PGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPP 890 900 910 920 930 940 770 780 790 pF1KB7 GEPG-----------SIFSPDG-----GALGPAQ-KGAKGEPGF------RGPPGPYGRP : :: .. .: : :. :: :: .:.:: :::::: : : CCDS22 GAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLP 950 960 970 980 990 1000 800 810 820 830 840 850 pF1KB7 GYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSN : : : ::: : :: .:: :. : :: . : : :: :: :: ::::: :: .. CCDS22 GLAGTAGEPGRDGNPGSDGLPGRDGSPGGKG-DRGENGSPGAPGAPGHPGPPG-PVGPAG 1010 1020 1030 1040 1050 860 870 880 890 900 910 pF1KB7 VFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAG .. .. :: : :. : ::: :..: :: :: :. ::: :.:: :: CCDS22 KSGDRGESGPAG-PA--GAPGPAGSRGAPGPQGPRGD------------KGETGERGAAG 1060 1070 1080 1090 1100 920 930 940 950 960 970 pF1KB7 QKGERGEPGGGGFFGSSLPGPPGPPGPRGYPGIPGPKGESIRGQPGPPGPQGPPGIGYEG ::.:: ::. : :: ::: : .: : ::: : :: ::.:::: .: CCDS22 IKGHRGFPGNPG-----APGSPGPAGQQGAIGSPGPAGPR-----GPVGPSGPPG--KDG 1110 1120 1130 1140 1150 980 990 1000 1010 1020 1030 pF1KB7 RQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQ .: ::: ::::: . : .: . . :: :: ::::::::. : : CCDS22 TSGHPGPIGPPGPRGNRG-ERGSEGSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGE 1160 1170 1180 1190 1200 1210 1040 1050 1060 1070 1080 1090 pF1KB7 VHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNP CCDS22 KAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPE 1220 1230 1240 1250 1260 1270 >>CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 899 init1: 486 opt: 1919 Z-score: 756.3 bits: 152.7 E(32554): 8.3e-36 Smith-Waterman score: 2091; 39.3% identity (52.3% similar) in 1048 aa overlap (246-1132:571-1577) 220 230 240 250 260 270 pF1KB7 FQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLP : : ::: : :. : : . :. CCDS75 AQESQAQAILQQARLALRGPAGPMGLTGRPGPVGPPGSG-G-----LKGEPGD-VGPQ-- 550 560 570 580 590 280 290 300 310 320 330 pF1KB7 APPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVST---WDGSVRTPGGR .: : :: .:. . .:. .: ::. .::. . . .:: . :: CCDS75 GPRGVQGPPGPAGKPGRRGRA---------GSDGARGMPGQTGPKGDRGFDGLAGLPG-- 600 610 620 630 640 340 350 360 370 380 pF1KB7 VKEGGLKGQKGEPGVPGPPG---------RAGPPGSPCLPGPPGLPCPVSPLGPAGP--- : : .:. : : ::::: ..:: : : ::: :: : .: :: :: CCDS75 --EKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPPGV 650 660 670 680 690 390 400 410 420 430 pF1KB7 -ALQTVPGP------QGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGP ... ::: :: :::::..:.:: .: :: : : ::. :: : :: :: : CCDS75 TGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMPGA 700 710 720 730 740 750 440 450 460 470 pF1KB7 KGDKGDPGV----GERG---PPGPQGPPGPPGPSFRHDKLTFIDMEGS----------GF : : :: ::.: ::::::: : ::: . . ..:. :: CCDS75 DGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFPGF 760 770 780 790 800 810 480 490 500 510 520 pF1KB7 GGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVP-GPAG---------LPGVPG ::. ...: :: ::::: : : : :: : :.. : :: : ::: :: CCDS75 KGDM-GIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYPG 820 830 840 850 860 870 530 540 550 560 570 pF1KB7 REGP------PGFPG---------LPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKG :.:: ::::: :: ::: :..:: : :..: : .: :: ::..: CCDS75 RQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSG 880 890 900 910 920 930 580 590 600 610 620 pF1KB7 APGPAGARGESGLAGAPGPAGPPGPPGPPGPPG----PGLPA-----GFDDMEGSGGPFW . :::: :: : : ::.: ::: ::::::: :: :. ::. : :: CCDS75 GDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPG 940 950 960 970 980 990 630 640 650 660 pF1KB7 STA-------------RSADGPQGPPG---LPGL------KGDPGVPGLPGAKGEVGADG .. :. :: :::: :::: ::::: :::: : : : CCDS75 VVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRG 1000 1010 1020 1030 1040 1050 670 680 690 700 710 pF1KB7 VPGFPGLPGR------EGIAGPQGPKGDRGSRGEKGDPGKDG-VGQPGLPGPPGPPGPVV :: :::: .: :: :: : :: ::.: : : .: :: ::: :::::. CCDS75 FPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPA- 1060 1070 1080 1090 1100 1110 720 730 740 750 760 pF1KB7 YVSEQDGSVLSVP-GPEGRPGF---AGFPGPAGP---------KGNLGSKGERGSPGPKG .:. . . : :: :: :. .:.:::::: ::..: :..:: : :: CCDS75 --GEKGAPGEKGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKG 1120 1130 1140 1150 1160 1170 770 780 790 800 810 820 pF1KB7 EKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLK :.: :: .:.: :. .:: :::: :: : .:. : .: :::: :: :..:: CCDS75 EQGPPGPT-GPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLP 1180 1190 1200 1210 1220 1230 830 840 850 860 pF1KB7 GEKGEPGDASLGFGMRGMPGPPGP------PGPPGPPGTP--VYDSNVFAESSRPGP--- : :: :... :. : :::::: :: :: : : . . .. .:...:: CCDS75 GPPGEKGETG-DVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE 1240 1250 1260 1270 1280 1290 870 880 890 900 pF1KB7 PGLPGNQGPPGPKGAKGE---------VGPPGP------------PGQFPFDFLQLEAEM :::::. ::::::: .:: .::::: :: : CCDS75 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE 1300 1310 1320 1330 1340 1350 910 920 930 940 950 960 pF1KB7 KGEKGDRGDAGQKGERGEPGGGGFFG-SSLPGPPGPPGPRGYPGIPGPKG-ESIRGQPGP : :. : :.::. :::: : : .. ::: :::: :: :: ::.: .. .: : CCDS75 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE 1360 1370 1380 1390 1400 1410 970 980 990 1000 1010 1020 pF1KB7 PGPQGPPG-IGYEGRQGPPGPPGPPGPPSFPGP-HRQTI-SVPGPPGPPGPPGPPGTMG- : .:::: : : :: :: ::: : ..::: .: . . ::: ::::: :::: : CCDS75 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL 1420 1430 1440 1450 1460 1470 1030 1040 1050 1060 1070 1080 pF1KB7 -ASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTD ..:: . . ...: . : : :: : . . :. : . .: CCDS75 KGDSGPKGEKGHPGLIGLIG--PPG------EQGE---KGDRGLPGPQGSSGPKGEQGIT 1480 1490 1500 1510 1520 1090 1100 1110 1120 1130 pF1KB7 NEVAALQPPVVQLHDSNPYPR--REHPHPTARPWRADDILASPPRLPEP-----QPYPGA . . . :: . : :. . ::. : ... .:: : : :: : CCDS75 GPSGPIGPPGPPGLPGPPGPKGAKGSSGPTG-P-KGEAGHPGPPGPPGPPGEVIQPLPIQ 1530 1540 1550 1560 1570 1140 1150 1160 1170 1180 1190 pF1KB7 PHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVG CCDS75 ASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPARTC 1580 1590 1600 1610 1620 1630 >>CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 899 init1: 486 opt: 1919 Z-score: 756.3 bits: 152.7 E(32554): 8.3e-36 Smith-Waterman score: 2091; 39.3% identity (52.3% similar) in 1048 aa overlap (246-1132:571-1577) 220 230 240 250 260 270 pF1KB7 FQGVIAELKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETGAALKPRLP : : ::: : :. : : . :. CCDS69 AQESQAQAILQQARLALRGPAGPMGLTGRPGPVGPPGSG-G-----LKGEPGD-VGPQ-- 550 560 570 580 590 280 290 300 310 320 330 pF1KB7 APPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPGSDSVST---WDGSVRTPGGR .: : :: .:. . .:. .: ::. .::. . . .:: . :: CCDS69 GPRGVQGPPGPAGKPGRRGRA---------GSDGARGMPGQTGPKGDRGFDGLAGLPG-- 600 610 620 630 640 340 350 360 370 380 pF1KB7 VKEGGLKGQKGEPGVPGPPG---------RAGPPGSPCLPGPPGLPCPVSPLGPAGP--- : : .:. : : ::::: ..:: : : ::: :: : .: :: :: CCDS69 --EKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPPGV 650 660 670 680 690 390 400 410 420 430 pF1KB7 -ALQTVPGP------QGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGTPGDVGP ... ::: :: :::::..:.:: .: :: : : ::. :: : :: :: : CCDS69 TGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMPGA 700 710 720 730 740 750 440 450 460 470 pF1KB7 KGDKGDPGV----GERG---PPGPQGPPGPPGPSFRHDKLTFIDMEGS----------GF : : :: ::.: ::::::: : ::: . . ..:. :: CCDS69 DGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFPGF 760 770 780 790 800 810 480 490 500 510 520 pF1KB7 GGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVP-GPAG---------LPGVPG ::. ...: :: ::::: : : : :: : :.. : :: : ::: :: CCDS69 KGDM-GIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYPG 820 830 840 850 860 870 530 540 550 560 570 pF1KB7 REGP------PGFPG---------LPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKG :.:: ::::: :: ::: :..:: : :..: : .: :: ::..: CCDS69 RQGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSG 880 890 900 910 920 930 580 590 600 610 620 pF1KB7 APGPAGARGESGLAGAPGPAGPPGPPGPPGPPG----PGLPA-----GFDDMEGSGGPFW . :::: :: : : ::.: ::: ::::::: :: :. ::. : :: CCDS69 GDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPG 940 950 960 970 980 990 630 640 650 660 pF1KB7 STA-------------RSADGPQGPPG---LPGL------KGDPGVPGLPGAKGEVGADG .. :. :: :::: :::: ::::: :::: : : : CCDS69 VVGPQGPTGETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRG 1000 1010 1020 1030 1040 1050 670 680 690 700 710 pF1KB7 VPGFPGLPGR------EGIAGPQGPKGDRGSRGEKGDPGKDG-VGQPGLPGPPGPPGPVV :: :::: .: :: :: : :: ::.: : : .: :: ::: :::::. CCDS69 FPGDRGLPGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPA- 1060 1070 1080 1090 1100 1110 720 730 740 750 760 pF1KB7 YVSEQDGSVLSVP-GPEGRPGF---AGFPGPAGP---------KGNLGSKGERGSPGPKG .:. . . : :: :: :. .:.:::::: ::..: :..:: : :: CCDS69 --GEKGAPGEKGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKG 1120 1130 1140 1150 1160 1170 770 780 790 800 810 820 pF1KB7 EKGEPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLK :.: :: .:.: :. .:: :::: :: : .:. : .: :::: :: :..:: CCDS69 EQGPPGPT-GPQGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLP 1180 1190 1200 1210 1220 1230 830 840 850 860 pF1KB7 GEKGEPGDASLGFGMRGMPGPPGP------PGPPGPPGTP--VYDSNVFAESSRPGP--- : :: :... :. : :::::: :: :: : : . . .. .:...:: CCDS69 GPPGEKGETG-DVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGE 1240 1250 1260 1270 1280 1290 870 880 890 900 pF1KB7 PGLPGNQGPPGPKGAKGE---------VGPPGP------------PGQFPFDFLQLEAEM :::::. ::::::: .:: .::::: :: : CCDS69 PGLPGEGGPPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGE 1300 1310 1320 1330 1340 1350 910 920 930 940 950 960 pF1KB7 KGEKGDRGDAGQKGERGEPGGGGFFG-SSLPGPPGPPGPRGYPGIPGPKG-ESIRGQPGP : :. : :.::. :::: : : .. ::: :::: :: :: ::.: .. .: : CCDS69 PGPAGQDGPPGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGE 1360 1370 1380 1390 1400 1410 970 980 990 1000 1010 1020 pF1KB7 PGPQGPPG-IGYEGRQGPPGPPGPPGPPSFPGP-HRQTI-SVPGPPGPPGPPGPPGTMG- : .:::: : : :: :: ::: : ..::: .: . . ::: ::::: :::: : CCDS69 AGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLPGSPGPDGPPGPMGPPGLPGL 1420 1430 1440 1450 1460 1470 1030 1040 1050 1060 1070 1080 pF1KB7 -ASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTD ..:: . . ...: . : : :: : . . :. : . .: CCDS69 KGDSGPKGEKGHPGLIGLIG--PPG------EQGE---KGDRGLPGPQGSSGPKGEQGIT 1480 1490 1500 1510 1520 1090 1100 1110 1120 1130 pF1KB7 NEVAALQPPVVQLHDSNPYPR--REHPHPTARPWRADDILASPPRLPEP-----QPYPGA . . . :: . : :. . ::. : ... .:: : : :: : CCDS69 GPSGPIGPPGPPGLPGPPGPKGAKGSSGPTG-P-KGEAGHPGPPGPPGPPGEVIQPLPIQ 1530 1540 1550 1560 1570 1140 1150 1160 1170 1180 1190 pF1KB7 PHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVG CCDS69 ASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPARTC 1580 1590 1600 1610 1620 1630 >>CCDS35366.1 COL4A5 gene_id:1287|Hs108|chrX (1691 aa) initn: 3059 init1: 930 opt: 1895 Z-score: 747.5 bits: 151.0 E(32554): 2.6e-35 Smith-Waterman score: 2110; 38.8% identity (53.2% similar) in 1063 aa overlap (229-1183:442-1444) 200 210 220 230 240 250 pF1KB7 PGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSD-GASGDSG----S :.. : . : : . :. ::.: . CCDS35 GPPGISIPGPPGLDGQPGAPGLPGPPGPAGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQ 420 430 440 450 460 470 260 270 280 290 300 310 pF1KB7 GL-GDARELLREETGAALK--PRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTVASLGA :. :: . . :.... : :. : . :: :: ... : . ... : CCDS35 GVKGDKGDTCFNCIGTGISGPPGQPGLPGLPGPP---GSLGFPGQKGEKGQAGATGPKGL 480 490 500 510 520 320 330 340 350 360 pF1KB7 QTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGE---PGVPGPPGRAGPPGS---PCLPG .::. .. . :: :: . :.::.::: ::.:: :: : ::. : ::: CCDS35 PGIPGAPGAPGFPGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPG 530 540 550 560 570 580 370 380 390 pF1KB7 P---------------------PGLPCPVSPLGPAG--P-------ALQTV---PGPQGP : :::: ..:.:: : : ..: : :: : CCDS35 PKGEPGGITFKGERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGI 590 600 610 620 630 640 400 410 420 430 440 pF1KB7 PGPPGRDGT----PGRDGEPGDPGEDGK---PGDTGPQGFPGTPGDVGPKGDKGDPGVGE ::: : : ::. : ::.::.:: ::: : : :: :: : ::. : ::.: CCDS35 PGPKGDPGQTITQPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGL 650 660 670 680 690 700 450 460 470 480 490 500 pF1KB7 RGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGE :::::.: :: ::: . : : . :. ::::::: :: :: CCDS35 PGPPGPKGFPGIPGPP-----------GAPGTPGRI-------GLEGPPGPPGFPGPKGE 710 720 730 740 750 510 520 530 540 550 560 pF1KB7 PGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAP :: :.. . ::: :::: : :: : :.::::::::: : : : ::..: : : CCDS35 PG-FALPGP--PGPPGLPGFKGALGPKGDRGFPGPPGPPGRTGLDGLPGPKGDVGPNGQP 760 770 780 790 800 570 580 590 600 610 620 pF1KB7 GHKGSKGAPGPAGARGESGLAGAPGPAGPPGPPGPPGPPG-PGLPAGFDDMEGSGGPFWS : : : :: :..: : : ::: : :: : :: : :: : :.: . : : : CCDS35 GPMGPPGLPG-IGVQGPPGPPGIPGPIGQPGLHGIPGEKGDPG-PPGLD-VPGPPGERGS 810 820 830 840 850 860 630 640 650 660 670 pF1KB7 TA-RSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFPG---LPGREGIAGPQGP . .: :: :::: ::: : :. :.::.:::.: : :: :: .::: :. : : CCDS35 PGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGMMGPPGPPGPLGIPGRSGV--P-GL 870 880 890 900 910 920 680 690 700 710 720 730 pF1KB7 KGDRGSRGEKGDPGKDGV-GQPGLPGPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFP ::: : .:. : :: : :. : :: ::::::. : ..:. : .:.::. :.: CCDS35 KGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPM------DPNLLGSKGEKGEPGLPGIP 930 940 950 960 970 740 750 760 770 780 790 pF1KB7 GPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKGEPGFRGP------ : .:::: : :. :.:: .:. : :: .: :. :.: : : ::..: CCDS35 GVSGPKGYQGLPGDPGQPGLSGQPGLPGPP-GPKGNPGLPGQPGLIGPPGLKGTIGDMGF 980 990 1000 1010 1020 1030 800 810 820 830 840 850 pF1KB7 PGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGPPGPPGPPG ::: : : : : ::.:: ::. : ::.::.:: .:. :. :.::: : :: :: :: CCDS35 PGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSI--GLPGLPGPKGEPGLPGYPG 1040 1050 1060 1070 1080 1090 860 870 880 890 900 pF1KB7 TP-----VYDSNVFAESSRPGP---PGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQL .: : : .. . . :: ::::: : ::: : :: :::: :: : CCDS35 NPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPGNPG------LPG 1100 1110 1120 1130 1140 910 920 930 940 950 pF1KB7 EAEMKGEKGDRGDAGQKGERGEPGGGGFFGSS----LPGPPG--PPGPRGYPGIPGPKGE : : : :. : ::.:.:: :. : . :: :: ::: : ::. : ::. CCDS35 EPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGD 1150 1160 1170 1180 1190 1200 960 970 980 990 1000 pF1KB7 S----IRGQPGPPGPQGPPGI-GYEGRQGPPGPPGPPGP----PSF-PGPHRQTISVPGP . : :.:: :::.: ::. :. : :::::::: ::: :. :::. . ::: CCDS35 GGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP-GRPGP 1210 1220 1230 1240 1250 1260 1010 1020 1030 1040 1050 pF1KB7 ------PGPPGPPGPPGTMGASSGVRLWATRQAMLG----QVHEVPEGWLI------FVA ::: :::: ::. :. .: . . .. : . . : : . . CCDS35 TGFQGLPGPEGPPGLPGN-GGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNG 1270 1280 1290 1300 1310 1320 1060 1070 1080 1090 1100 1110 pF1KB7 EQEELYVRVQNGFRKVQLEARTPLPRGTDNEVAALQPPVVQLHDSNPYPRREHPHPTARP . . . :: .. . .: : ..: . . :: .: : : . CCDS35 MKGDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPP-------GP-PGLPGPSGQSII 1330 1340 1350 1360 1370 1120 1130 1140 1150 1160 1170 pF1KB7 WRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPTSPPAHSHRDFQPVLHLVA--LNS ..: :.:: .: : : .. : :. ::.::. :. : . .. .. CCDS35 IKGD---AGPPGIPGQPGLKGLPGPQGPQGL-PG-PTGPPGDPGRNGLPGFDGAGGRKGD 1380 1390 1400 1410 1420 1430 1180 1190 1200 1210 1220 1230 pF1KB7 PLSGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSIVRRADRAAVPIVNLKDE : :. : :: : CCDS35 PGLPGQPGTRGLDGPPGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTLQVYE 1440 1450 1460 1470 1480 1490 >-- initn: 791 init1: 435 opt: 1108 Z-score: 446.1 bits: 95.2 E(32554): 1.6e-18 Smith-Waterman score: 1241; 43.8% identity (54.2% similar) in 504 aa overlap (501-992:19-439) 480 490 500 510 520 pF1KB7 MEGSGFGGDLEALRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGP----AGLPGVPG : :.:.. .. . :: .:. : : CCDS35 MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKG 10 20 30 40 530 540 550 560 570 580 pF1KB7 REGPPGFPGLPGPPGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAG ..: ::. : :: :: :: ::::: ::::. : : :: :: .: :: : : :: : CCDS35 ERGFPGLEGHPGLPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPG 50 60 70 80 90 100 590 600 610 620 630 640 pF1KB7 APGPAGPPGPPGPPGPPGPGLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPG- :: : ::: : :: : :: : : .:: ::::.::.::.:: CCDS35 MPGHDGAPGPQGIPGCNGTKGERGFPGSPGFPG--------LQGPPGPPGIPGMKGEPGS 110 120 130 140 150 160 650 660 670 680 690 700 pF1KB7 --VPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGVGQPGLP . .::: ::. : : ::. :::: :: :: :: : : ::: CCDS35 IIMSSLPGPKGNPGYPGPPGIQGLPGPTGIPGPIGPPG----------P-------PGLM 170 180 190 200 710 720 730 740 750 760 pF1KB7 GPPGPPGPVVYVSEQDGSVLSVPGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKG ::::::: .:::.: :. .: :: : ::. : .: : :: .:. CCDS35 GPPGPPG--------------LPGPKGNMGL-NFQGPKGEKGEQGLQGPPGPPGQISEQK 210 220 230 240 770 780 790 800 810 820 pF1KB7 EPGSIFSPDGGALGPAQKGAKGEPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEK .: .. ::: .: :: :::::: :: .: :: :: : .: :::. CCDS35 RPIDV---------EFQKGDQGLPGDRGPPGP---PGIRGP---PGPPG--GEKGEKGEQ 250 260 270 280 290 830 840 850 860 870 880 pF1KB7 GEPGDASLGFGMRGMPGPPGPPGPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKG :::: :: :: : : :: :: : : :: ::. : : :: CCDS35 GEPGK-------RGKPGKDGENGQPGIPGLP-------------GDPGYPGEPGRDGEKG 300 310 320 330 890 900 910 920 930 940 pF1KB7 AKGEVGPPGPPGQ-FPFDFLQLEAEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPG ::..::::::: .: . ::. : : :.::::: :: : : :::::: CCDS35 QKGDTGPPGPPGLVIPRPGTGITIGEKGNIGLPGLPGEKGERGFPGIQGPPG--LPGPPG 340 350 360 370 380 950 960 970 980 990 pF1KB7 PP--GPRGYPGIPGPKGESIRGQPGPPGPQ--GPPGIGYEGRQGPPGPPGPPGPPSFPGP :: : ::.:: .:. .:. :::: . ::::. .:. : :: :::::: CCDS35 AAVMGPPGPPGFPGERGQ--KGDEGPPGISIPGPPGL--DGQPGAPGLPGPPGPAGPHIP 390 400 410 420 430 440 1000 1010 1020 1030 1040 1050 pF1KB7 HRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELY CCDS35 PSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDTCFNCIGTGISGPPGQPGLPGLPGPP 450 460 470 480 490 500 >>CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690 aa) initn: 1246 init1: 529 opt: 1875 Z-score: 739.8 bits: 149.6 E(32554): 6.9e-35 Smith-Waterman score: 2062; 36.9% identity (50.2% similar) in 1197 aa overlap (194-1279:368-1511) 170 180 190 200 210 220 pF1KB7 HLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIA-E :: ::. :.. :.: .: . : : CCDS78 GPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPTISAQE 340 350 360 370 380 390 230 240 250 260 270 pF1KB7 LKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETG--AALKPR-LPAPPP ... : . . : : .: : : . . :.: . :: . .:: CCDS78 AQAQAILQQARIALRGPPGPM--GLTGRPGPVGGPGSSGAKGESGDPGPQGPRGVQGPPG 400 410 420 430 440 450 280 290 300 310 320 330 pF1KB7 VTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPG--SDSVSTWDGSVRTPGGRVKEGG : : : :. : . .. : . ::: .:. . . . : : . : CCDS78 PTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPPGPPGDDG 460 470 480 490 500 510 340 350 360 370 380 390 pF1KB7 LKGQKGE------PGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGP ..:. :: :: :: : :: :.: :: ::. .: :: : :::: CCDS78 MRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNM-----GPQGE 520 530 540 550 560 570 400 410 420 430 440 pF1KB7 PGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGT---PGDVGPKGDKGDPG-VGERG- :::::..:.:: .: :: : : ::. :::: :: :: :: : : : ::.: CCDS78 PGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGA 580 590 600 610 620 630 450 460 470 480 pF1KB7 --PPGPQGPPGPPGPSFRHDKLTFIDMEGS----------GFGGD--LEALRG------P ::::::: : ::: . ..:: :: :: :.. :: : CCDS78 LGPPGPQGPIGYPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGP 640 650 660 670 680 690 490 500 510 520 530 pF1KB7 RGFPGPPGPPGVPGLPGEPGRFG-VNSSDVPGPAGLPGVPGREGPPG---FPGLPGP--- :: :: :: : : :.:: : .. . : :::: :::.:: : :::.:: CCDS78 RGEDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTGFPGFPGANGE 700 710 720 730 740 750 540 550 560 570 580 590 pF1KB7 ---------PGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGP ::: :..:: : :..:. : .: :: ::..:. :: : :: : : :: CCDS78 KGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGP 760 770 780 790 800 810 600 610 620 630 640 pF1KB7 AGPPGPPGPPGPPG----PGLPA--GFDDMEGSGGPFWSTARSADGPQGPPGL--P-GLK .: ::: ::::::: :: :. : ..:. :: .. ::::: : : : . CCDS78 VGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGP--PGPGGVVGPQGPTGETGPIGER 820 830 840 850 860 650 660 670 680 690 700 pF1KB7 GDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGV-GQP : :: :: :: .: :: : : : :: .::.: .:: : :: ::.: :: .:. : CCDS78 GHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGERGLPGAQGAPGLK 870 880 890 900 910 920 710 720 730 740 750 pF1KB7 GLPGPPGPPGPVVYVSEQDGSVLSVP-GPEGRPGFAGFPGPAGPKGNLGSKGERGS---- : :: :::::: .:. .. . : : :::: : ::::: :: : :: .: CCDS78 GGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRD 930 940 950 960 970 980 760 770 780 790 800 pF1KB7 --------PGPKGEKGEPGSIFSPDGGALG-PAQKGAKGEPGFRGPPGPYGRPGYKGEIG ::: : : :: . : : .: :.:::.::. : ::::: : : : : CCDS78 GVQGPVGLPGPAGPAGSPGE--DGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAPG 990 1000 1010 1020 1030 1040 810 820 830 840 850 860 pF1KB7 FPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGP---PGPPGPPGTPVYDSNVFAE . : :.:: : .: :. :: : ::.:::::: : ::::: ...: CCDS78 IAGGDGEPGPRGQQGMFGQKGDE----GARGFPGPPGPIGLQGLPGPPGEKGENGDV--- 1050 1060 1070 1080 1090 870 880 890 900 910 920 pF1KB7 SSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGE . :::: :: .:: ::.:: : :::: :. . : :. : :.:: : CCDS78 -GPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEAGVGGP 1100 1110 1120 1130 1140 1150 930 940 950 960 970 980 pF1KB7 RGEPGGGGFFGSSLPGPPGPPGPRGYPGIPGPKGE-SIRGQPGPPGPQGPPG-IGYEGRQ .:: : : : :: :::: .: :: ::::. . : :: ::: : :: : .: CCDS78 KGERGEKGEAGP--PGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPPGEPGPAGQDGVG 1160 1170 1180 1190 1200 1210 990 1000 1010 1020 1030 1040 pF1KB7 GPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVH : : : :: :. ::: .. ::::::: ::::. :: . ::. : CCDS78 GDKGEDGDPGQPGPPGPSGEA----GPPGPPGKRGPPGAAGAEG-------RQGEKGAKG 1220 1230 1240 1250 1260 1050 1060 1070 1080 1090 pF1KB7 EV----PEGWLIFVAEQEELYVRVQNGFRKVQL---EARTPLPRGTDNEVAALQPPVVQL :. : : :. : .:.: . : : : :. . . :: . CCDS78 EAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLPG 1270 1280 1290 1300 1310 1320 1100 1110 1120 1130 1140 pF1KB7 HDSNPYPRREHPHP-----TARPWRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPT ..: . :. :: . : . . . :: : ::: .. :: : CCDS78 LKGDPGSKGEKGHPGLIGLIGPPGEQGE--KGDRGLPGTQGSPGAKGDGGIPG--PAGPL 1330 1340 1350 1360 1370 1380 1150 1160 1170 1180 1190 1200 pF1KB7 SPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRA-------- .::. : : : : .: .:. :.. . :: : . CCDS78 GPPG------PPGL-------PGPQGPKGNKGSTGPAGQKGDS-GLPGPPGSPGPPGEVI 1390 1400 1410 1420 1210 1220 1230 1240 1250 pF1KB7 ----FLSSRLQDLYSIVRRADRAAVPIVNLKD--ELLFPSWEALFSGSEGPLKP-GARIF .:::. .. .:: : :.. .: : .: : ..: . : : :.. CCDS78 QPLPILSSKKTRRHTEGMQAD-ADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTN 1430 1440 1450 1460 1470 1480 1260 1270 1280 1290 1300 1310 pF1KB7 -SFDGKDV-LRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLG . ::. : :: .:. : ::: CCDS78 PARTCKDLQLSHPDFPDGEYW--IDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRI 1490 1500 1510 1520 1530 1540 >>CCDS53348.1 COL11A1 gene_id:1301|Hs108|chr1 (1767 aa) initn: 1246 init1: 529 opt: 1875 Z-score: 739.6 bits: 149.6 E(32554): 7.1e-35 Smith-Waterman score: 2062; 36.9% identity (50.2% similar) in 1197 aa overlap (194-1279:445-1588) 170 180 190 200 210 220 pF1KB7 HLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIA-E :: ::. :.. :.: .: . : : CCDS53 GPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPTISAQE 420 430 440 450 460 470 230 240 250 260 270 pF1KB7 LKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETG--AALKPR-LPAPPP ... : . . : : .: : : . . :.: . :: . .:: CCDS53 AQAQAILQQARIALRGPPGPM--GLTGRPGPVGGPGSSGAKGESGDPGPQGPRGVQGPPG 480 490 500 510 520 530 280 290 300 310 320 330 pF1KB7 VTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPG--SDSVSTWDGSVRTPGGRVKEGG : : : :. : . .. : . ::: .:. . . . : : . : CCDS53 PTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPPGPPGDDG 540 550 560 570 580 590 340 350 360 370 380 390 pF1KB7 LKGQKGE------PGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGP ..:. :: :: :: : :: :.: :: ::. .: :: : :::: CCDS53 MRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNM-----GPQGE 600 610 620 630 640 400 410 420 430 440 pF1KB7 PGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGT---PGDVGPKGDKGDPG-VGERG- :::::..:.:: .: :: : : ::. :::: :: :: :: : : : ::.: CCDS53 PGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGA 650 660 670 680 690 700 450 460 470 480 pF1KB7 --PPGPQGPPGPPGPSFRHDKLTFIDMEGS----------GFGGD--LEALRG------P ::::::: : ::: . ..:: :: :: :.. :: : CCDS53 LGPPGPQGPIGYPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGP 710 720 730 740 750 760 490 500 510 520 530 pF1KB7 RGFPGPPGPPGVPGLPGEPGRFG-VNSSDVPGPAGLPGVPGREGPPG---FPGLPGP--- :: :: :: : : :.:: : .. . : :::: :::.:: : :::.:: CCDS53 RGEDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTGFPGFPGANGE 770 780 790 800 810 820 540 550 560 570 580 590 pF1KB7 ---------PGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGP ::: :..:: : :..:. : .: :: ::..:. :: : :: : : :: CCDS53 KGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGP 830 840 850 860 870 880 600 610 620 630 640 pF1KB7 AGPPGPPGPPGPPG----PGLPA--GFDDMEGSGGPFWSTARSADGPQGPPGL--P-GLK .: ::: ::::::: :: :. : ..:. :: .. ::::: : : : . CCDS53 VGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGP--PGPGGVVGPQGPTGETGPIGER 890 900 910 920 930 940 650 660 670 680 690 700 pF1KB7 GDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGV-GQP : :: :: :: .: :: : : : :: .::.: .:: : :: ::.: :: .:. : CCDS53 GHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGERGLPGAQGAPGLK 950 960 970 980 990 1000 710 720 730 740 750 pF1KB7 GLPGPPGPPGPVVYVSEQDGSVLSVP-GPEGRPGFAGFPGPAGPKGNLGSKGERGS---- : :: :::::: .:. .. . : : :::: : ::::: :: : :: .: CCDS53 GGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRD 1010 1020 1030 1040 1050 1060 760 770 780 790 800 pF1KB7 --------PGPKGEKGEPGSIFSPDGGALG-PAQKGAKGEPGFRGPPGPYGRPGYKGEIG ::: : : :: . : : .: :.:::.::. : ::::: : : : : CCDS53 GVQGPVGLPGPAGPAGSPGE--DGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAPG 1070 1080 1090 1100 1110 1120 810 820 830 840 850 860 pF1KB7 FPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGP---PGPPGPPGTPVYDSNVFAE . : :.:: : .: :. :: : ::.:::::: : ::::: ...: CCDS53 IAGGDGEPGPRGQQGMFGQKGDE----GARGFPGPPGPIGLQGLPGPPGEKGENGDV--- 1130 1140 1150 1160 1170 870 880 890 900 910 920 pF1KB7 SSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGE . :::: :: .:: ::.:: : :::: :. . : :. : :.:: : CCDS53 -GPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEAGVGGP 1180 1190 1200 1210 1220 1230 930 940 950 960 970 980 pF1KB7 RGEPGGGGFFGSSLPGPPGPPGPRGYPGIPGPKGE-SIRGQPGPPGPQGPPG-IGYEGRQ .:: : : : :: :::: .: :: ::::. . : :: ::: : :: : .: CCDS53 KGERGEKGEAGP--PGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPPGEPGPAGQDGVG 1240 1250 1260 1270 1280 1290 990 1000 1010 1020 1030 1040 pF1KB7 GPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVH : : : :: :. ::: .. ::::::: ::::. :: . ::. : CCDS53 GDKGEDGDPGQPGPPGPSGEA----GPPGPPGKRGPPGAAGAEG-------RQGEKGAKG 1300 1310 1320 1330 1340 1050 1060 1070 1080 1090 pF1KB7 EV----PEGWLIFVAEQEELYVRVQNGFRKVQL---EARTPLPRGTDNEVAALQPPVVQL :. : : :. : .:.: . : : : :. . . :: . CCDS53 EAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLPG 1350 1360 1370 1380 1390 1400 1100 1110 1120 1130 1140 pF1KB7 HDSNPYPRREHPHP-----TARPWRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPT ..: . :. :: . : . . . :: : ::: .. :: : CCDS53 LKGDPGSKGEKGHPGLIGLIGPPGEQGE--KGDRGLPGTQGSPGAKGDGGIPG--PAGPL 1410 1420 1430 1440 1450 1150 1160 1170 1180 1190 1200 pF1KB7 SPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRA-------- .::. : : : : .: .:. :.. . :: : . CCDS53 GPPG------PPGL-------PGPQGPKGNKGSTGPAGQKGDS-GLPGPPGSPGPPGEVI 1460 1470 1480 1490 1500 1210 1220 1230 1240 1250 pF1KB7 ----FLSSRLQDLYSIVRRADRAAVPIVNLKD--ELLFPSWEALFSGSEGPLKP-GARIF .:::. .. .:: : :.. .: : .: : ..: . : : :.. CCDS53 QPLPILSSKKTRRHTEGMQAD-ADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTN 1510 1520 1530 1540 1550 1560 1260 1270 1280 1290 1300 1310 pF1KB7 -SFDGKDV-LRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLG . ::. : :: .:. : ::: CCDS53 PARTCKDLQLSHPDFPDGEYW--IDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRI 1570 1580 1590 1600 1610 1620 >>CCDS778.1 COL11A1 gene_id:1301|Hs108|chr1 (1806 aa) initn: 1246 init1: 529 opt: 1875 Z-score: 739.5 bits: 149.6 E(32554): 7.2e-35 Smith-Waterman score: 2062; 36.9% identity (50.2% similar) in 1197 aa overlap (194-1279:484-1627) 170 180 190 200 210 220 pF1KB7 HLALSVAGGFVALYVDCEEFQRMPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIA-E :: ::. :.. :.: .: . : : CCDS77 GPPGLQGPTGPPGDPGDRGPPGRPGLPGADGLPGPPGTMLMLPFRYGGDGSKGPTISAQE 460 470 480 490 500 510 230 240 250 260 270 pF1KB7 LKVRRDPQVSPMHCLDEEGDDSDGASGDSGSGLGDARELLREETG--AALKPR-LPAPPP ... : . . : : .: : : . . :.: . :: . .:: CCDS77 AQAQAILQQARIALRGPPGPM--GLTGRPGPVGGPGSSGAKGESGDPGPQGPRGVQGPPG 520 530 540 550 560 570 280 290 300 310 320 330 pF1KB7 VTTPPLAGGSSTEDSRSEEVEEQTTVASLGAQTLPG--SDSVSTWDGSVRTPGGRVKEGG : : : :. : . .. : . ::: .:. . . . : : . : CCDS77 PTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPGLPGDKGHRGERGPQGPPGPPGDDG 580 590 600 610 620 630 340 350 360 370 380 390 pF1KB7 LKGQKGE------PGVPGPPGRAGPPGSPCLPGPPGLPCPVSPLGPAGPALQTVPGPQGP ..:. :: :: :: : :: :.: :: ::. .: :: : :::: CCDS77 MRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNM-----GPQGE 640 650 660 670 680 400 410 420 430 440 pF1KB7 PGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFPGT---PGDVGPKGDKGDPG-VGERG- :::::..:.:: .: :: : : ::. :::: :: :: :: : : : ::.: CCDS77 PGPPGQQGNPGPQGLPGPQGPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGA 690 700 710 720 730 740 450 460 470 480 pF1KB7 --PPGPQGPPGPPGPSFRHDKLTFIDMEGS----------GFGGD--LEALRG------P ::::::: : ::: . ..:: :: :: :.. :: : CCDS77 LGPPGPQGPIGYPGPRGVKGADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGP 750 760 770 780 790 800 490 500 510 520 530 pF1KB7 RGFPGPPGPPGVPGLPGEPGRFG-VNSSDVPGPAGLPGVPGREGPPG---FPGLPGP--- :: :: :: : : :.:: : .. . : :::: :::.:: : :::.:: CCDS77 RGEDGPEGPKGRAGPTGDPGPSGQAGEKGKLGVPGLPGYPGRQGPKGSTGFPGFPGANGE 810 820 830 840 850 860 540 550 560 570 580 590 pF1KB7 ---------PGPPGREGPPGRTGQKGSLGEAGAPGHKGSKGAPGPAGARGESGLAGAPGP ::: :..:: : :..:. : .: :: ::..:. :: : :: : : :: CCDS77 KGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGP 870 880 890 900 910 920 600 610 620 630 640 pF1KB7 AGPPGPPGPPGPPG----PGLPA--GFDDMEGSGGPFWSTARSADGPQGPPGL--P-GLK .: ::: ::::::: :: :. : ..:. :: .. ::::: : : : . CCDS77 VGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGP--PGPGGVVGPQGPTGETGPIGER 930 940 950 960 970 980 650 660 670 680 690 700 pF1KB7 GDPGVPGLPGAKGEVGADGVPGFPGLPGREGIAGPQGPKGDRGSRGEKGDPGKDGV-GQP : :: :: :: .: :: : : : :: .::.: .:: : :: ::.: :: .:. : CCDS77 GHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGERGLPGAQGAPGLK 990 1000 1010 1020 1030 1040 710 720 730 740 750 pF1KB7 GLPGPPGPPGPVVYVSEQDGSVLSVP-GPEGRPGFAGFPGPAGPKGNLGSKGERGS---- : :: :::::: .:. .. . : : :::: : ::::: :: : :: .: CCDS77 GGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRD 1050 1060 1070 1080 1090 1100 760 770 780 790 800 pF1KB7 --------PGPKGEKGEPGSIFSPDGGALG-PAQKGAKGEPGFRGPPGPYGRPGYKGEIG ::: : : :: . : : .: :.:::.::. : ::::: : : : : CCDS77 GVQGPVGLPGPAGPAGSPGE--DGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAPG 1110 1120 1130 1140 1150 1160 810 820 830 840 850 860 pF1KB7 FPGRPGRPGMNGLKGEKGEPGDASLGFGMRGMPGPPGP---PGPPGPPGTPVYDSNVFAE . : :.:: : .: :. :: : ::.:::::: : ::::: ...: CCDS77 IAGGDGEPGPRGQQGMFGQKGDE----GARGFPGPPGPIGLQGLPGPPGEKGENGDV--- 1170 1180 1190 1200 1210 870 880 890 900 910 920 pF1KB7 SSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLEAEMKGEKGDRGDAGQKGE . :::: :: .:: ::.:: : :::: :. . : :. : :.:: : CCDS77 -GPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEAGVGGP 1220 1230 1240 1250 1260 1270 930 940 950 960 970 980 pF1KB7 RGEPGGGGFFGSSLPGPPGPPGPRGYPGIPGPKGE-SIRGQPGPPGPQGPPG-IGYEGRQ .:: : : : :: :::: .: :: ::::. . : :: ::: : :: : .: CCDS77 KGERGEKGEAGP--PGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPPGEPGPAGQDGVG 1280 1290 1300 1310 1320 1330 990 1000 1010 1020 1030 1040 pF1KB7 GPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMGASSGVRLWATRQAMLGQVH : : : :: :. ::: .. ::::::: ::::. :: . ::. : CCDS77 GDKGEDGDPGQPGPPGPSGEA----GPPGPPGKRGPPGAAGAEG-------RQGEKGAKG 1340 1350 1360 1370 1380 1050 1060 1070 1080 1090 pF1KB7 EV----PEGWLIFVAEQEELYVRVQNGFRKVQL---EARTPLPRGTDNEVAALQPPVVQL :. : : :. : .:.: . : : : :. . . :: . CCDS77 EAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLPG 1390 1400 1410 1420 1430 1440 1100 1110 1120 1130 1140 pF1KB7 HDSNPYPRREHPHP-----TARPWRADDILASPPRLPEPQPYPGAPHHSSYVHLRPARPT ..: . :. :: . : . . . :: : ::: .. :: : CCDS77 LKGDPGSKGEKGHPGLIGLIGPPGEQGE--KGDRGLPGTQGSPGAKGDGGIPG--PAGPL 1450 1460 1470 1480 1490 1150 1160 1170 1180 1190 1200 pF1KB7 SPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRA-------- .::. : : : : .: .:. :.. . :: : . CCDS77 GPPG------PPGL-------PGPQGPKGNKGSTGPAGQKGDS-GLPGPPGSPGPPGEVI 1500 1510 1520 1530 1540 1210 1220 1230 1240 1250 pF1KB7 ----FLSSRLQDLYSIVRRADRAAVPIVNLKD--ELLFPSWEALFSGSEGPLKP-GARIF .:::. .. .:: : :.. .: : .: : ..: . : : :.. CCDS77 QPLPILSSKKTRRHTEGMQAD-ADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTN 1550 1560 1570 1580 1590 1600 1260 1270 1280 1290 1300 1310 pF1KB7 -SFDGKDV-LRHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLG . ::. : :: .:. : ::: CCDS77 PARTCKDLQLSHPDFPDGEYW--IDPNQGCSGDSFKVYCNFTSGGETCIYPDKKSEGVRI 1610 1620 1630 1640 1650 1660 1336 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:46:34 2016 done: Sun Nov 6 20:46:35 2016 Total Scan time: 7.060 Total Display time: 0.690 Function used was FASTA [36.3.4 Apr, 2011]