FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8523, 937 aa 1>>>pF1KB8523 937 - 937 aa - 937 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4933+/-0.0017; mu= 11.0340+/- 0.100 mean_var=198.7361+/-38.845, 0's: 0 Z-trim(103.4): 224 B-trim: 0 in 0/50 Lambda= 0.090978 statistics sampled from 7183 (7416) to 7183 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.558), E-opt: 0.2 (0.228), width: 16 Scan time: 3.610 The best scores are: opt bits E(32554) CCDS55265.1 MATN2 gene_id:4147|Hs108|chr8 ( 937) 6497 867.3 0 CCDS55264.1 MATN2 gene_id:4147|Hs108|chr8 ( 956) 5990 800.7 0 CCDS83309.1 MATN2 gene_id:4147|Hs108|chr8 ( 915) 3818 515.6 1.7e-145 CCDS46226.1 MATN3 gene_id:4148|Hs108|chr2 ( 486) 1239 176.9 8.6e-44 CCDS13348.1 MATN4 gene_id:8785|Hs108|chr20 ( 581) 1194 171.0 5.8e-42 CCDS14048.1 SCUBE1 gene_id:80274|Hs108|chr22 ( 988) 1046 151.8 5.9e-36 CCDS46607.1 MATN4 gene_id:8785|Hs108|chr20 ( 540) 1012 147.1 8.6e-35 CCDS4800.1 SCUBE3 gene_id:222663|Hs108|chr6 ( 993) 984 143.7 1.7e-33 CCDS41237.1 MEGF6 gene_id:1953|Hs108|chr1 (1541) 898 132.6 5.6e-30 CCDS336.1 MATN1 gene_id:4146|Hs108|chr1 ( 496) 805 119.9 1.2e-26 CCDS54345.1 LTBP1 gene_id:4052|Hs108|chr2 (1342) 799 119.6 4.2e-26 CCDS33178.2 LTBP1 gene_id:4052|Hs108|chr2 (1395) 799 119.6 4.3e-26 CCDS33177.2 LTBP1 gene_id:4052|Hs108|chr2 (1721) 799 119.7 4.9e-26 CCDS12196.1 FBN3 gene_id:84467|Hs108|chr19 (2809) 747 113.1 7.8e-24 CCDS74370.1 LTBP4 gene_id:8425|Hs108|chr19 (1557) 742 112.2 8.2e-24 CCDS74368.1 LTBP4 gene_id:8425|Hs108|chr19 (1587) 742 112.2 8.3e-24 CCDS74369.1 LTBP4 gene_id:8425|Hs108|chr19 (1624) 742 112.2 8.5e-24 CCDS54344.1 LTBP1 gene_id:4052|Hs108|chr2 (1353) 714 108.4 9.5e-23 CCDS32232.1 FBN1 gene_id:2200|Hs108|chr15 (2871) 694 106.1 9.8e-22 CCDS34222.1 FBN2 gene_id:2201|Hs108|chr5 (2912) 662 101.9 1.8e-20 CCDS43028.1 FBLN1 gene_id:2192|Hs108|chr22 ( 566) 636 97.8 6.4e-20 CCDS14069.1 FBLN1 gene_id:2192|Hs108|chr22 ( 683) 637 98.0 6.6e-20 CCDS14068.1 FBLN1 gene_id:2192|Hs108|chr22 ( 601) 636 97.8 6.6e-20 CCDS14067.1 FBLN1 gene_id:2192|Hs108|chr22 ( 703) 636 97.9 7.4e-20 CCDS8103.1 LTBP3 gene_id:4054|Hs108|chr11 (1256) 630 97.4 1.9e-19 CCDS44647.1 LTBP3 gene_id:4054|Hs108|chr11 (1303) 628 97.1 2.3e-19 CCDS53599.1 SCUBE2 gene_id:57758|Hs108|chr11 ( 807) 579 90.5 1.4e-17 CCDS7797.2 SCUBE2 gene_id:57758|Hs108|chr11 ( 971) 579 90.5 1.6e-17 CCDS81553.1 SCUBE2 gene_id:57758|Hs108|chr11 ( 999) 579 90.6 1.7e-17 CCDS8116.1 EFEMP2 gene_id:30008|Hs108|chr11 ( 443) 520 82.4 2.1e-15 CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626) 521 83.2 4.6e-15 CCDS9831.1 LTBP2 gene_id:4053|Hs108|chr14 (1821) 517 82.7 7.1e-15 CCDS9898.1 FBLN5 gene_id:10516|Hs108|chr14 ( 448) 505 80.5 8.1e-15 CCDS34420.1 NOTCH4 gene_id:4855|Hs108|chr6 (2003) 497 80.1 4.7e-14 CCDS83099.1 COL21A1 gene_id:81578|Hs108|chr6 ( 954) 484 78.1 9.2e-14 CCDS55025.1 COL21A1 gene_id:81578|Hs108|chr6 ( 957) 484 78.1 9.2e-14 CCDS54784.1 NPNT gene_id:255743|Hs108|chr4 ( 582) 479 77.2 1e-13 CCDS13149.1 CD93 gene_id:22918|Hs108|chr20 ( 652) 471 76.2 2.3e-13 CCDS54787.1 NPNT gene_id:255743|Hs108|chr4 ( 536) 469 75.8 2.4e-13 CCDS34046.1 NPNT gene_id:255743|Hs108|chr4 ( 565) 469 75.9 2.5e-13 CCDS14155.1 EGFL6 gene_id:25975|Hs108|chrX ( 553) 465 75.3 3.6e-13 CCDS55370.1 EGFL6 gene_id:25975|Hs108|chrX ( 554) 465 75.3 3.6e-13 CCDS73268.1 NELL1 gene_id:4745|Hs108|chr11 ( 753) 461 74.9 6.3e-13 CCDS7855.1 NELL1 gene_id:4745|Hs108|chr11 ( 810) 461 75.0 6.7e-13 CCDS73267.1 NELL1 gene_id:4745|Hs108|chr11 ( 838) 461 75.0 6.8e-13 CCDS13148.1 THBD gene_id:7056|Hs108|chr20 ( 575) 452 73.6 1.2e-12 CCDS908.1 NOTCH2 gene_id:4853|Hs108|chr1 (2471) 461 75.5 1.4e-12 CCDS45072.1 GAS6 gene_id:2621|Hs108|chr13 ( 678) 447 73.1 2.1e-12 CCDS34938.1 COL14A1 gene_id:7373|Hs108|chr8 (1796) 444 73.1 5.4e-12 CCDS43481.1 COL12A1 gene_id:1303|Hs108|chr6 (1899) 439 72.5 8.8e-12 >>CCDS55265.1 MATN2 gene_id:4147|Hs108|chr8 (937 aa) initn: 6497 init1: 6497 opt: 6497 Z-score: 4626.0 bits: 867.3 E(32554): 0 Smith-Waterman score: 6497; 99.9% identity (100.0% similar) in 937 aa overlap (1-937:1-937) 10 20 30 40 50 60 pF1KB8 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCT :::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::: CCDS55 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDKKTCT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KIDYCASSNHGCQHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KIDYCASSNHGCQHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB8 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB8 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB8 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB8 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB8 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB8 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED 790 800 810 820 830 840 850 860 870 880 890 900 pF1KB8 SDGRQDSPAGELPKTVQQPTVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SDGRQDSPAGELPKTVQQPTVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCE 850 860 870 880 890 900 910 920 930 pF1KB8 NLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR ::::::::::::::::::::::::::::::::::::: CCDS55 NLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR 910 920 930 >>CCDS55264.1 MATN2 gene_id:4147|Hs108|chr8 (956 aa) initn: 6483 init1: 5990 opt: 5990 Z-score: 4266.2 bits: 800.7 E(32554): 0 Smith-Waterman score: 6449; 97.9% identity (98.0% similar) in 956 aa overlap (1-937:1-956) 10 20 30 40 50 60 pF1KB8 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCT :::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::: CCDS55 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDKKTCT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KIDYCASSNHGCQHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KIDYCASSNHGCQHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB8 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB8 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB8 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB8 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB8 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB8 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED 790 800 810 820 830 840 850 860 870 880 pF1KB8 SDGRQDSPAGELPKTVQQPT-------------------VQHRYLFEEDNLLRSTQKLSH :::::::::::::::::::: ::::::::::::::::::::: CCDS55 SDGRQDSPAGELPKTVQQPTESEPVTINIQDLLSCSNFAVQHRYLFEEDNLLRSTQKLSH 850 860 870 880 890 900 890 900 910 920 930 pF1KB8 STKPSGSPLEEKHDQCKCENLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 STKPSGSPLEEKHDQCKCENLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR 910 920 930 940 950 >>CCDS83309.1 MATN2 gene_id:4147|Hs108|chr8 (915 aa) initn: 4311 init1: 3818 opt: 3818 Z-score: 2725.8 bits: 515.6 E(32554): 1.7e-145 Smith-Waterman score: 6042; 93.6% identity (93.7% similar) in 956 aa overlap (1-937:1-915) 10 20 30 40 50 60 pF1KB8 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 IIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 ERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 AAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCT :::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::: CCDS83 FVCQCYSGYALAEDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDKKTCT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 KIDYCASSNHGCQHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNM ::::::::::::::::::: CCDS83 -----------------------------------------RINYCALNKPGCEHECVNM 370 430 440 450 460 470 480 pF1KB8 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 EESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLK 380 390 400 410 420 430 490 500 510 520 530 540 pF1KB8 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 TCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSC 440 450 460 470 480 490 550 560 570 580 590 600 pF1KB8 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 VSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAE 500 510 520 530 540 550 610 620 630 640 650 660 pF1KB8 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 DGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVI 560 570 580 590 600 610 670 680 690 700 710 720 pF1KB8 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 DGSKSLGEENFEVVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKK 620 630 640 650 660 670 730 740 750 760 770 780 pF1KB8 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 AVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWAS 680 690 700 710 720 730 790 800 810 820 830 840 pF1KB8 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 KAKANGITMYAVGVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALED 740 750 760 770 780 790 850 860 870 880 pF1KB8 SDGRQDSPAGELPKTVQQPT-------------------VQHRYLFEEDNLLRSTQKLSH :::::::::::::::::::: ::::::::::::::::::::: CCDS83 SDGRQDSPAGELPKTVQQPTESEPVTINIQDLLSCSNFAVQHRYLFEEDNLLRSTQKLSH 800 810 820 830 840 850 890 900 910 920 930 pF1KB8 STKPSGSPLEEKHDQCKCENLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 STKPSGSPLEEKHDQCKCENLIMFQNLANEEVRKLTQRLEEMTQRMEALENRLRYR 860 870 880 890 900 910 >>CCDS46226.1 MATN3 gene_id:4148|Hs108|chr2 (486 aa) initn: 984 init1: 649 opt: 1239 Z-score: 899.7 bits: 176.9 E(32554): 8.6e-44 Smith-Waterman score: 1249; 45.5% identity (67.3% similar) in 444 aa overlap (3-414:8-449) 10 20 30 pF1KB8 MEKMLAGCFLLILGQIVLLPAEA----------RERSRGRSISRGRH-------- . : : .::.: ..:::. : : ..:: . : ::. CCDS46 MPRPAPARRLPG-LLLLLWPLLLLPSAAPDPVARPGFRRLETRGPGGSPGRRPSPAAPDG 10 20 30 40 50 40 50 60 70 80 90 pF1KB8 ----ARTHPQTALLESSCENKRADLVFIIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDV . ..: : . :... :::::::::::: ...::: :. :.. ::::: CCDS46 APASGTSEPGRARGAGVCKSRPLDLVFIIDSSRSVRPLEFTKVKTFVSRIIDTLDIGPAD 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB8 TRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEG :::....:.:::: ::.:... :. ...:: :. ::::::.::::: :.. ::. : CCDS46 TRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVEAG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB8 ARPLRENVPRVIMIVTDGRPQDSVAEVAAKARDTGILIFAIGVGQVDFNTLKSIGSEPHE :: :.:.: .:::::::::.: ::::.:. .:: ..:.:: ..:. .:: ..::: : CCDS46 AREPSSNIPKVAIIVTDGRPQDQVNEVAARAQASGIELYAVGVDRADMASLKMMASEPLE 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB8 DHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHFCI-NIPGSYVCRCKQGYILNS .::: : ... :: :.: ::. .:. : :.: : :: . :.. :.:.::: ::. CCDS46 EHVFYVETYGVIEKLSSRFQETFCALDPCVLGTHQCQHVCISDGEGKHHCECSQGYTLNA 240 250 260 270 280 290 280 290 300 310 320 330 pF1KB8 DQTTCRIQDLCAMEDHNCEQLCVN-VPGSFVCQCYSGYALAEDGKRCVAVDYCASENHGC :. :: : ::.. :.::..::: ::. :.:: ::.: :: : : : : :: .::: CCDS46 DKKTCSALDRCALNTHGCEHICVNDRSGSYHCECYEGYTLNEDRKTCSAQDKCALGTHGC 300 310 320 330 340 350 340 350 360 370 380 pF1KB8 EHECVNA-DGSYLCQCHEGFALNPDEKTCTKIDYCASSNHGCQHECVNTDD-SYSCHCLK .: ::: ::. :.:.::..:: :.:::. : :: ..::::: ::. :: : : CCDS46 QHICVNDRTGSHHCECYEGYTLNADKKTCSVRDKCALGSHGCQHICVSDGAASYHCDCYP 360 370 380 390 400 410 390 400 410 420 430 440 pF1KB8 GFTLNPDKKTC------RRINYCALNKPGCEHECVNMEESYYCRCHRGYTLDPNGKTCSR :.::: ::::: ::. . . ::: CCDS46 GYTLNEDKKTCSATEEARRL-VSTEDACGCEATLAFQDKVSSYLQRLNTKLDDILEKLKI 420 430 440 450 460 470 450 460 470 480 490 500 pF1KB8 VDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDYCLLSDHGCEYSCVNMD CCDS46 NEYGQIHR 480 >>CCDS13348.1 MATN4 gene_id:8785|Hs108|chr20 (581 aa) initn: 1182 init1: 1182 opt: 1194 Z-score: 866.8 bits: 171.0 E(32554): 5.8e-42 Smith-Waterman score: 1350; 33.1% identity (50.3% similar) in 892 aa overlap (43-934:20-577) 20 30 40 50 60 70 pF1KB8 LGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVFIIDSSRSVNTHD : : :.. ::::.::::::: . CCDS13 MRGLLCWPVLLLLLQPWETQLQLTGPRCHTGPLDLVFVIDSSRSVRPFE 10 20 30 40 80 90 100 110 120 130 pF1KB8 YAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLST . ...:.. .:. :..::..::::..::.: :.. : :..:.:. ..:::.. . :. CCDS13 FETMRQFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQ 50 60 70 80 90 100 140 150 160 170 180 190 pF1KB8 GTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEVAAKARDTGILIF :::::::::::.:.::: :::::: .: :::: .::::::::: ::::::.:: :: :. CCDS13 GTMTGLAIQYAMNVAFSVAEGARPPEERVPRVAVIVTDGRPQDRVAEVAAQARARGIEIY 110 120 130 140 150 160 200 210 220 230 240 250 pF1KB8 AIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF :.:: ..: ..:....: : ..::::: .:. :. . ::..:: CCDS13 AVGVQRADVGSLRAMASPPLDEHVFLVESFDLIQEFGLQFQSRLC--------------- 170 180 190 200 210 260 270 280 290 300 310 pF1KB8 CINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGSFVCQCYSGYALA CCDS13 ------------------------------------------------------------ 320 330 340 350 360 370 pF1KB8 EDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTKIDYCASSNHGC :.: :: .:::::.:::. :::.:.:. ::.:. :...: ::::. .::.: CCDS13 -------AIDLCAEGTHGCEHHCVNSPGSYFCHCQVGFVLQQDQRSCRAIDYCSFGNHSC 220 230 240 250 260 380 390 400 410 420 430 pF1KB8 QHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNMEESYYCRCHRGY :::::.: :: ::: CCDS13 QHECVST-------------------------------PGGP------------RCH--- 270 280 440 450 460 470 480 490 pF1KB8 TLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDYCLLSD CCDS13 ------------------------------------------------------------ 500 510 520 530 540 550 pF1KB8 HGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSEDSFVCQCF : ::: : CCDS13 -----------------CREGH------------------------------D------- 560 570 580 590 600 610 pF1KB8 EGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRKDVCK :. ::..:. .:.:...::::: :. CCDS13 ----LQPDGRSCQVRDLCNGVDHGCE-----------------FQ--------------- 290 300 310 620 630 640 650 660 670 pF1KB8 STHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENFE ::..: :: : : :: : ::. :..: :: .:::...:::::. .::: CCDS13 ---------CVSEGLSYRCLCPEGRQLQADGKSCNRCREGHVDLVLLVDGSKSVRPQNFE 320 330 340 350 360 680 690 700 710 720 730 pF1KB8 VVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGS .::.::. :.: : .::...::::.:.:..:.::: : ...: ..:.:: ..:: .:. CCDS13 LVKRFVNQIVDFLDVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGT 370 380 390 400 410 420 740 750 760 770 780 790 pF1KB8 MTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWASKAKANGITMYAV ::::::.:: :.::....:::: . :::...::::::.:::.: ::..:: .::.:::: CCDS13 MTGLALRHMVEHSFSEAQGARPRALNVPRVGLVFTDGRSQDDISVWAARAKEEGIVMYAV 430 440 450 460 470 480 800 810 820 830 840 850 pF1KB8 GVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALEDSDGRQDSPAGEL :::::.: ::.::::::.. :. :: ::.:: .. :.:. .:: : : . CCDS13 GVGKAVEAELREIASEPAELHVSYAPDFGTMTHLLENLRGSICPEEGISAGTE------- 490 500 510 520 530 860 870 880 890 900 910 pF1KB8 PKTVQQPTVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCENLIMFQNLANEE ::: : :.::.:. ::. . CCDS13 --------------------LRS---------P-----------CECESLVEFQGRTLGA 540 550 920 930 pF1KB8 VRKLTQRLEEMTQRMEALENRLRYR ...:: : ..: :.: :::.: CCDS13 LESLTLNLAQLTARLEDLENQLANQK 560 570 580 >>CCDS14048.1 SCUBE1 gene_id:80274|Hs108|chr22 (988 aa) initn: 647 init1: 363 opt: 1046 Z-score: 759.0 bits: 151.8 E(32554): 5.9e-36 Smith-Waterman score: 1046; 36.2% identity (58.5% similar) in 417 aa overlap (242-642:37-443) 220 230 240 250 260 pF1KB8 HEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF---CINIPGSYVCRCKQGY :: .: :. : : : :: : :: :: CCDS14 RWHLCVLLALGTRGRLAGGSGLPGSVDVDECSEGTDDC-HIDAICQNTPKSYKCLCKPGY 10 20 30 40 50 60 270 280 290 300 310 320 pF1KB8 ILNSDQTTCRIQDLCAMEDHN--CEQLCVNVPGSFVCQCYSGYALAEDGKRCVAVDYCAS .. : :. : : . .: : . :.:.::.. : :..:. ::.::. :. :: : . CCDS14 KGEGKQ--CEDIDECENDYYNGGCVHECINIPGNYRCTCFDGFMLAHDGHNCLDVDECQD 70 80 90 100 110 120 330 340 350 360 370 380 pF1KB8 ENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTKIDY----CASSNHGCQHECVNTDDS .: ::.. :::: ::: :::: :: :. ...:: . . : ...::: : : .: . CCDS14 NNGGCQQICVNAMGSYECQCHSGFFLSDNQHTCIHRSNEGMNCMNKDHGCAHICRETPKG 130 140 150 160 170 180 390 400 410 420 430 440 pF1KB8 -YSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNMEESYYCRCHRGYTLDPNGKTC .: : :: : ..: : : .. ::.: : . . . : ::. :.: .:.:: CCDS14 GVACDCRPGFDLAQNQKDCTLT--CNYGNGGCQHSCEDTDTGPTCGCHQKYALHSDGRTC 190 200 210 220 230 240 450 460 470 480 490 500 pF1KB8 SRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDYCLLSDHGCEYSCVN .. :: .. ::.. : .: . :.: :: .. : :::. .. ::... ::.. : : CCDS14 --IETCAVNNGGCDRTCKDTATGVRCSCPVGFTLQPDGKTCKDINECLVNNGGCDHFCRN 250 260 270 280 290 510 520 530 540 550 560 pF1KB8 MDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSEDSFVCQCFEGYILREDG :: : : .:. : .: .:: .: :.. .. :.: :..: :: : : .:::: : CCDS14 TVGSFECGCRKGYKLLTDERTCQDIDECSF-ERTCDHICINSPGSFQCLCHRGYILY--G 300 310 320 330 340 350 570 580 590 600 610 pF1KB8 KT-CRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRKDVCKS---THHG : : : :. . .:.. :::. :: : : : :: .:: : . : : : CCDS14 TTHCGDVDECSMSNGSCDQGCVNTKGSYECVCPPGRRLHWNGKDCVETGKCLSRAKTSPR 360 370 380 390 400 410 620 630 640 650 660 670 pF1KB8 CEHICVNNGNSYIC--KCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENFEVVK . : . :. : .: ... : CCDS14 AQLSCSKAGGVESCFLSCPAHTLFVPDSENSYVLSCGVPGPQGKALQKRNGTSSGLGPSC 420 430 440 450 460 470 >>CCDS46607.1 MATN4 gene_id:8785|Hs108|chr20 (540 aa) initn: 1916 init1: 994 opt: 1012 Z-score: 738.1 bits: 147.1 E(32554): 8.6e-35 Smith-Waterman score: 1073; 30.6% identity (46.6% similar) in 892 aa overlap (43-934:20-536) 20 30 40 50 60 70 pF1KB8 LGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVFIIDSSRSVNTHD : : :.. ::::.::::::: . CCDS46 MRGLLCWPVLLLLLQPWETQLQLTGPRCHTGPLDLVFVIDSSRSVRPFE 10 20 30 40 80 90 100 110 120 130 pF1KB8 YAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLST . ...:.. .:. :..::..::::..::.: :.. : :..:.:. ..:::.. . :. CCDS46 FETMRQFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQ 50 60 70 80 90 100 140 150 160 170 180 190 pF1KB8 GTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEVAAKARDTGILIF :::::::::::.:.::: :::::: .: :::: .::::::::: ::::::.:: :: :. CCDS46 GTMTGLAIQYAMNVAFSVAEGARPPEERVPRVAVIVTDGRPQDRVAEVAAQARARGIEIY 110 120 130 140 150 160 200 210 220 230 240 250 pF1KB8 AIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF :.:: ..: ..:....: : ..::::: .:. :. . ::..::. CCDS46 AVGVQRADVGSLRAMASPPLDEHVFLVESFDLIQEFGLQFQSRLCA-------------- 170 180 190 200 210 260 270 280 290 300 310 pF1KB8 CINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGSFVCQCYSGYALA CCDS46 ------------------------------------------------------------ 320 330 340 350 360 370 pF1KB8 EDGKRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTKIDYCASSNHGC ::::. .::.: CCDS46 -------------------------------------------------IDYCSFGNHSC 220 380 390 400 410 420 430 pF1KB8 QHECVNTDDSYSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNMEESYYCRCHRGY :::::.: :: ::: CCDS46 QHECVST-------------------------------PGGP------------RCH--- 230 240 440 450 460 470 480 490 pF1KB8 TLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDYCLLSD CCDS46 ------------------------------------------------------------ 500 510 520 530 540 550 pF1KB8 HGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSEDSFVCQCF : ::: : CCDS46 -----------------CREGH------------------------------D------- 560 570 580 590 600 610 pF1KB8 EGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRKDVCK :. ::..:. .:.:...::::: :. CCDS46 ----LQPDGRSCQVRDLCNGVDHGCE-----------------FQ--------------- 250 260 270 620 630 640 650 660 670 pF1KB8 STHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENFE ::..: :: : : :: : ::. :..: :: .:::...:::::. .::: CCDS46 ---------CVSEGLSYRCLCPEGRQLQADGKSCNRCREGHVDLVLLVDGSKSVRPQNFE 280 290 300 310 320 680 690 700 710 720 730 pF1KB8 VVKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGS .::.::. :.: : .::...::::.:.:..:.::: : ...: ..:.:: ..:: .:. CCDS46 LVKRFVNQIVDFLDVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGT 330 340 350 360 370 380 740 750 760 770 780 790 pF1KB8 MTGLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWASKAKANGITMYAV ::::::.:: :.::....:::: . :::...::::::.:::.: ::..:: .::.:::: CCDS46 MTGLALRHMVEHSFSEAQGARPRALNVPRVGLVFTDGRSQDDISVWAARAKEEGIVMYAV 390 400 410 420 430 440 800 810 820 830 840 850 pF1KB8 GVGKAIEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALEDSDGRQDSPAGEL :::::.: ::.::::::.. :. :: ::.:: .. :.:. .:: : : . CCDS46 GVGKAVEAELREIASEPAELHVSYAPDFGTMTHLLENLRGSICPEEGISAGTE------- 450 460 470 480 490 860 870 880 890 900 910 pF1KB8 PKTVQQPTVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCENLIMFQNLANEE ::: : :.::.:. ::. . CCDS46 --------------------LRS---------P-----------CECESLVEFQGRTLGA 500 510 920 930 pF1KB8 VRKLTQRLEEMTQRMEALENRLRYR ...:: : ..: :.: :::.: CCDS46 LESLTLNLAQLTARLEDLENQLANQK 520 530 540 >>CCDS4800.1 SCUBE3 gene_id:222663|Hs108|chr6 (993 aa) initn: 549 init1: 311 opt: 984 Z-score: 715.0 bits: 143.7 E(32554): 1.7e-33 Smith-Waterman score: 1040; 36.7% identity (59.8% similar) in 403 aa overlap (242-631:33-425) 220 230 240 250 260 pF1KB8 HEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF---CINIPGSYVCRCKQGY : :: :. : : : :: : ::.:: CCDS48 SGRVPGLCLLVLLVHARAAQYSKAAQDVDECVEGTDNC-HIDAICQNTPRSYKCICKSGY 10 20 30 40 50 60 270 280 290 300 310 320 pF1KB8 ILNSDQTTCRIQDLCAMEDH-NCEQLCVNVPGSFVCQCYSGYALAEDGKRCVAVDYCASE ..: :. : : ::. .: . :::.::.. : ::.:. ::.::. :. :: :: CCDS48 --TGDGKHCKDVDECEREDNAGCVHDCVNIPGNYRCTCYDGFHLAHDGHNCLDVDECAEG 70 80 90 100 110 330 340 350 360 370 380 pF1KB8 NHGCEHECVNADGSYLCQCHEGFALNPDEKTCTKIDY----CASSNHGCQHECVNTDDS- : ::.. ::: ::: :.:.::: :. ...:: . : ..:::: : : .: . CCDS48 NGGCQQSCVNMMGSYECHCREGFFLSDNQHTCIQRPEEGMNCMNKNHGCAHICRETPKGG 120 130 140 150 160 170 390 400 410 420 430 440 pF1KB8 YSCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNMEESYYCRCHRGYTLDPNGKTCS .:.: :: :. ... :. : .. ::.: : . :.. : :: ..: .:::: CCDS48 IACECRPGFELTKNQRDCKLT--CNYGNGGCQHTCDDTEQGPRCGCHIKFVLHTDGKTC- 180 190 200 210 220 230 450 460 470 480 490 500 pF1KB8 RVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDYCLLSDHGCEYSCVNM .. :: .. ::.. : .. . : : ::... : :::. .: : :.. ::.. : : CCDS48 -IETCAVNNGGCDSKCHDAATGVHCTCPVGFMLQPDRKTCKDIDECRLNNGGCDHICRNT 240 250 260 270 280 290 510 520 530 540 550 560 pF1KB8 DRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSEDSFVCQCFEGYILREDGK :: :.: .:. : . ..: .: :.. :. :.: ::.. :: : : .::.: : CCDS48 VGSFECSCKKGYKLLINERNCQDIDECSF-DRTCDHICVNTPGSFQCLCHRGYLLY--GI 300 310 320 330 340 350 570 580 590 600 610 620 pF1KB8 T-CRRKDVCQAIDHGCEHICVNSDDSYTCECLEGF-RLAEDGKRCRRKDVCKSTHHGCEH : : : :. ::. :.:. :: : : : :: .:: : . :... . . CCDS48 THCGDVDECSINRGGCRFGCINTPGSYQCTCPAGQGRLHWNGKDCTEPLKCQGSPGASKA 360 370 380 390 400 410 630 640 650 660 670 pF1KB8 I--CVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENFEVVKQFV . : .:.. : CCDS48 MLSCNRSGKKDTCALTCPSRARFLPESENGFTVSCGTPSPRAAPARAGHNGNSTNSNHCH 420 430 440 450 460 470 >>CCDS41237.1 MEGF6 gene_id:1953|Hs108|chr1 (1541 aa) initn: 858 init1: 345 opt: 898 Z-score: 651.7 bits: 132.6 E(32554): 5.6e-30 Smith-Waterman score: 939; 35.8% identity (57.8% similar) in 377 aa overlap (237-605:83-451) 210 220 230 240 250 260 pF1KB8 IGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHFCINIPGSYVCRCKQ :..: :. . . . . : :: . CCDS41 LTLVGRRQPCVQALSHTVPVWKAGCGWQAWCVGHERRTVYYMGYRQVYTTEARTVLRCCR 60 70 80 90 100 110 270 280 290 300 310 320 pF1KB8 GYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGSFV-CQCYSGYALAEDGKRC-VAVDYC :.. . :. : .. :. : . :::: :.: :. .: :: :: : CCDS41 GWMQQPDEEGC-LSAECSASL--CFHGGRCVPGSAQPCHCPPGF----QGPRCQYDVDEC 120 130 140 150 160 330 340 350 360 370 380 pF1KB8 ASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTKIDYCASSNHGCQHECVN-TDDSY ..: ::.:.:::. :::::.:. :: :. : .:: :. :: .: ::::.::. : . CCDS41 RTHNGGCQHRCVNTPGSYLCECKPGFRLHTDSRTCLAINSCALGNGGCQHHCVQLTITRH 170 180 190 200 210 220 390 400 410 420 430 440 pF1KB8 SCHCLKGFTLNPDKKTCRRINYCALNKPGCEHECVNMEESYYCRCHRGYTLDPNGKTCSR :.: :: :. : . : : . :: . .: :.: .. :.:: :: : .::.: CCDS41 RCQCRPGFQLQEDGRHCVRRSPCANRNGSCMHRCQVVRGLARCECHVGYQLAADGKACED 230 240 250 260 270 280 450 460 470 480 490 pF1KB8 VDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTCSRVDY-----CLLSDHGCEYS ::.:: : . ::::. :: : : :. .. : . : :... : .. :: .. CCDS41 VDECAAGLAQCAHGCLNTQGSFKCVCHAGYELGADGRQCYRIEMEIVNSCEANNGGCSHG 290 300 310 320 330 340 500 510 520 530 540 550 pF1KB8 CVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSEDSFVCQCFEGYILR : . . . : ::.:. : .: .:: .:.:: . :.. :... .. : :. :: : CCDS41 CSHTSAGPLCTCPRGYELDTDQRTCIDVDDCA-DSPCCQQVCTNNPGGYECGCYAGYRLS 350 360 370 380 390 400 560 570 580 590 600 610 pF1KB8 EDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRKDVCKSTHHGC :: :. : : . :::: :.: :. : : :.:: :: . : CCDS41 ADGCGCEDVDECASSRGGCEHHCTNLAGSFQCSCEAGYRLHEDRRGCSPLEEPMVDLDGE 410 420 430 440 450 460 620 630 640 650 660 670 pF1KB8 EHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENFEVVKQFV CCDS41 LPFVRPLPHIAVLQDELPQLFQDDDVGADEEEAELRGEHTLTEKFVCLDDSFGHDCSLTC 470 480 490 500 510 520 >>CCDS336.1 MATN1 gene_id:4146|Hs108|chr1 (496 aa) initn: 1318 init1: 646 opt: 805 Z-score: 591.7 bits: 119.9 E(32554): 1.2e-26 Smith-Waterman score: 813; 51.1% identity (77.4% similar) in 235 aa overlap (605-835:221-452) 580 590 600 610 620 630 pF1KB8 HGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRKDVCKSTHHGCEHICVNNGNSYICKCS : .:.: . : ::..:... .:: : : CCDS33 IASEPQDEHVDYVESYSVIEKLSRKFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACH 200 210 220 230 240 250 640 650 660 670 680 690 pF1KB8 EGFVLAEDGRRCKKCTEG----PIDLVFVIDGSKSLGEENFEVVKQFVTGIIDSLTISPK :::.: ::. :. :. : ::::.::::::. ::::.::.:.. :.:.: .: : CCDS33 EGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFISQIVDTLDVSDK 260 270 280 290 300 310 700 710 720 730 740 750 pF1KB8 AARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE :.:::.:::..:. :: : :.. ::.: :: .:.:: ::.::: :::.... ::: . CCDS33 LAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSS 320 330 340 350 360 370 760 770 780 790 800 810 pF1KB8 GARPLSTRVPRAAIVFTDGRAQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEPT :::: . .: .:::::::.:: ... :.::: :. :.:::::.:.:.::.::::::. CCDS33 GARPGAQKV---GIVFTDGRSQDYINDAAKKAKDLGFKMFAVGVGNAVEDELREIASEPV 380 390 400 410 420 820 830 840 850 860 870 pF1KB8 NKHLFYAEDFSTMDEISEKLKKGICEALEDSDGRQDSPAGELPKTVQQPTVQHRYLFEED .: ::. ::.:...:..::.: :: CCDS33 AEHYFYTADFKTINQIGKKLQKKICVEEDPCACESLVKFQAKVEGLLQALTRKLEAVSKR 430 440 450 460 470 480 >-- initn: 668 init1: 620 opt: 633 Z-score: 469.7 bits: 97.3 E(32554): 7.6e-20 Smith-Waterman score: 634; 46.7% identity (73.6% similar) in 227 aa overlap (8-234:12-218) 10 20 30 40 50 pF1KB8 MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRA : ::.: : . :. : . :::. ::.: . CCDS33 MRVLSGTSLMLCSLLLLLQALCSPGLAPQ-------SRGHLCRTRP-------------T 10 20 30 40 60 70 80 90 100 110 pF1KB8 DLVFIIDSSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKR ::::..:::::: .. ::: :. .... ::.::..::::...:.::::.::::.. CCDS33 DLVFVVDSSRSVRPVEFEKVKVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVS 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 KSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDS :. . .::.:.. ::::::::::::.:.. ::..:::.: .. .:...::::::::: CCDS33 KAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDS 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 VAEVAAKARDTGILIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKL : .:.:.:: .:. .::::::.:: ::..:.:::...:: : ..: :: :. ::. CCDS33 VQDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAF 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB8 CTAHMCSTLEHNCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVN CCDS33 CVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLI 230 240 250 260 270 280 937 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 16:34:03 2016 done: Sat Nov 5 16:34:03 2016 Total Scan time: 3.610 Total Display time: 0.220 Function used was FASTA [36.3.4 Apr, 2011]