FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1179, 689 aa 1>>>pF1KE1179 689 - 689 aa - 689 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.3679+/-0.00134; mu= 4.3417+/- 0.081 mean_var=594.2117+/-125.621, 0's: 0 Z-trim(115.1): 218 B-trim: 564 in 2/50 Lambda= 0.052614 statistics sampled from 15402 (15622) to 15402 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.723), E-opt: 0.2 (0.48), width: 16 Scan time: 3.750 The best scores are: opt bits E(32554) CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 ( 689) 5139 405.4 1.5e-112 CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 ( 678) 2740 223.2 9.5e-58 CCDS4971.1 COL9A1 gene_id:1297|Hs108|chr6 ( 921) 2723 222.2 2.7e-57 CCDS13505.1 COL9A3 gene_id:1299|Hs108|chr20 ( 684) 2418 198.8 2.2e-50 CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626) 1998 167.5 1.4e-40 CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466) 1929 162.2 4.9e-39 CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499) 1907 160.5 1.6e-38 CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 1876 158.3 9e-38 CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838) 1876 158.3 9e-38 CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690) 1839 155.5 6e-37 CCDS53348.1 COL11A1 gene_id:1301|Hs108|chr1 (1767) 1839 155.5 6.2e-37 CCDS778.1 COL11A1 gene_id:1301|Hs108|chr1 (1806) 1839 155.5 6.3e-37 CCDS11561.1 COL1A1 gene_id:1277|Hs108|chr17 (1464) 1826 154.4 1.1e-36 CCDS9511.1 COL4A1 gene_id:1282|Hs108|chr13 (1669) 1792 151.9 7.1e-36 CCDS6802.1 COL27A1 gene_id:85301|Hs108|chr9 (1860) 1771 150.4 2.3e-35 CCDS34682.1 COL1A2 gene_id:1278|Hs108|chr7 (1366) 1763 149.5 2.9e-35 CCDS42829.1 COL4A3 gene_id:1285|Hs108|chr2 (1670) 1747 148.5 7.6e-35 CCDS2773.1 COL7A1 gene_id:1294|Hs108|chr3 (2944) 1750 149.1 8.8e-35 CCDS43452.1 COL11A2 gene_id:1302|Hs108|chr6 (1650) 1707 145.4 6.2e-34 CCDS42828.1 COL4A4 gene_id:1286|Hs108|chr2 (1690) 1704 145.2 7.4e-34 CCDS8759.1 COL2A1 gene_id:1280|Hs108|chr12 (1418) 1666 142.2 4.9e-33 CCDS41778.1 COL2A1 gene_id:1280|Hs108|chr12 (1487) 1666 142.2 5.1e-33 CCDS14542.1 COL4A6 gene_id:1288|Hs108|chrX (1690) 1665 142.2 5.7e-33 CCDS14541.1 COL4A6 gene_id:1288|Hs108|chrX (1691) 1665 142.2 5.7e-33 CCDS76008.1 COL4A6 gene_id:1288|Hs108|chrX (1633) 1654 141.4 1e-32 CCDS76009.1 COL4A6 gene_id:1288|Hs108|chrX (1666) 1654 141.4 1e-32 CCDS41353.1 COL24A1 gene_id:255631|Hs108|chr1 (1714) 1651 141.2 1.2e-32 CCDS12222.1 COL5A3 gene_id:50509|Hs108|chr19 (1745) 1626 139.3 4.5e-32 CCDS76010.1 COL4A6 gene_id:1288|Hs108|chrX (1707) 1616 138.5 7.6e-32 CCDS41297.1 COL16A1 gene_id:1307|Hs108|chr1 (1604) 1597 137.1 2e-31 CCDS41907.1 COL4A2 gene_id:1284|Hs108|chr13 (1712) 1557 134.1 1.7e-30 CCDS42971.1 COL18A1 gene_id:80781|Hs108|chr21 (1339) 1550 133.4 2.1e-30 CCDS42972.1 COL18A1 gene_id:80781|Hs108|chr21 (1519) 1550 133.4 2.3e-30 CCDS77643.1 COL18A1 gene_id:80781|Hs108|chr21 (1754) 1550 133.5 2.5e-30 CCDS55025.1 COL21A1 gene_id:81578|Hs108|chr6 ( 957) 1482 128.0 6.4e-29 CCDS83099.1 COL21A1 gene_id:81578|Hs108|chr6 ( 954) 1477 127.6 8.3e-29 CCDS72756.1 COL8A2 gene_id:1296|Hs108|chr1 ( 638) 1434 124.1 6.4e-28 CCDS403.1 COL8A2 gene_id:1296|Hs108|chr1 ( 703) 1434 124.1 6.7e-28 CCDS5105.1 COL10A1 gene_id:1300|Hs108|chr6 ( 680) 1431 123.9 7.7e-28 CCDS58922.1 COL25A1 gene_id:84570|Hs108|chr4 ( 645) 1359 118.4 3.3e-26 CCDS76649.1 COL4A1 gene_id:1282|Hs108|chr13 ( 519) 1338 116.7 8.9e-26 CCDS44419.1 COL13A1 gene_id:1305|Hs108|chr10 ( 717) 1316 115.2 3.4e-25 CCDS44424.2 COL13A1 gene_id:1305|Hs108|chr10 ( 695) 1313 114.9 3.9e-25 CCDS44427.2 COL13A1 gene_id:1305|Hs108|chr10 ( 645) 1269 111.6 3.8e-24 CCDS44425.2 COL13A1 gene_id:1305|Hs108|chr10 ( 686) 1257 110.7 7.4e-24 CCDS43258.1 COL25A1 gene_id:84570|Hs108|chr4 ( 654) 1232 108.8 2.7e-23 CCDS43259.1 COL25A1 gene_id:84570|Hs108|chr4 ( 642) 1171 104.1 6.6e-22 CCDS44423.2 COL13A1 gene_id:1305|Hs108|chr10 ( 668) 1145 102.2 2.6e-21 CCDS2934.1 COL8A1 gene_id:1295|Hs108|chr3 ( 744) 1126 100.8 7.6e-21 CCDS7554.1 COL17A1 gene_id:1308|Hs108|chr10 (1497) 1130 101.6 9e-21 >>CCDS450.1 COL9A2 gene_id:1298|Hs108|chr1 (689 aa) initn: 5139 init1: 5139 opt: 5139 Z-score: 2134.3 bits: 405.4 E(32554): 1.5e-112 Smith-Waterman score: 5139; 99.9% identity (100.0% similar) in 689 aa overlap (1-689:1-689) 10 20 30 40 50 60 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 PGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 GPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGAT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLP ::::::::::::::::::::::::::::::::::.::::::::::::::::::::::::: CCDS45 GPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 GFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 GLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 GDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEV 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE1 AVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKR 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE1 GEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGL 610 620 630 640 650 660 670 680 pF1KE1 PGFCEPAACLGASAYASARLTEPGSIKGP ::::::::::::::::::::::::::::: CCDS45 PGFCEPAACLGASAYASARLTEPGSIKGP 670 680 >>CCDS47447.1 COL9A1 gene_id:1297|Hs108|chr6 (678 aa) initn: 4352 init1: 2004 opt: 2740 Z-score: 1150.3 bits: 223.2 E(32554): 9.5e-58 Smith-Waterman score: 2740; 55.4% identity (69.1% similar) in 682 aa overlap (3-683:2-678) 10 20 30 40 50 60 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAG : :: :. : :: . . : ::::::.::::::::::::: :::::: :: : : CCDS47 MAWTARDRGALGLLLLGLCLCAAQRGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGPPG 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 PPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGP :::: ::::: : : : :: ::::: : :: .: : ::.::.:: :.:: :. :: CCDS47 PPGPAGEPGKPGAPGKPGTPGADGLTGPDGSPGSIGSKGQKGEPGVPGSRGFPGRGIPGP 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 PGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPP ::::: .:::::.: :: :::: :: ::::::: : : .: : :::. ::: CCDS47 PGPPGTAGLPGELGRVGPVGDPGRRGPPGPPGPPGPRGTIGFHDG-----DPLCPNACPP 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 GMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMA : .: ::: :..:: : .: .:.::.::. : .:: : :: : :::: ::.:: :.. CCDS47 GRSGYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGAQGLRGITGIV 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE1 GPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGAT : ::: : .: : : : : ::..: :::::.:: :::.:. : :: :. :::: : CCDS47 GDKGEKGARGLDGEPGPQGLPGAPGDQGQRGPPGEAGPKGDRGAEGARGIPGLPGPKGDT 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE1 GPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLP : ::..:.:: :: :: :: :. : ::. : ::. :::: ::.:: :..: : : : CCDS47 GLPGVDGRDGIPGMPGTKGEPGKPGPPGDAGLQGLPGVPGIPGAKGVAGEKGSTGAPGKP 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE1 GFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQ : : :: :. :: ::.::.: .: :..:: ::::.:: :. : : : ::.::: CCDS47 GQMGNSGKPGQQGPPGEVGPRGPQGLPGSRGELGPVGSPGLPGKLGSLGSPGLPGLPGPP 360 370 380 390 400 410 430 440 450 460 470 480 pF1KE1 GLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPS ::::.:::.: :. ::.:. : : : ::.:: :. : :::::. : :::: :: CCDS47 GLPGMKGDRGVVGEPGPKGEQGASGEEGEAGERGELGDIGLPGPKGSAGNPGEPGLRGPE 420 430 440 450 460 470 490 500 510 520 530 540 pF1KE1 GDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEV :. : :::.: :::::::. :..:. : :: :: :: ::::: .: ....::..::. CCDS47 GSRGLPGVEGPRGPPGPRGVQGEQGATGLPGVQGPPGRAPTDQHIKQVCMRVIQEHFAEM 480 490 500 510 520 530 550 560 570 580 590 600 pF1KE1 AVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKR :.: :: ::.:. : :::::::: ::..: :. : ::.::: : : .: :::: CCDS47 AASLKRPDSGATGLPGRPGPPGPPGPPGENGFPGQMGIRGLPGIKGPPGALGLRGPKGDL 540 550 560 570 580 590 610 620 630 640 650 660 pF1KE1 GEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGL ::::. : ::: :.:: :.:: :: . . ::.::.:: ::. : : :: ::: :: CCDS47 GEKGERGPPGRGPNGLPGAIGLPGDPGPASYGRNGRDGERGPPGVAGIPGVPGPPGPPGL 600 610 620 630 640 650 670 680 pF1KE1 PGFCEPAAC-LGASAYASARLTEPGSIKGP :::::::.: . :. : . .: CCDS47 PGFCEPASCTMQAGQRAFNKGPDP 660 670 >>CCDS4971.1 COL9A1 gene_id:1297|Hs108|chr6 (921 aa) initn: 4352 init1: 2004 opt: 2723 Z-score: 1142.0 bits: 222.2 E(32554): 2.7e-57 Smith-Waterman score: 2723; 56.1% identity (69.8% similar) in 659 aa overlap (26-683:268-921) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGP ::::::.::::::::::::: :::::: :: CCDS49 MLIHCDPLRPRRETCHELPARITPSQTTDERGPPGEQGPPGPPGPPGVPGIDGIDGDRGP 240 250 260 270 280 290 60 70 80 90 100 110 pF1KE1 PGKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGP : ::::: ::::: : : : :: ::::: : :: .: : ::.::.:: :.:: CCDS49 KGPPGPPGPAGEPGKPGAPGKPGTPGADGLTGPDGSPGSIGSKGQKGEPGVPGSRGFPGR 300 310 320 330 340 350 120 130 140 150 160 170 pF1KE1 GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCP :. ::::::: .:::::.: :: :::: :: ::::::: : : .: : ::: CCDS49 GIPGPPGPPGTAGLPGELGRVGPVGDPGRRGPPGPPGPPGPRGTIGFHDG-----DPLCP 360 370 380 390 400 410 180 190 200 210 220 230 pF1KE1 TNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRG . :::: .: ::: :..:: : .: .:.::.::. : .:: : :: : :::: ::.:: CCDS49 NACPPGRSGYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGAQGLRG 420 430 440 450 460 470 240 250 260 270 280 290 pF1KE1 YPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITG :..: ::: : .: : : : : ::..: :::::.:: :::.:. : :: :. : CCDS49 ITGIVGDKGEKGARGLDGEPGPQGLPGAPGDQGQRGPPGEAGPKGDRGAEGARGIPGLPG 480 490 500 510 520 530 300 310 320 330 340 350 pF1KE1 PKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPG ::: :: ::..:.:: :: :: :: :. : ::. : ::. :::: ::.:: :..: : CCDS49 PKGDTGLPGVDGRDGIPGMPGTKGEPGKPGPPGDAGLQGLPGVPGIPGAKGVAGEKGSTG 540 550 560 570 580 590 360 370 380 390 400 410 pF1KE1 PQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPG : :: : :: :. :: ::.::.: .: :..:: ::::.:: :. : : : :: CCDS49 APGKPGQMGNSGKPGQQGPPGEVGPRGPQGLPGSRGELGPVGSPGLPGKLGSLGSPGLPG 600 610 620 630 640 650 420 430 440 450 460 470 pF1KE1 IPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPG .::: ::::.:::.: :. ::.:. : : : ::.:: :. : :::::. : :::: CCDS49 LPGPPGLPGMKGDRGVVGEPGPKGEQGASGEEGEAGERGELGDIGLPGPKGSAGNPGEPG 660 670 680 690 700 710 480 490 500 510 520 530 pF1KE1 YPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQE :: :. : :::.: :::::::. :..:. : :: :: :: ::::: .: ....:: CCDS49 LRGPEGSRGLPGVEGPRGPPGPRGVQGEQGATGLPGVQGPPGRAPTDQHIKQVCMRVIQE 720 730 740 750 760 770 540 550 560 570 580 590 pF1KE1 QLAEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTG ..::.:.: :: ::.:. : :::::::: ::..: :. : ::.::: : : .: : CCDS49 HFAEMAASLKRPDSGATGLPGRPGPPGPPGPPGENGFPGQMGIRGLPGIKGPPGALGLRG 780 790 800 810 820 830 600 610 620 630 640 650 pF1KE1 PKGKRGEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLP ::: ::::. : ::: :.:: :.:: :: . . ::.::.:: ::. : : :: : CCDS49 PKGDLGEKGERGPPGRGPNGLPGAIGLPGDPGPASYGRNGRDGERGPPGVAGIPGVPGPP 840 850 860 870 880 890 660 670 680 pF1KE1 GPVGLPGFCEPAAC-LGASAYASARLTEPGSIKGP :: :::::::::.: . :. : . .: CCDS49 GPPGLPGFCEPASCTMQAGQRAFNKGPDP 900 910 920 >>CCDS13505.1 COL9A3 gene_id:1299|Hs108|chr20 (684 aa) initn: 2360 init1: 1313 opt: 2418 Z-score: 1018.1 bits: 198.8 E(32554): 2.2e-50 Smith-Waterman score: 2418; 50.8% identity (65.8% similar) in 681 aa overlap (1-672:1-670) 10 20 30 40 50 pF1KE1 MAAATA-SPRSLLVLL-QVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGK ::. : .: ::.:: .... : :: : :: ::::::::: ::.:::::. :::: CCDS13 MAGPRACAPLLLLLLLGELLAAAGAQRVGLPG---PPGPPGPPGKPGQDGIDGEAGPPGL 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 AGPPGPKGEPGKAGPDGP---DGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGP ::::::: ::: : : : ::.::::: : ::: : :: .:. : :::::: : CCDS13 PGPPGPKGAPGKPGKPGEAGLPGLPGVDGLTGRDGPPGPKGAPGERGSLGPPGPPGLGGK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCP :. :::: : : :: ::.::: : : : ::::::: ::.::.. ::..:. :: CCDS13 GLPGPPGEAGVSGPPGGIGLRGPPGPSGLPGLPGPPGPPGPPGHPGVLP--EGATDLQCP 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 TNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGP---PGPQG . :::: ::::. : :: .: .: . :. :: : ::: : : :.:: ::.: CCDS13 SICPPGPPGPPGMPGFKGPTGYKG---EQGEVGKDGEKGDPGPPGPAGLPGSVGLQGPRG 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 IRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQG .:: :: :: :. :: :..: : :: : :..: ::: : : ::: : :: .: : CCDS13 LRGLPGPLGPPGDRGPIGFRGPPGIPGAPGKAGDRGERGPEGFRGPKGDLGRPGPKGTPG 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 ITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQG ..::.: : :: .:..:.:: :.:: ::. : :: : .:. :.::. :.:: :..: CCDS13 VAGPSGEPGMPGKDGQNGVPGLDGQKGEAGRNGAPGEKGPNGLPGLPGRAGSKGEKGERG 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 EPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQG . : : : :: :: :. : :: : : :. : : .:: : :: .: :: :: .: CCDS13 RAGELGEAGPSGEPGVPGDAGMPGERGEAGHRGSAGALGPQGPPGAPGVRGFQGQKGSMG 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE1 PPGIPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRG ::.:::::: : ::.: : .::.: : : ::::.::: : :: ::::..: :: CCDS13 DPGLPGPQGLRGDVGDRGPGGAAGPKGDQGIAGSDGLPGDKGELGPSGLVGPKGESGSRG 420 430 440 450 460 470 480 490 500 510 520 530 pF1KE1 EPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKM : : : .: :. :::: :::::: :: : :::: :. :: :..:..:.: .. : CCDS13 ELGPKGTQGPNGTSGVQGVPGPPGPLGLQGVPGVPGITGKPGVPGKEASEQRIRELCGGM 480 490 500 510 520 530 540 550 560 570 580 590 pF1KE1 LQEQLAEVAVSAKRE-ALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQI ..::.:..:. .. : :..: :: ::::::: ::. : ::: :: :: : .:.. CCDS13 ISEQIAQLAAHLRKPLAPGSIGRPGPAGPPGPPGPPGSIG---HPGARGPPGYRGPTGEL 540 550 560 570 580 600 610 620 630 640 650 pF1KE1 GNTGPKGKRGEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGR :. ::.:..:..:: : .: : : : : : : :: . .:.:: : :: ::. : CCDS13 GDPGPRGNQGDRGDKGAAGAGLDGPEGDQGPQGPQGVPGTSKDGQDGAPGEPGPPGDPGL 590 600 610 620 630 640 660 670 680 pF1KE1 PGLPGPVGLPGFCEPAACLGASAYASARLTEPGSIKGP :: : : ::.:. .:: :: CCDS13 PGAIGAQGTPGICDTSACQGAVLGGVGEKSGSRSS 650 660 670 680 >>CCDS6376.1 COL22A1 gene_id:169044|Hs108|chr8 (1626 aa) initn: 5122 init1: 1092 opt: 1998 Z-score: 842.1 bits: 167.5 E(32554): 1.4e-40 Smith-Waterman score: 2066; 45.4% identity (58.5% similar) in 716 aa overlap (27-689:925-1626) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP : :: :::: : ::. :. : ::. : CCDS63 GPTGPPGAKGQEGAHGAPGAAGNPGAPGHVGAPGPSGPPGSVGAPGLRGTPGKDGERGEK 900 910 920 930 940 950 60 70 80 90 pF1KE1 GKAG------PPGPKGEPGKAGPDGPDGK--PGIDGLTGAKGEPGPMG------------ : :: : ::.:.:: : :: :: : :: :. : :::.: CCDS63 GAAGEEGSPGPVGPRGDPGAPGLPGPPGKGKDGEPGLRGSPGLPGPLGTKAACGKVRGSE 960 970 980 990 1000 1010 100 110 120 130 140 pF1KE1 --------IPGVKGQPGLPGPPGL---PGPGFAGPPGPPGPVGLPGEIGIRGPKGDPGPD . : .: ::.:: :: :: : :::::: :: : : : :: : :::. CCDS63 NCALGGQCVKGDRGAPGIPGSPGSRGDPGIGVAGPPGPSGPPGDKGSPGSRGLPGFPGPQ 1020 1030 1040 1050 1060 1070 150 160 170 180 190 pF1KE1 GPSGPPGPPGKPGRPGTIQGLEGSADFLCP-----------TNCPPGMKGPPGLQGVKGH ::.: : ::.::. : : : ...: : ..:::: :::: :. : CCDS63 GPAGRDGAPGNPGERGP-PGKPGLSSLLSPGDINLLAKDVCNDCPPG---PPGLPGLPGF 1080 1090 1100 1110 1120 1130 200 210 220 230 240 250 pF1KE1 AGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYKGM : .:. : ::..: : ::..: : : :: :::: .: : : :. : .:. :. CCDS63 KGDKGVPGKPGREGTEGKKGEAGPPGLPGPPGIAGPQGSQGERGADGEVGQKGDQGHPGV 1140 1150 1160 1170 1180 1190 260 270 280 290 300 310 pF1KE1 VGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGT : .: : :: : : : :: : .:::: .:: : ::.: : :: .::.: : CCDS63 PGFMGPPGNPGPPGADGIAGAAGPPGIQGSPGKEGPPGPQGPSGLPGIPGEEGKEGRDGK 1200 1210 1220 1230 1240 1250 320 330 340 350 360 370 pF1KE1 PGMKGSAGQAGQPGSPGHQGVAGVPG---QPGTKGGPGDQGEPGPQGLPGFSGPPGKEGE :: : :.::.:: :: .:. : :: . : .:.:: .:: : .:::: : :::.:. CCDS63 PGPPGEPGKAGEPGLPGPEGARGPPGFKGHTGDSGAPGPRGESGAMGLPGQEGLPGKDGD 1260 1270 1280 1290 1300 1310 380 390 400 410 420 pF1KE1 PGPRGEIGPQGIMGQKGDQGERGPVGQPGPQG---RQGPKGEQGPPGIPG---PQGLPGV :: : :::: : : .: : :.:::.: ..: :::.: ::.:: :.: :: CCDS63 TGPTGPQGPQGPRGPPGKNGSPGSPGEPGPSGTPGQKGSKGENGSPGLPGFLGPRGPPGE 1320 1330 1340 1350 1360 1370 430 440 450 460 470 480 pF1KE1 KGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGA :.:: ::: : :: :.:: : :. : ::..: :: ::: : .:: :: .: .: CCDS63 PGEKGVPGKEGVPGKPGEPGFKGERGDPGIKGDKGPPGGKGQPG---DPGIPGHKGHTGL 1380 1390 1400 1410 1420 490 500 510 520 530 540 pF1KE1 PGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVAL-KMLQEQLAEVAVSA : :: :: :: : : : :: :: .: : ..... : :.:. .:: . .. CCDS63 MGPQGLPGENGPVGPPGPPGQPGFPGLRGESPSMETLRRLIQEELGKQLETRLAYLLAQM 1430 1440 1450 1460 1470 1480 550 560 570 580 590 600 pF1KE1 KREAL-GAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEK . .. : ::::::: : ::. :: :.:: : :. : : :: :::.:: : CCDS63 PPAYMKSSQGRPGPPGPPGKDGLPGRAGPMGEPGRPGQGGLEGPSGPIG---PKGERGAK 1490 1500 1510 1520 1530 1540 610 620 630 640 650 660 pF1KE1 GDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGF :::: : : : ::::::: ::.:: : .: : : : : ::.:::::: : :: CCDS63 GDPGAPGVGLRGEMGPPGIPGQPGEPGYAKDGLPGIPGPQGETGPAGHPGLPGPPGPPGQ 1550 1560 1570 1580 1590 1600 670 680 pF1KE1 CEPAACLGASAYASARLTEPGSIKGP :.:. : :: .. ..::..::: CCDS63 CDPSQC----AYFASLAARPGNVKGP 1610 1620 >-- initn: 701 init1: 701 opt: 1182 Z-score: 507.4 bits: 105.6 E(32554): 6.1e-22 Smith-Waterman score: 1397; 45.4% identity (58.1% similar) in 511 aa overlap (28-509:451-923) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGID------- :: :: :: : .:::. : CCDS63 IVIYCDSRHAELETCCDIPSGPCQVTVVTEPP----PPPPPQRPPTPGSEQIGFLKTINC 430 440 450 460 470 60 70 80 90 100 pF1KE1 ----GDNGPPGKAGP---PGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQ :..: : ::: :::::. : :: : : : : .: : : .: : ::. CCDS63 SCPAGEKGEMGVAGPMGLPGPKGDIGAIGPVGAPGPKGEKGDVGI-G-PFGQGEKGEKGS 480 490 500 510 520 530 110 120 130 140 150 160 pF1KE1 PGLPGPPGLPGP-GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGT ::::::: : :. : :: : :::::.:.:::.: : : :::::. : :: CCDS63 LGLPGPPGRDGSKGMRGEPGELGEPGLPGEVGMRGPQG------PPGLPGPPGRVGAPG- 540 550 560 570 580 170 180 190 200 210 220 pF1KE1 IQGLEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQ .:: .: : .: : .:. : :: : : :.::.:::.: .: .::. CCDS63 LQGERGEK----------GTRGEKGERGLDGFPGKPG---DTGQQGRPGPSGVAGPQGEK 590 600 610 620 630 230 240 250 260 270 pF1KE1 GIPGPPGPQGIRG----YPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAG- : :: :: :. : :. : .: ::.:..: : :: :: : :: :::: : CCDS63 GDVGPAGPPGVPGSVVQQEGLKGEQGAPGPRGHQGAPGPPGARGPIGPEGRDGPPGLQGL 640 650 660 670 680 690 280 290 300 310 320 330 pF1KE1 --EKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGV .::: : ::: : :. :: : :::. : : :.::. : : :.:: :: : CCDS63 RGKKGDMGPPGIPGLLGLQGPPG---PPGVPGPPGPGGSPGLPGEIGFPGKPGPPGPTGP 700 710 720 730 740 750 340 350 360 370 380 390 pF1KE1 AGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGP : : : : :: .:::: .: :. ::: :: ::::: ::. :. :..:: : CCDS63 PGKDGPNGPPGPPGTKGEPGERGEDGL---PGK---PGLRGEIGEQGLAGRPGEKGEAGL 760 770 780 790 800 400 410 420 430 440 pF1KE1 VGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGL------ : :: : .: ::.:: : ::::.:::.: :..:: : : ::...: CCDS63 PGAPGFPGVRGEKGDQGEKG---ELGLPGLKGDRGEKGEAGPAGPPGLPGTTSLFTPHPR 810 820 830 840 850 860 450 460 470 480 490 500 pF1KE1 -PGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPG :::.: :::.:.:: :. :..:.:: ::.: .: ::..: : : : ::: :.:: CCDS63 MPGEQGPKGEKGDPGLPGEPGLQGRPGELGPQGPTGPPGAKGQEGAHGAPGAAGNPGAPG 870 880 890 900 910 920 510 520 530 540 550 560 pF1KE1 QPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGPPGPPGYPG . CCDS63 HVGAPGPSGPPGSVGAPGLRGTPGKDGERGEKGAAGEEGSPGPVGPRGDPGAPGLPGPPG 930 940 950 960 970 980 >>CCDS2297.1 COL3A1 gene_id:1281|Hs108|chr2 (1466 aa) initn: 1245 init1: 1245 opt: 1929 Z-score: 814.3 bits: 162.2 E(32554): 4.9e-39 Smith-Waterman score: 1968; 44.7% identity (56.6% similar) in 732 aa overlap (27-689:372-1090) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP :: :. : :::::::. :: : :. :: CCDS22 GFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPGGKGEMGPA 350 360 370 380 390 400 60 70 80 90 100 pF1KE1 GKAGPPGPKGEPGKAGPDGPDGKPGI------DGLTGAKGEPGPMG---------IPGVK : : :: : : :: : .: ::. : .:::::::: : .::.: CCDS22 GIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIPGVPGAK 410 420 430 440 450 460 110 120 130 140 150 pF1KE1 GQPGLPGPPGLPG----PGFAGP---PG---PPGPVGLPGEIGIRGPKGDPGPDGPSGPP :. : : :: :: :: :: :: : :: :.::: : : .: ::: :: : CCDS22 GEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAA 470 480 490 500 510 520 160 170 180 190 200 pF1KE1 GPPGKPGRPGT--IQGLEGSADFLCPTNCPPGMKGPPGLQGVKGHAGK---RGILGDPGH : ::. : :: ..:. :: : . : :::: :: .:. : : :.:: CCDS22 GEPGRDGVPGGPGMRGMPGSPG--GPGS--DGKPGPPGSQGESGRPGPPGPSGPRGQPGV 530 540 550 560 570 210 220 230 240 250 pF1KE1 QGKPGPKGDVGA---SGEQGIPGPPGPQGI---------RGYPGMAGP---KGETGPHGY .: :::::. :: .::.: :: ::::: .: :: .:: ::.::: : CCDS22 MGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGP 580 590 600 610 620 630 260 270 280 290 300 pF1KE1 KGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITG---PKGATGPPGINGK .:. : :. :::::.: : :: :. : :.:: .: : : : : .: ::. : CCDS22 QGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGG 640 650 660 670 680 690 310 320 330 340 350 360 pF1KE1 DGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGK : :: : ::.:: : ::. : :. :.::. : :.:: .:. : : :: .: ::: CCDS22 AGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGK 700 710 720 730 740 750 370 380 390 400 410 420 pF1KE1 EGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGD .: :: : ::: : :: ::.:: : : :: : .: ::.: : ::: :.::. :. CCDS22 DGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQ 760 770 780 790 800 810 430 440 450 460 470 480 pF1KE1 KGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGV .: :: : :: :. : .: :: : : :: :: : :::.:: : :: : :: ::. CCDS22 NGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGA 820 830 840 850 860 870 490 500 510 520 530 540 pF1KE1 QGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREA .: ::::: . : :: :: .: :.:. ... . . ... CCDS22 RGLPGPPG------SNGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGE 880 890 900 910 920 930 550 560 570 580 590 600 pF1KE1 LGAVGMMGPPGPPGPPGYPGKQG------PHGHPGPRGVPGIVGAVGQIGNTGPKGKRGE :. : .:::: ::: : : : : : ::::: :: :. :. :. : .: :: CCDS22 KGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGE 940 950 960 970 980 990 610 620 630 640 650 pF1KE1 KGDPGEVG-------RGHPGMPGPPGIPGLPGRPGQ-AINGKDGDRGSPGAPGEAGRPGL .: :: : :.:: : :: ::::: :. . .: :. ::::::: :.:: CCDS22 RGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGENGSPGAPGAPGHPGP 1000 1010 1020 1030 1040 1050 660 670 680 pF1KE1 PGPVGLP-------GFCEPAACLGASAYASARLTEPGSIKGP ::::: : : ::. :: . :..: :: .:: CCDS22 PGPVG-PAGKSGDRGESGPAGPAGAPGPAGSR-GAPGP-QGPRGDKGETGERGAAGIKGH 1060 1070 1080 1090 1100 CCDS22 RGFPGNPGAPGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGN 1110 1120 1130 1140 1150 1160 >-- initn: 1279 init1: 695 opt: 839 Z-score: 367.1 bits: 79.5 E(32554): 4e-14 Smith-Waterman score: 861; 44.1% identity (55.2% similar) in 315 aa overlap (28-339:98-371) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPG ::. .:: :: : :: :: : .:: : :: CCDS22 DDQELDCPNPEIPFGECCAVCPQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRNGDPGIPG 70 80 90 100 110 120 60 70 80 90 100 110 pF1KE1 KAGPPGPKGEPG--KAGPDGPDG-KPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPG . : :: : :: .. : ::.. .: :. .: ... :. : :: :::: :: CCDS22 QPGSPGSPGPPGICESCPTGPQNYSPQYDSYDVKSG----VAVGGLAGYPGPAGPPGPPG 130 140 150 160 170 180 120 130 140 150 160 170 pF1KE1 PGFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLC ::: : : :: : .:: :.:: :::::::::: : CCDS22 -----PPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIG---------------- 190 200 210 220 180 190 200 210 220 230 pF1KE1 PTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIR :. :: : .: .:. :. ::..: ::: : .: :::: :: .: : CCDS22 PS-------GPAGKDGESGRPGR------PGERGLPGPPG---IKGPAGIPGFPGMKGHR 230 240 250 260 240 250 260 270 280 290 pF1KE1 GYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGIT :. : : ::::: : :: : : .: :: :::: ::. :. : :. : :: .: CCDS22 GFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGAR 270 280 290 300 310 320 300 310 320 330 340 350 pF1KE1 GPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEP : : :::: : : ::.:: :: .: ::.::: : : : : CCDS22 GSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPP 330 340 350 360 370 380 360 370 380 390 400 410 pF1KE1 GPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPP CCDS22 GINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPR 390 400 410 420 430 440 >>CCDS33350.1 COL5A2 gene_id:1290|Hs108|chr2 (1499 aa) initn: 4605 init1: 1193 opt: 1907 Z-score: 805.2 bits: 160.5 E(32554): 1.6e-38 Smith-Waterman score: 1927; 44.7% identity (57.2% similar) in 698 aa overlap (27-689:282-940) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP : :: :: :: :: ::. : : : .:: CCDS33 GSRGPEGPPGKPGEDGEPGRNGNPGEVGFAGSPGARGFPGAPGLPGLKGHRGHKGLEGPK 260 270 280 290 300 310 60 70 80 90 100 110 pF1KE1 GKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGP- :..: :: ::: : .:: : : : :. : .:. ::.: :: .: :.:: :: :: CCDS33 GEVGAPGSKGEAGPTGPMGAMGPLGPRGMPGERGRLGPQGAPGQRGAHGMPGKPGPMGPL 320 330 340 350 360 370 120 130 140 150 160 170 pF1KE1 ---GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADF : .: :: :: : : : :::.: : : .::::: :.:: ::.: : .:. CCDS33 GIPGSSGFPGNPGMKGEAGPTGARGPEGPQGQRGETGPPGPVGSPGLPGAI-GTDGTPGA 380 390 400 410 420 430 180 190 200 210 220 pF1KE1 LCPTNCP-----PGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGE---QGI ::. : :: :::: : .: .: .:: :.:: : :: ::..: .:: .:: CCDS33 KGPTGSPGTSGPPGSAGPPGSPGPQGSTGPQGIRGQPGDPGVPGFKGEAGPKGEPGPHGI 440 450 460 470 480 490 230 240 250 260 270 pF1KE1 PGPPGP------QGIRGYPGMAGPKG---ETGPHGYKGMVGAIGATGPPGEEGPRGPPGR :: :: .: :: :: .:: : : : : .:. :. : :: : .: ::: : CCDS33 QGPIGPPGEEGKRGPRGDPGTVGPPGPVGERGAPGNRGFPGSDGLPGPKGAQGERGPVGS 500 510 520 530 540 550 280 290 300 310 320 330 pF1KE1 AGEKGDEGSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPG---H .: ::..:.:: : :. : .: :: ::..: .: : : : :. : ::: : . CCDS33 SGPKGSQGDPGRPGEPGLPGARGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIRGQ 560 570 580 590 600 610 340 350 360 370 380 390 pF1KE1 QGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGE : :.:: :..: :: :: : :.:: : :::.:: :: : .:: :. :..:.:: CCDS33 PGSMGLPGPKGSSGDPGKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQGP 620 630 640 650 660 670 400 410 420 430 440 450 pF1KE1 RGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGE ::.: : : :: :: : :: : : ::. : : :. : :. :.::..::::: CCDS33 PGPTGFQGLPGPPGPPGEGGKPGDQGVPGDPGAVGPLGPRGERGNPGERGEPGITGLPGE 680 690 700 710 720 730 460 470 480 490 500 510 pF1KE1 KGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGR :: : : ::::. : : :: :: : : :: .: : :::. :.:: :. : CCDS33 KGMAGGHGPDGPKGSPGPSGTPGDTGPPGLQGMPGERGIAGTPGPK---GDRGGIGEKGA 740 750 760 770 780 520 530 540 550 560 570 pF1KE1 QGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGP .:. : :. : : : .::::: :: : :. :: CCDS33 EGTAGNDG---------------------------ARGLPGPLGPPGPAGPTGEKGEPGP 790 800 810 820 580 590 600 610 620 pF1KE1 HG---------HPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVG-RGHPGMPGPPGI .: .:: :: : .:::: : :: :. : ::.::: : .: : ::: :. CCDS33 RGLVGPPGSRGNPGSRGENGPTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGL 830 840 850 860 870 880 630 640 650 660 670 680 pF1KE1 PGLPGRPG-QAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGASAYASARLT : :: : ... : : ::. : :: .: :: : :: :: :: :: . :. : CCDS33 AGSPGPHGPNGVPGLKGGRGTQGPPGATGFPGSAGRVGPPG---PA---GAPGPAGP-LG 890 900 910 920 930 pF1KE1 EPGSIKGP :::. .:: CCDS33 EPGK-EGPPGLRGDPGSHGRVGDRGPAGPPGGPGDKGDPGEDGQPGPDGPPGPAGTTGQR 940 950 960 970 980 990 >-- initn: 867 init1: 867 opt: 930 Z-score: 404.4 bits: 86.4 E(32554): 3.3e-16 Smith-Waterman score: 950; 45.1% identity (56.3% similar) in 350 aa overlap (38-373:941-1267) 10 20 30 40 50 60 pF1KE1 PRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPPGKAGPPGPKGE :: : ::: : :: :: : : :: ::. CCDS33 PGSAGRVGPPGPAGAPGPAGPLGEPGKEGPPGLRGDPGSHGRVGDRGPAGPPGGPGDKGD 920 930 940 950 960 970 70 80 90 100 110 120 pF1KE1 PGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPG---P-GFAGPPGP ::. : :::: :: : :: .: : : : .:.:::::: : :: : : .: :: CCDS33 PGEDGQPGPDGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGKVGPTGATGDKGP 980 990 1000 1010 1020 1030 130 140 150 160 170 180 pF1KE1 PGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPPGMK ::::: :: :: :.:::.::.: : ::. : : .: .:. :: CCDS33 PGPVGPPGS---NGPVGEPGPEGPAGNDGTPGRDGAVGE-RGDRGD----------PGPA 1040 1050 1060 1070 190 200 210 220 230 240 pF1KE1 GPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPK : :: ::. : : : :: :..: :: .: .: ::: : :: :: ::. CCDS33 GLPGSQGAPGTPGPVGAPGDAGQRGDPGSRGPIG---------PPGRAGKRGLPGPQGPR 1080 1090 1100 1110 1120 250 260 270 280 290 300 pF1KE1 GETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSPGIRGPQGITGPKGATGPP :. : :: .: : : : : .: :::: ::.:. : :: ::.: :: : .: CCDS33 GDKGDHGDRGDRGQKGHRGFTGLQGLPGPPGPNGEQGSAGIPGPFGPRGPPGPVGPSGKE 1130 1140 1150 1160 1170 1180 310 320 330 340 350 pF1KE1 GINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPG-TKGGPGD-QG---EPGPQG : : : : ::..::.:.:: : ::. : : :: :: .. :: .: : :. CCDS33 GNPGPLGPIGPPGVRGSVGEAGPEGPPGEPGPPGPPGPPGHLTAALGDIMGHYDESMPDP 1190 1200 1210 1220 1230 1240 360 370 380 390 400 410 pF1KE1 LPGFS----GPPGK-EGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGP :: :. .: : . .:: CCDS33 LPEFTEDQAAPDDKNKTDPGVHATLKSLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQ 1250 1260 1270 1280 1290 1300 >>CCDS75932.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 5784 init1: 1264 opt: 1876 Z-score: 791.6 bits: 158.3 E(32554): 9e-38 Smith-Waterman score: 1971; 46.7% identity (56.9% similar) in 670 aa overlap (27-650:916-1558) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP :: ::::: : : :: :..: :: ::: CCDS75 GFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGPAGPP 890 900 910 920 930 940 60 70 80 90 100 110 pF1KE1 GKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPG------PP :. :: ::.: : :: :: : :: ::: : :. : :. : : :: :: : CCDS75 GERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPT 950 960 970 980 990 1000 120 130 140 150 160 pF1KE1 GLPGP----GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGL : :: : ::::::: :::: : .: :::::: : : :::: : :: .:: CCDS75 GETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRGFPGD-RGL 1010 1020 1030 1040 1050 1060 170 180 190 200 210 220 pF1KE1 EG--SADFLCPTNCPPGMKGP---PGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGE : .: : .. ::: :: :: .: : :: :: : :: :: ::: :. :: :: CCDS75 PGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGE 1070 1080 1090 1100 1110 1120 230 240 250 260 pF1KE1 QGIPGP------------PGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGP .: :: ::: : : :: : ::: : : :: : : :::: :: CCDS75 KGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGP 1130 1140 1150 1160 1170 1180 270 280 290 300 310 320 pF1KE1 RGPPGRAGEKGDEGSPGIRGPQGI------TGPKGATGPPGINGKDGTPGTPGMKGSAGQ .:: :. : .: .: :: :: ::. ::.: :::: : .: :: :: :: .:. CCDS75 QGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGD 1190 1200 1210 1220 1230 1240 330 340 350 360 370 380 pF1KE1 AGQ---PGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGP .:: :: :: .: .:.:: : .: :: :.:: : .: ::. :::: :: :: CCDS75 VGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGE---KGEPGEAGEPGLPGEGGP 1250 1260 1270 1280 1290 1300 390 400 410 420 430 440 pF1KE1 QGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGK : :..:..:: :: : :: : .:: :..:: : ::: :.:: : : :: .: : CCDS75 PGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGP 1310 1320 1330 1340 1350 1360 450 460 470 480 490 pF1KE1 VGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGP---SGDAGAPGVQGYPGPPGP :: : : ::. : : .:::::.: : :: :: :: .:. :: : : :::: CCDS75 PGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGK 1370 1380 1390 1400 1410 1420 500 510 520 530 540 550 pF1KE1 RGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGP : : .:.::.:: .:..: . ..: .. :. : :: CCDS75 TGPIGPQGAPGKPGPDGLRG---------------IPGPVGEQGLP------GSPGPDGP 1430 1440 1450 1460 560 570 580 590 600 610 pF1KE1 PGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVG-RGHPGM ::: :::: :: .: : : .: ::..: .: :. : :: :: : : : .:. :. CCDS75 PGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI 1470 1480 1490 1500 1510 1520 620 630 640 650 660 670 pF1KE1 --P----GPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACL : :::: ::::: :: .: :. : : :::: CCDS75 TGPSGPIGPPGPPGLPGPPGP--KGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPI 1530 1540 1550 1560 1570 680 pF1KE1 GASAYASARLTEPGSIKGP CCDS75 QASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPART 1580 1590 1600 1610 1620 1630 >-- initn: 3776 init1: 1012 opt: 1370 Z-score: 584.0 bits: 119.9 E(32554): 3.3e-26 Smith-Waterman score: 1393; 41.3% identity (56.4% similar) in 550 aa overlap (21-517:372-913) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGID : :.: .. . . :.:: :.: .. CCDS75 PSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPAPPPGEGADDLE 350 360 370 380 390 400 60 70 80 90 100 pF1KE1 GD-------NGPPGKAGPP-GPKGEPGKAGPDGPDGKPGI-DGLTGAKGEPGPMGIPGVK :. : . : : . :.. :: : .. : .:. : .:: : : :.. CCDS75 GEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRGEKGQKGEPAII 410 420 430 440 450 460 110 120 130 140 150 pF1KE1 GQPGL--PGPPGLPGP-GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGK-- .::. :::: :: :. :::: ::.: :. : ::: : :: : .: ::::: CCDS75 -EPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPGADGLPGPPGTML 470 480 490 500 510 520 160 170 180 190 200 pF1KE1 --PGRPGTIQGLEGSADFLCPTNCPPG----------MKGPPGLQGVKGHAGKRGILGDP : : : : :: . .. . ..:: : .:. .:. : .: : CCDS75 MLPFRFGG-GGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGL---TGRPGPVGPP 530 540 550 560 570 210 220 230 240 250 pF1KE1 GHQGKPGPKGDVGASGEQGIPGPPGP------------QGIRGYPGMAGPKGETGPHGYK : : : :::: .: .:. ::::: .: ::.::..::::. : : CCDS75 GSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQTGPKGDRGFDGLA 580 590 600 610 620 630 260 270 280 290 300 pF1KE1 GMVGAIGATGPPGEEGPRGPPGRAGEKGDEGS------PGIRGPQGITGPKGATGPPGIN :. : : : :: :: :::: ::.::.: :: ::.:. :::: :::: CCDS75 GLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPP 640 650 660 670 680 690 310 320 330 340 350 360 pF1KE1 GKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPP : : : :: ::..: :.:: ::.:: :. : :: .:. : :: :: : ::. : : CCDS75 GVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMP 700 710 720 730 740 750 370 380 390 400 410 pF1KE1 GKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGP---------KGEQGPPGIP : .: :: :. :: : : .: : .::.: :::.: .: :::.: :.: CCDS75 GADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFP 760 770 780 790 800 810 420 430 440 450 460 470 pF1KE1 GPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYP : .: :.:::.: : ::::. : : : : .:. : : :: ::. :: : :::: CCDS75 GFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYP 820 830 840 850 860 870 480 490 500 510 520 530 pF1KE1 GPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQL : .: :. :.:: :: : :.::.::.:: .: .: CCDS75 GRQGPKGSI---GFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPK 880 890 900 910 920 930 540 550 560 570 580 590 pF1KE1 AEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPK CCDS75 GNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 940 950 960 970 980 990 >>CCDS6982.1 COL5A1 gene_id:1289|Hs108|chr9 (1838 aa) initn: 5784 init1: 1264 opt: 1876 Z-score: 791.6 bits: 158.3 E(32554): 9e-38 Smith-Waterman score: 1971; 46.7% identity (56.9% similar) in 670 aa overlap (27-650:916-1558) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP :: ::::: : : :: :..: :: ::: CCDS69 GFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGPAGPP 890 900 910 920 930 940 60 70 80 90 100 110 pF1KE1 GKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPG------PP :. :: ::.: : :: :: : :: ::: : :. : :. : : :: :: : CCDS69 GERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPGPPGVVGPQGPT 950 960 970 980 990 1000 120 130 140 150 160 pF1KE1 GLPGP----GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGL : :: : ::::::: :::: : .: :::::: : : :::: : :: .:: CCDS69 GETGPMGERGHPGPPGPPGEQGLPGLAGKEGTKGDPGPAGLPGKDGPPGLRGFPGD-RGL 1010 1020 1030 1040 1050 1060 170 180 190 200 210 220 pF1KE1 EG--SADFLCPTNCPPGMKGP---PGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGE : .: : .. ::: :: :: .: : :: :: : :: :: ::: :. :: :: CCDS69 PGPVGALGLKGNEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQGPPGPAGEKGAPGE 1070 1080 1090 1100 1110 1120 230 240 250 260 pF1KE1 QGIPGP------------PGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGP .: :: ::: : : :: : ::: : : :: : : :::: :: CCDS69 KGPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGP 1130 1140 1150 1160 1170 1180 270 280 290 300 310 320 pF1KE1 RGPPGRAGEKGDEGSPGIRGPQGI------TGPKGATGPPGINGKDGTPGTPGMKGSAGQ .:: :. : .: .: :: :: ::. ::.: :::: : .: :: :: :: .:. CCDS69 QGPIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGD 1190 1200 1210 1220 1230 1240 330 340 350 360 370 380 pF1KE1 AGQ---PGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGP .:: :: :: .: .:.:: : .: :: :.:: : .: ::. :::: :: :: CCDS69 VGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGE---KGEPGEAGEPGLPGEGGP 1250 1260 1270 1280 1290 1300 390 400 410 420 430 440 pF1KE1 QGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGK : :..:..:: :: : :: : .:: :..:: : ::: :.:: : : :: .: : CCDS69 PGPKGERGEKGESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQDGP 1310 1320 1330 1340 1350 1360 450 460 470 480 490 pF1KE1 VGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGP---SGDAGAPGVQGYPGPPGP :: : : ::. : : .:::::.: : :: :: :: .:. :: : : :::: CCDS69 PGDKGDDGEPGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGEKGAKGEAGLEGPPGK 1370 1380 1390 1400 1410 1420 500 510 520 530 540 550 pF1KE1 RGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGP : : .:.::.:: .:..: . ..: .. :. : :: CCDS69 TGPIGPQGAPGKPGPDGLRG---------------IPGPVGEQGLP------GSPGPDGP 1430 1440 1450 1460 560 570 580 590 600 610 pF1KE1 PGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVG-RGHPGM ::: :::: :: .: : : .: ::..: .: :. : :: :: : : : .:. :. CCDS69 PGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQGI 1470 1480 1490 1500 1510 1520 620 630 640 650 660 670 pF1KE1 --P----GPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACL : :::: ::::: :: .: :. : : :::: CCDS69 TGPSGPIGPPGPPGLPGPPGP--KGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQPLPI 1530 1540 1550 1560 1570 680 pF1KE1 GASAYASARLTEPGSIKGP CCDS69 QASRTRRNIDASQLLDDGNGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLGTQQNPART 1580 1590 1600 1610 1620 1630 >-- initn: 3776 init1: 1012 opt: 1370 Z-score: 584.0 bits: 119.9 E(32554): 3.3e-26 Smith-Waterman score: 1393; 41.3% identity (56.4% similar) in 550 aa overlap (21-517:372-913) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGID : :.: .. . . :.:: :.: .. CCDS69 PSEDYYTPSPYDDLTYGEGEENPDQPTDPGAGAEIPTSTADTSNSSNPAPPPGEGADDLE 350 360 370 380 390 400 60 70 80 90 100 pF1KE1 GD-------NGPPGKAGPP-GPKGEPGKAGPDGPDGKPGI-DGLTGAKGEPGPMGIPGVK :. : . : : . :.. :: : .. : .:. : .:: : : :.. CCDS69 GEFTEETIRNLDENYYDPYYDPTSSPSEIGPGMPANQDTIYEGIGGPRGEKGQKGEPAII 410 420 430 440 450 460 110 120 130 140 150 pF1KE1 GQPGL--PGPPGLPGP-GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGK-- .::. :::: :: :. :::: ::.: :. : ::: : :: : .: ::::: CCDS69 -EPGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPGADGLPGPPGTML 470 480 490 500 510 520 160 170 180 190 200 pF1KE1 --PGRPGTIQGLEGSADFLCPTNCPPG----------MKGPPGLQGVKGHAGKRGILGDP : : : : :: . .. . ..:: : .:. .:. : .: : CCDS69 MLPFRFGG-GGDAGSKGPMVSAQESQAQAILQQARLALRGPAGPMGL---TGRPGPVGPP 530 540 550 560 570 210 220 230 240 250 pF1KE1 GHQGKPGPKGDVGASGEQGIPGPPGP------------QGIRGYPGMAGPKGETGPHGYK : : : :::: .: .:. ::::: .: ::.::..::::. : : CCDS69 GSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQTGPKGDRGFDGLA 580 590 600 610 620 630 260 270 280 290 300 pF1KE1 GMVGAIGATGPPGEEGPRGPPGRAGEKGDEGS------PGIRGPQGITGPKGATGPPGIN :. : : : :: :: :::: ::.::.: :: ::.:. :::: :::: CCDS69 GLPGEKGHRGDPGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPP 640 650 660 670 680 690 310 320 330 340 350 360 pF1KE1 GKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPP : : : :: ::..: :.:: ::.:: :. : :: .:. : :: :: : ::. : : CCDS69 GVTGMDGQPGPKGNVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMP 700 710 720 730 740 750 370 380 390 400 410 pF1KE1 GKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGP---------KGEQGPPGIP : .: :: :. :: : : .: : .::.: :::.: .: :::.: :.: CCDS69 GADGPPGHPGKEGPPGEKGGQGPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGEDGFP 760 770 780 790 800 810 420 430 440 450 460 470 pF1KE1 GPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYP : .: :.:::.: : ::::. : : : : .:. : : :: ::. :: : :::: CCDS69 GFKGDMGIKGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYP 820 830 840 850 860 870 480 490 500 510 520 530 pF1KE1 GPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQL : .: :. :.:: :: : :.::.::.:: .: .: CCDS69 GRQGPKGSI---GFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPK 880 890 900 910 920 930 540 550 560 570 580 590 pF1KE1 AEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPK CCDS69 GNSGGDGPAGPPGERGPNGPQGPTGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 940 950 960 970 980 990 >>CCDS780.2 COL11A1 gene_id:1301|Hs108|chr1 (1690 aa) initn: 4540 init1: 1219 opt: 1839 Z-score: 776.7 bits: 155.5 E(32554): 6e-37 Smith-Waterman score: 1974; 46.3% identity (58.1% similar) in 669 aa overlap (27-662:770-1421) 10 20 30 40 50 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGVPGSDGIDGDNGPP :: : :: :: : :: :..: :: ::: CCDS78 GFPGFPGANGEKGARGVAGKPGPRGQRGPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPP 740 750 760 770 780 790 60 70 80 90 100 pF1KE1 GKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPG-------P :. :: ::.: : :: :: : :: ::: : :. : :. : : :: :: : CCDS78 GERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPPG-PGGVVGPQGP 800 810 820 830 840 850 110 120 130 140 150 160 pF1KE1 PGLPGP----GFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQG : :: : ::::::: :::: : .: ::::::.: :: :: : : :: .: CCDS78 TGETGPIGERGHPGPPGPPGEQGLPGAAGKEGAKGDPGPQGISGKDGPAGLRGFPGE-RG 860 870 880 890 900 910 170 180 190 200 210 220 pF1KE1 LEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIP : :: .: :::.: .: : : .:.::..:. : : .: :. : CCDS78 L-------------PGAQGAPGLKGGEGPQGPPGPVGSPGERGSAGTAGPIGLPGRPGPQ 920 930 940 950 960 230 240 250 260 270 280 pF1KE1 GPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGP---PGEEGPRGPPGRAGEKGDE ::::: : .: :: ::.: .: : .: :: : .:: :::.: .: :. :.::.. CCDS78 GPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLPGPAGPAGSPGEDGDKGEIGEPGQKGSK 970 980 990 1000 1010 1020 290 300 310 320 330 pF1KE1 GSPGIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQ------PGSPGHQGVA :. : :: : : .: .: ::: : :: :: :..: :: :. :: :: :. CCDS78 GDKGENGPPGPPGLQGPVGAPGIAGGDGEPGPRGQQGMFGQKGDEGARGFPGPPGPIGLQ 1030 1040 1050 1060 1070 1080 340 350 360 370 380 390 pF1KE1 GVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPV :.:: :: :: :: : :: : :: :: : .: ::.: : : .: :..:: : . CCDS78 GLPGPPGEKGENGDVGPMGPPGPPGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEA 1090 1100 1110 1120 1130 1140 400 410 420 430 440 450 pF1KE1 GQPGPQGRQG---PKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGK---VGDPGVAGLP :.::: :. : ::::.: : :: : : : :: :: ::.:. :: :: : : CCDS78 GNPGPPGEAGVGGPKGERGEKGEAGPPGAAGPPGAKGPPGDDGPKGNPGPVGFPGDPGPP 1150 1160 1170 1180 1190 1200 460 470 480 490 500 510 pF1KE1 GEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQP :: : :..: : ::..: :.:: :::::.:: :: : :::: : : .: : CCDS78 GEPGPAGQDGVGGDKGEDGDPGQPGPPGPSGEAGPPGPPGKRGPPGAAGAEGRQGEKGAK 1210 1220 1230 1240 1250 1260 520 530 540 550 560 pF1KE1 GRQGVEGRDATDQHI--VDVALKMLQEQLAEVAVSAKREAL-GAVGMMGPPGPPGPPGYP :. :.:: . . : : : : . . ...: ::.:. ::::: :::: : CCDS78 GEAGAEGPPGKTGPVGPQGPAGKPGPEGLRGIPGPVGEQGLPGAAGQDGPPGPMGPPGLP 1270 1280 1290 1300 1310 1320 570 580 590 600 610 620 pF1KE1 GKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRG---EKGDPGEVGRGH-PGMPGPPGIP : .: : : .: ::..: .: :. : :: :: .:.:: : : :: :: : : CCDS78 GLKGDPGSKGEKGHPGLIGLIGPPGEQGEKGDRGLPGTQGSPGAKGDGGIPGPAGPLGPP 1330 1340 1350 1360 1370 1380 630 640 650 660 670 680 pF1KE1 GLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGASAYASARLTEP : :: :: .: :..:: : :. : ::::: : :: CCDS78 GPPGLPGP--QGPKGNKGSTGPAGQKGDSGLPGPPGSPGPPGEVIQPLPILSSKKTRRHT 1390 1400 1410 1420 1430 1440 pF1KE1 GSIKGP CCDS78 EGMQADADDNILDYSDGMEEIFGSLNSLKQDIEHMKFPMGTQTNPARTCKDLQLSHPDFP 1450 1460 1470 1480 1490 1500 >-- initn: 1968 init1: 997 opt: 1213 Z-score: 519.9 bits: 107.9 E(32554): 1.2e-22 Smith-Waterman score: 1348; 43.1% identity (56.2% similar) in 511 aa overlap (20-517:295-767) 10 20 30 40 pF1KE1 MAAATASPRSLLVLLQVVVLALAQIRGPPGERGPPGPPGPPGV--PGSD .: ..: : : : : : :.: :: CCDS78 DIIEYDYEYGEAEYKEAESVTEGPTVTEETIAQTEINGH-GAYGEKGQKGEPAVVEPGML 270 280 290 300 310 320 50 60 70 80 90 100 pF1KE1 GIDGDNGPPGKAGPPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLP .. :::: ::: : : :: :: :: : :: . :: : ::. : ::: CCDS78 -VE---GPPGPAGPAGIMGPPGLQGPTGPPGDPG---------DRGPPGRPGLPGADGLP 330 340 350 360 370 110 120 130 140 150 160 pF1KE1 GPPG--LPGPGFAGPPGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQG :::: : : : : ::. : .. . . . ::::: : :::: . : CCDS78 GPPGTMLMLPFRYGGDGSKGPTISAQEAQAQAIL-QQARIALRGPPGPMGLTGRPGPVGG 380 390 400 410 420 170 180 190 200 210 220 pF1KE1 LEGSADFLCPTNCPPGMKGPPGLQGVKGHAGKRGILGDPGHQGKPGPKGDVGASGEQGIP :: .: : .: : : ::. : :: :::: .: ::.: .:.: CCDS78 --------------PGSSGAKGESGDPGPQGPRGVQGPPGPTGKPGKRGRPGADGGRGMP 430 440 450 460 470 230 240 250 260 270 280 pF1KE1 GPPGPQGIRGYPGMAGPKGETGPHGYKGMVGAIGATGPPGEEGPRGPPGRAGEKGDEGSP : :: .: ::. :. : :. .:..: : : ::::..: :: . :: : .: : CCDS78 GEPGAKGDRGFDGLPGLPGD---KGHRGERGPQGPPGPPGDDGMRG---EDGEIGPRGLP 480 490 500 510 520 290 300 310 320 330 340 pF1KE1 GIRGPQGITGPKGATGPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGVAGVPGQPGTK : ::.:. ::.:. : :: : :. : :: ::. : :.:: ::.:: : : :: . CCDS78 GEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMGPQGEPGPPGQQGNPGPQGLPGPQ 530 540 550 560 570 580 350 360 370 380 390 400 pF1KE1 GGPGDQGEPGPQGLPGFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQ : : :: :::: ::..: :: .: :: :. : .: : : : .::.: :::.: . CCDS78 GPIGPPGEKGPQGKPGLAGLPGADGPPGHPGKEGQSGEKGALGPPGPQGPIGYPGPRGVK 590 600 610 620 630 640 410 420 430 440 450 pF1KE1 GP---------KGEQGPPGIPGPQGLPGVKGDKGSPGKTGPRGKVGDPGVAGLPGEKGEK : :::.: :.:: .: :.:::.: :. ::::. : : : : :. CCDS78 GADGVRGLKGSKGEKGEDGFPGFKGDMGLKGDRGEVGQIGPRGEDGPEGPKGRAGPTGDP 650 660 670 680 690 700 460 470 480 490 500 510 pF1KE1 GESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVE : ::. : ::. :: : ::::: .: :. :.:: :: : : ::: :.:: .: . CCDS78 GPSGQAGEKGKLGVPGLPGYPGRQGPKGST---GFPGFPGANGEKGARGVAGKPGPRGQR 710 720 730 740 750 760 520 530 540 550 560 570 pF1KE1 GRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHP : CCDS78 GPTGPRGSRGARGPTGKPGPKGTSGGDGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKD 770 780 790 800 810 820 689 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 19:21:37 2016 done: Sun Nov 6 19:21:38 2016 Total Scan time: 3.750 Total Display time: 0.230 Function used was FASTA [36.3.4 Apr, 2011]