FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6605, 271 aa 1>>>pF1KE6605 271 - 271 aa - 271 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3633+/-0.00082; mu= 12.0278+/- 0.050 mean_var=137.3587+/-28.481, 0's: 0 Z-trim(112.4): 172 B-trim: 637 in 1/50 Lambda= 0.109432 statistics sampled from 13017 (13198) to 13017 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.405), width: 16 Scan time: 2.230 The best scores are: opt bits E(32554) CCDS1649.1 COLEC11 gene_id:78989|Hs108|chr2 ( 271) 1862 304.7 4.4e-83 CCDS58689.1 COLEC11 gene_id:78989|Hs108|chr2 ( 285) 1862 304.8 4.5e-83 CCDS58691.1 COLEC11 gene_id:78989|Hs108|chr2 ( 245) 1590 261.8 3.5e-70 CCDS58693.1 COLEC11 gene_id:78989|Hs108|chr2 ( 221) 1416 234.2 6e-62 CCDS1650.1 COLEC11 gene_id:78989|Hs108|chr2 ( 268) 1414 234.0 8.5e-62 CCDS58692.1 COLEC11 gene_id:78989|Hs108|chr2 ( 221) 1355 224.6 4.7e-59 CCDS58690.1 COLEC11 gene_id:78989|Hs108|chr2 ( 247) 1355 224.7 5.1e-59 CCDS58694.1 COLEC11 gene_id:78989|Hs108|chr2 ( 197) 1239 206.2 1.4e-53 CCDS6327.1 COLEC10 gene_id:10584|Hs108|chr8 ( 277) 938 158.9 3.7e-39 CCDS7247.1 MBL2 gene_id:4153|Hs108|chr10 ( 248) 385 71.5 6.5e-13 >>CCDS1649.1 COLEC11 gene_id:78989|Hs108|chr2 (271 aa) initn: 1862 init1: 1862 opt: 1862 Z-score: 1605.1 bits: 304.7 E(32554): 4.4e-83 Smith-Waterman score: 1862; 100.0% identity (100.0% similar) in 271 aa overlap (1-271:1-271) 10 20 30 40 50 60 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE 190 200 210 220 230 240 250 260 270 pF1KE6 DCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::: CCDS16 DCVEMVASGGWNDVACHTTMYFMCEFDKENM 250 260 270 >>CCDS58689.1 COLEC11 gene_id:78989|Hs108|chr2 (285 aa) initn: 1862 init1: 1862 opt: 1862 Z-score: 1604.8 bits: 304.8 E(32554): 4.5e-83 Smith-Waterman score: 1862; 100.0% identity (100.0% similar) in 271 aa overlap (1-271:15-285) 10 20 30 40 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDA :::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MKKQRGVGVLPALRMRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 GEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 GEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADA 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 QLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFN 190 200 210 220 230 240 230 240 250 260 270 pF1KE6 KWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM 250 260 270 280 >>CCDS58691.1 COLEC11 gene_id:78989|Hs108|chr2 (245 aa) initn: 1583 init1: 1583 opt: 1590 Z-score: 1373.6 bits: 261.8 E(32554): 3.5e-70 Smith-Waterman score: 1590; 94.3% identity (95.1% similar) in 247 aa overlap (25-271:6-245) 10 20 30 40 50 60 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR :.: : :. : ::::::::::::::::: CCDS58 MWWVPPSPYGCLPCA-------LPGDAGEKGDKGAPGRPGR 10 20 30 70 80 90 100 110 120 pF1KE6 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE 160 170 180 190 200 210 250 260 270 pF1KE6 DCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::: CCDS58 DCVEMVASGGWNDVACHTTMYFMCEFDKENM 220 230 240 >>CCDS58693.1 COLEC11 gene_id:78989|Hs108|chr2 (221 aa) initn: 1405 init1: 1405 opt: 1416 Z-score: 1225.7 bits: 234.2 E(32554): 6e-62 Smith-Waterman score: 1416; 95.4% identity (95.9% similar) in 217 aa overlap (55-271:5-221) 30 40 50 60 70 80 pF1KE6 PQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGK : : : . ::::::::::::::::: CCDS58 MWWVPPSPYGCLPCALPGDMGDKGQKGSVGRHGK 10 20 30 90 100 110 120 130 140 pF1KE6 IGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 IGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVA 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE6 GVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIG 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE6 INDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 INDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMC 160 170 180 190 200 210 270 pF1KE6 EFDKENM ::::::: CCDS58 EFDKENM 220 >>CCDS1650.1 COLEC11 gene_id:78989|Hs108|chr2 (268 aa) initn: 1414 init1: 1414 opt: 1414 Z-score: 1222.9 bits: 234.0 E(32554): 8.5e-62 Smith-Waterman score: 1414; 99.0% identity (99.5% similar) in 207 aa overlap (65-271:62-268) 40 50 60 70 80 90 pF1KE6 VQILVPGLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEK : .::::::::::::::::::::::::::: CCDS16 SAPREKKQSQPVVTASDISKRKCTSSFVEMGSQGDMGDKGQKGSVGRHGKIGPIGSKGEK 40 50 60 70 80 90 100 110 120 130 140 150 pF1KE6 GDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 GDSGDIGPPGPNGEPGLPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIY 100 110 120 130 140 150 160 170 180 190 200 210 pF1KE6 LLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 LLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAF 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE6 VYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 VYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM 220 230 240 250 260 >>CCDS58692.1 COLEC11 gene_id:78989|Hs108|chr2 (221 aa) initn: 1344 init1: 1344 opt: 1355 Z-score: 1173.6 bits: 224.6 E(32554): 4.7e-59 Smith-Waterman score: 1362; 84.6% identity (85.4% similar) in 247 aa overlap (25-271:6-221) 10 20 30 40 50 60 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR :.: : :. : ::::::::::::::::: CCDS58 MWWVPPSPYGCLPCA-------LPGDAGEKGDKGAPGRPGR 10 20 30 70 80 90 100 110 120 pF1KE6 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK :::::::: :::::::::::::::::::::::::::: CCDS58 VGPTGEKG------------------------EKGDSGDIGPPGPNGEPGLPCECSQLRK 40 50 60 70 130 140 150 160 170 180 pF1KE6 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP 80 90 100 110 120 130 190 200 210 220 230 240 pF1KE6 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE 140 150 160 170 180 190 250 260 270 pF1KE6 DCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::: CCDS58 DCVEMVASGGWNDVACHTTMYFMCEFDKENM 200 210 220 >>CCDS58690.1 COLEC11 gene_id:78989|Hs108|chr2 (247 aa) initn: 1355 init1: 1355 opt: 1355 Z-score: 1173.0 bits: 224.7 E(32554): 5.1e-59 Smith-Waterman score: 1634; 91.1% identity (91.1% similar) in 271 aa overlap (1-271:1-247) 10 20 30 40 50 60 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK :::::::: :::::::::::::::::::::::::::: CCDS58 VGPTGEKG------------------------EKGDSGDIGPPGPNGEPGLPCECSQLRK 70 80 90 130 140 150 160 170 180 pF1KE6 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEE 160 170 180 190 200 210 250 260 270 pF1KE6 DCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::: CCDS58 DCVEMVASGGWNDVACHTTMYFMCEFDKENM 220 230 240 >>CCDS58694.1 COLEC11 gene_id:78989|Hs108|chr2 (197 aa) initn: 1235 init1: 1235 opt: 1239 Z-score: 1075.3 bits: 206.2 E(32554): 1.4e-53 Smith-Waterman score: 1239; 97.8% identity (98.4% similar) in 185 aa overlap (87-271:13-197) 60 70 80 90 100 110 pF1KE6 RPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECS : . ::::::::::::::::::::::::: CCDS58 MWWVPPSPYGCLPCALPGEKGDSGDIGPPGPNGEPGLPCECS 10 20 30 40 120 130 140 150 160 170 pF1KE6 QLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGT 50 60 70 80 90 100 180 190 200 210 220 230 pF1KE6 LSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNA 110 120 130 140 150 160 240 250 260 270 pF1KE6 YDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM ::::::::::::::::::::::::::::::::::: CCDS58 YDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM 170 180 190 >>CCDS6327.1 COLEC10 gene_id:10584|Hs108|chr8 (277 aa) initn: 1251 init1: 905 opt: 938 Z-score: 816.6 bits: 158.9 E(32554): 3.7e-39 Smith-Waterman score: 938; 47.4% identity (76.8% similar) in 272 aa overlap (1-269:8-275) 10 20 30 40 50 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKG .: : .. ::. : . :: . .:... .:... . :: ::: ::::: : CCDS63 MNGFASLLRRNQFILLVLFLLQIQSLGLDIDSRPTAE-VCATHTISPGPKGDDGEKGDPG 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 APGRPGRVG---PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPG :. :.:: : : ::..:: :..:..: : ::::.::.::..: .: :: .:. : CCDS63 EEGKHGKVGRMGPKGIKGELGDMGDQGNIG---KTGPIGKKGDKGEKGLLGIPGEKGKAG 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 LPCECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSC :.:.. :: .:..: ....: . .::.::..::.:::: :.: .:.::: : .. : CCDS63 TVCDCGRYRKFVGQLDISIARLKTSMKFVKNVIAGIRETEEKFYYIVQEEKNYRESLTHC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 QGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRS . ::: :.:::::::: :.: :.:..:. :::::.::::.:: ....:..:......: CCDS63 RIRGGMLAMPKDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMFTDNTPLQNYSNWNE 180 190 200 210 220 230 240 250 260 270 pF1KE6 GEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM :::.. : .::::::..:: :::. :: ::::.::: :. CCDS63 GEPSDPYGHEDCVEMLSSGRWNDTECHLTMYFVCEFIKKKK 240 250 260 270 >>CCDS7247.1 MBL2 gene_id:4153|Hs108|chr10 (248 aa) initn: 438 init1: 179 opt: 385 Z-score: 345.3 bits: 71.5 E(32554): 6.5e-13 Smith-Waterman score: 399; 31.4% identity (58.9% similar) in 258 aa overlap (12-266:7-246) 10 20 30 40 50 pF1KE6 MRGNLALVGVLISLAFLSLLPSGHPQPAGDDAC-SVQILVPGLKGDAGEKGDKGAPGRPG . : .::.. ... . . .: ..: :.. . . : .: ::. : CCDS72 MSLFPSLPLLLLSMVAASYSETV---TCEDAQKTCPAVIA-CSSPGINGFPGKDG 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 RVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGL-PCECSQL : : ::::. : .: .: : ::.:: :. : ::. :: : .:.:: : :.: CCDS72 RDGTKGEKGEPG-QGLRGLQGPPGKLGP---PGNPGPSGSPGPKGQKGDPGKSPDGDSSL 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 RKAIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLS : .: .... :.. ... ... .:..: : . .. : .... CCDS72 --AASERKALQTEMARIKKWLTFSLG--KQVGNKFFLTNGEIMTFEKVKALCVKFQASVA 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE6 MPKDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMR-TFNKWRSGEPNNAY :.. : :: . . . ..:.::.: . :: :: : . : :...: :::::: CCDS72 TPRNAAENGAIQNLIKE----EAFLGITDEKTEGQFV--DLTGNRLTYTNWNEGEPNNAG 170 180 190 200 210 240 250 260 270 pF1KE6 DEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM ..:::: .. .: :::: : :. .::: CCDS72 SDEDCVLLLKNGQWNDVPCSTSHLAVCEFPI 220 230 240 271 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:42:21 2016 done: Tue Nov 8 14:42:22 2016 Total Scan time: 2.230 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]