FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1907, 442 aa 1>>>pF1KE1907 442 - 442 aa - 442 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5885+/-0.000854; mu= 16.1411+/- 0.051 mean_var=80.8874+/-16.353, 0's: 0 Z-trim(107.4): 84 B-trim: 68 in 1/48 Lambda= 0.142605 statistics sampled from 9478 (9563) to 9478 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.294), width: 16 Scan time: 3.130 The best scores are: opt bits E(32554) CCDS43437.1 F gene_id:3134|Hs108|chr6 ( 442) 3030 633.2 1.5e-181 CCDS43438.1 F gene_id:3134|Hs108|chr6 ( 346) 2353 493.9 1.1e-139 CCDS34373.1 A gene_id:3105|Hs108|chr6 ( 365) 1930 406.9 1.7e-113 CCDS34394.1 B gene_id:3106|Hs108|chr6 ( 362) 1886 397.8 9.2e-111 CCDS4668.1 G gene_id:3135|Hs108|chr6 ( 338) 1847 389.8 2.3e-108 CCDS34393.1 C gene_id:3107|Hs108|chr6 ( 366) 1832 386.7 2.1e-107 CCDS34379.1 E gene_id:3133|Hs108|chr6 ( 358) 1732 366.1 3.1e-101 CCDS43439.1 F gene_id:3134|Hs108|chr6 ( 254) 1380 293.6 1.5e-79 CCDS1342.1 MR1 gene_id:3140|Hs108|chr1 ( 341) 773 168.8 7.5e-42 CCDS4578.1 HFE gene_id:3077|Hs108|chr6 ( 348) 715 156.9 3e-38 CCDS75412.1 HFE gene_id:3077|Hs108|chr6 ( 337) 707 155.2 9.1e-38 CCDS47387.1 HFE gene_id:3077|Hs108|chr6 ( 325) 668 147.2 2.3e-35 CCDS5680.1 AZGP1 gene_id:563|Hs108|chr7 ( 298) 652 143.9 2.1e-34 CCDS4580.1 HFE gene_id:3077|Hs108|chr6 ( 260) 572 127.4 1.7e-29 CCDS12770.1 FCGRT gene_id:2217|Hs108|chr19 ( 365) 505 113.7 3.1e-25 CCDS56412.1 MICA gene_id:100507436|Hs108|chr6 ( 332) 501 112.8 5.1e-25 CCDS53440.1 MR1 gene_id:3140|Hs108|chr1 ( 249) 480 108.4 8.2e-24 CCDS53441.1 MR1 gene_id:3140|Hs108|chr1 ( 214) 474 107.2 1.7e-23 CCDS43449.1 MICB gene_id:4277|Hs108|chr6 ( 383) 476 107.7 2e-23 CCDS75423.1 MICB gene_id:4277|Hs108|chr6 ( 351) 442 100.7 2.4e-21 CCDS53442.1 MR1 gene_id:3140|Hs108|chr1 ( 296) 439 100.1 3.2e-21 CCDS4581.1 HFE gene_id:3077|Hs108|chr6 ( 168) 398 91.5 7.1e-19 CCDS4579.1 HFE gene_id:3077|Hs108|chr6 ( 256) 395 91.0 1.5e-18 CCDS75421.1 MICA gene_id:100507436|Hs108|chr6 ( 235) 389 89.7 3.4e-18 CCDS47386.1 HFE gene_id:3077|Hs108|chr6 ( 242) 365 84.8 1.1e-16 CCDS54975.1 HFE gene_id:3077|Hs108|chr6 ( 246) 363 84.4 1.4e-16 CCDS54974.1 HFE gene_id:3077|Hs108|chr6 ( 334) 363 84.5 1.8e-16 CCDS75422.1 MICB gene_id:4277|Hs108|chr6 ( 340) 319 75.4 9.8e-14 CCDS1174.1 CD1A gene_id:909|Hs108|chr1 ( 327) 301 71.7 1.2e-12 >>CCDS43437.1 F gene_id:3134|Hs108|chr6 (442 aa) initn: 3030 init1: 3030 opt: 3030 Z-score: 3372.8 bits: 633.2 E(32554): 1.5e-181 Smith-Waterman score: 3030; 99.8% identity (99.8% similar) in 442 aa overlap (1-442:1-442) 10 20 30 40 50 60 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 DGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQSPQ ::::::::::::::::::::::::::::::: :::::::::::::::::::::::::::: CCDS43 DGEEQTQDTELVETRPAGDGTFQKWAAVVVPPGEEQRYTCHVQHEGLPQPLILRWEQSPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMITWWS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMITWWS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 SLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFGFRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFGFRR 370 380 390 400 410 420 430 440 pF1KE1 GRSFLLRSWHHLMKRVQIKIFD :::::::::::::::::::::: CCDS43 GRSFLLRSWHHLMKRVQIKIFD 430 440 >>CCDS43438.1 F gene_id:3134|Hs108|chr6 (346 aa) initn: 2353 init1: 2353 opt: 2353 Z-score: 2621.5 bits: 493.9 E(32554): 1.1e-139 Smith-Waterman score: 2353; 99.7% identity (99.7% similar) in 345 aa overlap (1-345:1-345) 10 20 30 40 50 60 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 DGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQSPQ ::::::::::::::::::::::::::::::: :::::::::::::::::::::::::::: CCDS43 DGEEQTQDTELVETRPAGDGTFQKWAAVVVPPGEEQRYTCHVQHEGLPQPLILRWEQSPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMITWWS ::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAV 310 320 330 340 370 380 390 400 410 420 pF1KE1 SLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFGFRR >>CCDS34373.1 A gene_id:3105|Hs108|chr6 (365 aa) initn: 1956 init1: 1930 opt: 1930 Z-score: 2150.9 bits: 406.9 E(32554): 1.7e-113 Smith-Waterman score: 1930; 79.0% identity (91.5% similar) in 353 aa overlap (1-353:4-356) 10 20 30 40 50 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRF ::::.::::::::::::.:::::::.::: :.::::::::::.::: :::::::.:: CCDS34 MAVMAPRTLLLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 DSDAAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQ ::::: ::::: ::.:::::.::. : .::..::::: : .: ::::::::::.: CCDS34 DSDAASQRMEPRAPWIEQEGPEYWDQETRNVKAQSQTDRVDLGTLRGYYNQSEAGSHTIQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 GMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEF : :::.: :::.::::.: ::::::::.:::::::::::: .::::.: .:: . ::.. CCDS34 IMYGCDVGSDGRFLRGYRQDAYDGKDYIALNEDLRSWTAADMAAQITKRKWEAAHEAEQL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 RTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLT :.::.: :.: ::::::::::::::.::::.:..:::::::::::::::::::::::::: CCDS34 RAYLDGTCVEWLRRYLENGKETLQRTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLT 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 WQRDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQ ::::::.::::::::::::::::::::::::::::::::::::::::::::.:: :::: CCDS34 WQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEL 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 SPQPTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMIT : :::::::::.::::.::::.:::::::::::.:::::. :::.:::. . ..:. CCDS34 SSQPTIPIVGIIAGLVLLGAVITGAVVAAVMWRRKSSDRKGGSYTQAASSDSAQGSDVSL 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE1 WWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFG CCDS34 TACKV >>CCDS34394.1 B gene_id:3106|Hs108|chr6 (362 aa) initn: 1882 init1: 1882 opt: 1886 Z-score: 2102.0 bits: 397.8 E(32554): 9.2e-111 Smith-Waterman score: 1886; 78.5% identity (90.4% similar) in 353 aa overlap (1-353:4-356) 10 20 30 40 50 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRF ::::..:::::.:::::.:::::::.::: :.::::::::::.:.: :::::::.:: CCDS34 MLVMAPRTVLLLLSAALALTETWAGSHSMRYFYTSVSRPGRGEPRFISVGYVDDTQFVRF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 DSDAAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQ ::::: :: ::: ::.:::::.::. .: ::.::::: .:::: :::::::::::: CCDS34 DSDAASPREEPRAPWIEQEGPEYWDRNTQIYKAQAQTDRESLRNLRGYYNQSEAGSHTLQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 GMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEF .: :::.:::::::::. :.::::::::.:::::::::::::.:::::: .:: . ::. CCDS34 SMYGCDVGPDGRLLRGHDQYAYDGKDYIALNEDLRSWTAADTAAQITQRKWEAAREAEQR 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 RTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLT :.::::::.: :::::::::. :.::::::.::.:::::::::::::::::::::::::: CCDS34 RAYLEGECVEWLRRYLENGKDKLERADPPKTHVTHHPISDHEATLRCWALGFYPAEITLT 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 WQRDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQ ::::::.::::::::::::::: ::::::::::::::::::::::::::::.:: :::: CCDS34 WQRDGEDQTQDTELVETRPAGDRTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 SPQPTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMIT : : :.:::::::::.::..:: :::::::: :.::: . ::::::: . ..:. CCDS34 SSQSTVPIVGIVAGLAVLAVVVIGAVVAAVMCRRKSSGGKGGSYSQAACSDSAQGSDVSL 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE1 WWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFG CCDS34 TA >>CCDS4668.1 G gene_id:3135|Hs108|chr6 (338 aa) initn: 1847 init1: 1847 opt: 1847 Z-score: 2059.1 bits: 389.8 E(32554): 2.3e-108 Smith-Waterman score: 1847; 79.4% identity (92.2% similar) in 335 aa overlap (1-335:4-338) 10 20 30 40 50 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRF ::::.:.:::::::.::.:::::::.::::.:::::::::::.::. :::::::.:: CCDS46 MVVMAPRTLFLLLSGALTLTETWAGSHSMRYFSAAVSRPGRGEPRFIAMGYVDDTQFVRF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 DSDAAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQ :::.: :::::: ::::::::.::: : .::.:::::. :..: ::::::.::::: CCDS46 DSDSACPRMEPRAPWVEQEGPEYWEEETRNTKAHAQTDRMNLQTLRGYYNQSEASSHTLQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 GMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEF : :::.: ::::::::.:.:::::::..:::::::::::::.:::..: :: . ::. CCDS46 WMIGCDLGSDGRLLRGYEQYAYDGKDYLALNEDLRSWTAADTAAQISKRKCEAANVAEQR 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 RTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLT :.:::: :.: :.:::::::: ::::::::.::.:::. :.:::::::::::::::: :: CCDS46 RAYLEGTCVEWLHRYLENGKEMLQRADPPKTHVTHHPVFDYEATLRCWALGFYPAEIILT 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 WQRDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQ ::::::.::::.:::::::::::::::::::::::::::::::::::::::.::.:::.: CCDS46 WQRDGEDQTQDVELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPEPLMLRWKQ 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 SPQPTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMIT : :::::.:::::::::.::::::.::::.::::::: CCDS46 SSLPTIPIMGIVAGLVVLAAVVTGAAVAAVLWRKKSSD 310 320 330 360 370 380 390 400 410 pF1KE1 WWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFG >>CCDS34393.1 C gene_id:3107|Hs108|chr6 (366 aa) initn: 1949 init1: 1783 opt: 1832 Z-score: 2041.9 bits: 386.7 E(32554): 2.1e-107 Smith-Waterman score: 1832; 76.8% identity (88.7% similar) in 354 aa overlap (1-353:4-357) 10 20 30 40 50 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRF ::::.:::::::.::::.::: :::.:::.::::::::::::.:.: :::::::.:: CCDS34 MRVMAPRALLLLLSGGLALTETWACSHSMRYFDTAVSRPGRGEPRFISVGYVDDTQFVRF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 DSDAAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQ ::::: :: ::: ::::::::.::. : : .::.:::.:::: ::::: :::::: CCDS34 DSDAASPRGEPRAPWVEQEGPEYWDRETQKYKRQAQADRVSLRNLRGYYNQSEDGSHTLQ 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 GMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEF :.:::.:::::::::: : ::::::::.:::::::::::::.:::::: :: . ::.. CCDS34 RMSGCDLGPDGRLLRGYDQSAYDGKDYIALNEDLRSWTAADTAAQITQRKLEAARAAEQL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE1 RTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLT :.:::: :.: :::::::::::::::.:::.::.:::.:::::::::::::::::::::: CCDS34 RAYLEGTCVEWLRRYLENGKETLQRAEPPKTHVTHHPLSDHEATLRCWALGFYPAEITLT 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE1 WQRDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQ ::::::.:::::::::::::::::::::::::::::.:::::::.::::: .:: : :: CCDS34 WQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGQEQRYTCHMQHEGLQEPLTLSWEP 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE1 SPQPTIPIVGIVAGLVVLGAV-VTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMI : ::::::.::::::.:: .. : ::::.:.: :.::: . :: :::: . ..:. CCDS34 SSQPTIPIMGIVAGLAVLVVLAVLGAVVTAMMCRRKSSGGKGGSCSQAACSNSAQGSDES 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE1 TWWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGF CCDS34 LITCKA >>CCDS34379.1 E gene_id:3133|Hs108|chr6 (358 aa) initn: 1790 init1: 1724 opt: 1732 Z-score: 1930.8 bits: 366.1 E(32554): 3.1e-101 Smith-Waterman score: 1732; 72.3% identity (87.6% similar) in 354 aa overlap (1-353:1-353) 10 20 30 40 50 60 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD :. .:::::: :::::.::::::::.:: :.::::::::::.:.: :::::::.:::.: CCDS34 MVDGTLLLLLSEALALTQTWAGSHSLKYFHTSVSRPGRGEPRFISVGYVDDTQFVRFDND 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN :: ::: :: ::.:::: .::. : :. .:: :: ::.: :::::::::::: :. CCDS34 AASPRMVPRAPWMEQEGSEYWDRETRSARDTAQIFRVNLRTLRGYYNQSEAGSHTLQWMH 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQIT-QRFYEAEEYAEEFRT ::..:::::.::::.: :::::::..::::::::::.::.:::. :. .: : ::. :. CCDS34 GCELGPDGRFLRGYEQFAYDGKDYLTLNEDLRSWTAVDTAAQISEQKSNDASE-AEHQRA 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 YLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQ ::: :.: :..:::.::::: . .:::.::.:::::::::::::::::::::::::::: CCDS34 YLEDTCVEWLHKYLEKGKETLLHLEPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQ 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 RDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQSP .::: .:::::::::::::::::::::::::::::::::::::::::::.:. :::. . CCDS34 QDGEGHTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPEPVTLRWKPAS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 QPTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMITWW :::::::::.::::.::.::.:::::::.:::::: . ::::.: . ..:. CCDS34 QPTIPIVGIIAGLVLLGSVVSGAVVAAVIWRKKSSGGKGGSYSKAEWSDSAQGSESHSL 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 SSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFGFR >>CCDS43439.1 F gene_id:3134|Hs108|chr6 (254 aa) initn: 1380 init1: 1380 opt: 1380 Z-score: 1541.6 bits: 293.6 E(32554): 1.5e-79 Smith-Waterman score: 1499; 73.3% identity (73.3% similar) in 345 aa overlap (1-345:1-253) 10 20 30 40 50 60 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AAIPRMEPREPWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 GCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEFRTY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITLTWQR ::::::::::::::::::::::: CCDS43 LEGECLELLRRYLENGKETLQRA------------------------------------- 190 200 250 260 270 280 290 300 pF1KE1 DGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWEQSPQ ::::: CCDS43 -------------------------------------------------------EQSPQ 310 320 330 340 350 360 pF1KE1 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLMITWWS ::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAV 210 220 230 240 250 370 380 390 400 410 420 pF1KE1 SLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFGFGFRR >>CCDS1342.1 MR1 gene_id:3140|Hs108|chr1 (341 aa) initn: 609 init1: 368 opt: 773 Z-score: 864.9 bits: 168.8 E(32554): 7.5e-42 Smith-Waterman score: 773; 38.4% identity (66.9% similar) in 341 aa overlap (5-341:6-335) 10 20 30 40 50 pF1KE1 MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDS ..:: : .: . . . .:::::: .:: : .: :..:.: :::. . .:: CCDS13 MGELMAFLLPLIIVLMVKHSDSRTHSLRYFRLGVSDPIHGVPEFISVGYVDSHPITTYDS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 DAAIPRM-EPREPWV-EQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQ . :. ::: ::. :. .:..:: : .. : .: :. : :.::.: :::: : CCDS13 ---VTRQKEPRAPWMAENLAPDHWERYTQLLRGWQQMFKVELKRLQRHYNHS--GSHTYQ 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 GMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEYAEEF : ::.. :: :. :.::::.:.. .:.: :: :.:.::. .. .::... . CCDS13 RMIGCELLEDGSTT-GFLQYAYDGQDFLIFNKDTLSWLAVDNVAHTIKQAWEANQHELLY 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 -RTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEITL ...:: ::. :.:.:: ::.::::..:: ..: .. ..: : : :::: :: . CCDS13 QKNWLEEECIAWLKRFLEYGKDTLQRTEPPLVRVNRKETFPGVTALFCKAHGFYPPEIYM 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 TWQRDGEEQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPLILRWE ::...::: .:. . . :.::::.: ::.. . . :.:::.: :. ..:. CCDS13 TWMKNGEEIVQEIDYGDILPSGDGTYQAWASIELDPQSSNLYSCHVEHCGV--HMVLQVP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 QSPQPTIPIV-GIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVSGNLM : . :::.: :.: .:: :..: :....::.. ..: . : CCDS13 QESE-TIPLVMKAVSGSIVLVIVLAG--VGVLVWRRRPREQNGAIYLPTPDR 300 310 320 330 340 360 370 380 390 400 410 pF1KE1 ITWWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQSLRFG >>CCDS4578.1 HFE gene_id:3077|Hs108|chr6 (348 aa) initn: 499 init1: 293 opt: 715 Z-score: 800.2 bits: 156.9 E(32554): 3e-38 Smith-Waterman score: 715; 35.4% identity (65.2% similar) in 353 aa overlap (1-344:1-345) 10 20 30 40 50 pF1KE1 MAPRS----LLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLR :.::. :::.: . .: ::::.:. ..:. : . :. :::: :. CCDS45 MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 FDSDAAIPRMEPREPWVEQE-GPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHT .: .. :.::: ::: .. . :.: . :. . : . .... .:.:. ::: CCDS45 YDHESR--RVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKE-SHT 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 LQGMNGCDMGPDGRLLRGYHQHAYDGKDYISLNEDLRSWTAADTVAQITQRFYEAEEY-A :: . ::.: :. .:: ...:::.:.. . : .: ::. : :. .: .. : CCDS45 LQVILGCEMQEDNST-EGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 EEFRTYLEGECLELLRRYLENGKETLQRADPPKAHVAHHPISDHEATLRCWALGFYPAEI .. :.::: .: :.. :: :. .:.. :: ..:.:: ... .:::: ::..:: .: CCDS45 RQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHH-VTSSVTTLRCRALNYYPQNI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 TLTWQRDGEEQTQDTELVETR---PAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPQPL :. : .: .: .:.. : . : ::::.: : ...:: ::::::::.:.: :: ::: CCDS45 TMKWLKD--KQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPL 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 ILRWEQSPQPTIPIVGIVAGLVVLGAVVTGAVVAAVMWRKKSSDRNRGSYSQAAAYSVVS :. :: ::. :. ..:...:..:. ... ... .. ....: : : : CCDS45 IVIWEPSPSGTL-VIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE 300 310 320 330 340 360 370 380 390 400 410 pF1KE1 GNLMITWWSSLFLLGVLFQGYLGCLRSHSVLGRRKVGDMWILFFLWLWTSFNTAFLALQS 442 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:57:23 2016 done: Sun Nov 6 10:57:24 2016 Total Scan time: 3.130 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]