FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6505, 200 aa 1>>>pF1KE6505 200 - 200 aa - 200 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8360+/-0.000617; mu= 15.7928+/- 0.037 mean_var=58.2615+/-11.532, 0's: 0 Z-trim(110.8): 11 B-trim: 0 in 0/50 Lambda= 0.168029 statistics sampled from 11901 (11908) to 11901 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.756), E-opt: 0.2 (0.366), width: 16 Scan time: 1.550 The best scores are: opt bits E(32554) CCDS72877.1 HFE2 gene_id:148738|Hs108|chr1 ( 200) 1323 328.2 2e-90 CCDS72878.1 HFE2 gene_id:148738|Hs108|chr1 ( 313) 1323 328.4 2.9e-90 CCDS72879.1 HFE2 gene_id:148738|Hs108|chr1 ( 426) 1323 328.4 3.7e-90 CCDS53973.1 RGMA gene_id:56963|Hs108|chr15 ( 434) 366 96.5 2.6e-20 CCDS45357.1 RGMA gene_id:56963|Hs108|chr15 ( 450) 366 96.5 2.7e-20 CCDS53974.1 RGMA gene_id:56963|Hs108|chr15 ( 458) 366 96.5 2.7e-20 CCDS47251.1 RGMB gene_id:285704|Hs108|chr5 ( 478) 256 69.8 3e-12 >>CCDS72877.1 HFE2 gene_id:148738|Hs108|chr1 (200 aa) initn: 1323 init1: 1323 opt: 1323 Z-score: 1736.9 bits: 328.2 E(32554): 2e-90 Smith-Waterman score: 1323; 100.0% identity (100.0% similar) in 200 aa overlap (1-200:1-200) 10 20 30 40 50 60 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGDRPGGSSLSIQTANPGNHVEIQAAYIGTTII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 MQECIDQKVYQAEVDNLPVAFEDGSINGGDRPGGSSLSIQTANPGNHVEIQAAYIGTTII 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 IRQTAGQLSFSIKVAEDVAMAFSAEQDLQLCVGGCPPSQRLSRSERNRRGAITIDTARRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 IRQTAGQLSFSIKVAEDVAMAFSAEQDLQLCVGGCPPSQRLSRSERNRRGAITIDTARRL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 CKEGLPVEDAYFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLFPSDAGVPLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 CKEGLPVEDAYFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLFPSDAGVPLS 130 140 150 160 170 180 190 200 pF1KE6 SATLLAPLLSGLFVLWLCIQ :::::::::::::::::::: CCDS72 SATLLAPLLSGLFVLWLCIQ 190 200 >>CCDS72878.1 HFE2 gene_id:148738|Hs108|chr1 (313 aa) initn: 1323 init1: 1323 opt: 1323 Z-score: 1734.0 bits: 328.4 E(32554): 2.9e-90 Smith-Waterman score: 1323; 100.0% identity (100.0% similar) in 200 aa overlap (1-200:114-313) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGD :::::::::::::::::::::::::::::: CCDS72 DFLFVQATSSPMALGANATATRKLTIIFKNMQECIDQKVYQAEVDNLPVAFEDGSINGGD 90 100 110 120 130 140 40 50 60 70 80 90 pF1KE6 RPGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RPGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQL 150 160 170 180 190 200 100 110 120 130 140 150 pF1KE6 CVGGCPPSQRLSRSERNRRGAITIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 CVGGCPPSQRLSRSERNRRGAITIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVA 210 220 230 240 250 260 160 170 180 190 200 pF1KE6 AQAALEDARAFLPDLEKLHLFPSDAGVPLSSATLLAPLLSGLFVLWLCIQ :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 AQAALEDARAFLPDLEKLHLFPSDAGVPLSSATLLAPLLSGLFVLWLCIQ 270 280 290 300 310 >>CCDS72879.1 HFE2 gene_id:148738|Hs108|chr1 (426 aa) initn: 1323 init1: 1323 opt: 1323 Z-score: 1732.1 bits: 328.4 E(32554): 3.7e-90 Smith-Waterman score: 1323; 100.0% identity (100.0% similar) in 200 aa overlap (1-200:227-426) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGD :::::::::::::::::::::::::::::: CCDS72 DFLFVQATSSPMALGANATATRKLTIIFKNMQECIDQKVYQAEVDNLPVAFEDGSINGGD 200 210 220 230 240 250 40 50 60 70 80 90 pF1KE6 RPGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RPGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQL 260 270 280 290 300 310 100 110 120 130 140 150 pF1KE6 CVGGCPPSQRLSRSERNRRGAITIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 CVGGCPPSQRLSRSERNRRGAITIDTARRLCKEGLPVEDAYFHSCVFDVLISGDPNFTVA 320 330 340 350 360 370 160 170 180 190 200 pF1KE6 AQAALEDARAFLPDLEKLHLFPSDAGVPLSSATLLAPLLSGLFVLWLCIQ :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 AQAALEDARAFLPDLEKLHLFPSDAGVPLSSATLLAPLLSGLFVLWLCIQ 380 390 400 410 420 >>CCDS53973.1 RGMA gene_id:56963|Hs108|chr15 (434 aa) initn: 591 init1: 321 opt: 366 Z-score: 478.2 bits: 96.5 E(32554): 2.6e-20 Smith-Waterman score: 543; 44.4% identity (65.5% similar) in 223 aa overlap (2-195:208-430) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGDR :::.::::::::.:.::.:: ::: ::::. CCDS53 YLNVQVTNTPVLPGSAATATSKLTIIFKNFQECVDQKVYQAEMDELPAAFVDGSKNGGDK 180 190 200 210 220 230 40 50 60 70 80 pF1KE6 PGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSA--EQDLQ :..::.: :.:::::: ::::::..::.. :.:.... :.:. : : : CCDS53 HGANSLKITEKVSGQHVEIQAKYIGTTIVVRQVGRYLTFAVRMPEEVVNAVEDWDSQGLY 240 250 260 270 280 290 90 100 110 120 130 pF1KE6 LCVGGCPPSQRL--------SRSERNRRGAIT-----------IDTARRLCKEGLPVEDA ::. ::: .:.. ... :: : . .:: ::: ::::: CCDS53 LCLRGCPLNQQIDFQAFHTNAEGTGARRLAAASPAPTAPETFPYETAVAKCKEKLPVEDL 300 310 320 330 340 350 140 150 160 170 180 pF1KE6 YFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLF------P--SDAGVPLSSA :...::::.: .:: :::.:: ::::.. . . .::::. : . ::.::. CCDS53 YYQACVFDLLTTGDVNFTLAAYYALEDVKMLHSNKDKLHLYERTRDLPGRAAAGLPLAPR 360 370 380 390 400 410 190 200 pF1KE6 TLLAPLLSGLFVLWLCIQ ::. :. : .: CCDS53 PLLGALVPLLALLPVFC 420 430 >>CCDS45357.1 RGMA gene_id:56963|Hs108|chr15 (450 aa) initn: 591 init1: 321 opt: 366 Z-score: 477.9 bits: 96.5 E(32554): 2.7e-20 Smith-Waterman score: 543; 44.4% identity (65.5% similar) in 223 aa overlap (2-195:224-446) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGDR :::.::::::::.:.::.:: ::: ::::. CCDS45 YLNVQVTNTPVLPGSAATATSKLTIIFKNFQECVDQKVYQAEMDELPAAFVDGSKNGGDK 200 210 220 230 240 250 40 50 60 70 80 pF1KE6 PGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSA--EQDLQ :..::.: :.:::::: ::::::..::.. :.:.... :.:. : : : CCDS45 HGANSLKITEKVSGQHVEIQAKYIGTTIVVRQVGRYLTFAVRMPEEVVNAVEDWDSQGLY 260 270 280 290 300 310 90 100 110 120 130 pF1KE6 LCVGGCPPSQRL--------SRSERNRRGAIT-----------IDTARRLCKEGLPVEDA ::. ::: .:.. ... :: : . .:: ::: ::::: CCDS45 LCLRGCPLNQQIDFQAFHTNAEGTGARRLAAASPAPTAPETFPYETAVAKCKEKLPVEDL 320 330 340 350 360 370 140 150 160 170 180 pF1KE6 YFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLF------P--SDAGVPLSSA :...::::.: .:: :::.:: ::::.. . . .::::. : . ::.::. CCDS45 YYQACVFDLLTTGDVNFTLAAYYALEDVKMLHSNKDKLHLYERTRDLPGRAAAGLPLAPR 380 390 400 410 420 430 190 200 pF1KE6 TLLAPLLSGLFVLWLCIQ ::. :. : .: CCDS45 PLLGALVPLLALLPVFC 440 450 >>CCDS53974.1 RGMA gene_id:56963|Hs108|chr15 (458 aa) initn: 591 init1: 321 opt: 366 Z-score: 477.8 bits: 96.5 E(32554): 2.7e-20 Smith-Waterman score: 543; 44.4% identity (65.5% similar) in 223 aa overlap (2-195:232-454) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGDR :::.::::::::.:.::.:: ::: ::::. CCDS53 YLNVQVTNTPVLPGSAATATSKLTIIFKNFQECVDQKVYQAEMDELPAAFVDGSKNGGDK 210 220 230 240 250 260 40 50 60 70 80 pF1KE6 PGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSA--EQDLQ :..::.: :.:::::: ::::::..::.. :.:.... :.:. : : : CCDS53 HGANSLKITEKVSGQHVEIQAKYIGTTIVVRQVGRYLTFAVRMPEEVVNAVEDWDSQGLY 270 280 290 300 310 320 90 100 110 120 130 pF1KE6 LCVGGCPPSQRL--------SRSERNRRGAIT-----------IDTARRLCKEGLPVEDA ::. ::: .:.. ... :: : . .:: ::: ::::: CCDS53 LCLRGCPLNQQIDFQAFHTNAEGTGARRLAAASPAPTAPETFPYETAVAKCKEKLPVEDL 330 340 350 360 370 380 140 150 160 170 180 pF1KE6 YFHSCVFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLF------P--SDAGVPLSSA :...::::.: .:: :::.:: ::::.. . . .::::. : . ::.::. CCDS53 YYQACVFDLLTTGDVNFTLAAYYALEDVKMLHSNKDKLHLYERTRDLPGRAAAGLPLAPR 390 400 410 420 430 440 190 200 pF1KE6 TLLAPLLSGLFVLWLCIQ ::. :. : .: CCDS53 PLLGALVPLLALLPVFC 450 >>CCDS47251.1 RGMB gene_id:285704|Hs108|chr5 (478 aa) initn: 451 init1: 242 opt: 256 Z-score: 333.5 bits: 69.8 E(32554): 3e-12 Smith-Waterman score: 557; 43.7% identity (71.2% similar) in 215 aa overlap (2-199:265-476) 10 20 30 pF1KE6 MQECIDQKVYQAEVDNLPVAFEDGSINGGDR .:: ::::::: .:.::.:: ::. .::: CCDS47 YLSVQVTNVPVVPGSSATATNKITIIFKAHHECTDQKVYQAVTDDLPAAFVDGTTSGGDS 240 250 260 270 280 290 40 50 60 70 80 90 pF1KE6 PGGSSLSIQTANPGNHVEIQAAYIGTTIIIRQTAGQLSFSIKVAEDVAMAFSAEQDLQLC . .:: : . :..::..: :::::...::.. :...:.. ::.::.. :::::: CCDS47 DA-KSLRIVERESGHYVEMHARYIGTTVFVRQVGRYLTLAIRMPEDLAMSYEESQDLQLC 300 310 320 330 340 350 100 110 120 130 pF1KE6 VGGCPPSQRLSRSERN----------RRGAI------TIDTARRLCKEGLPVEDAYFHSC :.::: :.:.. .. . : . . :..:: :.: .::.: ::.:: CCDS47 VNGCPLSERIDDGQGQVSAILGHSLPRTSLVQAWPGYTLETANTQCHEKMPVKDIYFQSC 360 370 380 390 400 410 140 150 160 170 180 190 pF1KE6 VFDVLISGDPNFTVAAQAALEDARAFLPDLEKLHLFPSDA-GVPLSSATLLAPLLSGLFV :::.: .:: :::.::..::::..:. : :. :.:::.. :.: ... : . : :: CCDS47 VFDLLTTGDANFTAAAHSALEDVEALHPRKERWHIFPSSGNGTPRGGSDLSVSL--GLTC 420 430 440 450 460 470 200 pF1KE6 LWLCIQ : : . CCDS47 LILIVFL 200 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:55:30 2016 done: Tue Nov 8 13:55:30 2016 Total Scan time: 1.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]