FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6129, 150 aa 1>>>pF1KE6129 150 - 150 aa - 150 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2598+/-0.000319; mu= 12.6812+/- 0.020 mean_var=65.0095+/-12.795, 0's: 0 Z-trim(116.2): 20 B-trim: 101 in 1/53 Lambda= 0.159069 statistics sampled from 27254 (27274) to 27254 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.708), E-opt: 0.2 (0.32), width: 16 Scan time: 4.350 The best scores are: opt bits E(85289) NP_002090 (OMIM: 111300,611162) glycophorin-A isof ( 150) 925 220.3 9.6e-58 XP_016863624 (OMIM: 111300,611162) PREDICTED: glyc ( 124) 788 188.8 2.4e-48 XP_016863623 (OMIM: 111300,611162) PREDICTED: glyc ( 145) 725 174.3 6.1e-44 NP_001295119 (OMIM: 111300,611162) glycophorin-A i ( 117) 679 163.7 7.7e-41 XP_016863625 (OMIM: 111300,611162) PREDICTED: glyc ( 112) 479 117.8 4.8e-27 NP_001295116 (OMIM: 111300,611162) glycophorin-A i ( 137) 459 113.3 1.4e-25 NP_002091 (OMIM: 111740,611162) glycophorin-B isof ( 91) 247 64.5 4.3e-11 XP_011530205 (OMIM: 111740,611162) PREDICTED: glyc ( 96) 247 64.6 4.5e-11 NP_941391 (OMIM: 138590) glycophorin-E precursor [ ( 78) 231 60.8 4.9e-10 XP_016863628 (OMIM: 138590) PREDICTED: glycophorin ( 78) 231 60.8 4.9e-10 NP_002093 (OMIM: 138590) glycophorin-E precursor [ ( 78) 231 60.8 4.9e-10 XP_016863629 (OMIM: 138590) PREDICTED: glycophorin ( 78) 231 60.8 4.9e-10 XP_016863626 (OMIM: 111740,611162) PREDICTED: glyc ( 65) 178 48.6 1.9e-06 NP_001291311 (OMIM: 111740,611162) glycophorin-B i ( 65) 176 48.2 2.6e-06 XP_011530207 (OMIM: 111740,611162) PREDICTED: glyc ( 82) 176 48.2 3.2e-06 XP_011530206 (OMIM: 111740,611162) PREDICTED: glyc ( 87) 176 48.2 3.4e-06 >>NP_002090 (OMIM: 111300,611162) glycophorin-A isoform (150 aa) initn: 925 init1: 925 opt: 925 Z-score: 1157.7 bits: 220.3 E(85289): 9.6e-58 Smith-Waterman score: 925; 99.3% identity (99.3% similar) in 150 aa overlap (1-150:1-150) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK 70 80 90 100 110 120 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ :::::::::::::::::::::::::::::: NP_002 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ 130 140 150 >>XP_016863624 (OMIM: 111300,611162) PREDICTED: glycopho (124 aa) initn: 788 init1: 788 opt: 788 Z-score: 989.0 bits: 188.8 E(85289): 2.4e-48 Smith-Waterman score: 788; 100.0% identity (100.0% similar) in 124 aa overlap (27-150:1-124) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::::::::::::::::::::::::: XP_016 MHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK 40 50 60 70 80 90 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ :::::::::::::::::::::::::::::: XP_016 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ 100 110 120 >>XP_016863623 (OMIM: 111300,611162) PREDICTED: glycopho (145 aa) initn: 738 init1: 722 opt: 725 Z-score: 909.9 bits: 174.3 E(85289): 6.1e-44 Smith-Waterman score: 725; 88.8% identity (95.5% similar) in 134 aa overlap (1-134:1-134) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::. XP_016 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKR 70 80 90 100 110 120 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ . . . . .:... XP_016 QVINENLFTKPNVERTQRRHKTSVK 130 140 >>NP_001295119 (OMIM: 111300,611162) glycophorin-A isofo (117 aa) initn: 679 init1: 679 opt: 679 Z-score: 854.2 bits: 163.7 E(85289): 7.7e-41 Smith-Waterman score: 679; 99.1% identity (100.0% similar) in 106 aa overlap (45-150:12-117) 20 30 40 50 60 70 pF1KE6 VSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVRTVYPPE .::::::::::::::::::::::::::::: NP_001 MYGKIIFVLLLSDTHKRDTYAATPRAHEVSEISVRTVYPPE 10 20 30 40 80 90 100 110 120 130 pF1KE6 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTD 50 60 70 80 90 100 140 150 pF1KE6 VPLSSVEIENPETSDQ :::::::::::::::: NP_001 VPLSSVEIENPETSDQ 110 >>XP_016863625 (OMIM: 111300,611162) PREDICTED: glycopho (112 aa) initn: 549 init1: 475 opt: 479 Z-score: 606.5 bits: 117.8 E(85289): 4.8e-27 Smith-Waterman score: 479; 83.3% identity (94.4% similar) in 90 aa overlap (45-134:12-101) 20 30 40 50 60 70 pF1KE6 VSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVRTVYPPE .::::::::::::::::::::::::::::: XP_016 MYGKIIFVLLLSDTHKRDTYAATPRAHEVSEISVRTVYPPE 10 20 30 40 80 90 100 110 120 130 pF1KE6 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTD :::::::::::::::::::::::::::::::::::::::::::::.. . . . .:... XP_016 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKRQVINENLFTKPNVE 50 60 70 80 90 100 140 150 pF1KE6 VPLSSVEIENPETSDQ XP_016 RTQRRHKTSVK 110 >>NP_001295116 (OMIM: 111300,611162) glycophorin-A isofo (137 aa) initn: 515 init1: 456 opt: 459 Z-score: 580.3 bits: 113.3 E(85289): 1.4e-25 Smith-Waterman score: 797; 90.7% identity (90.7% similar) in 150 aa overlap (1-150:1-137) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK ::::::::::::::::: :::::::::::::::::::::::::::::: NP_001 EVSEISVRTVYPPEEET-------------EITLIIFGVMAGVIGTILLISYGIRRLIKK 70 80 90 100 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ :::::::::::::::::::::::::::::: NP_001 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ 110 120 130 >>NP_002091 (OMIM: 111740,611162) glycophorin-B isoform (91 aa) initn: 365 init1: 247 opt: 247 Z-score: 320.1 bits: 64.5 E(85289): 4.3e-11 Smith-Waterman score: 334; 60.7% identity (64.8% similar) in 122 aa overlap (1-119:1-90) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: :::::::::::::::::::::::::::::::: NP_002 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTN--------------- 10 20 30 40 70 80 90 100 110 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPE---ITLIIFGVMAGVIGTILLISYGIRRL :: ::.:.:. : : :::. ::::.::::::::: :::: NP_002 -----------------GETGQLVHRFTVPAPVVIILIILCVMAGIIGTILLISYTIRRL 50 60 70 80 120 130 140 150 pF1KE6 IKKSPSDVKPLPSPDTDVPLSSVEIENPETSDQ :: NP_002 IKA 90 >>XP_011530205 (OMIM: 111740,611162) PREDICTED: glycopho (96 aa) initn: 367 init1: 247 opt: 247 Z-score: 319.7 bits: 64.6 E(85289): 4.5e-11 Smith-Waterman score: 338; 58.6% identity (64.8% similar) in 128 aa overlap (1-125:1-96) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: :::::::::::::::::::::::::::::::: XP_011 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTN--------------- 10 20 30 40 70 80 90 100 110 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPE---ITLIIFGVMAGVIGTILLISYGIRRL :: ::.:.:. : : :::. ::::.:::::::::.:::: XP_011 -----------------GETGQLVHRFTVPAPVVIILIILCVMAGIIGTILLISYSIRRL 50 60 70 80 120 130 140 150 pF1KE6 IKKSPSDVKPLPSPDTDVPLSSVEIENPETSDQ :: . .: XP_011 IKVTALNV 90 >>NP_941391 (OMIM: 138590) glycophorin-E precursor [Homo (78 aa) initn: 292 init1: 231 opt: 231 Z-score: 301.2 bits: 60.8 E(85289): 4.9e-10 Smith-Waterman score: 231; 93.3% identity (95.6% similar) in 45 aa overlap (1-45:1-45) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH ::::::::::::.:::::: ::: ::::::::::::::::::::: NP_941 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK NP_941 EVMLVVVGMIILISYCIR 70 >>XP_016863628 (OMIM: 138590) PREDICTED: glycophorin-E i (78 aa) initn: 292 init1: 231 opt: 231 Z-score: 301.2 bits: 60.8 E(85289): 4.9e-10 Smith-Waterman score: 231; 93.3% identity (95.6% similar) in 45 aa overlap (1-45:1-45) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH ::::::::::::.:::::: ::: ::::::::::::::::::::: XP_016 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK XP_016 EVMLVVVGMIILISYCIR 70 150 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:42:20 2016 done: Tue Nov 8 09:42:21 2016 Total Scan time: 4.350 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]