FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6129, 150 aa 1>>>pF1KE6129 150 - 150 aa - 150 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1339+/-0.000697; mu= 13.2325+/- 0.042 mean_var=61.4035+/-12.012, 0's: 0 Z-trim(109.2): 13 B-trim: 21 in 1/51 Lambda= 0.163673 statistics sampled from 10731 (10738) to 10731 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.33), width: 16 Scan time: 1.550 The best scores are: opt bits E(32554) CCDS34069.1 GYPA gene_id:2993|Hs108|chr4 ( 150) 925 226.1 6.3e-60 CCDS82959.1 GYPA gene_id:2993|Hs108|chr4 ( 117) 679 168.0 1.6e-42 CCDS77965.1 GYPA gene_id:2993|Hs108|chr4 ( 137) 459 116.1 7.8e-27 CCDS54809.1 GYPB gene_id:2994|Hs108|chr4 ( 91) 247 65.9 6.5e-12 CCDS47138.1 GYPE gene_id:2996|Hs108|chr4 ( 78) 231 62.1 7.8e-11 >>CCDS34069.1 GYPA gene_id:2993|Hs108|chr4 (150 aa) initn: 925 init1: 925 opt: 925 Z-score: 1189.4 bits: 226.1 E(32554): 6.3e-60 Smith-Waterman score: 925; 99.3% identity (99.3% similar) in 150 aa overlap (1-150:1-150) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK 70 80 90 100 110 120 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ :::::::::::::::::::::::::::::: CCDS34 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ 130 140 150 >>CCDS82959.1 GYPA gene_id:2993|Hs108|chr4 (117 aa) initn: 679 init1: 679 opt: 679 Z-score: 877.1 bits: 168.0 E(32554): 1.6e-42 Smith-Waterman score: 679; 99.1% identity (100.0% similar) in 106 aa overlap (45-150:12-117) 20 30 40 50 60 70 pF1KE6 VSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVRTVYPPE .::::::::::::::::::::::::::::: CCDS82 MYGKIIFVLLLSDTHKRDTYAATPRAHEVSEISVRTVYPPE 10 20 30 40 80 90 100 110 120 130 pF1KE6 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 EETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDTD 50 60 70 80 90 100 140 150 pF1KE6 VPLSSVEIENPETSDQ :::::::::::::::: CCDS82 VPLSSVEIENPETSDQ 110 >>CCDS77965.1 GYPA gene_id:2993|Hs108|chr4 (137 aa) initn: 515 init1: 456 opt: 459 Z-score: 595.3 bits: 116.1 E(32554): 7.8e-27 Smith-Waterman score: 797; 90.7% identity (90.7% similar) in 150 aa overlap (1-150:1-137) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK ::::::::::::::::: :::::::::::::::::::::::::::::: CCDS77 EVSEISVRTVYPPEEET-------------EITLIIFGVMAGVIGTILLISYGIRRLIKK 70 80 90 100 130 140 150 pF1KE6 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ :::::::::::::::::::::::::::::: CCDS77 SPSDVKPLPSPDTDVPLSSVEIENPETSDQ 110 120 130 >>CCDS54809.1 GYPB gene_id:2994|Hs108|chr4 (91 aa) initn: 367 init1: 247 opt: 247 Z-score: 327.4 bits: 65.9 E(32554): 6.5e-12 Smith-Waterman score: 336; 60.7% identity (65.6% similar) in 122 aa overlap (1-119:1-90) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH :::::::::::: :::::::::::::::::::::::::::::::: CCDS54 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTN--------------- 10 20 30 40 70 80 90 100 110 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPE---ITLIIFGVMAGVIGTILLISYGIRRL :: ::.:.:. : : :::. ::::.:::::::::.:::: CCDS54 -----------------GETGQLVHRFTVPAPVVIILIILCVMAGIIGTILLISYSIRRL 50 60 70 80 120 130 140 150 pF1KE6 IKKSPSDVKPLPSPDTDVPLSSVEIENPETSDQ :: CCDS54 IKA 90 >>CCDS47138.1 GYPE gene_id:2996|Hs108|chr4 (78 aa) initn: 292 init1: 231 opt: 231 Z-score: 308.0 bits: 62.1 E(32554): 7.8e-11 Smith-Waterman score: 231; 93.3% identity (95.6% similar) in 45 aa overlap (1-45:1-45) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSAIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH ::::::::::::.:::::: ::: ::::::::::::::::::::: CCDS47 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK CCDS47 EVMLVVVGMIILISYCIR 70 150 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:42:19 2016 done: Tue Nov 8 09:42:20 2016 Total Scan time: 1.550 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]