FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6154, 78 aa 1>>>pF1KE6154 78 - 78 aa - 78 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5994+/-0.00083; mu= 11.2271+/- 0.050 mean_var=48.8793+/- 9.772, 0's: 0 Z-trim(104.7): 9 B-trim: 0 in 0/49 Lambda= 0.183447 statistics sampled from 8014 (8020) to 8014 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.612), E-opt: 0.2 (0.246), width: 16 Scan time: 0.770 The best scores are: opt bits E(32554) CCDS47138.1 GYPE gene_id:2996|Hs108|chr4 ( 78) 476 133.1 1.7e-32 CCDS54809.1 GYPB gene_id:2994|Hs108|chr4 ( 91) 276 80.2 1.6e-16 CCDS77965.1 GYPA gene_id:2993|Hs108|chr4 ( 137) 228 67.6 1.5e-12 CCDS34069.1 GYPA gene_id:2993|Hs108|chr4 ( 150) 228 67.6 1.7e-12 >>CCDS47138.1 GYPE gene_id:2996|Hs108|chr4 (78 aa) initn: 476 init1: 476 opt: 476 Z-score: 697.0 bits: 133.1 E(32554): 1.7e-32 Smith-Waterman score: 476; 100.0% identity (100.0% similar) in 78 aa overlap (1-78:1-78) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF 10 20 30 40 50 60 70 pF1KE6 EVMLVVVGMIILISYCIR :::::::::::::::::: CCDS47 EVMLVVVGMIILISYCIR 70 >>CCDS54809.1 GYPB gene_id:2994|Hs108|chr4 (91 aa) initn: 298 init1: 237 opt: 276 Z-score: 410.0 bits: 80.2 E(32554): 1.6e-16 Smith-Waterman score: 276; 65.1% identity (76.7% similar) in 86 aa overlap (1-78:1-86) 10 20 30 40 50 pF1KE6 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGIT--LINWWAMAR- :::::::::::: :::::: ::: :::::::::::::::::::::: : :.. ... CCDS54 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNGETGQLVHRFTVPAP 10 20 30 40 50 60 60 70 pF1KE6 -----VIFEVMLVVVGMIILISYCIR .:. :: ..: :.:::: :: CCDS54 VVIILIILCVMAGIIGTILLISYSIRRLIKA 70 80 90 >>CCDS77965.1 GYPA gene_id:2993|Hs108|chr4 (137 aa) initn: 289 init1: 228 opt: 228 Z-score: 338.6 bits: 67.6 E(32554): 1.5e-12 Smith-Waterman score: 239; 54.9% identity (61.8% similar) in 102 aa overlap (1-78:1-102) 10 20 30 40 50 pF1KE6 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWA------ :::::::::::: :::::: ::: ::::::::::::::::::::: . .: CCDS77 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 60 70 pF1KE6 ------------------MARVIFEVMLVVVGMIILISYCIR .. .:: :: :.: :.:::: :: CCDS77 EVSEISVRTVYPPEEETEITLIIFGVMAGVIGTILLISYGIRRLIKKSPSDVKPLPSPDT 70 80 90 100 110 120 CCDS77 DVPLSSVEIENPETSDQ 130 >>CCDS34069.1 GYPA gene_id:2993|Hs108|chr4 (150 aa) initn: 289 init1: 228 opt: 228 Z-score: 338.0 bits: 67.6 E(32554): 1.7e-12 Smith-Waterman score: 228; 93.3% identity (93.3% similar) in 45 aa overlap (1-45:1-45) 10 20 30 40 50 60 pF1KE6 MYGKIIFVLLLSGIVSISASSTTGVAMHTSTSSSVTKSYISSQTNGITLINWWAMARVIF :::::::::::: :::::: ::: ::::::::::::::::::::: CCDS34 MYGKIIFVLLLSEIVSISALSTTEVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAH 10 20 30 40 50 60 70 pF1KE6 EVMLVVVGMIILISYCIR CCDS34 EVSEISVRTVYPPEEETGERVQLAHHFSEPEITLIIFGVMAGVIGTILLISYGIRRLIKK 70 80 90 100 110 120 78 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:55:19 2016 done: Tue Nov 8 09:55:19 2016 Total Scan time: 0.770 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]