FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6274, 190 aa 1>>>pF1KE6274 190 - 190 aa - 190 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0969+/-0.0006; mu= 14.0453+/- 0.036 mean_var=55.7160+/-11.072, 0's: 0 Z-trim(110.5): 17 B-trim: 5 in 1/50 Lambda= 0.171824 statistics sampled from 11649 (11662) to 11649 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.358), width: 16 Scan time: 1.560 The best scores are: opt bits E(32554) CCDS11746.1 CYGB gene_id:114757|Hs108|chr17 ( 190) 1253 317.9 2.3e-87 CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 ( 147) 253 70.0 7.6e-13 CCDS13917.1 MB gene_id:4151|Hs108|chr22 ( 154) 251 69.5 1.1e-12 CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 ( 147) 245 68.0 3e-12 CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 ( 147) 243 67.5 4.3e-12 >>CCDS11746.1 CYGB gene_id:114757|Hs108|chr17 (190 aa) initn: 1253 init1: 1253 opt: 1253 Z-score: 1682.0 bits: 317.9 E(32554): 2.3e-87 Smith-Waterman score: 1253; 99.5% identity (100.0% similar) in 190 aa overlap (1-190:1-190) 10 20 30 40 50 60 pF1KE6 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYASCEDVGVAILVRFFVNFPSAKQYF ::::::::::::::::::::::::::::::::::::.::::::::::::::::::::::: CCDS11 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYANCEDVGVAILVRFFVNFPSAKQYF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT 130 140 150 160 170 180 190 pF1KE6 PPATLPSSGP :::::::::: CCDS11 PPATLPSSGP 190 >>CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 (147 aa) initn: 116 init1: 102 opt: 253 Z-score: 344.1 bits: 70.0 E(32554): 7.6e-13 Smith-Waterman score: 253; 27.2% identity (71.3% similar) in 136 aa overlap (19-154:4-134) 10 20 30 40 50 60 pF1KE6 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYASCEDVGVAILVRFFVNFPSAKQYF .. :. :: ..:... . :..: : :..: .: ....: CCDS77 MVHFTAEEKAAVTSLWSKM--NVEEAGGEALGRLLVVYPWTQRFF 10 20 30 40 70 80 90 100 110 120 pF1KE6 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE ..: .. .: . .:... :. .:. ... ...:. :... ..: ... : : .:. CCDS77 DSFGNLSSPSAILGNPKVKAHGKKVLTSFGDAIKNM---DNLKPAFAKLSELHCDKLHVD 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT : ::.:..:.. ..: .:...: ::.: :: :: CCDS77 PENFKLLGNVMVIILATHFGKEFTPEVQAAWQKLVSAVAIALAHKYH 110 120 130 140 190 pF1KE6 PPATLPSSGP >>CCDS13917.1 MB gene_id:4151|Hs108|chr22 (154 aa) initn: 238 init1: 126 opt: 251 Z-score: 341.1 bits: 69.5 E(32554): 1.1e-12 Smith-Waterman score: 251; 29.4% identity (63.4% similar) in 153 aa overlap (19-171:3-152) 10 20 30 40 50 60 pF1KE6 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYASCEDVGVAILVRFFVNFPSAKQYF ::..: . : .:... :. : .:.:.: . : . . : CCDS13 MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKF 10 20 30 40 70 80 90 100 110 120 pF1KE6 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE ..:::... ::. : .:.::. :. ::. .... . . :: ..:: :::. CCDS13 DKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLA---QSHATKHKIP 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT :....: :..:. . .:: ..: : : :. . ... :::.:. CCDS13 VKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG 110 120 130 140 150 190 pF1KE6 PPATLPSSGP >>CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 (147 aa) initn: 111 init1: 97 opt: 245 Z-score: 333.3 bits: 68.0 E(32554): 3e-12 Smith-Waterman score: 245; 25.0% identity (69.6% similar) in 148 aa overlap (19-166:4-146) 10 20 30 40 50 60 pF1KE6 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYASCEDVGVAILVRFFVNFPSAKQYF ..: .. .. ..:... . ::.: : :..: .: ....: CCDS77 MGHFTEEDKATITSLWGKV--NVEDAGGETLGRLLVVYPWTQRFF 10 20 30 40 70 80 90 100 110 120 pF1KE6 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE ..: .. . . .:... :. .:. .:. ....: : .....: ... : : .:. CCDS77 DSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDD---LKGTFAQLSELHCDKLHVD 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT : ::.:..:.. :.: .:...: ::.: .: :. . : ... : CCDS77 PENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH 110 120 130 140 190 pF1KE6 PPATLPSSGP >>CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 (147 aa) initn: 118 init1: 97 opt: 243 Z-score: 330.7 bits: 67.5 E(32554): 4.3e-12 Smith-Waterman score: 243; 25.0% identity (69.6% similar) in 148 aa overlap (19-166:4-146) 10 20 30 40 50 60 pF1KE6 MEKVPGEMEIERRERSEELSEAERKAVQAMWARLYASCEDVGVAILVRFFVNFPSAKQYF ..: .. .. ..:... . ::.: : :..: .: ....: CCDS77 MGHFTEEDKATITSLWGKV--NVEDAGGETLGRLLVVYPWTQRFF 10 20 30 40 70 80 90 100 110 120 pF1KE6 SQFKHMEDPLEMERSPQLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVE ..: .. . . .:... :. .:. .:. ....: : .....: ... : : .:. CCDS77 DSFGNLSSASAIMGNPKVKAHGKKVLTSLGDATKHLDD---LKGTFAQLSELHCDKLHVD 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE6 PVYFKILSGVILEVVAEEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATT : ::.:..:.. :.: .:...: ::.: .: :. . : ... : CCDS77 PENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTAVASALSSRYH 110 120 130 140 190 pF1KE6 PPATLPSSGP 190 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:41:29 2016 done: Tue Nov 8 11:41:29 2016 Total Scan time: 1.560 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]