FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5102, 147 aa 1>>>pF1KE5102 147 - 147 aa - 147 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1960+/-0.000669; mu= 11.2905+/- 0.040 mean_var=53.6295+/-10.673, 0's: 0 Z-trim(108.8): 13 B-trim: 0 in 0/50 Lambda= 0.175135 statistics sampled from 10427 (10439) to 10427 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.321), width: 16 Scan time: 1.240 The best scores are: opt bits E(32554) CCDS7753.1 HBB gene_id:3043|Hs108|chr11 ( 147) 991 257.9 1.7e-69 CCDS31376.1 HBD gene_id:3045|Hs108|chr11 ( 147) 929 242.2 8.7e-65 CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 ( 147) 774 203.0 5.4e-53 CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 ( 147) 756 198.5 1.3e-51 CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 ( 147) 745 195.7 8.6e-51 CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 ( 142) 381 103.7 4e-23 CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 ( 142) 381 103.7 4e-23 CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 ( 142) 321 88.6 1.5e-18 CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 ( 142) 316 87.3 3.6e-18 CCDS32347.1 HBM gene_id:3042|Hs108|chr16 ( 141) 281 78.5 1.6e-15 >>CCDS7753.1 HBB gene_id:3043|Hs108|chr11 (147 aa) initn: 991 init1: 991 opt: 991 Z-score: 1361.3 bits: 257.9 E(32554): 1.7e-69 Smith-Waterman score: 991; 100.0% identity (100.0% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG 70 80 90 100 110 120 130 140 pF1KE5 KEFTPPVQAAYQKVVAGVANALAHKYH ::::::::::::::::::::::::::: CCDS77 KEFTPPVQAAYQKVVAGVANALAHKYH 130 140 >>CCDS31376.1 HBD gene_id:3045|Hs108|chr11 (147 aa) initn: 929 init1: 929 opt: 929 Z-score: 1276.6 bits: 242.2 E(32554): 8.7e-65 Smith-Waterman score: 929; 93.2% identity (98.0% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK :::::::::.::.::::::::: :::::::::::::::::::::::::::.::::::::: CCDS31 MVHLTPEEKTAVNALWGKVNVDAVGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG ::::::::::::::::::::::::::. ::::::::::::::::::::::::::::..:: CCDS31 VKAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPENFRLLGNVLVCVLARNFG 70 80 90 100 110 120 130 140 pF1KE5 KEFTPPVQAAYQKVVAGVANALAHKYH ::::: .:::::::::::::::::::: CCDS31 KEFTPQMQAAYQKVVAGVANALAHKYH 130 140 >>CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 (147 aa) initn: 774 init1: 774 opt: 774 Z-score: 1065.0 bits: 203.0 E(32554): 5.4e-53 Smith-Waterman score: 774; 75.5% identity (93.9% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK :::.: :::.:::.::.:.::.:.:::::::::::::::::::.:::.::.:.:..:::: CCDS77 MVHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG ::::::::: .:.:.. ..:::: .:: ::::::::::::::::.:::::.: .:: ::: CCDS77 VKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFG 70 80 90 100 110 120 130 140 pF1KE5 KEFTPPVQAAYQKVVAGVANALAHKYH ::::: ::::.::.:..:: ::::::: CCDS77 KEFTPEVQAAWQKLVSAVAIALAHKYH 130 140 >>CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 (147 aa) initn: 756 init1: 756 opt: 756 Z-score: 1040.4 bits: 198.5 E(32554): 1.3e-51 Smith-Waterman score: 756; 73.5% identity (93.2% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK : :.: :.:...:.:::::::...:::.:::::::::::::::.:::.::. .:.::::: CCDS77 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG ::::::::: ...:.. :::.:::::: ::::::::::::::::.::::::: ::: ::: CCDS77 VKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG 70 80 90 100 110 120 130 140 pF1KE5 KEFTPPVQAAYQKVVAGVANALAHKYH ::::: :::..::.:.:::.::. .:: CCDS77 KEFTPEVQASWQKMVTGVASALSSRYH 130 140 >>CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 (147 aa) initn: 745 init1: 745 opt: 745 Z-score: 1025.4 bits: 195.7 E(32554): 8.6e-51 Smith-Waterman score: 745; 72.8% identity (92.5% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK : :.: :.:...:.:::::::...:::.:::::::::::::::.:::.::. .:.::::: CCDS77 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG ::::::::: ...:. :::.:::::: ::::::::::::::::.::::::: ::: ::: CCDS77 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG 70 80 90 100 110 120 130 140 pF1KE5 KEFTPPVQAAYQKVVAGVANALAHKYH ::::: :::..::.:..::.::. .:: CCDS77 KEFTPEVQASWQKMVTAVASALSSRYH 130 140 >>CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 (142 aa) initn: 325 init1: 273 opt: 381 Z-score: 528.6 bits: 103.7 E(32554): 4e-23 Smith-Waterman score: 381; 43.4% identity (74.5% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE5 MVHLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN :.: .:. : : :::: .. : :.::: :... .: :. .: : ::: :. CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGS 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH .::.::::: :.....::.:.. .....::.:: ::.::: ::.::.. :. .:: : CCDS10 AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH 60 70 80 90 100 110 120 130 140 pF1KE5 FGKEFTPPVQAAYQKVVAGVANALAHKYH . :::: :.:. .: .:.:...:. :: CCDS10 LPAEFTPAVHASLDKFLASVSTVLTSKYR 120 130 140 >>CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 (142 aa) initn: 325 init1: 273 opt: 381 Z-score: 528.6 bits: 103.7 E(32554): 4e-23 Smith-Waterman score: 381; 43.4% identity (74.5% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE5 MVHLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN :.: .:. : : :::: .. : :.::: :... .: :. .: : ::: :. CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGS 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH .::.::::: :.....::.:.. .....::.:: ::.::: ::.::.. :. .:: : CCDS10 AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH 60 70 80 90 100 110 120 130 140 pF1KE5 FGKEFTPPVQAAYQKVVAGVANALAHKYH . :::: :.:. .: .:.:...:. :: CCDS10 LPAEFTPAVHASLDKFLASVSTVLTSKYR 120 130 140 >>CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 (142 aa) initn: 231 init1: 231 opt: 321 Z-score: 446.6 bits: 88.6 E(32554): 1.5e-18 Smith-Waterman score: 321; 37.2% identity (73.1% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE5 MVHLTPEEKSAVTALWGKVNV--DEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN :: :.. ....:.:... : .: :.: ::.. .: :. .: : :: : :. CCDS10 MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHF-DLH-P----GS 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH ...:::.::..: .:.. .:.. :... ::::: :.::: ::.::.. :. .:: . CCDS10 AQLRAHGSKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAAR 60 70 80 90 100 110 120 130 140 pF1KE5 FGKEFTPPVQAAYQKVVAGVANALAHKYH : .:: ..::..: .. :...:..:: CCDS10 FPADFTAEAHAAWDKFLSVVSSVLTEKYR 120 130 140 >>CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 (142 aa) initn: 294 init1: 223 opt: 316 Z-score: 439.8 bits: 87.3 E(32554): 3.6e-18 Smith-Waterman score: 316; 40.0% identity (70.3% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE5 MVHLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN :. :... : ::: :. :: ::: : ....: :. .: : ::: : :. CCDS10 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYF-SHLDLS-P----GS 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH .:.:::.:: :.: .. .::.: ....::.:: .:.::: .:.:::. :. .::.: CCDS10 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH 60 70 80 90 100 110 120 130 140 pF1KE5 FGKEFTPPVQAAYQKVVAGVANALAHKYH . .:.: .::. .: .. : .::. .: CCDS10 YPGDFSPALQASLDKFLSHVISALVSEYR 120 130 140 >>CCDS32347.1 HBM gene_id:3042|Hs108|chr16 (141 aa) initn: 254 init1: 220 opt: 281 Z-score: 392.0 bits: 78.5 E(32554): 1.6e-15 Smith-Waterman score: 281; 35.9% identity (69.0% similar) in 145 aa overlap (4-146:2-140) 10 20 30 40 50 pF1KE5 MVHLTPEEKSAVTALWGKVNVDEV--GGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN :. .:.. .. .: . :. :.: : ::..::: :. .: .. . ::. CCDS32 MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLS--ACQDAT--- 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH .. .::...:.: . .. :.:::..... :..:: :.::: :: :: . . ::: : CCDS32 -QLLSHGQRMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASH 60 70 80 90 100 110 120 130 140 pF1KE5 FGKEFTPPVQAAYQKVVAGVANALAHKYH . ::: .:::..: ..::: .:..:: CCDS32 LQDEFTVQMQAAWDKFLTGVAVVLTEKYR 120 130 140 147 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 21:05:38 2016 done: Mon Nov 7 21:05:38 2016 Total Scan time: 1.240 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]