FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6115, 147 aa 1>>>pF1KE6115 147 - 147 aa - 147 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3495+/-0.00079; mu= 10.6884+/- 0.047 mean_var=58.3530+/-11.822, 0's: 0 Z-trim(107.2): 15 B-trim: 62 in 1/49 Lambda= 0.167897 statistics sampled from 9436 (9450) to 9436 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.695), E-opt: 0.2 (0.29), width: 16 Scan time: 1.360 The best scores are: opt bits E(32554) CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 ( 147) 968 242.4 7.4e-65 CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 ( 147) 957 239.8 4.7e-64 CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 ( 147) 809 203.9 2.9e-53 CCDS7753.1 HBB gene_id:3043|Hs108|chr11 ( 147) 745 188.4 1.3e-48 CCDS31376.1 HBD gene_id:3045|Hs108|chr11 ( 147) 732 185.3 1.2e-47 CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 ( 142) 363 95.9 9.3e-21 CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 ( 142) 363 95.9 9.3e-21 CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 ( 142) 362 95.6 1.1e-20 CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 ( 142) 318 85.0 1.8e-17 CCDS32347.1 HBM gene_id:3042|Hs108|chr16 ( 141) 306 82.1 1.3e-16 CCDS11746.1 CYGB gene_id:114757|Hs108|chr17 ( 190) 249 68.3 2.5e-12 >>CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 (147 aa) initn: 968 init1: 968 opt: 968 Z-score: 1277.9 bits: 242.4 E(32554): 7.4e-65 Smith-Waterman score: 968; 100.0% identity (100.0% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE6 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG 70 80 90 100 110 120 130 140 pF1KE6 KEFTPEVQASWQKMVTAVASALSSRYH ::::::::::::::::::::::::::: CCDS77 KEFTPEVQASWQKMVTAVASALSSRYH 130 140 >>CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 (147 aa) initn: 957 init1: 957 opt: 957 Z-score: 1263.5 bits: 239.8 E(32554): 4.7e-64 Smith-Waterman score: 957; 98.6% identity (99.3% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE6 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG ::::::::::::::: :::::::::::::::::::::::::::::::::::::::::::: CCDS77 VKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG 70 80 90 100 110 120 130 140 pF1KE6 KEFTPEVQASWQKMVTAVASALSSRYH ::::::::::::::::.:::::::::: CCDS77 KEFTPEVQASWQKMVTGVASALSSRYH 130 140 >>CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 (147 aa) initn: 809 init1: 809 opt: 809 Z-score: 1069.8 bits: 203.9 E(32554): 2.9e-53 Smith-Waterman score: 809; 79.6% identity (93.9% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE6 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK : ::: :.::..::::.:.:::.::::.::::::::::::::::::::::: :::.:::: CCDS77 MVHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG :::::::::::.::: :..:.:: .::.::::::::::::::::::::::.: .:: ::: CCDS77 VKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFG 70 80 90 100 110 120 130 140 pF1KE6 KEFTPEVQASWQKMVTAVASALSSRYH :::::::::.:::.:.::: ::. .:: CCDS77 KEFTPEVQAAWQKLVSAVAIALAHKYH 130 140 >>CCDS7753.1 HBB gene_id:3043|Hs108|chr11 (147 aa) initn: 745 init1: 745 opt: 745 Z-score: 986.0 bits: 188.4 E(32554): 1.3e-48 Smith-Waterman score: 745; 72.8% identity (92.5% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE6 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK : :.: :.:...:.:::::::...:::.:::::::::::::::.:::.::. .:.::::: CCDS77 MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG ::::::::: ...:. :::.:::::: ::::::::::::::::.::::::: ::: ::: CCDS77 VKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFG 70 80 90 100 110 120 130 140 pF1KE6 KEFTPEVQASWQKMVTAVASALSSRYH ::::: :::..::.:..::.::. .:: CCDS77 KEFTPPVQAAYQKVVAGVANALAHKYH 130 140 >>CCDS31376.1 HBD gene_id:3045|Hs108|chr11 (147 aa) initn: 732 init1: 732 opt: 732 Z-score: 969.0 bits: 185.3 E(32554): 1.2e-47 Smith-Waterman score: 732; 71.4% identity (93.2% similar) in 147 aa overlap (1-147:1-147) 10 20 30 40 50 60 pF1KE6 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK : :.: :.:.....:::::::. .:::.:::::::::::::::.:::.::: .:.::::: CCDS31 MVHLTPEEKTAVNALWGKVNVDAVGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGNPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG ::::::::: ...:. :::.:::::.:::::::::::::::::.::::::: ::: .:: CCDS31 VKAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPENFRLLGNVLVCVLARNFG 70 80 90 100 110 120 130 140 pF1KE6 KEFTPEVQASWQKMVTAVASALSSRYH :::::..::..::.:..::.::. .:: CCDS31 KEFTPQMQAAYQKVVAGVANALAHKYH 130 140 >>CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 (142 aa) initn: 311 init1: 274 opt: 363 Z-score: 486.1 bits: 95.9 E(32554): 9.3e-21 Smith-Waterman score: 363; 41.4% identity (75.9% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE6 MGHFTEEDKATITSLWGKVNVE--DAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN .. ::... . ::::... . :.:.: :... .: :. .: : .:: .:: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA---- 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH .::.::::: .: .:. :.::. .... ::.:: ::.::: :::::.. :...:: : CCDS10 -QVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH 60 70 80 90 100 110 120 130 140 pF1KE6 FGKEFTPEVQASWQKMVTAVASALSSRYH . :::: :.:: .:....:...:.:.: CCDS10 LPAEFTPAVHASLDKFLASVSTVLTSKYR 120 130 140 >>CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 (142 aa) initn: 311 init1: 274 opt: 363 Z-score: 486.1 bits: 95.9 E(32554): 9.3e-21 Smith-Waterman score: 363; 41.4% identity (75.9% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE6 MGHFTEEDKATITSLWGKVNVE--DAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN .. ::... . ::::... . :.:.: :... .: :. .: : .:: .:: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA---- 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH .::.::::: .: .:. :.::. .... ::.:: ::.::: :::::.. :...:: : CCDS10 -QVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH 60 70 80 90 100 110 120 130 140 pF1KE6 FGKEFTPEVQASWQKMVTAVASALSSRYH . :::: :.:: .:....:...:.:.: CCDS10 LPAEFTPAVHASLDKFLASVSTVLTSKYR 120 130 140 >>CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 (142 aa) initn: 324 init1: 272 opt: 362 Z-score: 484.8 bits: 95.6 E(32554): 1.1e-20 Smith-Waterman score: 362; 40.7% identity (78.6% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE6 MGHFTEEDKATITSLWGKVNVE-DA-GGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN .:. ... :.:.:.:.... :. : ::: ::.. .: :. .: : .: .:: CCDS10 MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHF-DLHPGSA---- 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH ...:::.::....:::.: .::. :....::::: :.::: :::::.. :...:: . CCDS10 -QLRAHGSKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAAR 60 70 80 90 100 110 120 130 140 pF1KE6 FGKEFTPEVQASWQKMVTAVASALSSRYH : .:: :..:.:.:....:.:.:. .: CCDS10 FPADFTAEAHAAWDKFLSVVSSVLTEKYR 120 130 140 >>CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 (142 aa) initn: 275 init1: 239 opt: 318 Z-score: 427.2 bits: 85.0 E(32554): 1.8e-17 Smith-Waterman score: 318; 40.0% identity (70.3% similar) in 145 aa overlap (4-146:3-141) 10 20 30 40 50 pF1KE6 MGHFTEEDKATITSLWGKV--NVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN .. ::.: . .:: :. :: :.: : ....: :. .: : .:: :. CCDS10 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYF-SHLDLSP-----GS 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH .:.:::.:: .:. :...:::: ... ::.:: .:.::: .:.:::. :...:: : CCDS10 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH 60 70 80 90 100 110 120 130 140 pF1KE6 FGKEFTPEVQASWQKMVTAVASALSSRYH . .:.: .::: .:... : ::: :.: CCDS10 YPGDFSPALQASLDKFLSHVISALVSEYR 120 130 140 >>CCDS32347.1 HBM gene_id:3042|Hs108|chr16 (141 aa) initn: 284 init1: 242 opt: 306 Z-score: 411.6 bits: 82.1 E(32554): 1.3e-16 Smith-Waterman score: 306; 34.5% identity (73.1% similar) in 145 aa overlap (4-146:2-140) 10 20 30 40 50 pF1KE6 MGHFTEEDKATITSLWGKVNVEDA--GGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN .. ...: :...: . ..: :.: : ::..::: :. .: .. ..:. .. CCDS32 MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLSACQDATQLL-- 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH .::...:...: :..:.:.:..... :..:: :.::: :: :: . . .::: : CCDS32 ----SHGQRMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASH 60 70 80 90 100 110 120 130 140 pF1KE6 FGKEFTPEVQASWQKMVTAVASALSSRYH . ::: ..::.:.:..:.:: .:. .: CCDS32 LQDEFTVQMQAAWDKFLTGVAVVLTEKYR 120 130 140 147 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:34:39 2016 done: Tue Nov 8 09:34:39 2016 Total Scan time: 1.360 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]