FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6184, 142 aa 1>>>pF1KE6184 142 - 142 aa - 142 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5371+/-0.00067; mu= 9.7281+/- 0.040 mean_var=57.2607+/-11.534, 0's: 0 Z-trim(109.3): 16 B-trim: 0 in 0/51 Lambda= 0.169491 statistics sampled from 10778 (10793) to 10778 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.728), E-opt: 0.2 (0.332), width: 16 Scan time: 1.280 The best scores are: opt bits E(32554) CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 ( 142) 918 232.1 8.8e-62 CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 ( 142) 588 151.4 1.7e-37 CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 ( 142) 588 151.4 1.7e-37 CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 ( 142) 485 126.2 6.6e-30 CCDS32347.1 HBM gene_id:3042|Hs108|chr16 ( 141) 364 96.6 5.3e-21 CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 ( 147) 322 86.4 6.8e-18 CCDS31376.1 HBD gene_id:3045|Hs108|chr11 ( 147) 321 86.1 8e-18 CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 ( 147) 318 85.4 1.3e-17 CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 ( 147) 318 85.4 1.3e-17 CCDS7753.1 HBB gene_id:3043|Hs108|chr11 ( 147) 316 84.9 1.9e-17 >>CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 (142 aa) initn: 918 init1: 918 opt: 918 Z-score: 1222.6 bits: 232.1 E(32554): 8.8e-62 Smith-Waterman score: 918; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP 70 80 90 100 110 120 130 140 pF1KE6 ALQASLDKFLSHVISALVSEYR :::::::::::::::::::::: CCDS10 ALQASLDKFLSHVISALVSEYR 130 140 >>CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 (142 aa) initn: 588 init1: 588 opt: 588 Z-score: 786.5 bits: 151.4 E(32554): 1.7e-37 Smith-Waterman score: 588; 62.0% identity (87.3% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG :.:: :.. :.: : :.:...: : .::::: ::.::.::::: :.::: ::.::..:: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP .::::::. :: ..::.:.:::::: ::: .:::::..:.::.:::::::: : :..:.: CCDS10 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP 70 80 90 100 110 120 130 140 pF1KE6 ALQASLDKFLSHVISALVSEYR :..:::::::. : ..:.:.:: CCDS10 AVHASLDKFLASVSTVLTSKYR 130 140 >>CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 (142 aa) initn: 588 init1: 588 opt: 588 Z-score: 786.5 bits: 151.4 E(32554): 1.7e-37 Smith-Waterman score: 588; 62.0% identity (87.3% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG :.:: :.. :.: : :.:...: : .::::: ::.::.::::: :.::: ::.::..:: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP .::::::. :: ..::.:.:::::: ::: .:::::..:.::.:::::::: : :..:.: CCDS10 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP 70 80 90 100 110 120 130 140 pF1KE6 ALQASLDKFLSHVISALVSEYR :..:::::::. : ..:.:.:: CCDS10 AVHASLDKFLASVSTVLTSKYR 130 140 >>CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 (142 aa) initn: 485 init1: 485 opt: 485 Z-score: 650.4 bits: 126.2 E(32554): 6.6e-30 Smith-Waterman score: 485; 52.1% identity (80.3% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG :.:. .:... ..: :..... . ::.::: ::. : ::::: :.:: :::.:.:::: CCDS10 MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP .::. :.. ::. .::. ::: ::.::: :::::..:.::.:::::::: ..:.::. CCDS10 SKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTA 70 80 90 100 110 120 130 140 pF1KE6 ALQASLDKFLSHVISALVSEYR .:. ::::: : :.:. .:: CCDS10 EAHAAWDKFLSVVSSVLTEKYR 130 140 >>CCDS32347.1 HBM gene_id:3042|Hs108|chr16 (141 aa) initn: 364 init1: 364 opt: 364 Z-score: 490.6 bits: 96.6 E(32554): 5.3e-21 Smith-Waterman score: 364; 41.4% identity (70.7% similar) in 140 aa overlap (3-142:2-141) 10 20 30 40 50 60 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG :::..:: . .: .... . . .: : : : ..:.::.:: ::. ..:. .:: CCDS32 MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLSACQDATQLLSHG 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP :.. :.. ::...:.: ::: :. ::: ::::::.: :: .:. :.:: : .:. CCDS32 QRMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASHLQDEFTV 60 70 80 90 100 110 130 140 pF1KE6 ALQASLDKFLSHVISALVSEYR .::. ::::. : .:. .:: CCDS32 QMQAAWDKFLTGVAVVLTEKYR 120 130 140 >>CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 (147 aa) initn: 279 init1: 243 opt: 322 Z-score: 434.8 bits: 86.4 E(32554): 6.8e-18 Smith-Waterman score: 322; 39.3% identity (71.0% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHL-DLSP-----GS .. ::.: . .:: :. :: :.: : ....: :. .:. . .:: :. CCDS77 MGHFTEEDKATITSLWGKV--NVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH .:.:::.:: .:. :...:::: ... ::.:: .:.::: .:.:::. :...:: : CCDS77 PKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH 60 70 80 90 100 110 120 130 140 pF1KE6 YPGDFSPALQASLDKFLSHVISALVSEYR . .:.: .::: .:... : ::: :.: CCDS77 FGKEFTPEVQASWQKMVTGVASALSSRYH 120 130 140 >>CCDS31376.1 HBD gene_id:3045|Hs108|chr11 (147 aa) initn: 316 init1: 226 opt: 321 Z-score: 433.4 bits: 86.1 E(32554): 8e-18 Smith-Waterman score: 321; 40.7% identity (70.3% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYF-SHLDLS-P----GS :. :... : ::: :. :: . ::: : ....: :. .: : ::: : :. CCDS31 MVHLTPEEKTAVNALWGKV--NVDAVGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH .:.:::.:: :.: .. .::.: ..: ::.:: .:.::: .:.:::. :. .:::. CCDS31 PKVKAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPENFRLLGNVLVCVLARN 60 70 80 90 100 110 120 130 140 pF1KE6 YPGDFSPALQASLDKFLSHVISALVSEYR . .:.: .::. .: .. : .::. .: CCDS31 FGKEFTPQMQAAYQKVVAGVANALAHKYH 120 130 140 >>CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 (147 aa) initn: 275 init1: 239 opt: 318 Z-score: 429.5 bits: 85.4 E(32554): 1.3e-17 Smith-Waterman score: 318; 39.3% identity (71.0% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHL-DLSP-----GS .. ::.: . .:: :. :: :.: : ....: :. .:. . .:: :. CCDS77 MGHFTEEDKATITSLWGKV--NVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH .:.:::.:: .:. :...:::: ... ::.:: .:.::: .:.:::. :...:: : CCDS77 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH 60 70 80 90 100 110 120 130 140 pF1KE6 YPGDFSPALQASLDKFLSHVISALVSEYR . .:.: .::: .:... : ::: :.: CCDS77 FGKEFTPEVQASWQKMVTAVASALSSRYH 120 130 140 >>CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 (147 aa) initn: 306 init1: 228 opt: 318 Z-score: 429.5 bits: 85.4 E(32554): 1.3e-17 Smith-Waterman score: 318; 37.9% identity (71.0% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHL-DLS-P----GS ..::..: : .::.:. :: ::: : ....: :. .:. . .:: : :. CCDS77 MVHFTAEEKAAVTSLWSKM--NVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH .:.:::.:: ... :.. .:.: :.. ::.:: .:.::: .:.:::. ... :: : CCDS77 PKVKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATH 60 70 80 90 100 110 120 130 140 pF1KE6 YPGDFSPALQASLDKFLSHVISALVSEYR . .:.: .::. .:..: : ::. .: CCDS77 FGKEFTPEVQAAWQKLVSAVAIALAHKYH 120 130 140 >>CCDS7753.1 HBB gene_id:3043|Hs108|chr11 (147 aa) initn: 291 init1: 223 opt: 316 Z-score: 426.8 bits: 84.9 E(32554): 1.9e-17 Smith-Waterman score: 316; 40.0% identity (70.3% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE6 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYF-SHLDLS-P----GS :. :... : ::: :. :: ::: : ....: :. .: : ::: : :. CCDS77 MVHLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 SQVRAHGQKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARH .:.:::.:: :.: .. .::.: ....::.:: .:.::: .:.:::. :. .::.: CCDS77 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH 60 70 80 90 100 110 120 130 140 pF1KE6 YPGDFSPALQASLDKFLSHVISALVSEYR . .:.: .::. .: .. : .::. .: CCDS77 FGKEFTPPVQAAYQKVVAGVANALAHKYH 120 130 140 142 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:09:56 2016 done: Tue Nov 8 10:09:57 2016 Total Scan time: 1.280 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]