FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1112, 142 aa 1>>>pF1KE1112 142 - 142 aa - 142 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5777+/-0.000838; mu= 9.1984+/- 0.050 mean_var=56.7323+/-11.320, 0's: 0 Z-trim(106.0): 14 B-trim: 0 in 0/52 Lambda= 0.170278 statistics sampled from 8729 (8743) to 8729 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.269), width: 16 Scan time: 1.710 The best scores are: opt bits E(32554) CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 ( 142) 925 235.2 1e-62 CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 ( 142) 925 235.2 1e-62 CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 ( 142) 588 152.4 8.6e-38 CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 ( 142) 563 146.3 6e-36 CCDS32347.1 HBM gene_id:3042|Hs108|chr16 ( 141) 430 113.6 4.1e-26 CCDS7753.1 HBB gene_id:3043|Hs108|chr11 ( 147) 381 101.6 1.8e-22 CCDS31376.1 HBD gene_id:3045|Hs108|chr11 ( 147) 373 99.6 7e-22 CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 ( 147) 366 97.9 2.3e-21 CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 ( 147) 363 97.2 3.9e-21 CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 ( 147) 332 89.5 7.5e-19 >>CCDS10398.1 HBA2 gene_id:3040|Hs108|chr16 (142 aa) initn: 925 init1: 925 opt: 925 Z-score: 1239.4 bits: 235.2 E(32554): 1e-62 Smith-Waterman score: 925; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP 70 80 90 100 110 120 130 140 pF1KE1 AVHASLDKFLASVSTVLTSKYR :::::::::::::::::::::: CCDS10 AVHASLDKFLASVSTVLTSKYR 130 140 >>CCDS10399.1 HBA1 gene_id:3039|Hs108|chr16 (142 aa) initn: 925 init1: 925 opt: 925 Z-score: 1239.4 bits: 235.2 E(32554): 1e-62 Smith-Waterman score: 925; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP 70 80 90 100 110 120 130 140 pF1KE1 AVHASLDKFLASVSTVLTSKYR :::::::::::::::::::::: CCDS10 AVHASLDKFLASVSTVLTSKYR 130 140 >>CCDS10400.1 HBQ1 gene_id:3049|Hs108|chr16 (142 aa) initn: 588 init1: 588 opt: 588 Z-score: 792.0 bits: 152.4 E(32554): 8.6e-38 Smith-Waterman score: 588; 62.0% identity (87.3% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG :.:: :.. :.: : :.:...: : .::::: ::.::.::::: :.::: ::.::..:: CCDS10 MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP .::::::. :: ..::.:.:::::: ::: .:::::..:.::.:::::::: : :..:.: CCDS10 QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP 70 80 90 100 110 120 130 140 pF1KE1 AVHASLDKFLASVSTVLTSKYR :..:::::::. : ..:.:.:: CCDS10 ALQASLDKFLSHVISALVSEYR 130 140 >>CCDS10397.1 HBZ gene_id:3050|Hs108|chr16 (142 aa) initn: 563 init1: 563 opt: 563 Z-score: 758.8 bits: 146.3 E(32554): 6e-36 Smith-Waterman score: 563; 59.9% identity (83.1% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG : :. ...: . . :.:....: :.:.:::.::: : :::::::::: ::::...:: CCDS10 MSLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP .::. :. .:: .::. .::: ::.:::. :::::::::::::::::::::..::.:: CCDS10 SKVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTA 70 80 90 100 110 120 130 140 pF1KE1 AVHASLDKFLASVSTVLTSKYR .::. ::::. ::.::: ::: CCDS10 EAHAAWDKFLSVVSSVLTEKYR 130 140 >>CCDS32347.1 HBM gene_id:3042|Hs108|chr16 (141 aa) initn: 430 init1: 430 opt: 430 Z-score: 582.3 bits: 113.6 E(32554): 4.1e-26 Smith-Waterman score: 430; 45.4% identity (75.9% similar) in 141 aa overlap (2-142:1-141) 10 20 30 40 50 60 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG .:: ..... .: ...: ...::: : :.: .:.::.::::.. . ..:. .:: CCDS32 MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLSACQDATQLLSHG 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP ... :. :: :::.. ::: :.:::: :::::.:: :: .:. :.::.:: ::: CCDS32 QRMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASHLQDEFTV 60 70 80 90 100 110 130 140 pF1KE1 AVHASLDKFLASVSTVLTSKYR ..:. ::::..:..::: ::: CCDS32 QMQAAWDKFLTGVAVVLTEKYR 120 130 140 >>CCDS7753.1 HBB gene_id:3043|Hs108|chr11 (147 aa) initn: 325 init1: 273 opt: 381 Z-score: 516.9 bits: 101.6 E(32554): 1.8e-22 Smith-Waterman score: 381; 43.4% identity (74.5% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGS :.: .:. : : :::: .. : :.::: :... .: :. .: : ::: :. CCDS77 MVHLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH .::.::::: :.....::.:.. .....::.:: ::.::: ::.::.. :. .:: : CCDS77 PKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHH 60 70 80 90 100 110 120 130 140 pF1KE1 LPAEFTPAVHASLDKFLASVSTVLTSKYR . :::: :.:. .: .:.:...:. :: CCDS77 FGKEFTPPVQAAYQKVVAGVANALAHKYH 120 130 140 >>CCDS31376.1 HBD gene_id:3045|Hs108|chr11 (147 aa) initn: 305 init1: 263 opt: 373 Z-score: 506.3 bits: 99.6 E(32554): 7e-22 Smith-Waterman score: 373; 43.4% identity (74.5% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GS :.: .:: :.: ::::.. : :.::: :... .: :. .: : ::: :. CCDS31 MVHLTPEEKTAVNALWGKVNVDA--VGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH .::.::::: :.....::.:.. ...: ::.:: ::.::: ::.::.. :. .:: . CCDS31 PKVKAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPENFRLLGNVLVCVLARN 60 70 80 90 100 110 120 130 140 pF1KE1 LPAEFTPAVHASLDKFLASVSTVLTSKYR . :::: ..:. .: .:.:...:. :: CCDS31 FGKEFTPQMQAAYQKVVAGVANALAHKYH 120 130 140 >>CCDS7755.1 HBG2 gene_id:3048|Hs108|chr11 (147 aa) initn: 314 init1: 277 opt: 366 Z-score: 497.0 bits: 97.9 E(32554): 2.3e-21 Smith-Waterman score: 366; 41.4% identity (75.9% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA---- .. ::... . ::::... . :.:.: :... .: :. .: : .:: .:: CCDS77 MGHFTEEDKATITSLWGKVNVE--DAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 -QVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH .::.::::: .: .:. :.::. .... ::.:: ::.::: :::::.. :...:: : CCDS77 PKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH 60 70 80 90 100 110 120 130 140 pF1KE1 LPAEFTPAVHASLDKFLASVSTVLTSKYR . :::: :.:: .:....:...:.:.: CCDS77 FGKEFTPEVQASWQKMVTGVASALSSRYH 120 130 140 >>CCDS7754.1 HBG1 gene_id:3047|Hs108|chr11 (147 aa) initn: 311 init1: 274 opt: 363 Z-score: 493.0 bits: 97.2 E(32554): 3.9e-21 Smith-Waterman score: 363; 41.4% identity (75.9% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA---- .. ::... . ::::... . :.:.: :... .: :. .: : .:: .:: CCDS77 MGHFTEEDKATITSLWGKVNVE--DAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 -QVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH .::.::::: .: .:. :.::. .... ::.:: ::.::: :::::.. :...:: : CCDS77 PKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIH 60 70 80 90 100 110 120 130 140 pF1KE1 LPAEFTPAVHASLDKFLASVSTVLTSKYR . :::: :.:: .:....:...:.:.: CCDS77 FGKEFTPEVQASWQKMVTAVASALSSRYH 120 130 140 >>CCDS7756.1 HBE1 gene_id:3046|Hs108|chr11 (147 aa) initn: 264 init1: 264 opt: 332 Z-score: 451.9 bits: 89.5 E(32554): 7.5e-19 Smith-Waterman score: 332; 38.6% identity (72.4% similar) in 145 aa overlap (3-141:4-146) 10 20 30 40 50 pF1KE1 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA---- .. .:. : . :.:.... : :.::: :... .: :. .: : .:: :: CCDS77 MVHFTAEEKAAVTSLWSKMNVE--EAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 -QVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH .::.::::: .. .:. ..:.. :.. ::.:: ::.::: :::::.. ... ::.: CCDS77 PKVKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATH 60 70 80 90 100 110 120 130 140 pF1KE1 LPAEFTPAVHASLDKFLASVSTVLTSKYR . :::: :.:. .:....:. .:. :: CCDS77 FGKEFTPEVQAAWQKLVSAVAIALAHKYH 120 130 140 142 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:19:00 2016 done: Mon Nov 7 02:19:00 2016 Total Scan time: 1.710 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]