FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8927, 273 aa 1>>>pF1KB8927 273 - 273 aa - 273 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4240+/-0.000804; mu= 8.3078+/- 0.049 mean_var=201.4754+/-40.518, 0's: 0 Z-trim(116.1): 156 B-trim: 898 in 1/52 Lambda= 0.090357 statistics sampled from 16488 (16657) to 16488 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.822), E-opt: 0.2 (0.512), width: 16 Scan time: 2.950 The best scores are: opt bits E(32554) CCDS31305.1 HMX2 gene_id:3167|Hs108|chr10 ( 273) 1882 256.8 1.2e-68 CCDS41575.1 HMX3 gene_id:340784|Hs108|chr10 ( 357) 495 76.1 3.8e-14 CCDS47018.1 HMX1 gene_id:3166|Hs108|chr4 ( 348) 468 72.6 4.3e-13 >>CCDS31305.1 HMX2 gene_id:3167|Hs108|chr10 (273 aa) initn: 1882 init1: 1882 opt: 1882 Z-score: 1345.9 bits: 256.8 E(32554): 1.2e-68 Smith-Waterman score: 1882; 100.0% identity (100.0% similar) in 273 aa overlap (1-273:1-273) 10 20 30 40 50 60 pF1KB8 MGSKEDAGKGCPAAGGVSSFTIQSILGGGPSEAPREPVGWPARKRSLSVSSEEEEPDDGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MGSKEDAGKGCPAAGGVSSFTIQSILGGGPSEAPREPVGWPARKRSLSVSSEEEEPDDGW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 KAPACFCPDQHGPKEQGPKHHPPIPFPCLGTPKGSGGSGPGGLERTPFLSPSHSDFKEEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KAPACFCPDQHGPKEQGPKHHPPIPFPCLGTPKGSGGSGPGGLERTPFLSPSHSDFKEEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ERLLPAGSPSPGSERPRDGGAERQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ERLLPAGSPSPGSERPRDGGAERQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 CLASSLQLTETQVKTWFQNRRNKWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 CLASSLQLTETQVKTWFQNRRNKWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRV 190 200 210 220 230 240 250 260 270 pF1KB8 PVPRSLAFPAPLYYPGSNLSALPLYNLYNKLDY ::::::::::::::::::::::::::::::::: CCDS31 PVPRSLAFPAPLYYPGSNLSALPLYNLYNKLDY 250 260 270 >>CCDS41575.1 HMX3 gene_id:340784|Hs108|chr10 (357 aa) initn: 543 init1: 455 opt: 495 Z-score: 367.4 bits: 76.1 E(32554): 3.8e-14 Smith-Waterman score: 555; 41.2% identity (62.2% similar) in 291 aa overlap (7-264:72-353) 10 20 30 pF1KB8 MGSKEDAGKGCPAAGGVSSFTIQSILGGGPSEAPRE :.:: : :...:..... : :: CCDS41 HRPPPKPQPPPRTLFAPASAAAAAAAAAAAAAKG--ALEGAAGFALSQV---GDLAFPRF 50 60 70 80 90 40 50 60 70 80 90 pF1KB8 PVGWPARKRSLSVSSEEEEPDDGWKAPACFCP-DQHGPK-EQGPKHHPPIPFPCLGTPKG . ::.. .: . :. : .: : . : : :. : . : : :: . CCDS41 EI--PAQRFALPAHYLERSP--AWWYPYTLTPAGGHLPRPEASEKALLRDSSPASGTDRD 100 110 120 130 140 150 100 110 120 130 pF1KB8 SG----GSGPGGLE---RTP---FLSPSHSDFKEEKERLLP--------AGSPSPGSERP : . : : ..: .: : :. .... . : :.. .::.: CCDS41 SPEPLLKADPDHKELDSKSPDEIILEESDSEESKKEGEAAPGAAGASVGAAAATPGAEDW 160 170 180 190 200 210 140 150 160 170 180 190 pF1KB8 RDGGA--ERQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERACLASSLQLTETQVK . :. :.. . :::::::::::::.:::::::::::::::::: ::.::.::::::: CCDS41 KKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYLSSSERAGLAASLHLTETQVK 220 230 240 250 260 270 200 210 220 230 240 pF1KB8 TWFQNRRNKWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLR--------VPVPRS- :::::::::::::.:::::::..::.:: .: .:......: . .::: : CCDS41 IWFQNRRNKWKRQLAAELEAANLSHAAAQRIVRVPILYHENSAAEGAAAAAAGAPVPVSQ 280 290 300 310 320 330 250 260 270 pF1KB8 --LAFPAPLYYPGSNLSALPLYNLYNKLDY :.:: :.:: .:..:: CCDS41 PLLTFPHPVYYSHPVVSSVPLLRPV 340 350 >>CCDS47018.1 HMX1 gene_id:3166|Hs108|chr4 (348 aa) initn: 466 init1: 440 opt: 468 Z-score: 348.5 bits: 72.6 E(32554): 4.3e-13 Smith-Waterman score: 485; 39.4% identity (56.6% similar) in 279 aa overlap (34-273:52-326) 10 20 30 40 50 60 pF1KB8 KEDAGKGCPAAGGVSSFTIQSILGGGPSEAPREPVGWPARKRSLSVSSEEEE---PDDGW :.. . ::.: :. . : CCDS47 ENLLAAEAKGAGRATQGDGSREDEEEDDDDPEDEDAEQARRRRLQRRRQLLAGTGPGGEA 30 40 50 60 70 80 70 80 90 100 110 pF1KB8 KAPACFCPDQHG--PKEQGPKHHPPIPFPCLGT----PKGSGGSGPGGL--ERTPFLSPS .: : . : : :. : ::. . : :. :.. :: : ::: . . :: CCDS47 RARALLGPGALGLGPRPP-PGPGPPFALGCGGAARWYPRAHGGYG-GGLSPDTSDRDSPE 90 100 110 120 130 120 130 140 pF1KB8 HSDFKEEKERLLPAGSPSPGS------------------------ERPRDGGAERQA--- .. . : : : :.::. : : .: : . CCDS47 TGEEMGRAEGAWPRG-PGPGAVQREAAELAARGPAAGTEEASELAEVPAAAGETRGGVGV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB8 -GAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERACLASSLQLTETQVKTWFQNRRNKW :. :::::::::::::.:::::::.::::::.::: ::.:::::::::: ::::::::: CCDS47 GGGRKKKTRTVFSRSQVFQLESTFDLKRYLSSAERAGLAASLQLTETQVKIWFQNRRNKW 200 210 220 230 240 250 210 220 230 240 250 260 pF1KB8 KRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRVPVPRSLAFPAPLYYPGSNLSALPL ::::.::::::... .:: :: .:.....: . . ..: :: :.. :: CCDS47 KRQLAAELEAASLSPPGAQRLVRVPVLYHESPPAAAAAGPPATLPFPLA-PAAPAPPPPL 260 270 280 290 300 310 270 pF1KB8 YNLYNKLDY .. . : : CCDS47 LGFSGALAYPLAAFPAAASVPFLRAQMPGLV 320 330 340 273 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:59:14 2016 done: Sun Nov 6 04:59:14 2016 Total Scan time: 2.950 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]