FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3905, 445 aa
1>>>pF1KE3905 445 - 445 aa - 445 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8343+/-0.00106; mu= 20.2031+/- 0.064
mean_var=60.6797+/-12.109, 0's: 0 Z-trim(102.7): 32 B-trim: 2 in 1/50
Lambda= 0.164647
statistics sampled from 7047 (7054) to 7047 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.217), width: 16
Scan time: 2.110
The best scores are: opt bits E(32554)
CCDS7071.1 GDI2 gene_id:2665|Hs108|chr10 ( 445) 2937 706.5 1.4e-203
CCDS35452.1 GDI1 gene_id:2664|Hs108|chrX ( 447) 2642 636.4 1.7e-182
CCDS44352.1 GDI2 gene_id:2665|Hs108|chr10 ( 400) 2094 506.2 2.4e-143
CCDS31073.1 CHML gene_id:1122|Hs108|chr1 ( 656) 438 113.0 9.2e-25
CCDS14454.1 CHM gene_id:1121|Hs108|chrX ( 653) 408 105.9 1.3e-22
>>CCDS7071.1 GDI2 gene_id:2665|Hs108|chr10 (445 aa)
initn: 2937 init1: 2937 opt: 2937 Z-score: 3768.5 bits: 706.5 E(32554): 1.4e-203
Smith-Waterman score: 2937; 100.0% identity (100.0% similar) in 445 aa overlap (1-445:1-445)
10 20 30 40 50 60
pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI
370 380 390 400 410 420
430 440
pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED
:::::::::::::::::::::::::
CCDS70 YKRMTGSEFDFEEMKRKKNDIYGED
430 440
>>CCDS35452.1 GDI1 gene_id:2664|Hs108|chrX (447 aa)
initn: 2642 init1: 2642 opt: 2642 Z-score: 3389.8 bits: 636.4 E(32554): 1.7e-182
Smith-Waterman score: 2642; 86.5% identity (96.8% similar) in 444 aa overlap (1-444:1-444)
10 20 30 40 50 60
pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG
:.:::::::::::::::::::::::::::::::::::::::::.::::::.:::::..
CCDS35 MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESSSITPLEELYKRFQLLE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP
.:::::::::::::::::::::::::::::::::::::::::::.:::::::::::::::
CCDS35 GPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVVEGSFVYKGGKIYKVP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD
:::.:::::.:::.:::::::::::.::::::.::.::::.::. :.:::::.:::::::
CCDS35 STETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEGVDPQTTSMRDVYRKFDLGQD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
:::::::::::::::::::::: ::.::::::::::::::::::::::::::::::::::
CCDS35 VIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV
::::::::::::::...::..::::.:::::::.::::::::::::. :::.:.:::::.
CCDS35 LSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVARCKQLICDPSYIPDRVRKAGQVIRI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK
::::::::::::::::::::::::::::::::::::::.:::::::::::::.::::::
CCDS35 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISYAHNVAAQGKYIAIASTTVETT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI
.::::..:::::::::.::::.:::: : : : :::.: : .:::::::::::.:::.:
CCDS35 DPEKEVEPALELLEPIDQKFVAISDLYEPIDDGCESQVFCSCSYDATTHFETTCNDIKDI
370 380 390 400 410 420
430 440
pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED
::::.:. ::::.::::.::..::
CCDS35 YKRMAGTAFDFENMKRKQNDVFGEAEQ
430 440
>>CCDS44352.1 GDI2 gene_id:2665|Hs108|chr10 (400 aa)
initn: 2091 init1: 2091 opt: 2094 Z-score: 2687.0 bits: 506.2 E(32554): 2.4e-143
Smith-Waterman score: 2562; 89.9% identity (89.9% similar) in 445 aa overlap (1-445:1-400)
10 20 30 40 50 60
pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP
:::::::::::::::::::::::::
CCDS44 SPPESMGRGRDWNVDLIPKFLMANG-----------------------------------
70 80
130 140 150 160 170 180
pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ----------LMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD
90 100 110 120 130
190 200 210 220 230 240
pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR
140 150 160 170 180 190
250 260 270 280 290 300
pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV
200 210 220 230 240 250
310 320 330 340 350 360
pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK
260 270 280 290 300 310
370 380 390 400 410 420
pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI
320 330 340 350 360 370
430 440
pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED
:::::::::::::::::::::::::
CCDS44 YKRMTGSEFDFEEMKRKKNDIYGED
380 390 400
>>CCDS31073.1 CHML gene_id:1122|Hs108|chr1 (656 aa)
initn: 455 init1: 204 opt: 438 Z-score: 558.0 bits: 113.0 E(32554): 9.2e-25
Smith-Waterman score: 438; 24.3% identity (62.0% similar) in 334 aa overlap (69-392:226-549)
40 50 60 70 80 90
pF1KE3 YGGESASITPLEDLYKRFKIPGSPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTR
:: .:.::. :.:...: :. .:. ..:.:
CCDS31 GDKDESKSTVEDKADEPIRNRITYSQIVKEGRRFNIDLVSKLLYSQGLLIDLLIKSDVSR
200 210 220 230 240 250
100 110 120 130 140 150
pF1KE3 YLDFKVTEGSFVYKGGKIYKVPSTEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTF
:..:: . .... ::. .:: ..:... :. . . ::: . :::.. .. :. : .
CCDS31 YVEFKNVTRILAFREGKVEQVPCSRADVFNSKELTMVEKRMLMKFLTFCLEY-EQHPDEY
260 270 280 290 300 310
160 170 180 190 200 210
pF1KE3 EGIDPKKTTMRDVYKKFDLGQDVIDFTGHALALYRTDDYLDQPC--YETINRIKLYSESL
... .. .. . : : .. :. :..:. .. : . .: : . . :
CCDS31 QAF--RQCSFSEYLKTKKLTPNLQHFVLHSIAMTS-----ESSCTTIDGLNATKNFLQCL
320 330 340 350 360
220 230 240 250 260 270
pF1KE3 ARYGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPIEEIIV--QNGKVIGVKSE-GE
.:.:..:.:.:::: ::.:::: :. :..:: : : . .. ..: ..:. .. .. :.
CCDS31 GRFGNTPFLFPLYGQGEIPQGFCRMCAVFGGIYCLRHKVQCFVVDKESGRCKAIIDHFGQ
370 380 390 400 410 420
280 290 300 310 320
pF1KE3 IARCKQLICDPSYVKDRVE---KVGQVIRVICILSHPIKNTN-DANSCQIIIPQNQVNRK
: .: . ::..... . :. :.. : .. : .:. : .. .:.: . .
CCDS31 RINAKYFIVEDSYLSEETCSNVQYKQISRAVLITDQSILKTDLDQQTSILIVPPAEPG-A
430 440 450 460 470 480
330 340 350 360 370 380
pF1KE3 SDIYVCMISFAHNVAAQGKYIAIVSTTVETKEPEKEIRPALE-LLEPIEQKFVSISDLLV
. : . . . . :.. .. . .: ..... ... :. : . .. .:
CCDS31 CAVRVTELCSSTMTCMKDTYLVHLTCS-SSKTAREDLESVVKKLFTPYTETEINEEELTK
490 500 510 520 530 540
390 400 410 420 430 440
pF1KE3 PKDLGTESQIFISRTYDATTHFETTCDDIKNIYKRMTGSEFDFEEMKRKKNDIYGED
:. :
CCDS31 PRLLWALYFNMRDSSGISRSSYNGLPSNVYVCSGPDCGLGNEHAVKQAETLFQEIFPTEE
550 560 570 580 590 600
>>CCDS14454.1 CHM gene_id:1121|Hs108|chrX (653 aa)
initn: 532 init1: 195 opt: 408 Z-score: 519.5 bits: 105.9 E(32554): 1.3e-22
Smith-Waterman score: 408; 24.6% identity (58.9% similar) in 418 aa overlap (25-421:174-573)
10 20 30 40
pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRN---PYYGGESAS-ITPL-
:.:.: : : . : ..:. : .:.
CCDS14 ESLSTMSCEMLTEQTPSSDPENALEVNGAEVTGEKENHCDDKTCVPSTSAEDMSENVPIA
150 160 170 180 190 200
50 60 70 80 90 100
pF1KE3 EDLY---KRFKIPGSPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTE
:: :. .: : . . .:: .:.::. :.:.. : :. .:. ..:.:: .::
CCDS14 EDTTEQPKKNRITYS--QIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNIT
210 220 230 240 250 260
110 120 130 140 150 160
pF1KE3 GSFVYKGGKIYKVPSTEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKT
.... :.. .:: ..:... :. . . ::: . :::.. .. :: : ..: . .
CCDS14 RILAFREGRVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCMEY-EKYPDEYKGYE--EI
270 280 290 300 310
170 180 190 200 210 220
pF1KE3 TMRDVYKKFDLGQDVIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLY
:. . : : .. .. :..:. :.. . . .. : . . :.:::..:.:.
CCDS14 TFYEYLKTQKLTPNLQYIVMHSIAM--TSE-TASSTIDGLKATKNFLHCLGRYGNTPFLF
320 330 340 350 360 370
230 240 250 260 270 280
pF1KE3 PLYGLGELPQGFARLSAIYGGTYMLNKPIEEIIV--QNGKVIGVKSE-GEIARCKQLICD
:::: ::::: : :. :..:: : : . .. ..: .. : .. .. :. .... .
CCDS14 PLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAIIDQFGQRIISEHFLVE
380 390 400 410 420 430
290 300 310 320 330
pF1KE3 PSYVKD----RVEKVGQVIRVICILSHPIKNTNDANSCQII-IPQNQVNRKSDIYVCMIS
:: . ::. :. :.. : .. . .:.. .. .:. .: .. . . . : .
CCDS14 DSYFPENMCSRVQ-YRQISRAVLITDRSVLKTDSDQQISILTVPAEEPGTFA-VRVIELC
440 450 460 470 480 490
340 350 360 370 380 390
pF1KE3 FAHNVAAQGKYIAIVSTTVETKEPEKEIRPALELLEPIEQK-FVSISDLLVPKDLGTESQ
. . .: :.. .. : . : : :: . :: :: ... . .. . .
CCDS14 SSTMTCMKGTYLVHLTCTSS--------KTAREDLESVVQKLFVPYTEMEIENEQVEKPR
500 510 520 530 540
400 410 420 430 440
pF1KE3 IFISRTYDA--TTHFETTC-DDI-KNIYKRMTGSEFDFEEMKRKKNDIYGED
:. . .. .. . .: .:. .:.:
CCDS14 ILWALYFNMRDSSDISRSCYNDLPSNVYVCSGPDCGLGNDNAVKQAETLFQEICPNEDFC
550 560 570 580 590 600
445 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 09:05:30 2016 done: Sun Nov 6 09:05:30 2016
Total Scan time: 2.110 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]