FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2598, 387 aa
1>>>pF1KE2598 387 - 387 aa - 387 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9933+/-0.000919; mu= 14.0758+/- 0.055
mean_var=83.7500+/-16.731, 0's: 0 Z-trim(106.7): 19 B-trim: 0 in 0/50
Lambda= 0.140146
statistics sampled from 9152 (9163) to 9152 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.646), E-opt: 0.2 (0.281), width: 16
Scan time: 2.200
The best scores are: opt bits E(32554)
CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 387) 2521 519.5 2e-147
CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 308) 1525 318.1 6.9e-87
CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 ( 406) 485 107.9 1.7e-23
>>CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 (387 aa)
initn: 2521 init1: 2521 opt: 2521 Z-score: 2760.2 bits: 519.5 E(32554): 2e-147
Smith-Waterman score: 2521; 99.5% identity (99.7% similar) in 387 aa overlap (1-387:1-387)
10 20 30 40 50 60
pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 RYPRKRFVAGVGANSKISALKGSKGKDWEIPVPVGISVTDENGKIIGELSKENDRILVAQ
::::::::::::::::::::::::::: :::::::::::::::::::::.::::::::::
CCDS56 RYPRKRFVAGVGANSKISALKGSKGKDCEIPVPVGISVTDENGKIIGELNKENDRILVAQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL
310 320 330 340 350 360
370 380
pF1KE2 NLWISDTMSSTEPPSKHAVTTSKMDII
:::::::::::::::::::::::::::
CCDS56 NLWISDTMSSTEPPSKHAVTTSKMDII
370 380
>>CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 (308 aa)
initn: 2025 init1: 1506 opt: 1525 Z-score: 1673.4 bits: 318.1 E(32554): 6.9e-87
Smith-Waterman score: 1871; 79.6% identity (79.6% similar) in 387 aa overlap (1-387:1-308)
10 20 30 40 50 60
pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 RYPRKRFVAGVGANSKISALKGSKGKDWEIPVPVGISVTDENGKIIGELSKENDRILVAQ
::::::::::::::::
CCDS43 RYPRKRFVAGVGANSK--------------------------------------------
70
130 140 150 160 170 180
pF1KE2 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF
:::::::::::::::::::::::::
CCDS43 -----------------------------------FPNAGKSSLLSCVSHAKPAIADYAF
80 90 100
190 200 210 220 230 240
pF1KE2 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ
110 120 130 140 150 160
250 260 270 280 290 300
pF1KE2 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK
170 180 190 200 210 220
310 320 330 340 350 360
pF1KE2 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL
230 240 250 260 270 280
370 380
pF1KE2 NLWISDTMSSTEPPSKHAVTTSKMDII
:::::::::::::::::::::::::::
CCDS43 NLWISDTMSSTEPPSKHAVTTSKMDII
290 300
>>CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 (406 aa)
initn: 668 init1: 447 opt: 485 Z-score: 535.1 bits: 107.9 E(32554): 1.7e-23
Smith-Waterman score: 677; 39.5% identity (62.9% similar) in 367 aa overlap (9-350:68-396)
10 20 30
pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGY------
...: :.: :... ::.:: :
CCDS13 RASPRLLSVGRADLAKHQELPGKKLLSEKKLKRY--FVDYRRVLVCGGNGGAGASCFHSE
40 50 60 70 80 90
40 50 60 70 80
pF1KE2 PRL------GGEGGKGGDVWV-VAQNRMTLKQLKDRYPRKRFVAGVGANSKISALKGSKG
:: ::.::.:: : . : :. .:... .:: . : .: ..:: : .:
CCDS13 PRKEFGGPDGGDGGNGGHVILRVDQQVKSLSSVLSRY--QGF-SGEDGGSKNCF--GRSG
100 110 120 130 140 150
90 100 110 120 130
pF1KE2 KDWEIPVPVGISVTDENGKIIGELSKENDRILVAQGGLGGKLLTNFL------PLK----
: :::: ... :.:.....:: .:. ..: :: ::: :: :.
CCDS13 AVLYIRVPVG-TLVKEGGRVVADLSCVGDEYIAALGGAGGKGNRFFLANNNRAPVTCTPG
160 170 180 190 200
140 150 160 170 180 190
pF1KE2 --GQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAFTTLKPELGKIMYS
::.:..::.:: .: .:.:::::::::::: .:.:.::.:.: ::::::..: . :
CCDS13 QPGQQRVLHLELKTVAHAGMVGFPNAGKSSLLRAISNARPAVASYPFTTLKPHVGIVHYE
210 220 230 240 250 260
200 210 220 230 240 250
pF1KE2 DFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQLSSHTQYRTAFET
::.:::.::.:.:::.:.:.: ::.:::: : ::::::.: : ::
CCDS13 GHLQIAVADIPGIIRGAHQNRGLGSAFLRHIERCRFLLFVVDLS--QPEPWTQ-------
270 280 290 300 310 320
260 270 280 290 300 310
pF1KE2 IILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPKDFLHLFEKNMIPE
. : :::.:.. :...: ...::.:::.:: . .:::.. ::
CCDS13 VDDLKYELEMYEKGLSARPHAIVANKIDLPEAQAN----LSQLRD-----HLG-------
330 340 350 360
320 330 340 350 360 370
pF1KE2 RTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLLNLWISDTMSSTEP
:..: .::.:::..:.: .. : :. :
CCDS13 -----QEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW
370 380 390 400
380
pF1KE2 PSKHAVTTSKMDII
387 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 16:54:10 2016 done: Tue Nov 8 16:54:10 2016
Total Scan time: 2.200 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]