FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2598, 387 aa 1>>>pF1KE2598 387 - 387 aa - 387 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9933+/-0.000919; mu= 14.0758+/- 0.055 mean_var=83.7500+/-16.731, 0's: 0 Z-trim(106.7): 19 B-trim: 0 in 0/50 Lambda= 0.140146 statistics sampled from 9152 (9163) to 9152 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.646), E-opt: 0.2 (0.281), width: 16 Scan time: 2.200 The best scores are: opt bits E(32554) CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 387) 2521 519.5 2e-147 CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 308) 1525 318.1 6.9e-87 CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 ( 406) 485 107.9 1.7e-23 >>CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 (387 aa) initn: 2521 init1: 2521 opt: 2521 Z-score: 2760.2 bits: 519.5 E(32554): 2e-147 Smith-Waterman score: 2521; 99.5% identity (99.7% similar) in 387 aa overlap (1-387:1-387) 10 20 30 40 50 60 pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 RYPRKRFVAGVGANSKISALKGSKGKDWEIPVPVGISVTDENGKIIGELSKENDRILVAQ ::::::::::::::::::::::::::: :::::::::::::::::::::.:::::::::: CCDS56 RYPRKRFVAGVGANSKISALKGSKGKDCEIPVPVGISVTDENGKIIGELNKENDRILVAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL 310 320 330 340 350 360 370 380 pF1KE2 NLWISDTMSSTEPPSKHAVTTSKMDII ::::::::::::::::::::::::::: CCDS56 NLWISDTMSSTEPPSKHAVTTSKMDII 370 380 >>CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 (308 aa) initn: 2025 init1: 1506 opt: 1525 Z-score: 1673.4 bits: 318.1 E(32554): 6.9e-87 Smith-Waterman score: 1871; 79.6% identity (79.6% similar) in 387 aa overlap (1-387:1-308) 10 20 30 40 50 60 pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 RYPRKRFVAGVGANSKISALKGSKGKDWEIPVPVGISVTDENGKIIGELSKENDRILVAQ :::::::::::::::: CCDS43 RYPRKRFVAGVGANSK-------------------------------------------- 70 130 140 150 160 170 180 pF1KE2 GGLGGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAF ::::::::::::::::::::::::: CCDS43 -----------------------------------FPNAGKSSLLSCVSHAKPAIADYAF 80 90 100 190 200 210 220 230 240 pF1KE2 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 TTLKPELGKIMYSDFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQ 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE2 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LSSHTQYRTAFETIILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPK 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE2 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 DFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLL 230 240 250 260 270 280 370 380 pF1KE2 NLWISDTMSSTEPPSKHAVTTSKMDII ::::::::::::::::::::::::::: CCDS43 NLWISDTMSSTEPPSKHAVTTSKMDII 290 300 >>CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 (406 aa) initn: 668 init1: 447 opt: 485 Z-score: 535.1 bits: 107.9 E(32554): 1.7e-23 Smith-Waterman score: 677; 39.5% identity (62.9% similar) in 367 aa overlap (9-350:68-396) 10 20 30 pF1KE2 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGY------ ...: :.: :... ::.:: : CCDS13 RASPRLLSVGRADLAKHQELPGKKLLSEKKLKRY--FVDYRRVLVCGGNGGAGASCFHSE 40 50 60 70 80 90 40 50 60 70 80 pF1KE2 PRL------GGEGGKGGDVWV-VAQNRMTLKQLKDRYPRKRFVAGVGANSKISALKGSKG :: ::.::.:: : . : :. .:... .:: . : .: ..:: : .: CCDS13 PRKEFGGPDGGDGGNGGHVILRVDQQVKSLSSVLSRY--QGF-SGEDGGSKNCF--GRSG 100 110 120 130 140 150 90 100 110 120 130 pF1KE2 KDWEIPVPVGISVTDENGKIIGELSKENDRILVAQGGLGGKLLTNFL------PLK---- : :::: ... :.:.....:: .:. ..: :: ::: :: :. CCDS13 AVLYIRVPVG-TLVKEGGRVVADLSCVGDEYIAALGGAGGKGNRFFLANNNRAPVTCTPG 160 170 180 190 200 140 150 160 170 180 190 pF1KE2 --GQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAFTTLKPELGKIMYS ::.:..::.:: .: .:.:::::::::::: .:.:.::.:.: ::::::..: . : CCDS13 QPGQQRVLHLELKTVAHAGMVGFPNAGKSSLLRAISNARPAVASYPFTTLKPHVGIVHYE 210 220 230 240 250 260 200 210 220 230 240 250 pF1KE2 DFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQLSSHTQYRTAFET ::.:::.::.:.:::.:.:.: ::.:::: : ::::::.: : :: CCDS13 GHLQIAVADIPGIIRGAHQNRGLGSAFLRHIERCRFLLFVVDLS--QPEPWTQ------- 270 280 290 300 310 320 260 270 280 290 300 310 pF1KE2 IILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPKDFLHLFEKNMIPE . : :::.:.. :...: ...::.:::.:: . .:::.. :: CCDS13 VDDLKYELEMYEKGLSARPHAIVANKIDLPEAQAN----LSQLRD-----HLG------- 330 340 350 360 320 330 340 350 360 370 pF1KE2 RTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLLNLWISDTMSSTEP :..: .::.:::..:.: .. : :. : CCDS13 -----QEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW 370 380 390 400 380 pF1KE2 PSKHAVTTSKMDII 387 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:54:10 2016 done: Tue Nov 8 16:54:10 2016 Total Scan time: 2.200 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]