FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2599, 406 aa 1>>>pF1KE2599 406 - 406 aa - 406 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7918+/-0.000794; mu= 14.9942+/- 0.048 mean_var=78.7803+/-16.092, 0's: 0 Z-trim(109.5): 19 B-trim: 366 in 1/49 Lambda= 0.144499 statistics sampled from 10903 (10919) to 10903 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.697), E-opt: 0.2 (0.335), width: 16 Scan time: 2.240 The best scores are: opt bits E(32554) CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 ( 406) 2737 579.9 1.4e-165 CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 387) 485 110.5 2.9e-24 CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 ( 308) 393 91.2 1.4e-18 >>CCDS13492.1 MTG2 gene_id:26164|Hs108|chr20 (406 aa) initn: 2737 init1: 2737 opt: 2737 Z-score: 3086.1 bits: 579.9 E(32554): 1.4e-165 Smith-Waterman score: 2737; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406) 10 20 30 40 50 60 pF1KE2 MAPARCFSARLRTVFQGVGHWALSTWAGLKPSRLLPQRASPRLLSVGRADLAKHQELPGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MAPARCFSARLRTVFQGVGHWALSTWAGLKPSRLLPQRASPRLLSVGRADLAKHQELPGK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 KLLSEKKLKRYFVDYRRVLVCGGNGGAGASCFHSEPRKEFGGPDGGDGGNGGHVILRVDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KLLSEKKLKRYFVDYRRVLVCGGNGGAGASCFHSEPRKEFGGPDGGDGGNGGHVILRVDQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 QVKSLSSVLSRYQGFSGEDGGSKNCFGRSGAVLYIRVPVGTLVKEGGRVVADLSCVGDEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 QVKSLSSVLSRYQGFSGEDGGSKNCFGRSGAVLYIRVPVGTLVKEGGRVVADLSCVGDEY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 IAALGGAGGKGNRFFLANNNRAPVTCTPGQPGQQRVLHLELKTVAHAGMVGFPNAGKSSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 IAALGGAGGKGNRFFLANNNRAPVTCTPGQPGQQRVLHLELKTVAHAGMVGFPNAGKSSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 LRAISNARPAVASYPFTTLKPHVGIVHYEGHLQIAVADIPGIIRGAHQNRGLGSAFLRHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LRAISNARPAVASYPFTTLKPHVGIVHYEGHLQIAVADIPGIIRGAHQNRGLGSAFLRHI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 ERCRFLLFVVDLSQPEPWTQVDDLKYELEMYEKGLSARPHAIVANKIDLPEAQANLSQLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ERCRFLLFVVDLSQPEPWTQVDDLKYELEMYEKGLSARPHAIVANKIDLPEAQANLSQLR 310 320 330 340 350 360 370 380 390 400 pF1KE2 DHLGQEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW :::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DHLGQEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW 370 380 390 400 >>CCDS5617.1 GTPBP10 gene_id:85865|Hs108|chr7 (387 aa) initn: 635 init1: 447 opt: 485 Z-score: 549.2 bits: 110.5 E(32554): 2.9e-24 Smith-Waterman score: 673; 39.2% identity (62.7% similar) in 367 aa overlap (68-396:9-350) 40 50 60 70 80 90 pF1KE2 RASPRLLSVGRADLAKHQELPGKKLLSEKKLKRY--FVDYRRVLVCGGNGGAGASCFHSE ...: :.: :... ::.:: : CCDS56 MVHCSCVLFRKYGNFIDKLRLFTRGGSGGMGY------ 10 20 30 100 110 120 130 140 150 pF1KE2 PRKEFGGPDGGDGGNGGHVILRVDQQVKSLSSVLSRY--QGF-SGEDGGSKNCF--GRSG :: ::.::.:: : . : :. .:... .:: . : .: ..:: : .: CCDS56 PRL------GGEGGKGGDVWV-VAQNRMTLKQLKDRYPRKRFVAGVGANSKISALKGSKG 40 50 60 70 80 160 170 180 190 200 pF1KE2 AVLYIRVPVG-TLVKEGGRVVADLSCVGDEYIAALGGAGGKGNRFFLANNNRAPVTCTPG : :::: ... :.:.....:. .:. ..: :: ::: :: :. CCDS56 KDCEIPVPVGISVTDENGKIIGELNKENDRILVAQGGLGGKLLTNFL------PL----- 90 100 110 120 130 210 220 230 240 250 260 pF1KE2 QPGQQRVLHLELKTVAHAGMVGFPNAGKSSLLRAISNARPAVASYPFTTLKPHVGIVHYE ::.:..::.:: .: .:.:::::::::::: .:.:.::.:.: ::::::..: . : CCDS56 -KGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSCVSHAKPAIADYAFTTLKPELGKIMYS 140 150 160 170 180 190 270 280 290 300 310 320 pF1KE2 GHLQIAVADIPGIIRGAHQNRGLGSAFLRHIERCRFLLFVVDLS--QPEPWTQVDD---- ::.:::.::.:.:::.:.:.: ::.:::: : ::::::.: : :: CCDS56 DFKQISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQLSSHTQYRTAFET 200 210 220 230 240 250 330 340 350 360 pF1KE2 ---LKYELEMYEKGLSARPHAIVANKIDLPEAQAN----LSQLRD-----HLG------- : :::.:.. :...: ...::.:::.:: . .:::.. :: CCDS56 IILLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPKDFLHLFEKNMIPE 260 270 280 290 300 310 370 380 390 400 pF1KE2 -----QEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW :..: .::.:::..:.: .. : :. : CCDS56 RTVEFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLLNLWISDTMSSTEP 320 330 340 350 360 370 CCDS56 PSKHAVTTSKMDII 380 >>CCDS43614.1 GTPBP10 gene_id:85865|Hs108|chr7 (308 aa) initn: 513 init1: 358 opt: 393 Z-score: 447.0 bits: 91.2 E(32554): 1.4e-18 Smith-Waterman score: 482; 40.6% identity (62.3% similar) in 244 aa overlap (184-396:35-271) 160 170 180 190 200 210 pF1KE2 YIRVPVGTLVKEGGRVVADLSCVGDEYIAALGGAGGKGNRFFLANNNRAPVTCTPGQPGQ ::: ::::. ... .:: . . . CCDS43 SCVLFRKYGNFIDKLRLFTRGGSGGMGYPRLGGEGGKGGDVWVVAQNRMTLKQLKDRYPR 10 20 30 40 50 60 220 230 240 250 260 270 pF1KE2 QRVLHLELKTVAHAGMVG-FPNAGKSSLLRAISNARPAVASYPFTTLKPHVGIVHYEGHL .: :: .: . :::::::::: .:.:.::.:.: ::::::..: . : CCDS43 KRF-------VAGVGANSKFPNAGKSSLLSCVSHAKPAIADYAFTTLKPELGKIMYSDFK 70 80 90 100 110 280 290 300 310 320 pF1KE2 QIAVADIPGIIRGAHQNRGLGSAFLRHIERCRFLLFVVDLS--QPEPWTQVDD------- ::.:::.::.:.:::.:.:.: ::.:::: : ::::::.: : :: CCDS43 QISVADLPGLIEGAHMNKGMGHKFLKHIERTRQLLFVVDISGFQLSSHTQYRTAFETIIL 120 130 140 150 160 170 330 340 350 360 pF1KE2 LKYELEMYEKGLSARPHAIVANKIDLPEAQAN----LSQLRD-----HLG---------- : :::.:.. :...: ...::.:::.:: . .:::.. :: CCDS43 LTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNPKDFLHLFEKNMIPERTV 180 190 200 210 220 230 370 380 390 400 pF1KE2 --QEVIVLSALTGENLEQLLLHLKVLYDAYAEAELGQGRQPLRW :..: .::.:::..:.: .. : :. : CCDS43 EFQHIIPISAVTGEGIEELKNCIRKSLDEQANQENDALHKKQLLNLWISDTMSSTEPPSK 240 250 260 270 280 290 CCDS43 HAVTTSKMDII 300 406 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:45:49 2016 done: Tue Nov 8 16:45:49 2016 Total Scan time: 2.240 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]