FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5409, 349 aa 1>>>pF1KE5409 349 - 349 aa - 349 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0207+/-0.000373; mu= 13.1887+/- 0.023 mean_var=83.1375+/-16.720, 0's: 0 Z-trim(114.6): 39 B-trim: 357 in 1/55 Lambda= 0.140662 statistics sampled from 24511 (24549) to 24511 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.288), width: 16 Scan time: 7.680 The best scores are: opt bits E(85289) NP_631908 (OMIM: 610726) probable tRNA pseudouridi ( 349) 2208 457.8 1.6e-128 NP_001316790 (OMIM: 610727) probable tRNA pseudour ( 287) 420 94.9 2.3e-19 NP_001316791 (OMIM: 610727) probable tRNA pseudour ( 275) 382 87.2 4.7e-17 NP_056494 (OMIM: 610727) probable tRNA pseudouridi ( 331) 382 87.2 5.5e-17 NP_001316792 (OMIM: 610727) probable tRNA pseudour ( 227) 326 75.8 1.1e-13 >>NP_631908 (OMIM: 610726) probable tRNA pseudouridine s (349 aa) initn: 2208 init1: 2208 opt: 2208 Z-score: 2428.2 bits: 457.8 E(85289): 1.6e-128 Smith-Waterman score: 2208; 100.0% identity (100.0% similar) in 349 aa overlap (1-349:1-349) 10 20 30 40 50 60 pF1KE5 MAASEAAVVSSPSLKTDTSPVLETAGTVAAMAATPSARAAAAVVAAAARTGSEARVSKAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 MAASEAAVVSSPSLKTDTSPVLETAGTVAAMAATPSARAAAAVVAAAARTGSEARVSKAA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 LATKLLSLSGVFAVHKPKGPTSAELLNRLKEKLLAEAGMPSPEWTKRKKQTLKIGHGGTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 LATKLLSLSGVFAVHKPKGPTSAELLNRLKEKLLAEAGMPSPEWTKRKKQTLKIGHGGTL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 DSAARGVLVVGIGSGTKMLTSMLSGSKRYTAIGELGKATDTLDSTGRVTEEKPYDKITQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 DSAARGVLVVGIGSGTKMLTSMLSGSKRYTAIGELGKATDTLDSTGRVTEEKPYDKITQE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DIEGILQKFTGNIMQVPPLYSALKKDGQRLSTLMKRGEVVEAKPARPVTVYSISLQKFQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 DIEGILQKFTGNIMQVPPLYSALKKDGQRLSTLMKRGEVVEAKPARPVTVYSISLQKFQP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 PFFTLDVECGGGFYIRSLVSDIGKELSSCANVLELTRTKQGPFTLEEHALPEDKWTIDDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 PFFTLDVECGGGFYIRSLVSDIGKELSSCANVLELTRTKQGPFTLEEHALPEDKWTIDDI 250 260 270 280 290 300 310 320 330 340 pF1KE5 AQSLEHCSSLFPAELALKKSKPESNEQVLSCEYITLNEPKREDDVIKTC ::::::::::::::::::::::::::::::::::::::::::::::::: NP_631 AQSLEHCSSLFPAELALKKSKPESNEQVLSCEYITLNEPKREDDVIKTC 310 320 330 340 >>NP_001316790 (OMIM: 610727) probable tRNA pseudouridin (287 aa) initn: 256 init1: 146 opt: 420 Z-score: 468.5 bits: 94.9 E(85289): 2.3e-19 Smith-Waterman score: 420; 33.3% identity (59.3% similar) in 270 aa overlap (65-330:6-265) 40 50 60 70 80 90 pF1KE5 PSARAAAAVVAAAARTGSEARVSKAALATKLLSLSGVFAVHKPKGPTSAELLNRLKEKLL : : :.:::.:: : .: . .. .:: NP_001 MGSAGLSRLHGLFAVYKPPGLKWKHLRDTVELQLL 10 20 30 100 110 120 130 140 150 pF1KE5 AEAGMPSPEWTKRKKQTLKIGHGGTLDSAARGVLVVGIGSGTKMLTSMLSG--SKRYTAI : : ... ::.: : ::. : ::::.:.: : ..::.: .. .: ::. NP_001 KVCG---PAFAH-----LKVGVGHRLDAQASGVLVLGVGHGCRLLTDMYNAHLTKDYTVR 40 50 60 70 80 160 170 180 190 200 210 pF1KE5 GELGKATDTLDSTGRVTEEKPYDKITQEDIEGILQKFTGNIMQVPPLYSALKKDGQRLST : :::::: . ::..:. ::..:.: .. :: . :. ... .:: : :. NP_001 GLLGKATDDFREDGRLVEKTTYDHVTREKLDRILAVIQGSHQKALVMYSNLDLKTQEAYE 90 100 110 120 130 140 220 230 240 250 260 270 pF1KE5 LMKRGEVVEAKPARPVTVYSISLQKFQPPFFTLDVECGGGFY--IRSLVSDIGKELSSCA . :: ... :. . .: : :: : :.:.: .:.:: .:: ::.. : NP_001 MAVRG-LIRPMNKSPMLITGIRCLYFAPPEFLLEVQCMHETQKELRKLVHEIGLELKTTA 150 160 170 180 190 200 280 290 300 310 320 330 pF1KE5 NVLELTRTKQGPFTLEEHALPEDKWTIDDIAQSLEHCSSLFPAELALKKSKPESNEQVLS .. ::..: :::. :: . .: . .: .... . ::: . : ...:. : NP_001 VCTQVRRTRDGFFTLDS-ALLRTQWDLTNIQDAIRAATPQVAAELEKSLSPGLDTKQLPS 210 220 230 240 250 260 340 pF1KE5 CEYITLNEPKREDDVIKTC NP_001 PGWSWDSQGPSSTLGLERGAGQ 270 280 >>NP_001316791 (OMIM: 610727) probable tRNA pseudouridin (275 aa) initn: 206 init1: 146 opt: 382 Z-score: 427.1 bits: 87.2 E(85289): 4.7e-17 Smith-Waterman score: 382; 34.1% identity (61.0% similar) in 223 aa overlap (112-330:33-253) 90 100 110 120 130 140 pF1KE5 SAELLNRLKEKLLAEAGMPSPEWTKRKKQTLKIGHGGTLDSAARGVLVVGIGSGTKMLTS ::.: : ::. : ::::.:.: : ..::. NP_001 GSEEKELTLTATSVPSFINHPLVCGPAFAHLKVGVGHRLDAQASGVLVLGVGHGCRLLTD 10 20 30 40 50 60 150 160 170 180 190 pF1KE5 MLSG--SKRYTAIGELGKATDTLDSTGRVTEEKPYDKITQEDIEGILQKFTGNIMQVPPL : .. .: ::. : :::::: . ::..:. ::..:.: .. :: . :. ... . NP_001 MYNAHLTKDYTVRGLLGKATDDFREDGRLVEKTTYDHVTREKLDRILAVIQGSHQKALVM 70 80 90 100 110 120 200 210 220 230 240 250 pF1KE5 YSALKKDGQRLSTLMKRGEVVEAKPARPVTVYSISLQKFQPPFFTLDVECGGGFY--IRS :: : :. . :: ... :. . .: : :: : :.:.: .:. NP_001 YSNLDLKTQEAYEMAVRG-LIRPMNKSPMLITGIRCLYFAPPEFLLEVQCMHETQKELRK 130 140 150 160 170 180 260 270 280 290 300 310 pF1KE5 LVSDIGKELSSCANVLELTRTKQGPFTLEEHALPEDKWTIDDIAQSLEHCSSLFPAELAL :: .:: ::.. : .. ::..: :::. :: . .: . .: .... . ::: NP_001 LVHEIGLELKTTAVCTQVRRTRDGFFTLDS-ALLRTQWDLTNIQDAIRAATPQVAAELEK 190 200 210 220 230 240 320 330 340 pF1KE5 KKSKPESNEQVLSCEYITLNEPKREDDVIKTC . : ...:. : NP_001 SLSPGLDTKQLPSPGWSWDSQGPSSTLGLERGAGQ 250 260 270 >>NP_056494 (OMIM: 610727) probable tRNA pseudouridine s (331 aa) initn: 253 init1: 146 opt: 382 Z-score: 425.9 bits: 87.2 E(85289): 5.5e-17 Smith-Waterman score: 382; 34.1% identity (61.0% similar) in 223 aa overlap (112-330:89-309) 90 100 110 120 130 140 pF1KE5 SAELLNRLKEKLLAEAGMPSPEWTKRKKQTLKIGHGGTLDSAARGVLVVGIGSGTKMLTS ::.: : ::. : ::::.:.: : ..::. NP_056 GSEEKELTLTATSVPSFINHPLVCGPAFAHLKVGVGHRLDAQASGVLVLGVGHGCRLLTD 60 70 80 90 100 110 150 160 170 180 190 pF1KE5 MLSG--SKRYTAIGELGKATDTLDSTGRVTEEKPYDKITQEDIEGILQKFTGNIMQVPPL : .. .: ::. : :::::: . ::..:. ::..:.: .. :: . :. ... . NP_056 MYNAHLTKDYTVRGLLGKATDDFREDGRLVEKTTYDHVTREKLDRILAVIQGSHQKALVM 120 130 140 150 160 170 200 210 220 230 240 250 pF1KE5 YSALKKDGQRLSTLMKRGEVVEAKPARPVTVYSISLQKFQPPFFTLDVECGGGFY--IRS :: : :. . :: ... :. . .: : :: : :.:.: .:. NP_056 YSNLDLKTQEAYEMAVRG-LIRPMNKSPMLITGIRCLYFAPPEFLLEVQCMHETQKELRK 180 190 200 210 220 230 260 270 280 290 300 310 pF1KE5 LVSDIGKELSSCANVLELTRTKQGPFTLEEHALPEDKWTIDDIAQSLEHCSSLFPAELAL :: .:: ::.. : .. ::..: :::. :: . .: . .: .... . ::: NP_056 LVHEIGLELKTTAVCTQVRRTRDGFFTLDS-ALLRTQWDLTNIQDAIRAATPQVAAELEK 240 250 260 270 280 290 320 330 340 pF1KE5 KKSKPESNEQVLSCEYITLNEPKREDDVIKTC . : ...:. : NP_056 SLSPGLDTKQLPSPGWSWDSQGPSSTLGLERGAGQ 300 310 320 330 >>NP_001316792 (OMIM: 610727) probable tRNA pseudouridin (227 aa) initn: 201 init1: 141 opt: 326 Z-score: 367.0 bits: 75.8 E(85289): 1.1e-13 Smith-Waterman score: 326; 31.9% identity (60.4% similar) in 207 aa overlap (128-330:1-205) 100 110 120 130 140 150 pF1KE5 GMPSPEWTKRKKQTLKIGHGGTLDSAARGVLVVGIGSGTKMLTSMLSG--SKRYTAIGEL .:.:.: : ..::.: .. .: ::. : : NP_001 MVLGVGHGCRLLTDMYNAHLTKDYTVRGLL 10 20 30 160 170 180 190 200 210 pF1KE5 GKATDTLDSTGRVTEEKPYDKITQEDIEGILQKFTGNIMQVPPLYSALKKDGQRLSTLMK ::::: . ::..:. ::..:.: .. :: . :. ... .:: : :. . NP_001 GKATDDFREDGRLVEKTTYDHVTREKLDRILAVIQGSHQKALVMYSNLDLKTQEAYEMAV 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE5 RGEVVEAKPARPVTVYSISLQKFQPPFFTLDVECGGGFY--IRSLVSDIGKELSSCANVL :: ... :. . .: : :: : :.:.: .:.:: .:: ::.. : NP_001 RG-LIRPMNKSPMLITGIRCLYFAPPEFLLEVQCMHETQKELRKLVHEIGLELKTTAVCT 100 110 120 130 140 280 290 300 310 320 330 pF1KE5 ELTRTKQGPFTLEEHALPEDKWTIDDIAQSLEHCSSLFPAELALKKSKPESNEQVLSCEY .. ::..: :::. :: . .: . .: .... . ::: . : ...:. : NP_001 QVRRTRDGFFTLDS-ALLRTQWDLTNIQDAIRAATPQVAAELEKSLSPGLDTKQLPSPGW 150 160 170 180 190 200 340 pF1KE5 ITLNEPKREDDVIKTC NP_001 SWDSQGPSSTLGLERGAGQ 210 220 349 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:27:49 2016 done: Tue Nov 8 00:27:50 2016 Total Scan time: 7.680 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]