FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3222, 504 aa 1>>>pF1KE3222 504 - 504 aa - 504 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4005+/-0.000842; mu= 17.5408+/- 0.050 mean_var=69.3197+/-13.439, 0's: 0 Z-trim(106.9): 14 B-trim: 0 in 0/49 Lambda= 0.154044 statistics sampled from 9275 (9281) to 9275 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.669), E-opt: 0.2 (0.285), width: 16 Scan time: 3.240 The best scores are: opt bits E(32554) CCDS14477.1 TRMT2B gene_id:79979|Hs108|chrX ( 504) 3409 766.8 0 CCDS55464.1 TRMT2B gene_id:79979|Hs108|chrX ( 459) 2415 545.9 3.5e-155 CCDS13774.1 TRMT2A gene_id:27037|Hs108|chr22 ( 625) 1063 245.5 1.3e-64 CCDS58793.1 TRMT2A gene_id:27037|Hs108|chr22 ( 562) 938 217.7 2.7e-56 CCDS82693.1 TRMT2A gene_id:27037|Hs108|chr22 ( 643) 669 158.0 3e-38 >>CCDS14477.1 TRMT2B gene_id:79979|Hs108|chrX (504 aa) initn: 3409 init1: 3409 opt: 3409 Z-score: 4092.8 bits: 766.8 E(32554): 0 Smith-Waterman score: 3409; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504) 10 20 30 40 50 60 pF1KE3 MAGLKRRVPLHSLRYFISMVGLFSKPGLLPWYARNPPGWSQLFLGTVCKGDFTRVIATKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAGLKRRVPLHSLRYFISMVGLFSKPGLLPWYARNPPGWSQLFLGTVCKGDFTRVIATKC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 QKGQKSQKKPSHLGPLDGSWQERLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QKGQKSQKKPSHLGPLDGSWQERLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LNGVSVTTAVPKSERLSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LNGVSVTTAVPKSERLSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 GNVVCVQSNHLKNIPEKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GNVVCVQSNHLKNIPEKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 AIITFHPQKLSQEELHVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQSPYQLLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 AIITFHPQKLSQEELHVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQSPYQLLF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 GEPYIFEELLSLKIRISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GEPYIFEELLSLKIRISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 SLAQHTSRVLGIELLEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SLAQHTSRVLGIELLEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 VVNPARAGLHYKVIQAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VVNPARAGLHYKVIQAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFV 430 440 450 460 470 480 490 500 pF1KE3 LQQAVPVDLFPHTPHCELVLLFTR :::::::::::::::::::::::: CCDS14 LQQAVPVDLFPHTPHCELVLLFTR 490 500 >>CCDS55464.1 TRMT2B gene_id:79979|Hs108|chrX (459 aa) initn: 3113 init1: 2415 opt: 2415 Z-score: 2899.6 bits: 545.9 E(32554): 3.5e-155 Smith-Waterman score: 3027; 91.1% identity (91.1% similar) in 504 aa overlap (1-504:1-459) 10 20 30 40 50 60 pF1KE3 MAGLKRRVPLHSLRYFISMVGLFSKPGLLPWYARNPPGWSQLFLGTVCKGDFTRVIATKC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MAGLKRRVPLHSLRYFISMVGLFSKPGLLPWYARNPPGWSQLFLGTVCKGDFTRVIATKC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 QKGQKSQKKPSHLGPLDGSWQERLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQM ::::::::::::::::::::::::::::::::::::::::: CCDS55 QKGQKSQKKPSHLGPLDGSWQERLADVVTPLWRLSYEEQLK------------------- 70 80 90 100 130 140 150 160 170 180 pF1KE3 LNGVSVTTAVPKSERLSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRD :::::::::::::::::::::::::::::::::: CCDS55 --------------------------PVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRD 110 120 130 190 200 210 220 230 240 pF1KE3 GNVVCVQSNHLKNIPEKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GNVVCVQSNHLKNIPEKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTM 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE3 AIITFHPQKLSQEELHVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQSPYQLLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AIITFHPQKLSQEELHVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQSPYQLLF 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE3 GEPYIFEELLSLKIRISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GEPYIFEELLSLKIRISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGL 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE3 SLAQHTSRVLGIELLEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SLAQHTSRVLGIELLEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVA 320 330 340 350 360 370 430 440 450 460 470 480 pF1KE3 VVNPARAGLHYKVIQAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VVNPARAGLHYKVIQAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFV 380 390 400 410 420 430 490 500 pF1KE3 LQQAVPVDLFPHTPHCELVLLFTR :::::::::::::::::::::::: CCDS55 LQQAVPVDLFPHTPHCELVLLFTR 440 450 >>CCDS13774.1 TRMT2A gene_id:27037|Hs108|chr22 (625 aa) initn: 573 init1: 371 opt: 1063 Z-score: 1273.7 bits: 245.5 E(32554): 1.3e-64 Smith-Waterman score: 1065; 37.3% identity (63.3% similar) in 520 aa overlap (1-504:77-588) 10 20 30 pF1KE3 MAGLKRRVPLHSLRYFISMVGLFSKPGLLP . .. :.. . ..: :.. :: .: CCDS13 AGAATGPGPQPGLYSYIRDDLFTSEIFKLELQNVPRHASFSDVRRFLGRFGL--QPHKTK 50 60 70 80 90 100 40 50 60 70 80 pF1KE3 WYARNPPGWSQLFLGTVCKGDFTRVIATKCQKGQKSQ---KKPSHLGPL------DGSWQ ... :: : ... . ::. ::. . .:. :. .: . CCDS13 LFGQ-PPCAFVTFRSAAERDKALRVLHGALWKGRPLSVRLARPKA-DPMARRRRQEGESE 110 120 130 140 150 160 90 100 110 120 130 pF1KE3 E---RLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQMLNGVSVTTAVP---KSER :.:::::::: . : :::. : ...::.: . : : . . . : .. CCDS13 PPVTRVADVVTPLWTVPYAEQLERKQLECEQVLQKLAKEIGSTNRALLPWLLEQRHKHNK 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE3 LSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRDGNVVCVQSNHLKNIP : :. . ::: . :::: : :. : ::. .::: :: .. :. . . .:: CCDS13 ACCPLEGVRPSPQQTEYRNKCEFLVGVGVDGEDNTVGCRLGKYKGGTCAVAAPFDTVHIP 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE3 EKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTMAIITFHPQKLSQEEL : .::.. .. :.:..: :.:..:::::. . ..::: ::::::: ::: CCDS13 EATKQVVKAFQEFIRSTPYSAYDPETYTGHWKQLTVRTSRRHQAMAIAYFHPQKLSPEEL 290 300 310 320 330 340 260 270 280 290 300 310 pF1KE3 HVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQS-PYQLLFGEPYIFEELLSLKI : . . : ::: : :.: ::: : . . :.. : . . :. : :.::.: . CCDS13 AELKTSLAQHFTAGPGRASGVTCLYFVEEGQRKTPSQEGLPLEHVAGDRCIHEDLLGLTF 350 360 370 380 390 400 320 330 340 350 360 370 pF1KE3 RISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGLSLAQHTSRVLGIEL :::: ::::.:: .::.:: .. . . ... ...::.:::::.:::.::....::.:.:: CCDS13 RISPHAFFQVNTPAAEVLYTVIQDWAQLDAGSMVLDVCCGTGTIGLALARKVKRVIGVEL 410 420 430 440 450 460 380 390 400 410 420 430 pF1KE3 LEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVAVVNPARAGLHYKVI .:::::: .: : ..: ::: :.:: ..: :. :. .: .::...: ::::: ::: CCDS13 CPEAVEDARVNAQDNELSNVEFHCGRAEDLVPTLV-SRLASQHLVAILDPPRAGLHSKVI 470 480 490 500 510 520 440 450 460 470 480 490 pF1KE3 QAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFVLQQAVPVDLFPHTP ::: . .. :..:::. .. . : ..:: : .... : :: .:: :::::.:: CCDS13 LAIRRAKNLRRLLYVSCNPRA-AMGNFVDLCRAP--SNRVKGIPFRPVKAVAVDLFPQTP 530 540 550 560 570 500 pF1KE3 HCELVLLFTR :::...:: : CCDS13 HCEMLILFERVEHPNGTGVLGPHSPPAQPTPGPPDNTLQETGTFPSS 580 590 600 610 620 >>CCDS58793.1 TRMT2A gene_id:27037|Hs108|chr22 (562 aa) initn: 462 init1: 371 opt: 938 Z-score: 1124.3 bits: 217.7 E(32554): 2.7e-56 Smith-Waterman score: 940; 36.2% identity (61.7% similar) in 486 aa overlap (1-470:77-557) 10 20 30 pF1KE3 MAGLKRRVPLHSLRYFISMVGLFSKPGLLP . .. :.. . ..: :.. :: .: CCDS58 AGAATGPGPQPGLYSYIRDDLFTSEIFKLELQNVPRHASFSDVRRFLGRFGL--QPHKTK 50 60 70 80 90 100 40 50 60 70 80 pF1KE3 WYARNPPGWSQLFLGTVCKGDFTRVIATKCQKGQKSQ---KKPSHLGPL------DGSWQ ... :: : ... . ::. ::. . .:. :. .: . CCDS58 LFGQ-PPCAFVTFRSAAERDKALRVLHGALWKGRPLSVRLARPKA-DPMARRRRQEGESE 110 120 130 140 150 160 90 100 110 120 130 pF1KE3 E---RLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQMLNGVSVTTAVP---KSER :.:::::::: . : :::. : ...::.: . : : . . . : .. CCDS58 PPVTRVADVVTPLWTVPYAEQLERKQLECEQVLQKLAKEIGSTNRALLPWLLEQRHKHNK 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE3 LSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRDGNVVCVQSNHLKNIP : :. . ::: . :::: : :. : ::. .::: :: .. :. . . .:: CCDS58 ACCPLEGVRPSPQQTEYRNKCEFLVGVGVDGEDNTVGCRLGKYKGGTCAVAAPFDTVHIP 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE3 EKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTMAIITFHPQKLSQEEL : .::.. .. :.:..: :.:..:::::. . ..::: ::::::: ::: CCDS58 EATKQVVKAFQEFIRSTPYSAYDPETYTGHWKQLTVRTSRRHQAMAIAYFHPQKLSPEEL 290 300 310 320 330 340 260 270 280 290 300 310 pF1KE3 HVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQS-PYQLLFGEPYIFEELLSLKI : . . : ::: : :.: ::: : . . :.. : . . :. : :.::.: . CCDS58 AELKTSLAQHFTAGPGRASGVTCLYFVEEGQRKTPSQEGLPLEHVAGDRCIHEDLLGLTF 350 360 370 380 390 400 320 330 340 350 360 370 pF1KE3 RISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGLSLAQHTSRVLGIEL :::: ::::.:: .::.:: .. . . ... ...::.:::::.:::.::....::.:.:: CCDS58 RISPHAFFQVNTPAAEVLYTVIQDWAQLDAGSMVLDVCCGTGTIGLALARKVKRVIGVEL 410 420 430 440 450 460 380 390 400 410 420 430 pF1KE3 LEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQSIVAVVNPARAGLHYKVI .:::::: .: : ..: ::: :.:: ..: :. :. .: .::...: ::::: ::: CCDS58 CPEAVEDARVNAQDNELSNVEFHCGRAEDLVPTLV-SRLASQHLVAILDPPRAGLHSKVI 470 480 490 500 510 520 440 450 460 470 480 490 pF1KE3 QAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLGEPFVLQQAVPVDLFPHTP ::: . .. :..:::. .. : ::.: CCDS58 LAIRRAKNLRRLLYVSCNPRAAMGNFVDAPLFPPQPLQSPI 530 540 550 560 500 pF1KE3 HCELVLLFTR >>CCDS82693.1 TRMT2A gene_id:27037|Hs108|chr22 (643 aa) initn: 1001 init1: 362 opt: 669 Z-score: 800.3 bits: 158.0 E(32554): 3e-38 Smith-Waterman score: 1025; 36.2% identity (61.2% similar) in 538 aa overlap (1-504:77-606) 10 20 30 pF1KE3 MAGLKRRVPLHSLRYFISMVGLFSKPGLLP . .. :.. . ..: :.. :: .: CCDS82 AGAATGPGPQPGLYSYIRDDLFTSEIFKLELQNVPRHASFSDVRRFLGRFGL--QPHKTK 50 60 70 80 90 100 40 50 60 70 80 pF1KE3 WYARNPPGWSQLFLGTVCKGDFTRVIATKCQKGQKSQ---KKPSHLGPL------DGSWQ ... :: : ... . ::. ::. . .:. :. .: . CCDS82 LFGQ-PPCAFVTFRSAAERDKALRVLHGALWKGRPLSVRLARPKA-DPMARRRRQEGESE 110 120 130 140 150 160 90 100 110 120 130 pF1KE3 E---RLADVVTPLWRLSYEEQLKVKFAAQKKILQRLESYIQMLNGVSVTTAVP---KSER :.:::::::: . : :::. : ...::.: . : : . . . : .. CCDS82 PPVTRVADVVTPLWTVPYAEQLERKQLECEQVLQKLAKEIGSTNRALLPWLLEQRHKHNK 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE3 LSCLLHPIIPSPVINGYRNKSTFSVNRGPDGNPKTVGFYLGTWRDGNVVCVQSNHLKNIP : :. . ::: . :::: : :. : ::. .::: :: .. :. . . .:: CCDS82 ACCPLEGVRPSPQQTEYRNKCEFLVGVGVDGEDNTVGCRLGKYKGGTCAVAAPFDTVHIP 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE3 EKHSQVAQYYEVFLRQSPLEPCLVFHEGGYWRELTVRTNSQGHTMAIITFHPQKLSQEEL : .::.. .. :.:..: :.:..:::::. . ..::: ::::::: ::: CCDS82 EATKQVVKAFQEFIRSTPYSAYDPETYTGHWKQLTVRTSRRHQAMAIAYFHPQKLSPEEL 290 300 310 320 330 340 260 270 280 290 300 310 pF1KE3 HVQKEIVKEFFIRGPGAACGLTSLYFQESTMTRCSHQQS-PYQLLFGEPYIFEELLSLKI : . . : ::: : :.: ::: : . . :.. : . . :. : :.::.: . CCDS82 AELKTSLAQHFTAGPGRASGVTCLYFVEEGQRKTPSQEGLPLEHVAGDRCIHEDLLGLTF 350 360 370 380 390 400 320 330 340 350 360 pF1KE3 RISPDAFFQINTAGAEMLYRTVGELTGVNSDTILLDICCGTGVIGLSLA----------- :::: ::::.:: .::.:: .. . . ... ...::.:::::.:::.:: CCDS82 RISPHAFFQVNTPAAEVLYTVIQDWAQLDAGSMVLDVCCGTGTIGLALARGPMYSPPWVG 410 420 430 440 450 460 370 380 390 400 410 pF1KE3 -------QHTSRVLGIELLEQAVEDARWTAAFNGITNSEFHTGQAEKILPGLLKSKEDGQ :...::.:.:: .:::::: .: : ..: ::: :.:: ..: :. :. .: CCDS82 RHHAFLFQKVKRVIGVELCPEAVEDARVNAQDNELSNVEFHCGRAEDLVPTLV-SRLASQ 470 480 490 500 510 520 420 430 440 450 460 470 pF1KE3 SIVAVVNPARAGLHYKVIQAIRNFRAIHTLVFVSCKLHGESTRNVIELCCPPDPAKKLLG .::...: ::::: ::: ::: . .. :..:::. .. . : ..:: : .... : CCDS82 HLVAILDPPRAGLHSKVILAIRRAKNLRRLLYVSCNPRA-AMGNFVDLCRAP--SNRVKG 530 540 550 560 570 480 490 500 pF1KE3 EPFVLQQAVPVDLFPHTPHCELVLLFTR :: .:: :::::.:::::...:: : CCDS82 IPFRPVKAVAVDLFPQTPHCEMLILFERVEHPNGTGVLGPHSPPAQPTPGPPDNTLQETG 580 590 600 610 620 630 504 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 09:29:39 2016 done: Thu Nov 3 09:29:40 2016 Total Scan time: 3.240 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]