FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3172, 403 aa 1>>>pF1KE3172 403 - 403 aa - 403 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2487+/-0.00102; mu= 12.9397+/- 0.062 mean_var=91.8524+/-18.170, 0's: 0 Z-trim(105.5): 21 B-trim: 0 in 0/51 Lambda= 0.133822 statistics sampled from 8435 (8446) to 8435 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.623), E-opt: 0.2 (0.259), width: 16 Scan time: 1.890 The best scores are: opt bits E(32554) CCDS43122.1 TRMT10C gene_id:54931|Hs108|chr3 ( 403) 2656 523.1 1.8e-148 CCDS43804.1 TRMT10B gene_id:158234|Hs108|chr9 ( 316) 324 72.8 4.9e-13 CCDS69601.1 TRMT10B gene_id:158234|Hs108|chr9 ( 221) 314 70.8 1.4e-12 >>CCDS43122.1 TRMT10C gene_id:54931|Hs108|chr3 (403 aa) initn: 2656 init1: 2656 opt: 2656 Z-score: 2779.2 bits: 523.1 E(32554): 1.8e-148 Smith-Waterman score: 2656; 100.0% identity (100.0% similar) in 403 aa overlap (1-403:1-403) 10 20 30 40 50 60 pF1KE3 MAAFLKMSVSVNFFRPFTRFLVPFTLHRKRNNLTILQRYMSSKIPAVTYPKNESTPPSEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MAAFLKMSVSVNFFRPFTRFLVPFTLHRKRNNLTILQRYMSSKIPAVTYPKNESTPPSEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 LELDKWKTTMKSSVQEECVSTISSSKDEDPLAATREFIEMWRLLGREVPEHITEEELKTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 LELDKWKTTMKSSVQEECVSTISSSKDEDPLAATREFIEMWRLLGREVPEHITEEELKTL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 MECVSNTAKKKYLKYLYTKEKVKKARQIKKEMKAAAREEAKNIKLLETTEEDKQKNFLFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MECVSNTAKKKYLKYLYTKEKVKKARQIKKEMKAAAREEAKNIKLLETTEEDKQKNFLFL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 RLWDRNMDIAMGWKGAQAMQFGQPLVFDMAYENYMKRKELQNTVSQLLESEGWNRRNVDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RLWDRNMDIAMGWKGAQAMQFGQPLVFDMAYENYMKRKELQNTVSQLLESEGWNRRNVDP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 FHIYFCNLKIDGALHRELVKRYQEKWDKLLLTSTEKSHVDLFPKDSIIYLTADSPNVMTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 FHIYFCNLKIDGALHRELVKRYQEKWDKLLLTSTEKSHVDLFPKDSIIYLTADSPNVMTT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 FRHDKVYVIGSFVDKSMQPGTSLAKAKRLNLATECLPLDKYLQWEIGNKNLTLDQMIRIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 FRHDKVYVIGSFVDKSMQPGTSLAKAKRLNLATECLPLDKYLQWEIGNKNLTLDQMIRIL 310 320 330 340 350 360 370 380 390 400 pF1KE3 LCLKNNGNWQEALQFVPKRKHTGFLEISQHSQEFINRLKKAKT ::::::::::::::::::::::::::::::::::::::::::: CCDS43 LCLKNNGNWQEALQFVPKRKHTGFLEISQHSQEFINRLKKAKT 370 380 390 400 >>CCDS43804.1 TRMT10B gene_id:158234|Hs108|chr9 (316 aa) initn: 211 init1: 186 opt: 324 Z-score: 347.6 bits: 72.8 E(32554): 4.9e-13 Smith-Waterman score: 332; 28.7% identity (62.0% similar) in 258 aa overlap (123-374:61-300) 100 110 120 130 140 150 pF1KE3 ATREFIEMWRLLGREVPEHITEEELKTLMECVSNTAKK-KYLKYLYTKEKVKKARQIKKE : .:. .: .. . . . .: : :. .:: CCDS43 GLPEGFQLLQIDAEGECQEGEILATGSTAWCSKNVQRKQRHWEKIVAAKKSK--RKQEKE 40 50 60 70 80 160 170 180 190 200 210 pF1KE3 MKAAAREEAKNIKLLETTEEDKQKNFLFLRLWDRNMDIAMGWKGAQAMQFGQPLVFDMAY . : : : .: ...:. ::: .. : .: . : : .:... CCDS43 RRKANRAENPGI----CPQHSKR----FLRALTKD-------KLLEAKHSGPRLCIDLSM 90 100 110 120 130 220 230 240 250 260 270 pF1KE3 ENYMKRKELQNTVSQLLESEGWNRRNVDPFHIYFCNLKIDGALHRELVKRYQEKWDKLLL .::..:::. ..:. . : :.. :: : . .. :. :..: : :... ... :: CCDS43 THYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEECV-RMNDGFSSYLL 140 150 160 170 180 190 280 290 300 310 320 330 pF1KE3 TSTEKSHVDLFPKDSIIYLTADSPNVMTTFRHDKVYVIGSFVDKSMQPGTSLAKAKRLNL ::.. .::: ....::: :: ... .:::..:..::.:.: ... ::.. .. CCDS43 DITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQKAREYSV 200 210 220 230 240 250 340 350 360 370 380 pF1KE3 ATECLPLDKYLQWEIGNKN-----LTLDQMIRILLCLKNNGNWQEALQFVPKRKHTGFLE : ::...:. . ..:: :...:.. :: .. :: :::. CCDS43 KTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSGKGYILR 260 270 280 290 300 310 390 400 pF1KE3 ISQHSQEFINRLKKAKT CCDS43 NSVE >>CCDS69601.1 TRMT10B gene_id:158234|Hs108|chr9 (221 aa) initn: 211 init1: 186 opt: 314 Z-score: 339.5 bits: 70.8 E(32554): 1.4e-12 Smith-Waterman score: 314; 31.1% identity (67.8% similar) in 183 aa overlap (197-374:24-205) 170 180 190 200 210 220 pF1KE3 ETTEEDKQKNFLFLRLWDRNMDIAMGWKGAQAMQFGQPLVFDMAYENYMKRKELQNTVSQ .: . : : .:... .::..:::. ..: CCDS69 MVLGICPQHSKRFLRALTKDKLLEAKHSGPRLCIDLSMTHYMSKKELSRLAGQ 10 20 30 40 50 230 240 250 260 270 280 pF1KE3 LLESEGWNRRNVDPFHIYFCNLKIDGALHRELVKRYQEKWDKLLLTSTEKSHVDLFPKDS . . : :.. :: : . .. :. :..: : :... ... :: ::.. .::: .. CCDS69 IRRLYGSNKKADRPFWICLTGFTTDSPLYEECV-RMNDGFSSYLLDITEEDCFSLFPLET 60 70 80 90 100 110 290 300 310 320 330 340 pF1KE3 IIYLTADSPNVMTTFRHDKVYVIGSFVDKSMQPGTSLAKAKRLNLATECLPLDKYLQWEI ..::: :: ... .:::..:..::.:.: ... ::.. .. : ::...:. . CCDS69 LVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQKAREYSVKTARLPIQEYMVRNQ 120 130 140 150 160 170 350 360 370 380 390 400 pF1KE3 GNKN-----LTLDQMIRILLCLKNNGNWQEALQFVPKRKHTGFLEISQHSQEFINRLKKA ..:: :...:.. :: .. :: :::. CCDS69 NGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSGKGYILRNSVE 180 190 200 210 220 pF1KE3 KT 403 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 03:19:29 2016 done: Mon Nov 7 03:19:29 2016 Total Scan time: 1.890 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]