FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3125, 316 aa 1>>>pF1KE3125 316 - 316 aa - 316 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2431+/-0.000852; mu= 11.5726+/- 0.051 mean_var=83.7579+/-16.853, 0's: 0 Z-trim(107.8): 14 B-trim: 0 in 0/50 Lambda= 0.140140 statistics sampled from 9790 (9797) to 9790 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.681), E-opt: 0.2 (0.301), width: 16 Scan time: 2.670 The best scores are: opt bits E(32554) CCDS43804.1 TRMT10B gene_id:158234|Hs108|chr9 ( 316) 2105 435.2 3.3e-122 CCDS69601.1 TRMT10B gene_id:158234|Hs108|chr9 ( 221) 1446 301.9 3.1e-82 CCDS69598.1 TRMT10B gene_id:158234|Hs108|chr9 ( 256) 947 201.0 8.2e-52 CCDS69600.1 TRMT10B gene_id:158234|Hs108|chr9 ( 229) 930 197.5 8.1e-51 CCDS3650.1 TRMT10A gene_id:93587|Hs108|chr4 ( 339) 405 91.5 1e-18 CCDS43122.1 TRMT10C gene_id:54931|Hs108|chr3 ( 403) 324 75.1 1e-13 >>CCDS43804.1 TRMT10B gene_id:158234|Hs108|chr9 (316 aa) initn: 2105 init1: 2105 opt: 2105 Z-score: 2307.5 bits: 435.2 E(32554): 3.3e-122 Smith-Waterman score: 2105; 100.0% identity (100.0% similar) in 316 aa overlap (1-316:1-316) 10 20 30 40 50 60 pF1KE3 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 CSKNVQRKQRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 CSKNVQRKQRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KHSGPRLCIDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 KHSGPRLCIDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEEC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VRMNDGFSSYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 VRMNDGFSSYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALK 250 260 270 280 290 300 310 pF1KE3 KGVSSGKGYILRNSVE :::::::::::::::: CCDS43 KGVSSGKGYILRNSVE 310 >>CCDS69601.1 TRMT10B gene_id:158234|Hs108|chr9 (221 aa) initn: 1446 init1: 1446 opt: 1446 Z-score: 1589.9 bits: 301.9 E(32554): 3.1e-82 Smith-Waterman score: 1446; 100.0% identity (100.0% similar) in 218 aa overlap (99-316:4-221) 70 80 90 100 110 120 pF1KE3 QRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEAKHSGPRLC :::::::::::::::::::::::::::::: CCDS69 MVLGICPQHSKRFLRALTKDKLLEAKHSGPRLC 10 20 30 130 140 150 160 170 180 pF1KE3 IDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEECVRMNDGFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 IDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEECVRMNDGFS 40 50 60 70 80 90 190 200 210 220 230 240 pF1KE3 SYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQKAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQKAR 100 110 120 130 140 150 250 260 270 280 290 300 pF1KE3 EYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSGKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSGKG 160 170 180 190 200 210 310 pF1KE3 YILRNSVE :::::::: CCDS69 YILRNSVE 220 >>CCDS69598.1 TRMT10B gene_id:158234|Hs108|chr9 (256 aa) initn: 1510 init1: 947 opt: 947 Z-score: 1043.6 bits: 201.0 E(32554): 8.2e-52 Smith-Waterman score: 1558; 81.0% identity (81.0% similar) in 316 aa overlap (1-316:1-256) 10 20 30 40 50 60 pF1KE3 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 CSKNVQRKQRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 CSKNVQRKQRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KHSGPRLCIDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEEC :::::::::::::::::::: CCDS69 KHSGPRLCIDLSMTHYMSKK---------------------------------------- 130 140 190 200 210 220 230 240 pF1KE3 VRMNDGFSSYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 -----------LDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK 150 160 170 180 250 260 270 280 290 300 pF1KE3 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALK ::::::::::::::::::::::::::::::::::::::::: :::::::::: CCDS69 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQ---------ETHNWPEALK 190 200 210 220 230 240 310 pF1KE3 KGVSSGKGYILRNSVE :::::::::::::::: CCDS69 KGVSSGKGYILRNSVE 250 >>CCDS69600.1 TRMT10B gene_id:158234|Hs108|chr9 (229 aa) initn: 930 init1: 930 opt: 930 Z-score: 1025.8 bits: 197.5 E(32554): 8.1e-51 Smith-Waterman score: 1332; 72.5% identity (72.5% similar) in 316 aa overlap (1-316:1-229) 10 20 30 40 50 60 pF1KE3 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MDWKLEGSTQKVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 CSKNVQRKQRHWEKIVAAKKSKRKQEKERRKANRAENPGICPQHSKRFLRALTKDKLLEA :: CCDS69 CS---------------------------------------------------------- 130 140 150 160 170 180 pF1KE3 KHSGPRLCIDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEEC :::::::::::::::::::::::::::::::::::::::: CCDS69 --------------------ELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEEC 70 80 90 100 190 200 210 220 230 240 pF1KE3 VRMNDGFSSYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 VRMNDGFSSYLLDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQK 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE3 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALK ::::::::::::::::::::::::::::::::::::::::: :::::::::: CCDS69 KVTFQKAREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQ---------ETHNWPEALK 170 180 190 200 210 310 pF1KE3 KGVSSGKGYILRNSVE :::::::::::::::: CCDS69 KGVSSGKGYILRNSVE 220 >>CCDS3650.1 TRMT10A gene_id:93587|Hs108|chr4 (339 aa) initn: 448 init1: 173 opt: 405 Z-score: 449.5 bits: 91.5 E(32554): 1e-18 Smith-Waterman score: 405; 29.9% identity (62.4% similar) in 274 aa overlap (41-310:21-279) 20 30 40 50 60 pF1KE3 KVESPVLQGQEGILEETGEDGLPEGFQLLQIDAEGECQEGEILATGSTAWCSKNVQR--K :. . : .. :. : ...... : CCDS36 MSSEMLPAFIETSNVDKKQGINEDQEESQKPRLGEGCEPISKRQMKKLIK 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 QRHWEKIVAAKKSKRKQEKERRKANRA-ENPGICPQHSKRFLRALTKDKLLEAKHSGPRL :..::. .:.:::....:.: .: . :... .: .: . :: :: CCDS36 QKQWEEQRELRKQKRKEKRKRKKLERQCQMEPNSDGHDRKRVR---RDVV----HSTLRL 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 CIDLSMTHYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEECVRMNDGF :: :. : : :....: ::.: :. :..: .: . :: . . : .. . . :. CCDS36 IIDCSFDHLMVLKDIKKLHKQIQRCYAENRRALHPVQFYLT--SHGGQLKKNMDENDKGW 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 SSYL-LDITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQK .. . : : :. : :.::: :: . :...: .:.:..:::::.. .: .:... CCDS36 VNWKDIHIKPEHYSELIKKEDLIYLTSDSPNILKELDESKAYVIGGLVDHNHHKGLTYKQ 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE3 AREYSVKTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSG : .:... :.::. ... :. ..::.:.::.:. ::::..: ::. . . CCDS36 ASDYGINHAQLPLGNFVKMNSR------KVLAVNHVFEIILEYLETRDWQEAFFTILPQR 230 240 250 260 270 310 pF1KE3 KGYILRNSVE :: . CCDS36 KGAVPTDKACESASHDNQSVRMEEGGSDSDSSEEEYSRNELDSPHEEKQDKENHTESTVN 280 290 300 310 320 330 >>CCDS43122.1 TRMT10C gene_id:54931|Hs108|chr3 (403 aa) initn: 234 init1: 186 opt: 324 Z-score: 359.8 bits: 75.1 E(32554): 1e-13 Smith-Waterman score: 332; 28.7% identity (62.0% similar) in 258 aa overlap (61-300:123-374) 40 50 60 70 80 pF1KE3 GLPEGFQLLQIDAEGECQEGEILATGSTAWCSKNVQRKQRHWEKIVAAKKSK--RKQEKE : .:. .: .. . . . .: : :. .:: CCDS43 ATREFIEMWRLLGREVPEHITEEELKTLMECVSNTAKK-KYLKYLYTKEKVKKARQIKKE 100 110 120 130 140 150 90 100 110 120 130 pF1KE3 RRKANRAENPGI----CPQHSKR----FLRALTKD-------KLLEAKHSGPRLCIDLSM . : : : .: ...:. ::: .. : .: . : : .:... CCDS43 MKAAAREEAKNIKLLETTEEDKQKNFLFLRLWDRNMDIAMGWKGAQAMQFGQPLVFDMAY 160 170 180 190 200 210 140 150 160 170 180 190 pF1KE3 THYMSKKELSRLAGQIRRLYGSNKKADRPFWICLTGFTTDSPLYEECV-RMNDGFSSYLL .::..:::. ..:. . : :.. :: : . .. :. :..: : :... ... :: CCDS43 ENYMKRKELQNTVSQLLESEGWNRRNVDPFHIYFCNLKIDGALHRELVKRYQEKWDKLLL 220 230 240 250 260 270 200 210 220 230 240 250 pF1KE3 DITEEDCFSLFPLETLVYLTPDSEHALEDVDLNKVYILGGLVDESIQKKVTFQKAREYSV ::.. .::: ....::: :: ... .:::..:..::.:.: ... ::.. .. CCDS43 TSTEKSHVDLFPKDSIIYLTADSPNVMTTFRHDKVYVIGSFVDKSMQPGTSLAKAKRLNL 280 290 300 310 320 330 260 270 280 290 300 310 pF1KE3 KTARLPIQEYMVRNQNGKNYHSEILAINQVFDILSTYLETHNWPEALKKGVSSGKGYILR : ::...:. . ..:: :...:.. :: .. :: :::. CCDS43 ATECLPLDKYLQWEIGNKN-----LTLDQMIRILLCLKNNGNWQEALQFVPKRKHTGFLE 340 350 360 370 380 pF1KE3 NSVE CCDS43 ISQHSQEFINRLKKAKT 390 400 316 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:54:54 2016 done: Sun Nov 6 10:54:55 2016 Total Scan time: 2.670 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]