FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5213, 244 aa 1>>>pF1KE5213 244 - 244 aa - 244 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2581+/-0.000338; mu= 14.3444+/- 0.021 mean_var=62.6481+/-13.127, 0's: 0 Z-trim(113.6): 28 B-trim: 0 in 0/55 Lambda= 0.162039 statistics sampled from 23011 (23039) to 23011 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.654), E-opt: 0.2 (0.27), width: 16 Scan time: 6.710 The best scores are: opt bits E(85289) NP_056158 (OMIM: 610684) CTD nuclear envelope phos ( 244) 1623 387.9 8.7e-108 NP_001137247 (OMIM: 610684) CTD nuclear envelope p ( 244) 1623 387.9 8.7e-108 NP_001008393 (OMIM: 608592) CTD small phosphatase- ( 276) 462 116.5 4.9e-26 XP_016861009 (OMIM: 608592) PREDICTED: CTD small p ( 282) 462 116.5 5e-26 XP_016861008 (OMIM: 608592) PREDICTED: CTD small p ( 297) 462 116.5 5.2e-26 NP_005799 (OMIM: 608592) CTD small phosphatase-lik ( 265) 457 115.3 1.1e-25 NP_001193807 (OMIM: 605323) carboxy-terminal domai ( 260) 436 110.4 3.1e-24 NP_872580 (OMIM: 605323) carboxy-terminal domain R ( 260) 436 110.4 3.1e-24 XP_011509872 (OMIM: 605323) PREDICTED: carboxy-ter ( 393) 436 110.5 4.5e-24 NP_067021 (OMIM: 605323) carboxy-terminal domain R ( 261) 432 109.5 6e-24 XP_011509871 (OMIM: 605323) PREDICTED: carboxy-ter ( 394) 432 109.6 8.5e-24 NP_005721 (OMIM: 608711) carboxy-terminal domain R ( 271) 415 105.5 9.8e-23 XP_005268613 (OMIM: 608711) PREDICTED: carboxy-ter ( 277) 415 105.5 9.9e-23 XP_016860107 (OMIM: 605323) PREDICTED: carboxy-ter ( 275) 370 95.0 1.5e-19 XP_016860105 (OMIM: 605323) PREDICTED: carboxy-ter ( 408) 370 95.1 2e-19 XP_016860106 (OMIM: 605323) PREDICTED: carboxy-ter ( 276) 366 94.1 2.8e-19 XP_016860104 (OMIM: 605323) PREDICTED: carboxy-ter ( 409) 366 94.1 3.9e-19 NP_001316488 (OMIM: 607381) mitochondrial import i ( 240) 218 59.4 6.4e-09 XP_011525793 (OMIM: 607381) PREDICTED: mitochondri ( 341) 218 59.5 8.7e-09 NP_001001563 (OMIM: 607381) mitochondrial import i ( 456) 218 59.6 1.1e-08 NP_001189433 (OMIM: 604168,604927) RNA polymerase ( 842) 162 46.6 0.00016 XP_011524563 (OMIM: 604168,604927) PREDICTED: RNA ( 851) 162 46.6 0.00016 NP_430255 (OMIM: 604168,604927) RNA polymerase II ( 867) 162 46.6 0.00017 NP_001305440 (OMIM: 604168,604927) RNA polymerase ( 867) 162 46.6 0.00017 NP_004706 (OMIM: 604168,604927) RNA polymerase II ( 961) 162 46.6 0.00018 XP_016881567 (OMIM: 604168,604927) PREDICTED: RNA ( 776) 141 41.7 0.0046 >>NP_056158 (OMIM: 610684) CTD nuclear envelope phosphat (244 aa) initn: 1623 init1: 1623 opt: 1623 Z-score: 2056.1 bits: 387.9 E(85289): 8.7e-108 Smith-Waterman score: 1623; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE5 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ 190 200 210 220 230 240 pF1KE5 HRLW :::: NP_056 HRLW >>NP_001137247 (OMIM: 610684) CTD nuclear envelope phosp (244 aa) initn: 1623 init1: 1623 opt: 1623 Z-score: 2056.1 bits: 387.9 E(85289): 8.7e-108 Smith-Waterman score: 1623; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE5 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MMRTQCLLGLRTFVAFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RKILVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VSQWYELVVFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SIVILDNSPGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQ 190 200 210 220 230 240 pF1KE5 HRLW :::: NP_001 HRLW >>NP_001008393 (OMIM: 608592) CTD small phosphatase-like (276 aa) initn: 443 init1: 237 opt: 462 Z-score: 588.4 bits: 116.5 E(85289): 4.9e-26 Smith-Waterman score: 501; 41.4% identity (69.7% similar) in 198 aa overlap (45-236:84-272) 20 30 40 50 60 pF1KE5 AFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPL-SPVSRNRLAQVK-----RKILVLDL ..:. :: .. : .: .: .:.:: NP_001 NVEAPPPSSPSVLPPLVEENGGLQKGDQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDL 60 70 80 90 100 110 70 80 90 100 110 120 pF1KE5 DETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELV ::::.:: . .: . :::. : :: . .: :::::: ::. ..: .: : NP_001 DETLVHS--------SFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECV 120 130 140 150 160 130 140 150 160 170 180 pF1KE5 VFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNS .::::. :.. ::: :: .... : .:. :... :.:.:::: . .::...:.::: NP_001 LFTASLAKYADPVADLLDR-WGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNS 170 180 190 200 210 220 190 200 210 220 230 240 pF1KE5 PGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :..: ::.::.:..:::.: .:: ::.:.:....: :: :.: : NP_001 PASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNR 230 240 250 260 270 >>XP_016861009 (OMIM: 608592) PREDICTED: CTD small phosp (282 aa) initn: 443 init1: 237 opt: 462 Z-score: 588.3 bits: 116.5 E(85289): 5e-26 Smith-Waterman score: 501; 41.4% identity (69.7% similar) in 198 aa overlap (45-236:69-257) 20 30 40 50 60 pF1KE5 AFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPL-SPVSRNRLAQVK-----RKILVLDL ..:. :: .. : .: .: .:.:: XP_016 NVEAPPPSSPSVLPPLVEENGGLQKGDQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDL 40 50 60 70 80 90 70 80 90 100 110 120 pF1KE5 DETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELV ::::.:: . .: . :::. : :: . .: :::::: ::. ..: .: : XP_016 DETLVHS--------SFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECV 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE5 VFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNS .::::. :.. ::: :: .... : .:. :... :.:.:::: . .::...:.::: XP_016 LFTASLAKYADPVADLLDR-WGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNS 160 170 180 190 200 190 200 210 220 230 240 pF1KE5 PGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :..: ::.::.:..:::.: .:: ::.:.:....: :: :.: : XP_016 PASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNSWIHMELFG 210 220 230 240 250 260 XP_016 NFPCCYRAATKWN 270 280 >>XP_016861008 (OMIM: 608592) PREDICTED: CTD small phosp (297 aa) initn: 443 init1: 237 opt: 462 Z-score: 588.0 bits: 116.5 E(85289): 5.2e-26 Smith-Waterman score: 501; 41.4% identity (69.7% similar) in 198 aa overlap (45-236:84-272) 20 30 40 50 60 pF1KE5 AFAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPL-SPVSRNRLAQVK-----RKILVLDL ..:. :: .. : .: .: .:.:: XP_016 NVEAPPPSSPSVLPPLVEENGGLQKGDQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDL 60 70 80 90 100 110 70 80 90 100 110 120 pF1KE5 DETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELV ::::.:: . .: . :::. : :: . .: :::::: ::. ..: .: : XP_016 DETLVHS--------SFKPISNADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECV 120 130 140 150 160 130 140 150 160 170 180 pF1KE5 VFTASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNS .::::. :.. ::: :: .... : .:. :... :.:.:::: . .::...:.::: XP_016 LFTASLAKYADPVADLLDR-WGVFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNS 170 180 190 200 210 220 190 200 210 220 230 240 pF1KE5 PGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :..: ::.::.:..:::.: .:: ::.:.:....: :: :.: : XP_016 PASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSREDDVYSMLHRLCNSWIHMELFG 230 240 250 260 270 280 XP_016 NFPCCYRAATKWN 290 >>NP_005799 (OMIM: 608592) CTD small phosphatase-like pr (265 aa) initn: 443 init1: 237 opt: 457 Z-score: 582.4 bits: 115.3 E(85289): 1.1e-25 Smith-Waterman score: 494; 43.8% identity (72.2% similar) in 176 aa overlap (61-236:95-261) 40 50 60 70 80 90 pF1KE5 QIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKILVLDLDETLIHSHHDGVLRPTVRPGTP .: .:.::::::.:: . .: . NP_005 VLPPLVEENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHS--------SFKPISN 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 PDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLDNSRS :::. : :: . .: :::::: ::. ..: .: :.::::. :.. ::: :: . NP_005 ADFIVPVEIDGTIHQVYVLKRPHVDEFLQRMGQLFECVLFTASLAKYADPVADLLDR-WG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 ILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSWFSDPS ... : .:. :... :.:.:::: . .::...:.::::..: ::.::.:..:::.: . NP_005 VFRARLFRESCVFHRGNYVKDLSRLGRELSKVIIVDNSPASYIFHPENAVPVQSWFDDMT 180 190 200 210 220 230 220 230 240 pF1KE5 DTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :: ::.:.:....: :: :.: : NP_005 DTELLDLIPFFEGLSREDDVYSMLHRLCNR 240 250 260 >>NP_001193807 (OMIM: 605323) carboxy-terminal domain RN (260 aa) initn: 476 init1: 283 opt: 436 Z-score: 556.0 bits: 110.4 E(85289): 3.1e-24 Smith-Waterman score: 478; 43.5% identity (68.4% similar) in 193 aa overlap (46-234:70-253) 20 30 40 50 60 70 pF1KE5 FAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSR---NRLAQVKRKI-LVLDLDET .: .::. . :: . :: .:.::::: NP_001 HSLFCCVCRDDGEALPAHSGAPLLVEENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDET 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE5 LIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFT :.:: . .: . :::. : :: . .: :::::: ::. ... .: :.:: NP_001 LVHS--------SFKPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFT 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE5 ASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGA ::. :.. ::: ::. . .. : .:. :... :.:.:::: . :: ..::::::.. NP_001 ASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPAS 160 170 180 190 200 210 200 210 220 230 240 pF1KE5 YRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW : :::::.:. :::.. ::: : .:::... : . :: ::: NP_001 YVFHPDNAVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 220 230 240 250 260 >>NP_872580 (OMIM: 605323) carboxy-terminal domain RNA p (260 aa) initn: 476 init1: 283 opt: 436 Z-score: 556.0 bits: 110.4 E(85289): 3.1e-24 Smith-Waterman score: 478; 43.5% identity (68.4% similar) in 193 aa overlap (46-234:70-253) 20 30 40 50 60 70 pF1KE5 FAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSR---NRLAQVKRKI-LVLDLDET .: .::. . :: . :: .:.::::: NP_872 HSLFCCVCRDDGEALPAHSGAPLLVEENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDET 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE5 LIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFT :.:: . .: . :::. : :: . .: :::::: ::. ... .: :.:: NP_872 LVHS--------SFKPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFT 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE5 ASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGA ::. :.. ::: ::. . .. : .:. :... :.:.:::: . :: ..::::::.. NP_872 ASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPAS 160 170 180 190 200 210 200 210 220 230 240 pF1KE5 YRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW : :::::.:. :::.. ::: : .:::... : . :: ::: NP_872 YVFHPDNAVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 220 230 240 250 260 >>XP_011509872 (OMIM: 605323) PREDICTED: carboxy-termina (393 aa) initn: 476 init1: 283 opt: 436 Z-score: 553.2 bits: 110.5 E(85289): 4.5e-24 Smith-Waterman score: 478; 43.5% identity (68.4% similar) in 193 aa overlap (46-234:203-386) 20 30 40 50 60 70 pF1KE5 FAAKLWSFFIYLLRRQIRTVIQYQTVRYDILPLSPVSR---NRLAQVKRKI-LVLDLDET .: .::. . :: . :: .:.::::: XP_011 HSLFCCVCRDDGEALPAHSGAPLLVEENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDET 180 190 200 210 220 230 80 90 100 110 120 130 pF1KE5 LIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFT :.:: . .: . :::. : :: . .: :::::: ::. ... .: :.:: XP_011 LVHS--------SFKPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFT 240 250 260 270 280 140 150 160 170 180 190 pF1KE5 ASMEIYGSAVADKLDNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGA ::. :.. ::: ::. . .. : .:. :... :.:.:::: . :: ..::::::.. XP_011 ASLAKYADPVADLLDK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPAS 290 300 310 320 330 340 200 210 220 230 240 pF1KE5 YRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW : :::::.:. :::.. ::: : .:::... : . :: ::: XP_011 YVFHPDNAVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 350 360 370 380 390 >>NP_067021 (OMIM: 605323) carboxy-terminal domain RNA p (261 aa) initn: 476 init1: 283 opt: 432 Z-score: 550.9 bits: 109.5 E(85289): 6e-24 Smith-Waterman score: 474; 45.3% identity (69.8% similar) in 179 aa overlap (57-234:85-254) 30 40 50 60 70 80 pF1KE5 LLRRQIRTVIQYQTVRYDILPLSPVSRNRLAQVKRKI-LVLDLDETLIHSHHDGVLRPTV :: . :: .:.::::::.:: . NP_067 PAHSGAPLLVEENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHS--------SF 60 70 80 90 100 90 100 110 120 130 140 pF1KE5 RPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKL .: . :::. : :: . .: :::::: ::. ... .: :.::::. :.. ::: : NP_067 KPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFTASLAKYADPVADLL 110 120 130 140 150 160 150 160 170 180 190 200 pF1KE5 DNSRSILKRRYYRQHCTLELGSYIKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSW :. . .. : .:. :... :.:.:::: . :: ..::::::..: :::::.:. :: NP_067 DK-WGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 170 180 190 200 210 220 210 220 230 240 pF1KE5 FSDPSDTALLNLLPMLDALRFTADVRSVLSRNLHQHRLW :.. ::: : .:::... : . :: ::: NP_067 FDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS 230 240 250 260 244 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:33:53 2016 done: Mon Nov 7 22:33:54 2016 Total Scan time: 6.710 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]