FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3130, 323 aa 1>>>pF1KE3130 323 - 323 aa - 323 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8327+/-0.000445; mu= 0.8679+/- 0.028 mean_var=227.9569+/-45.984, 0's: 0 Z-trim(117.3): 28 B-trim: 416 in 1/55 Lambda= 0.084947 statistics sampled from 29073 (29098) to 29073 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.341), width: 16 Scan time: 8.480 The best scores are: opt bits E(85289) NP_057107 (OMIM: 612021) OTU domain-containing pro ( 323) 2081 267.7 2.3e-71 XP_011515431 (OMIM: 612021) PREDICTED: OTU domain- ( 192) 1228 163.0 4.5e-40 NP_001273674 (OMIM: 612021) OTU domain-containing ( 192) 1228 163.0 4.5e-40 NP_997203 (OMIM: 300714) OTU domain-containing pro ( 288) 1057 142.2 1.2e-33 >>NP_057107 (OMIM: 612021) OTU domain-containing protein (323 aa) initn: 2081 init1: 2081 opt: 2081 Z-score: 1402.3 bits: 267.7 E(85289): 2.3e-71 Smith-Waterman score: 2081; 100.0% identity (100.0% similar) in 323 aa overlap (1-323:1-323) 10 20 30 40 50 60 pF1KE3 MEPRVRVEGWKVPTSRCRFLLARVLGYLVVMEAVLTEELDEEEQLLRRHRKEKKELQAKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_057 MEPRVRVEGWKVPTSRCRFLLARVLGYLVVMEAVLTEELDEEEQLLRRHRKEKKELQAKI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 QGMKNAVPKNDKKRRKQLTEDVAKLEKEMEQKHREELEQLKLTTKENKIDSVAVNISNLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_057 QGMKNAVPKNDKKRRKQLTEDVAKLEKEMEQKHREELEQLKLTTKENKIDSVAVNISNLV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LENQPPRISKAQKRREKKAALEKEREERIAEAEIENLTGARHMESEKLAQILAARQLEIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_057 LENQPPRISKAQKRREKKAALEKEREERIAEAEIENLTGARHMESEKLAQILAARQLEIK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 QIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDFLPFLTNPNTGDMYTPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_057 QIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDFLPFLTNPNTGDMYTPE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 EFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIVGEEYSKKPLILVYMRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_057 EFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIVGEEYSKKPLILVYMRH 250 260 270 280 290 300 310 320 pF1KE3 AYGLGEHYNSVTRLVNIVTENCS ::::::::::::::::::::::: NP_057 AYGLGEHYNSVTRLVNIVTENCS 310 320 >>XP_011515431 (OMIM: 612021) PREDICTED: OTU domain-cont (192 aa) initn: 1228 init1: 1228 opt: 1228 Z-score: 840.4 bits: 163.0 E(85289): 4.5e-40 Smith-Waterman score: 1228; 99.5% identity (100.0% similar) in 189 aa overlap (135-323:4-192) 110 120 130 140 150 160 pF1KE3 KENKIDSVAVNISNLVLENQPPRISKAQKRREKKAALEKEREERIAEAEIENLTGARHME .::::::::::::::::::::::::::::: XP_011 MISKEKKAALEKEREERIAEAEIENLTGARHME 10 20 30 170 180 190 200 210 220 pF1KE3 SEKLAQILAARQLEIKQIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SEKLAQILAARQLEIKQIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDF 40 50 60 70 80 90 230 240 250 260 270 280 pF1KE3 LPFLTNPNTGDMYTPEEFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LPFLTNPNTGDMYTPEEFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIV 100 110 120 130 140 150 290 300 310 320 pF1KE3 GEEYSKKPLILVYMRHAYGLGEHYNSVTRLVNIVTENCS ::::::::::::::::::::::::::::::::::::::: XP_011 GEEYSKKPLILVYMRHAYGLGEHYNSVTRLVNIVTENCS 160 170 180 190 >>NP_001273674 (OMIM: 612021) OTU domain-containing prot (192 aa) initn: 1228 init1: 1228 opt: 1228 Z-score: 840.4 bits: 163.0 E(85289): 4.5e-40 Smith-Waterman score: 1228; 99.5% identity (100.0% similar) in 189 aa overlap (135-323:4-192) 110 120 130 140 150 160 pF1KE3 KENKIDSVAVNISNLVLENQPPRISKAQKRREKKAALEKEREERIAEAEIENLTGARHME .::::::::::::::::::::::::::::: NP_001 MISKEKKAALEKEREERIAEAEIENLTGARHME 10 20 30 170 180 190 200 210 220 pF1KE3 SEKLAQILAARQLEIKQIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEKLAQILAARQLEIKQIPSDGHCMYKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDF 40 50 60 70 80 90 230 240 250 260 270 280 pF1KE3 LPFLTNPNTGDMYTPEEFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LPFLTNPNTGDMYTPEEFQKYCEDIVNTAAWGGQLELRALSHILQTPIEIIQADSPPIIV 100 110 120 130 140 150 290 300 310 320 pF1KE3 GEEYSKKPLILVYMRHAYGLGEHYNSVTRLVNIVTENCS ::::::::::::::::::::::::::::::::::::::: NP_001 GEEYSKKPLILVYMRHAYGLGEHYNSVTRLVNIVTENCS 160 170 180 190 >>NP_997203 (OMIM: 300714) OTU domain-containing protein (288 aa) initn: 1056 init1: 507 opt: 1057 Z-score: 724.8 bits: 142.2 E(85289): 1.2e-33 Smith-Waterman score: 1057; 56.7% identity (85.5% similar) in 275 aa overlap (41-314:7-275) 20 30 40 50 60 70 pF1KE3 KVPTSRCRFLLARVLGYLVVMEAVLTEELDEEEQLLRRHRKEKKELQAKIQGMKNAVPKN :....::::..:..::::.:...::.:::. NP_997 MDDPKSEQQRILRRHQRERQELQAQIRSLKNSVPKT 10 20 30 80 90 100 110 120 130 pF1KE3 DKKRRKQLTEDVAKLEKEMEQKHREELEQLKLTTKENKIDSVAVNISNLVLENQPPRISK :: .:::: .:::..: :: ::::.:::... ...:.::. ..... :::.::: :: NP_997 DKTKRKQLLQDVARMEAEMAQKHRQELEKFQ---DDSSIESVVEDLAKMNLENRPPRSSK 40 50 60 70 80 90 140 150 160 170 180 pF1KE3 AQKRREKKAALEKEREERIAEAEI-ENLTGARHMESEKLAQILAARQLEIKQIPSDGHCM :...::. . :.::.: : .::. :.:.: .. : :::: ::.:: ::.: ::.::::: NP_997 AHRKRERMESEERERQESIFQAEMSEHLAGFKREEEEKLAAILGARGLEMKAIPADGHCM 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 YKAIEDQLKEKDCALTVVALRSQTAEYMQSHVEDFLPFLTNPNTGDMYTPEEFQKYCEDI :.::.::: ...: :: .:: ::..::..::::..::.:.: . ..:. ::..: NP_997 YRAIQDQLV---FSVSVEMLRCRTASYMKKHVDEFLPFFSNPETSDSFGYDDFMIYCDNI 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 VNTAAWGGQLELRALSHILQTPIEIIQADSPPIIVGEEYSKKPLILVYMRHAYGLGEHYN : :.:::::::::::::.:.::::.:::::: .:.:::: :::.::::.:.::.:::::: NP_997 VRTTAWGGQLELRALSHVLKTPIEVIQADSPTLIIGEEYVKKPIILVYLRYAYSLGEHYN 220 230 240 250 260 270 310 320 pF1KE3 SVTRLVNIVTENCS ::: : NP_997 SVTPLEAGAAGGVLPRLL 280 323 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 16:05:43 2016 done: Sun Nov 6 16:05:44 2016 Total Scan time: 8.480 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]