FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4473, 493 aa 1>>>pF1KE4473 493 - 493 aa - 493 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1083+/-0.000422; mu= 19.2846+/- 0.026 mean_var=69.4096+/-14.432, 0's: 0 Z-trim(109.8): 20 B-trim: 1051 in 1/54 Lambda= 0.153945 statistics sampled from 17984 (17991) to 17984 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.578), E-opt: 0.2 (0.211), width: 16 Scan time: 8.900 The best scores are: opt bits E(85289) NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidas ( 493) 3352 754.1 2e-217 NP_001159529 (OMIM: 170100,613230) xaa-Pro dipepti ( 429) 2466 557.3 3.1e-158 NP_001159528 (OMIM: 170100,613230) xaa-Pro dipepti ( 452) 1881 427.4 4.2e-119 NP_071381 (OMIM: 613159,613553) probable Xaa-Pro a ( 507) 464 112.7 2.5e-24 NP_003390 (OMIM: 300145,300909) xaa-Pro aminopepti ( 674) 149 42.8 0.0036 >>NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidase is (493 aa) initn: 3352 init1: 3352 opt: 3352 Z-score: 4024.4 bits: 754.1 E(85289): 2e-217 Smith-Waterman score: 3352; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 430 440 450 460 470 480 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: NP_000 GCDKAFTPFSGPK 490 >>NP_001159529 (OMIM: 170100,613230) xaa-Pro dipeptidase (429 aa) initn: 2456 init1: 2456 opt: 2466 Z-score: 2961.8 bits: 557.3 E(85289): 3.1e-158 Smith-Waterman score: 2761; 87.0% identity (87.0% similar) in 493 aa overlap (1-493:1-429) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK ::::::: NP_001 TGVLFRQ----------------------------------------------------- 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI ::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 -----------IASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 70 80 90 100 110 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS 120 130 140 150 160 170 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 180 190 200 210 220 230 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 240 250 260 270 280 290 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 300 310 320 330 340 350 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 360 370 380 390 400 410 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: NP_001 GCDKAFTPFSGPK 420 >>NP_001159528 (OMIM: 170100,613230) xaa-Pro dipeptidase (452 aa) initn: 1879 init1: 1879 opt: 1881 Z-score: 2259.3 bits: 427.4 E(85289): 4.2e-119 Smith-Waterman score: 3009; 91.7% identity (91.7% similar) in 493 aa overlap (1-493:1-452) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS ::: :::::::::::::::: NP_001 VEC-----------------------------------------LFEHYCYSRGGMRHSS 190 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 320 330 340 350 360 370 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 380 390 400 410 420 430 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: NP_001 GCDKAFTPFSGPK 440 450 >>NP_071381 (OMIM: 613159,613553) probable Xaa-Pro amino (507 aa) initn: 383 init1: 130 opt: 464 Z-score: 557.7 bits: 112.7 E(85289): 2.5e-24 Smith-Waterman score: 573; 29.1% identity (56.6% similar) in 477 aa overlap (23-485:72-506) 10 20 30 40 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ .:: :..: ..:. :.:. .:::. NP_071 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE4 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS . : . .: :.:.. : . : :: :.. :: . ::::: : NP_071 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS 110 120 130 140 150 110 120 130 140 150 160 pF1KE4 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASF . : : . . .::.. ..:. .: ..: . .. : . . . NP_071 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM 160 170 180 190 200 210 170 180 190 200 210 pF1KE4 DGIS--KFEVNNTI--LHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKE . .. : . .: . .. : . :..:. :.: .. ..:..:.: :.: . :. ..: NP_071 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE4 YELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG : . :: : .::. .: . ..:. : .::: . :.. :..:.: :.: : NP_071 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC 280 290 300 310 320 330 280 290 300 310 320 330 pF1KE4 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE : :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . . NP_071 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ 340 350 360 370 380 390 340 350 360 370 380 390 pF1KE4 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT .: .::.. .. . . . :: .::.::.::::. .: ::: NP_071 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL-- 400 410 420 430 440 400 410 420 430 440 450 pF1KE4 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT- ::::::.:.:::::. . .... ..:::.: ::::.::::: NP_071 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ 450 460 470 480 460 470 480 490 pF1KE4 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK :: . : . :. ...:: :..: NP_071 DSPLILSADCPKEMNDIEQI---CSQAS 490 500 >>NP_003390 (OMIM: 300145,300909) xaa-Pro aminopeptidase (674 aa) initn: 138 init1: 68 opt: 149 Z-score: 177.9 bits: 42.8 E(85289): 0.0036 Smith-Waterman score: 149; 27.5% identity (52.1% similar) in 211 aa overlap (240-437:415-616) 210 220 230 240 250 260 pF1KE4 MKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQN :. : .:: :.:. ::. . : : ... NP_003 KNVPKGTVDEFSGAEIVDKFRGEEQFSSGPSFETISASGLNAALAHYSPTKELN-RKLSS 390 400 410 420 430 440 270 280 290 300 310 320 pF1KE4 GDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRS----SRAVMGAMKPGVW .: :.: ::.:. ..::: . : .: :: .: :: . :: .. : : NP_003 DEMYLLDSGGQYWDGTTDITRTVHW-GTPSAFQKEAYTRVLIGNIDLSRLIFPAATSGRM 450 460 470 480 490 500 330 340 350 360 370 pF1KE4 WPDMHRLADRIHLEELAHMGILSG-SVDAMVQAHLGAV-FMPHGL----GHFLGIDVHDV .. .: : . ..: .: .. .. .: : :. ... : : .: . NP_003 ---VEAFARRALWDAGLNYGHGTGHGIGNFLCVHEWPVGFQSNNIAMAKGMFTSI---EP 510 520 530 540 550 380 390 400 410 420 430 pF1KE4 GGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH---LLDEALADPARASFLN : : .: : . . :. :: :: : . :. . :.: .: .: . ..:: NP_003 GYYKDGEFGIRLEDVALVVEAKTKYPGSYLTFEV-VSFVPYDRNLIDVSLLSPEHLQYLN 560 570 580 590 600 610 440 450 460 470 480 490 pF1KE4 REVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK : NP_003 RYYQTIREKVGPELQRRQLLEEFEWLQQHTEPLAARAPDTASWASVLVVSTLAILGWSV 620 630 640 650 660 670 493 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:53:33 2016 done: Sun Nov 6 00:53:34 2016 Total Scan time: 8.900 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]