FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4382, 403 aa 1>>>pF1KE4382 403 - 403 aa - 403 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8554+/-0.000386; mu= 19.4900+/- 0.024 mean_var=62.3872+/-12.803, 0's: 0 Z-trim(112.3): 4 B-trim: 1018 in 1/52 Lambda= 0.162378 statistics sampled from 21110 (21112) to 21110 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.633), E-opt: 0.2 (0.248), width: 16 Scan time: 7.100 The best scores are: opt bits E(85289) NP_002155 (OMIM: 147435) indoleamine 2,3-dioxygena ( 403) 2708 643.1 3.4e-184 NP_919270 (OMIM: 612129) indoleamine 2,3-dioxygena ( 420) 1140 275.8 1.3e-73 >>NP_002155 (OMIM: 147435) indoleamine 2,3-dioxygenase 1 (403 aa) initn: 2708 init1: 2708 opt: 2708 Z-score: 3427.8 bits: 643.1 E(85289): 3.4e-184 Smith-Waterman score: 2708; 100.0% identity (100.0% similar) in 403 aa overlap (1-403:1-403) 10 20 30 40 50 60 pF1KE4 MAHAMENSWTISKEYHIDEEVGFALPNPQENLPDFYNDWMFIAKHLPDLIESGQLRERVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MAHAMENSWTISKEYHIDEEVGFALPNPQENLPDFYNDWMFIAKHLPDLIESGQLRERVE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 KLNMLSIDHLTDHKSQRLARLVLGCITMAYVWGKGHGDVRKVLPRNIAVPYCQLSKKLEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 KLNMLSIDHLTDHKSQRLARLVLGCITMAYVWGKGHGDVRKVLPRNIAVPYCQLSKKLEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 PPILVYADCVLANWKKKDPNKPLTYENMDVLFSFRDGDCSKGFFLVSLLVEIAAASAIKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 PPILVYADCVLANWKKKDPNKPLTYENMDVLFSFRDGDCSKGFFLVSLLVEIAAASAIKV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 IPTVFKAMQMQERDTLLKALLEIASCLEKALQVFHQIHDHVNPKAFFSVLRIYLSGWKGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 IPTVFKAMQMQERDTLLKALLEIASCLEKALQVFHQIHDHVNPKAFFSVLRIYLSGWKGN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PQLSDGLVYEGFWEDPKEFAGGSAGQSSVFQCFDVLLGIQQTAGGGHAAQFLQDMRRYMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 PQLSDGLVYEGFWEDPKEFAGGSAGQSSVFQCFDVLLGIQQTAGGGHAAQFLQDMRRYMP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 PAHRNFLCSLESNPSVREFVLSKGDAGLREAYDACVKALVSLRSYHLQIVTKYILIPASQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 PAHRNFLCSLESNPSVREFVLSKGDAGLREAYDACVKALVSLRSYHLQIVTKYILIPASQ 310 320 330 340 350 360 370 380 390 400 pF1KE4 QPKENKTSEDPSKLEAKGTGGTDLMNFLKTVRSTTEKSLLKEG ::::::::::::::::::::::::::::::::::::::::::: NP_002 QPKENKTSEDPSKLEAKGTGGTDLMNFLKTVRSTTEKSLLKEG 370 380 390 400 >>NP_919270 (OMIM: 612129) indoleamine 2,3-dioxygenase 2 (420 aa) initn: 1145 init1: 815 opt: 1140 Z-score: 1442.4 bits: 275.8 E(85289): 1.3e-73 Smith-Waterman score: 1140; 44.3% identity (75.5% similar) in 388 aa overlap (15-400:32-416) 10 20 30 40 pF1KE4 MAHAMENSWTISKEYHIDEEVGFALPNPQENLPDFYNDWMFIAK :::.:: :: ::. ..::: : :: ::. NP_919 LHFHYYDTSNKIMEPHRPNVKTAVPLSLESYHISEEYGFLLPDSLKELPDHYRPWMEIAN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 HLPDLIESGQLRERVEKLNMLSIDHLTDHKSQRLARLVLGCITMAYVWGKGHGDVRKVLP .::.::.. ::. .:.:. .:: . : :. ::::.:::. .::.::: .:... .::: NP_919 KLPQLIDAHQLQAHVDKMPLLSCQFLKGHREQRLAHLVLSFLTMGYVWQEGEAQPAEVLP 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 RNIAVPYCQLSKKLELPPILVYADCVLANWKKKDPNKPLTYENMDVLFSFRDGDCSKGFF ::.:.:. ..:..: ::::::..: ::.:: ::::. : :.....:: :. .::. NP_919 RNLALPFVEVSRNLGLPPILVHSDLVLTNWTKKDPDGFLEIGNLETIISFPGGESLHGFI 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 LVSLLVEIAAASAIKVIPTVFKAMQMQERDTLLKALLEIASCLEKALQVFHQIHDHVNPK ::. ::: :. .::.. . .:. . ....::.:: .. .. ... :.::.:.: NP_919 LVTALVEKEAVPGIKALVQATNAILQPNQEALLQALQRLRLSIQDITKTLGQMHDYVDPD 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 AFFSVLRIYLSGWKGNPQLSDGLVYEGFWEDPKEFAGGSAGQSSVFQCFDVLLGIQQTAG :.. .::.::::: :: . ::.::: ..: ...::::.::.:.. :: .:::... NP_919 IFYAGIRIFLSGWKDNPAMPAGLMYEGVSQEPLKYSGGSAAQSTVLHAFDEFLGIRHSKE 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 GGHAAQFLQDMRRYMPPAHRNFLCSLESNPSVREFVLSKGDAGLREAYDACVKALVSLRS .: .:: :: ::::.:. :. ...: ::.:...::.:. : ::. ::.::. ::: NP_919 SG---DFLYRMRDYMPPSHKAFIEDIHSAPSLRDYILSSGQDHLLTAYNQCVQALAELRS 310 320 330 340 350 350 360 370 380 390 400 pF1KE4 YHLQIVTKYILIPASQ--QPKENKTSEDPSKLEAKGTGGTDLMNFLKTVRSTTEKSLLKE ::. .::::.. :.. . : :. :. :. .::::: .:.:::.::. : .:.: NP_919 YHITMVTKYLITAAAKAKHGKPNHLPGPPQALKDRGTGGTAVMSFLKSVRDKTLESILHP 360 370 380 390 400 410 pF1KE4 G NP_919 RG 420 403 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:58:39 2016 done: Sat Nov 5 22:58:40 2016 Total Scan time: 7.100 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]