FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4514, 546 aa 1>>>pF1KE4514 546 - 546 aa - 546 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3800+/-0.000295; mu= 14.9738+/- 0.019 mean_var=123.2379+/-24.784, 0's: 0 Z-trim(121.1): 80 B-trim: 0 in 0/61 Lambda= 0.115532 statistics sampled from 37024 (37133) to 37024 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.435), width: 16 Scan time: 11.950 The best scores are: opt bits E(85289) NP_056480 (OMIM: 231550,605378) aladin isoform 1 [ ( 546) 3852 653.1 6.5e-187 XP_011537080 (OMIM: 231550,605378) PREDICTED: alad ( 560) 3122 531.4 2.8e-150 NP_001166937 (OMIM: 231550,605378) aladin isoform ( 513) 2580 441.0 4.1e-123 XP_011537082 (OMIM: 231550,605378) PREDICTED: alad ( 527) 1850 319.4 1.8e-86 XP_016865474 (OMIM: 606929) PREDICTED: THO complex ( 263) 183 41.2 0.0047 NP_115737 (OMIM: 606929) THO complex subunit 3 [Ho ( 351) 183 41.4 0.0058 >>NP_056480 (OMIM: 231550,605378) aladin isoform 1 [Homo (546 aa) initn: 3852 init1: 3852 opt: 3852 Z-score: 3476.7 bits: 653.1 E(85289): 6.5e-187 Smith-Waterman score: 3852; 100.0% identity (100.0% similar) in 546 aa overlap (1-546:1-546) 10 20 30 40 50 60 pF1KE4 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_056 HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH 490 500 510 520 530 540 pF1KE4 SPHSHL :::::: NP_056 SPHSHL >>XP_011537080 (OMIM: 231550,605378) PREDICTED: aladin i (560 aa) initn: 3119 init1: 3119 opt: 3122 Z-score: 2818.9 bits: 531.4 E(85289): 2.8e-150 Smith-Waterman score: 3814; 97.5% identity (97.5% similar) in 560 aa overlap (1-546:1-560) 10 20 30 40 50 60 pF1KE4 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR 370 380 390 400 410 420 430 440 450 460 pF1KE4 VQDGKPVILLFRTRNSPVFELLPC--------------GIIQGEPGAQPQLITFHPSFNK :::::::::::::::::::::::: :::::::::::::::::::::: XP_011 VQDGKPVILLFRTRNSPVFELLPCSLLASGCLLTFSSSGIIQGEPGAQPQLITFHPSFNK 430 440 450 460 470 480 470 480 490 500 510 520 pF1KE4 GALLSVGWSTGRIAHIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GALLSVGWSTGRIAHIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSA 490 500 510 520 530 540 530 540 pF1KE4 PWDPLPGPPPVLPHSPHSHL :::::::::::::::::::: XP_011 PWDPLPGPPPVLPHSPHSHL 550 560 >>NP_001166937 (OMIM: 231550,605378) aladin isoform 2 [H (513 aa) initn: 3593 init1: 2562 opt: 2580 Z-score: 2331.2 bits: 441.0 E(85289): 4.1e-123 Smith-Waterman score: 3534; 94.0% identity (94.0% similar) in 546 aa overlap (1-546:1-513) 10 20 30 40 50 60 pF1KE4 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA :::::::::::::::::::::::::::: : NP_001 ASSLHGSLFPHLSLRSEDLIAEFAQVTN---C---------------------------- 130 140 190 200 210 220 230 240 pF1KE4 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 --TIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE4 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE4 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR 330 340 350 360 370 380 430 440 450 460 470 480 pF1KE4 VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA 390 400 410 420 430 440 490 500 510 520 530 540 pF1KE4 HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH 450 460 470 480 490 500 pF1KE4 SPHSHL :::::: NP_001 SPHSHL 510 >>XP_011537082 (OMIM: 231550,605378) PREDICTED: aladin i (527 aa) initn: 2860 init1: 1829 opt: 1850 Z-score: 1673.5 bits: 319.4 E(85289): 1.8e-86 Smith-Waterman score: 3496; 91.6% identity (91.6% similar) in 560 aa overlap (1-546:1-527) 10 20 30 40 50 60 pF1KE4 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA :::::::::::::::::::::::::::: : XP_011 ASSLHGSLFPHLSLRSEDLIAEFAQVTN---C---------------------------- 130 140 190 200 210 220 230 240 pF1KE4 SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 --TIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE4 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE4 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR 330 340 350 360 370 380 430 440 450 460 pF1KE4 VQDGKPVILLFRTRNSPVFELLPC--------------GIIQGEPGAQPQLITFHPSFNK :::::::::::::::::::::::: :::::::::::::::::::::: XP_011 VQDGKPVILLFRTRNSPVFELLPCSLLASGCLLTFSSSGIIQGEPGAQPQLITFHPSFNK 390 400 410 420 430 440 470 480 490 500 510 520 pF1KE4 GALLSVGWSTGRIAHIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GALLSVGWSTGRIAHIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSA 450 460 470 480 490 500 530 540 pF1KE4 PWDPLPGPPPVLPHSPHSHL :::::::::::::::::::: XP_011 PWDPLPGPPPVLPHSPHSHL 510 520 >>XP_016865474 (OMIM: 606929) PREDICTED: THO complex sub (263 aa) initn: 183 init1: 81 opt: 183 Z-score: 175.8 bits: 41.2 E(85289): 0.0047 Smith-Waterman score: 183; 29.4% identity (55.5% similar) in 119 aa overlap (243-360:9-125) 220 230 240 250 260 270 pF1KE4 CQSCILIWTLDPTSLSTRPSSGCAQVLSHPGH-TPVTSLAWAPSGGRLLSASPVDAAIRV :: : .: : ::. :. .. : .::. XP_016 MVKENNYRGHGDSVDQLCWHPSNPDLFVTASGDKTIRI 10 20 30 280 290 300 310 320 330 pF1KE4 WDVSTETCVPLPWFRGGGVTNLLWSPDGSKILATTPSAVFRVWEAQMWTCERWPTLSGRC ::: : :. .: .. :. :::::. : . . . : .:. . .. . XP_016 WDVRTTKCIATVNTKGENI-NICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQFKFEV 40 50 60 70 80 90 340 350 360 370 380 390 pF1KE4 QTGCWSPDGSRLLFTVLGEPLIYSLSFPERCGEGKGCVGGAKSATIVADLSETTIQTPDG . :. :.. ..: . :. : ::.:: XP_016 NEISWNNDNN-MFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSA 100 110 120 130 140 150 >>NP_115737 (OMIM: 606929) THO complex subunit 3 [Homo s (351 aa) initn: 163 init1: 81 opt: 183 Z-score: 174.2 bits: 41.4 E(85289): 0.0058 Smith-Waterman score: 183; 29.4% identity (55.5% similar) in 119 aa overlap (243-360:97-213) 220 230 240 250 260 270 pF1KE4 CQSCILIWTLDPTSLSTRPSSGCAQVLSHPGH-TPVTSLAWAPSGGRLLSASPVDAAIRV :: : .: : ::. :. .. : .::. NP_115 GRRLASGSFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPDLFVTASGDKTIRI 70 80 90 100 110 120 280 290 300 310 320 330 pF1KE4 WDVSTETCVPLPWFRGGGVTNLLWSPDGSKILATTPSAVFRVWEAQMWTCERWPTLSGRC ::: : :. .: .. :. :::::. : . . . : .:. . .. . NP_115 WDVRTTKCIATVNTKGENI-NICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQFKFEV 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE4 QTGCWSPDGSRLLFTVLGEPLIYSLSFPERCGEGKGCVGGAKSATIVADLSETTIQTPDG . :. :.. ..: . :. : ::.:: NP_115 NEISWNNDNN-MFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSA 190 200 210 220 230 240 546 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:20:01 2016 done: Sun Nov 6 00:20:03 2016 Total Scan time: 11.950 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]