FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4488, 510 aa
1>>>pF1KE4488 510 - 510 aa - 510 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0117+/-0.00092; mu= 11.1584+/- 0.055
mean_var=131.8774+/-26.370, 0's: 0 Z-trim(109.9): 17 B-trim: 0 in 0/52
Lambda= 0.111683
statistics sampled from 11223 (11228) to 11223 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.703), E-opt: 0.2 (0.345), width: 16
Scan time: 3.370
The best scores are: opt bits E(32554)
CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 ( 510) 3454 568.0 8.5e-162
CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 ( 526) 1000 172.7 9.3e-43
CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 ( 356) 570 103.2 4.9e-22
CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 ( 348) 528 96.5 5.3e-20
>>CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 (510 aa)
initn: 3454 init1: 3454 opt: 3454 Z-score: 3018.2 bits: 568.0 E(32554): 8.5e-162
Smith-Waterman score: 3454; 99.8% identity (100.0% similar) in 510 aa overlap (1-510:1-510)
10 20 30 40 50 60
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISEKEKEKYQEEFEHFQQEL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHLEIKQLNRQLDMILDEQR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSLSETVRLVSGM
:::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::
CCDS11 RYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEILRQVNEMKNSMSETVRLVSGM
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCPELPPFPSCLSTVH
430 440 450 460 470 480
490 500 510
pF1KE4 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF
::::::::::::::::::::::::::::::
CCDS11 FIIFVVVQTVLFIGYIMYRSQQEAAAKKFF
490 500 510
>>CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 (526 aa)
initn: 900 init1: 435 opt: 1000 Z-score: 881.1 bits: 172.7 E(32554): 9.3e-43
Smith-Waterman score: 1000; 35.5% identity (66.3% similar) in 501 aa overlap (15-501:9-486)
10 20 30 40 50 60
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEYKYSFKGPHLVQS
:::: ::: : : : : :::::: :::::.:.
CCDS10 MPAVSGPGPLFCLLLLLLDPHSPETGC---P----PLRRFEYKLSFKGPRLALP
10 20 30 40
70 80 90 100 110 120
pF1KE4 DGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEVTFRVTGRGRIGA
. .:::.: :.:: . ...:..::.... :.::..... : ::::: .:::: :: ::
CCDS10 GAGIPFWSHHGDAILGLEEVRLTPSMRNRSGAVWSRASVPFSAWEVEVQMRVTGLGRRGA
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE4 DGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNNPAIVIIGNNGQIHYDHQN
.:.:.::....: : :.:. :.:.:::::: .: ...::: .....:.: .. .
CCDS10 QGMAVWYTRGRGHVGSVLGGLASWDGIGIFFDSPAED-TQDSPAIRVLASDGHIPSEQPG
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE4 DGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPDKNDYEFCAKVENMIIPA
:::::.:.::. ::::.:.: ::.:::. . : . .:.:.::. . :::. : ...
CCDS10 DGASQGLGSCHWDFRNRPHPFRARITYWGQRLRMSLNSGLTPS-DPGEFCVDVGPLLLVP
170 180 190 200 210 220
250 260 270 280 290
pF1KE4 QGHFGISAATGGLADDHDVLSFLTFQLTEPGKE-PPTPDKEISEKEKEKYQEEFEHFQQE
: ::.::::: :::::::::::::.:.::. : :: : :. .. . : : .
CCDS10 GGFFGVSAATGTLADDHDVLSFLTFSLSEPSPEVPPQPFLEM-QQLRLARQLEGLWARLG
230 240 250 260 270 280
300 310 320 330 340 350
pF1KE4 LDKKKEEFQKGHPDLQGQPAEEIF---ESVG-DRELRQVFEGQNRIHLEIKQLNRQLDMI
: ... :. . ::. .:..: :..: :.. :...: .. .. : .::
CCDS10 LGTREDVTPKSDSEAQGE-GERLFDLEETLGRHRRILQALRGLSK---QLAQAERQWKKQ
290 300 310 320 330 340
360 370 380 390 400
pF1KE4 LDE--QRRYVSSLTEEISKRGAGMPGQHGQITQQ------ELDTVVKTQHEILRQVNEMK
: : : .. . . : . . ::. :..... .. .... : .:. ..::.
CCDS10 LGPPGQARPDGGWALDASCQIPSTPGRGGHLSMSLNKDSAKVGALLHGQWTLLQALQEMR
350 360 370 380 390 400
410 420 430 440 450 460
pF1KE4 NSLSETVRLVSGMQHPGSAGGVYETTQHFIDIKEHLHIVKRDIDNLVQRNMPSNEKPKCP
.. .::... : :. .::... . : ...... . .. . . :. :
CCDS10 DA---AVRMAAEAQVSYLPVGI---EHHFLELDHILGLLQEELRGPAK---AAAKAPRPP
410 420 430 440 450
470 480 490 500 510
pF1KE4 ELPPFPS-CLSTVHFIIFVVVQTVLFIGYIMYRSQQEAAAKKFF
:: : ::. :......::: :.::. .:..
CCDS10 GQPPRASSCLQPGIFLFYLLIQTVGFFGYVHFRQELNKSLQECLSTGSLPLGPAPHTPRA
460 470 480 490 500 510
CCDS10 LGILRRQPLPASMPA
520
>>CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 (356 aa)
initn: 408 init1: 160 opt: 570 Z-score: 509.1 bits: 103.2 E(32554): 4.9e-22
Smith-Waterman score: 570; 33.2% identity (64.8% similar) in 301 aa overlap (15-310:32-316)
10 20 30 40
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHR
::: ::: :: : .: . :. :
CCDS44 AAEGWIWRWGWGRRCLGRPGLLGPGPGPTTPLF--LLLLLGS-VTADITDGNS----EHL
10 20 30 40 50
50 60 70 80 90 100
pF1KE4 RFEYKYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENW
. :. :. :. .....:.: :... .:. .:..:. .:..::.:.. ...:
CCDS44 KREH--SLIKPYQGVGSSSMPLWDFQGSTMLTSQYVRLTPDERSKEGSIWNHQPCFLKDW
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE4 EVEVTFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDND--GKK
:..: :.: : :. . .::.:.::.... . :::::: : ..:..::.:.. :: ..
CCDS44 EMHVHFKVHGTGKKNLHGDGIALWYTRDRLVPGPVFGSKDNFHGLAIFLDTYPNDETTER
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE4 NNPAIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGF
: : .. :::.. :::..:: ::.: ::::. . . . : .. :::: .
CCDS44 VFPYISVMVNNGSLSYDHSKDGRWTELAGCTADFRNRDHDTFLAVRYSRGRLTVMTD---
180 190 200 210 220
230 240 250 260 270 280
pF1KE4 TPDKNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKE
:::... : . .. .:. .:: ::.:: :.:.::..:. ::: :::.:
CCDS44 LEDKNEWKNCIDITGVRLPTGYYFGASAGTGDLSDNHDIISMKLFQLMV----EHTPDEE
230 240 250 260 270 280
290 300 310 320 330
pF1KE4 ISEKEKEKYQEEF-EHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQN
. : . . .: . ....: .:..:
CCDS44 SIDWTKIEPSVNFLKSPKDNVDDPTGNFRSGPLTGWRVFLLLLCALLGIVVCAVVGAVVF
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE4 RIHLEIKQLNRQLDMILDEQRRYVSSLTEEISKRGAGMPGQHGQITQQELDTVVKTQHEI
CCDS44 QKRQERNKRFY
350
>>CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 (348 aa)
initn: 265 init1: 180 opt: 528 Z-score: 472.6 bits: 96.5 E(32554): 5.3e-20
Smith-Waterman score: 528; 33.6% identity (62.7% similar) in 271 aa overlap (6-268:15-275)
10 20 30 40
pF1KE4 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEY---
.: : :: . ::: :: :.: : . . :::
CCDS20 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGS---GQG----PQQVGAGQTFEYLKR
10 20 30 40 50
50 60 70 80 90 100
pF1KE4 KYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEV
..:.. :. . :. .: :::. .. ::..:...:..:..:... ...::..:
CCDS20 EHSLSKPYQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQV
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE4 TFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNN---P
:.. :.:. . .:::::::.... :::::. : . :.:.: :.. :. :... :
CCDS20 HFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQERVFP
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE4 AIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTLTVMINNGFTPD
: . :::.. :::. :: :..: :: : . : : . ::.:..
CCDS20 YISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMD---IDG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE4 KNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGKEPPTPDKEISE
:.... : .: .. .: .:: :. :: :.:.:::.:. :.::
CCDS20 KHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVF
240 250 260 270 280 290
290 300 310 320 330 340
pF1KE4 KEKEKYQEEFEHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELRQVFEGQNRIHL
CCDS20 LPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY
300 310 320 330 340
510 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:48:31 2016 done: Sun Nov 6 00:48:31 2016
Total Scan time: 3.370 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]