FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7682, 406 aa
1>>>pF1KB7682 406 - 406 aa - 406 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 14.3037+/-0.000452; mu= -22.0431+/- 0.028
mean_var=728.8475+/-155.715, 0's: 0 Z-trim(126.4): 166 B-trim: 858 in 1/60
Lambda= 0.047507
statistics sampled from 51931 (52184) to 51931 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.847), E-opt: 0.2 (0.612), width: 16
Scan time: 11.300
The best scores are: opt bits E(85289)
NP_612157 (OMIM: 606903) pygopus homolog 2 [Homo s ( 406) 3000 220.1 7.7e-57
NP_056432 (OMIM: 606902) pygopus homolog 1 isoform ( 419) 556 52.6 2.1e-06
NP_001317255 (OMIM: 606902) pygopus homolog 1 isof ( 419) 556 52.6 2.1e-06
XP_011519748 (OMIM: 606902) PREDICTED: pygopus hom ( 419) 556 52.6 2.1e-06
NP_006239 (OMIM: 168810) basic salivary proline-ri ( 416) 444 44.9 0.00043
NP_005030 (OMIM: 180989) basic salivary proline-ri ( 331) 394 41.4 0.0039
>>NP_612157 (OMIM: 606903) pygopus homolog 2 [Homo sapie (406 aa)
initn: 3000 init1: 3000 opt: 3000 Z-score: 1141.4 bits: 220.1 E(85289): 7.7e-57
Smith-Waterman score: 3000; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406)
10 20 30 40 50 60
pF1KB7 MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLTE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 FAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGSPVPFGGFRVQGGMAGQVPPGYST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 FAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGSPVPFGGFRVQGGMAGQVPPGYST
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPPSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 GGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPPSG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 QMMPGPVGGFGPMISPTMGQPPRAELGPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 QMMPGPVGGFGPMISPTMGQPPRAELGPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 TSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 TSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASCQKWFHRECTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_612 GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASCQKWFHRECTG
310 320 330 340 350 360
370 380 390 400
pF1KB7 MTESAYGLLTTEASAVWACDLCLKTKEIQSVYIREGMGQLVAANDG
::::::::::::::::::::::::::::::::::::::::::::::
NP_612 MTESAYGLLTTEASAVWACDLCLKTKEIQSVYIREGMGQLVAANDG
370 380 390 400
>>NP_056432 (OMIM: 606902) pygopus homolog 1 isoform 1 [ (419 aa)
initn: 634 init1: 407 opt: 556 Z-score: 236.0 bits: 52.6 E(85289): 2.1e-06
Smith-Waterman score: 874; 37.6% identity (59.2% similar) in 431 aa overlap (2-405:3-418)
10 20 30 40 50
pF1KB7 MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLT
: ..: : :. . :: : : :.:. ::.::.::.:::::.. :.
NP_056 MPAENSPAPAYKVSSHGGD-------SGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLS
10 20 30 40 50
60 70 80 90 100 110
pF1KB7 EFAPPPTPMVDHLVASNPFEDDFG--APKVGVAAPPFLGSPVP-FGGFRVQGGMAGQVPP
:.::::.: :::::.:::.:... . : .. :.:: : :::. . : .:::
NP_056 EYAPPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFR-MPPHVPP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB7 GYSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFS
.:. : : :: :: ::: ::.: .:: : .. .. : : .: . .:. :.:: .
NP_056 RMSSPYCG-PYSLRNQPHPFPQNPLGMGFNRP-HAFNFGPHDNSSFGNPSYNNALSQNVN
120 130 140 150 160 170
180 190 200 210 220
pF1KB7 PPSGQMMPGPVGGFG---PMISPTMGQPPRAE----------LGPPSLSQRFAQPGAPFG
:. .. .:. .:. :. . ...: : .: .. : : ::
NP_056 MPNQHFRQNPAENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFG
180 190 200 210 220 230
230 240 250 260 270
pF1KB7 ----PSPLQRPGQG-LPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSG
: : : :: . :.: : : .. .... . : ..: :: .
NP_056 QAKAPPPKQDFTQGATKNTNQNSSAHP-PHLNMDDTVNQSNIELKNVNRNNAVNQENSRS
240 250 260 270 280
280 290 300 310 320 330
pF1KB7 SPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAG------GGSGPQPPPGLVYPCGA
: . :.:.: . :. . :. :: . .:.. : :. .: :::::
NP_056 SSTEATNNNPANGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDP----VYPCGI
290 300 310 320 330 340
340 350 360 370 380 390
pF1KB7 CRSEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVY
: .::::::::::::::::::::: ::::::.::::::.::::::.:: :. :..: .
NP_056 CTNEVNDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMR
350 360 370 380 390 400
400
pF1KB7 IREGMGQLVAANDG
:: .: ....:
NP_056 TRETFGPSAVGSDA
410
>>NP_001317255 (OMIM: 606902) pygopus homolog 1 isoform (419 aa)
initn: 634 init1: 407 opt: 556 Z-score: 236.0 bits: 52.6 E(85289): 2.1e-06
Smith-Waterman score: 865; 38.7% identity (60.8% similar) in 401 aa overlap (32-405:26-418)
10 20 30 40 50 60
pF1KB7 AASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLTEF
: :.:. ::.::.::.:::::.. :.:.
NP_001 MSAEQEKDPISLKRVRGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEY
10 20 30 40 50
70 80 90 100 110
pF1KB7 APPPTPMVDHLVASNPFEDDFG--APKVGVAAPPFLGSPVP-FGGFRVQGGMAGQVPPGY
::::.: :::::.:::.:... . : .. :.:: : :::. . : .::: .
NP_001 APPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFR-MPPHVPPRM
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB7 STGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPP
:. : : :: :: ::: ::.: .:: : .. .. : : .: . .:. :.:: . :
NP_001 SSPYCG-PYSLRNQPHPFPQNPLGMGFNRP-HAFNFGPHDNSSFGNPSYNNALSQNVNMP
120 130 140 150 160 170
180 190 200 210 220
pF1KB7 SGQMMPGPVGGFG---PMISPTMGQPPRAE----------LGPPSLSQRFAQPGAPFG--
. .. .:. .:. :. . ...: : .: .. : : ::
NP_001 NQHFRQNPAENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQA
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB7 --PSPLQRPGQG-LPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSGSP
: : : :: . :.: : : .. .... . : ..: :: .:
NP_001 KAPPPKQDFTQGATKNTNQNSSAHP-PHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSS
240 250 260 270 280 290
290 300 310 320 330
pF1KB7 AAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAG------GGSGPQPPPGLVYPCGACR
. :.:.: . :. . :. :: . .:.. : :. .: ::::: :
NP_001 TEATNNNPANGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDP----VYPCGICT
300 310 320 330 340
340 350 360 370 380 390
pF1KB7 SEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVYIR
.::::::::::::::::::::: ::::::.::::::.::::::.:: :. :..: . :
NP_001 NEVNDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTR
350 360 370 380 390 400
400
pF1KB7 EGMGQLVAANDG
: .: ....:
NP_001 ETFGPSAVGSDA
410
>>XP_011519748 (OMIM: 606902) PREDICTED: pygopus homolog (419 aa)
initn: 634 init1: 407 opt: 556 Z-score: 236.0 bits: 52.6 E(85289): 2.1e-06
Smith-Waterman score: 865; 38.7% identity (60.8% similar) in 401 aa overlap (32-405:26-418)
10 20 30 40 50 60
pF1KB7 AASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLTEF
: :.:. ::.::.::.:::::.. :.:.
XP_011 MSAEQEKDPISLKRVRGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEY
10 20 30 40 50
70 80 90 100 110
pF1KB7 APPPTPMVDHLVASNPFEDDFG--APKVGVAAPPFLGSPVP-FGGFRVQGGMAGQVPPGY
::::.: :::::.:::.:... . : .. :.:: : :::. . : .::: .
XP_011 APPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFR-MPPHVPPRM
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB7 STGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPP
:. : : :: :: ::: ::.: .:: : .. .. : : .: . .:. :.:: . :
XP_011 SSPYCG-PYSLRNQPHPFPQNPLGMGFNRP-HAFNFGPHDNSSFGNPSYNNALSQNVNMP
120 130 140 150 160 170
180 190 200 210 220
pF1KB7 SGQMMPGPVGGFG---PMISPTMGQPPRAE----------LGPPSLSQRFAQPGAPFG--
. .. .:. .:. :. . ...: : .: .. : : ::
XP_011 NQHFRQNPAENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQA
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB7 --PSPLQRPGQG-LPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSGSP
: : : :: . :.: : : .. .... . : ..: :: .:
XP_011 KAPPPKQDFTQGATKNTNQNSSAHP-PHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSS
240 250 260 270 280 290
290 300 310 320 330
pF1KB7 AAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAG------GGSGPQPPPGLVYPCGACR
. :.:.: . :. . :. :: . .:.. : :. .: ::::: :
XP_011 TEATNNNPANGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDP----VYPCGICT
300 310 320 330 340
340 350 360 370 380 390
pF1KB7 SEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVYIR
.::::::::::::::::::::: ::::::.::::::.::::::.:: :. :..: . :
XP_011 NEVNDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTR
350 360 370 380 390 400
400
pF1KB7 EGMGQLVAANDG
: .: ....:
XP_011 ETFGPSAVGSDA
410
>>NP_006239 (OMIM: 168810) basic salivary proline-rich p (416 aa)
initn: 223 init1: 223 opt: 444 Z-score: 194.5 bits: 44.9 E(85289): 0.00043
Smith-Waterman score: 482; 32.1% identity (47.3% similar) in 349 aa overlap (6-324:72-395)
10 20 30
pF1KB7 MAASAPPPPDKLEGGG--GPAPPPAPPSTGRKQGK
:::: : .: : : .:: :. ::
NP_006 QGGNKPQGPPSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGP
50 60 70 80 90 100
40 50 60 70 80 90
pF1KB7 A--GLQMKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAA
: . .::.. : . : .. . .::: : . .: . . :. :
NP_006 PPQGDKSRSPRSPPGKPQGPPPQGGNQPQ-GPPPPPGKPQ----GPPPQGGNKPQ-GPPP
110 120 130 140 150
100 110 120 130 140
pF1KB7 PPFLGSPVPFGGFRVQGGMAGQVPPGYSTGGG--GGPQPLRRQPPPFPPNPMGPAFNMPP
: .: : : . . ... ::: : :: :: . ::: : .:.:: ::
NP_006 PGKPQGPPPQGDNKSR---SSRSPPGKPQGPPPQGGNQP--QGPPPPPGKPQGP----PP
160 170 180 190 200
150 160 170 180 190 200
pF1KB7 QG---P-GYPPPGNMNFPSQPFNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMGQPPRAE
:: : : ::::. . : .. . :::. . : : :: :. : ::
NP_006 QGGNKPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPP---PPPGKP
210 220 230 240 250 260
210 220 230 240 250
pF1KB7 LGPPSLSQRFAQ----PGAPFGPSPL-------QRPGQGLPSLPP----NTSPFPGPDPG
::: . : :: : :: : .: : :. :: : : : ::
NP_006 QGPPPQGGNKPQGPPPPGKPQGPPPQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPG
270 280 290 300 310 320
260 270 280 290 300
pF1KB7 FP-GPGGEDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDA---N
: :: . :.:: .:: : .:.. : .. .. . :: :. : :. :
NP_006 KPQGPPPQGGNKPQGPPP----PGKPQGPPPQGGSKSRSARSPP---GKPQGPPQQEGNN
330 340 350 360 370
310 320 330 340 350 360
pF1KB7 SLAPPGKAGGG-SGPQPPPGLVYPCGACRSEVNDDQDAILCEASCQKWFHRECTGMTESA
.:: :::. . :: ::
NP_006 PQGPPPPAGGNPQQPQAPPAGQPQGPPRPPQGGRPSRPPQ
380 390 400 410
>>NP_005030 (OMIM: 180989) basic salivary proline-rich p (331 aa)
initn: 222 init1: 222 opt: 394 Z-score: 177.2 bits: 41.4 E(85289): 0.0039
Smith-Waterman score: 446; 34.2% identity (45.9% similar) in 342 aa overlap (1-324:31-310)
10 20 30
pF1KB7 MAASAPPPPDKLEGGGGPAPPPAPPSTGRK
. :. : :. .::. : :: :: :.
NP_005 MLLILLSVALLALSSAQNLNEDVSQEESPSLIAGNPQGPSP-QGGNKPQGPPPPP--GKP
10 20 30 40 50
40 50 60 70 80 90
pF1KB7 QGKAGLQMKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVA
:: : . : ::: :: :. . : : .:.
NP_005 QGP-------PPQGGNK--PQGPP--------PPGKPQ-----GPPPQGDKSRSPR----
60 70 80 90
100 110 120 130 140
pF1KB7 APPFLGSPVPFGGFRVQGGMAGQVPPGYSTGGGGGPQPL---RRQPPPFPPNPMGPAFNM
.:: :.: : ::: : :: : :: : : : :: : .:.::
NP_005 SPP--GKP---QGPPPQGGNQPQGPPP-PPGKPQGPPPQGGNRPQGPPPPGKPQGP----
100 110 120 130 140
150 160 170 180 190 200
pF1KB7 PPQG-----PGYPPPGNMNFPSQPFNQPLGQNFSPPSGQMM-PGPVGGFGPMISPTMGQP
:::: : :: .. : : ::: : :: :. . : : :: :. : :.:
NP_005 PPQGDKSRSPRSPPGKPQGPPPQGGNQPQGP--PPPPGKPQGPPPQGGKKPQGPPPPGKP
150 160 170 180 190
210 220 230 240 250
pF1KB7 PRAELGPPSLSQ--RFAQ--PGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFP-GPGG
::: .. : .: :: : :: : : . :. :: : :: : ::
NP_005 Q----GPPPQGDKSRSSQSPPGKPQGPPP---QGGNQPQGPP-------PPPGKPQGPPP
200 210 220 230 240
260 270 280 290 300 310
pF1KB7 EDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDA---NSLAPPGK
. :.:: .:: : .:. : :: . . .: . : :. : :. : .::
NP_005 QGGNKPQGPPP----PGKPQ-GPPAQGGSKSQSARSP--PGKPQGPPQQEGNNPQGPPPP
250 260 270 280 290
320 330 340 350 360 370
pF1KB7 AGGG-SGPQPPPGLVYPCGACRSEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTE
:::. . :: ::
NP_005 AGGNPQQPQAPPAGQPQGPPRPPQGGRPSRPPQ
300 310 320 330
406 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:04:44 2016 done: Sat Nov 5 10:04:46 2016
Total Scan time: 11.300 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]