FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5350, 306 aa
1>>>pF1KE5350 306 - 306 aa - 306 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6802+/-0.000345; mu= 13.6754+/- 0.021
mean_var=72.3665+/-14.691, 0's: 0 Z-trim(116.0): 16 B-trim: 740 in 1/55
Lambda= 0.150767
statistics sampled from 26894 (26900) to 26894 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.315), width: 16
Scan time: 6.880
The best scores are: opt bits E(85289)
NP_001265352 (OMIM: 607894) polycystic kidney dise ( 306) 2007 445.5 6.2e-125
NP_001070248 (OMIM: 607894) polycystic kidney dise ( 991) 2007 445.8 1.7e-124
NP_001265354 (OMIM: 607894) polycystic kidney dise (1774) 1857 413.2 1.8e-114
NP_443124 (OMIM: 607894) polycystic kidney disease (2459) 1857 413.3 2.4e-114
>>NP_001265352 (OMIM: 607894) polycystic kidney disease (306 aa)
initn: 2007 init1: 2007 opt: 2007 Z-score: 2364.0 bits: 445.5 E(85289): 6.2e-125
Smith-Waterman score: 2007; 99.7% identity (99.7% similar) in 306 aa overlap (1-306:1-306)
10 20 30 40 50 60
pF1KE5 MGEDSPVAMFSWYLDNTPTEQAEPLPDACRLRGFWPRSLTLLQSNTSTLLLNSSFLQSRG
::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::::
NP_001 MGEDSPVAMFSWYLDNTPTEQAEPLLDACRLRGFWPRSLTLLQSNTSTLLLNSSFLQSRG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EVIRIRATALTRHAYGEDTYVISTVPPREVPACTIAPEEGTVLTSFAIFCNASTALGPLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 EVIRIRATALTRHAYGEDTYVISTVPPREVPACTIAPEEGTVLTSFAIFCNASTALGPLE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 FCFCLESGSCLHCGPEPALPSVYLPLGEENNDFVLTVVISATNRAGDTQQTQAMAKVALG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FCFCLESGSCLHCGPEPALPSVYLPLGEENNDFVLTVVISATNRAGDTQQTQAMAKVALG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 DTCVEDVAFQAAVSEKIPTALQGEGGPEQLLQLAKAVSSMLNQEHESQGSGQSLSIDVRQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DTCVEDVAFQAAVSEKIPTALQGEGGPEQLLQLAKAVSSMLNQEHESQGSGQSLSIDVRQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 KVREHVLGSLSAVTTGLEDVQRVQELAEVLREVTCRSKELTPSAQGSCMGDSWEGAPPAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KVREHVLGSLSAVTTGLEDVQRVQELAEVLREVTCRSKELTPSAQGSCMGDSWEGAPPAA
250 260 270 280 290 300
pF1KE5 HVSHAR
::::::
NP_001 HVSHAR
>>NP_001070248 (OMIM: 607894) polycystic kidney disease (991 aa)
initn: 2007 init1: 2007 opt: 2007 Z-score: 2356.1 bits: 445.8 E(85289): 1.7e-124
Smith-Waterman score: 2007; 99.7% identity (99.7% similar) in 306 aa overlap (1-306:686-991)
10 20 30
pF1KE5 MGEDSPVAMFSWYLDNTPTEQAEPLPDACR
::::::::::::::::::::::::: ::::
NP_001 SAPWELRPRVSCERNCRPVNASKDILLRVTMGEDSPVAMFSWYLDNTPTEQAEPLLDACR
660 670 680 690 700 710
40 50 60 70 80 90
pF1KE5 LRGFWPRSLTLLQSNTSTLLLNSSFLQSRGEVIRIRATALTRHAYGEDTYVISTVPPREV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LRGFWPRSLTLLQSNTSTLLLNSSFLQSRGEVIRIRATALTRHAYGEDTYVISTVPPREV
720 730 740 750 760 770
100 110 120 130 140 150
pF1KE5 PACTIAPEEGTVLTSFAIFCNASTALGPLEFCFCLESGSCLHCGPEPALPSVYLPLGEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PACTIAPEEGTVLTSFAIFCNASTALGPLEFCFCLESGSCLHCGPEPALPSVYLPLGEEN
780 790 800 810 820 830
160 170 180 190 200 210
pF1KE5 NDFVLTVVISATNRAGDTQQTQAMAKVALGDTCVEDVAFQAAVSEKIPTALQGEGGPEQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 NDFVLTVVISATNRAGDTQQTQAMAKVALGDTCVEDVAFQAAVSEKIPTALQGEGGPEQL
840 850 860 870 880 890
220 230 240 250 260 270
pF1KE5 LQLAKAVSSMLNQEHESQGSGQSLSIDVRQKVREHVLGSLSAVTTGLEDVQRVQELAEVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LQLAKAVSSMLNQEHESQGSGQSLSIDVRQKVREHVLGSLSAVTTGLEDVQRVQELAEVL
900 910 920 930 940 950
280 290 300
pF1KE5 REVTCRSKELTPSAQGSCMGDSWEGAPPAAHVSHAR
::::::::::::::::::::::::::::::::::::
NP_001 REVTCRSKELTPSAQGSCMGDSWEGAPPAAHVSHAR
960 970 980 990
>>NP_001265354 (OMIM: 607894) polycystic kidney disease (1774 aa)
initn: 1847 init1: 1847 opt: 1857 Z-score: 2175.9 bits: 413.2 E(85289): 1.8e-114
Smith-Waterman score: 1857; 94.8% identity (96.1% similar) in 305 aa overlap (1-305:1-298)
10 20 30 40 50 60
pF1KE5 MGEDSPVAMFSWYLDNTPTEQAEPLPDACRLRGFWPRSLTLLQSNTSTLLLNSSFLQSRG
::::::::::::::::::::::::: ::::::::::::::::::::::::::::::::::
NP_001 MGEDSPVAMFSWYLDNTPTEQAEPLLDACRLRGFWPRSLTLLQSNTSTLLLNSSFLQSRG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EVIRIRATALTRHAYGEDTYVISTVPPREVPACTIAPEEGTVLTSFAIFCNASTALGPLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 EVIRIRATALTRHAYGEDTYVISTVPPREVPACTIAPEEGTVLTSFAIFCNASTALGPLE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 FCFCLESGSCLHCGPEPALPSVYLPLGEENNDFVLTVVISATNRAGDTQQTQAMAKVALG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FCFCLESGSCLHCGPEPALPSVYLPLGEENNDFVLTVVISATNRAGDTQQTQAMAKVALG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 DTCVEDVAFQAAVSEKIPTALQGEGGPEQLLQLAKAVSSMLNQEHESQGSGQSLSIDVRQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DTCVEDVAFQAAVSEKIPTALQGEGGPEQLLQLAKAVSSMLNQEHESQGSGQSLSIDVRQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 KVREHVLGSLSAVTTGLEDVQRVQELAEVLREVTCRSKELTPSAQGSCMGDSWEGAPPAA
::::::::::::::::::::::::::::::::::::::::::::: ::..
NP_001 KVREHVLGSLSAVTTGLEDVQRVQELAEVLREVTCRSKELTPSAQ-------WEASLALQ
250 260 270 280 290
pF1KE5 HVSHAR
:.:.:
NP_001 HASEALLTVSAKARPEDQRRQAATRDLFQAVGSVLEASLSNRPEEPAEASSSQIATVLRL
300 310 320 330 340 350
>>NP_443124 (OMIM: 607894) polycystic kidney disease pro (2459 aa)
initn: 1847 init1: 1847 opt: 1857 Z-score: 2173.7 bits: 413.3 E(85289): 2.4e-114
Smith-Waterman score: 1857; 94.8% identity (96.1% similar) in 305 aa overlap (1-305:686-983)
10 20 30
pF1KE5 MGEDSPVAMFSWYLDNTPTEQAEPLPDACR
::::::::::::::::::::::::: ::::
NP_443 SAPWELRPRVSCERNCRPVNASKDILLRVTMGEDSPVAMFSWYLDNTPTEQAEPLLDACR
660 670 680 690 700 710
40 50 60 70 80 90
pF1KE5 LRGFWPRSLTLLQSNTSTLLLNSSFLQSRGEVIRIRATALTRHAYGEDTYVISTVPPREV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_443 LRGFWPRSLTLLQSNTSTLLLNSSFLQSRGEVIRIRATALTRHAYGEDTYVISTVPPREV
720 730 740 750 760 770
100 110 120 130 140 150
pF1KE5 PACTIAPEEGTVLTSFAIFCNASTALGPLEFCFCLESGSCLHCGPEPALPSVYLPLGEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_443 PACTIAPEEGTVLTSFAIFCNASTALGPLEFCFCLESGSCLHCGPEPALPSVYLPLGEEN
780 790 800 810 820 830
160 170 180 190 200 210
pF1KE5 NDFVLTVVISATNRAGDTQQTQAMAKVALGDTCVEDVAFQAAVSEKIPTALQGEGGPEQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_443 NDFVLTVVISATNRAGDTQQTQAMAKVALGDTCVEDVAFQAAVSEKIPTALQGEGGPEQL
840 850 860 870 880 890
220 230 240 250 260 270
pF1KE5 LQLAKAVSSMLNQEHESQGSGQSLSIDVRQKVREHVLGSLSAVTTGLEDVQRVQELAEVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_443 LQLAKAVSSMLNQEHESQGSGQSLSIDVRQKVREHVLGSLSAVTTGLEDVQRVQELAEVL
900 910 920 930 940 950
280 290 300
pF1KE5 REVTCRSKELTPSAQGSCMGDSWEGAPPAAHVSHAR
::::::::::::::: ::.. :.:.:
NP_443 REVTCRSKELTPSAQ-------WEASLALQHASEALLTVSAKARPEDQRRQAATRDLFQA
960 970 980 990 1000
NP_443 VGSVLEASLSNRPEEPAEASSSQIATVLRLLRVMEHVQTTLLLGKLPGGLPAMLATPSIS
1010 1020 1030 1040 1050 1060
306 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 00:01:25 2016 done: Tue Nov 8 00:01:26 2016
Total Scan time: 6.880 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]