FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4493, 513 aa
1>>>pF1KE4493 513 - 513 aa - 513 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4586+/-0.000701; mu= 19.7593+/- 0.043
mean_var=103.2722+/-19.969, 0's: 0 Z-trim(113.3): 78 B-trim: 43 in 1/53
Lambda= 0.126207
statistics sampled from 13826 (13909) to 13826 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.756), E-opt: 0.2 (0.427), width: 16
Scan time: 2.900
The best scores are: opt bits E(32554)
CCDS45231.1 SPINT1 gene_id:6692|Hs108|chr15 ( 513) 3652 675.2 4.7e-194
CCDS10067.1 SPINT1 gene_id:6692|Hs108|chr15 ( 529) 2153 402.3 7e-112
CCDS5632.1 TFPI2 gene_id:7980|Hs108|chr7 ( 235) 362 75.8 5.9e-14
>>CCDS45231.1 SPINT1 gene_id:6692|Hs108|chr15 (513 aa)
initn: 3652 init1: 3652 opt: 3652 Z-score: 3597.3 bits: 675.2 E(32554): 4.7e-194
Smith-Waterman score: 3652; 100.0% identity (100.0% similar) in 513 aa overlap (1-513:1-513)
10 20 30 40 50 60
pF1KE4 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 RGVQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNCPDASDEAACEKYTSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 RGVQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNCPDASDEAACEKYTSGF
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 DELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 DELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 CLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 CLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFH
430 440 450 460 470 480
490 500 510
pF1KE4 GHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL
:::::::::::::::::::::::::::::::::
CCDS45 GHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL
490 500 510
>>CCDS10067.1 SPINT1 gene_id:6692|Hs108|chr15 (529 aa)
initn: 2318 init1: 2130 opt: 2153 Z-score: 2122.1 bits: 402.3 E(32554): 7e-112
Smith-Waterman score: 3610; 97.0% identity (97.0% similar) in 529 aa overlap (1-513:1-529)
10 20 30 40 50 60
pF1KE4 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC
250 260 270 280 290 300
310 320 330 340
pF1KE4 RGVQG----------------PSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNC
::::: :::::::::::::::::::::::::::::::::::::::
CCDS10 RGVQGGPLRGSSGAQATFPQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNC
310 320 330 340 350 360
350 360 370 380 390 400
pF1KE4 PDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFT
370 380 390 400 410 420
410 420 430 440 450 460
pF1KE4 YGGCYGNKNNFEEEQQCLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YGGCYGNKNNFEEEQQCLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVV
430 440 450 460 470 480
470 480 490 500 510
pF1KE4 AILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL
490 500 510 520
>>CCDS5632.1 TFPI2 gene_id:7980|Hs108|chr7 (235 aa)
initn: 364 init1: 176 opt: 362 Z-score: 364.1 bits: 75.8 E(32554): 5.9e-14
Smith-Waterman score: 362; 34.0% identity (54.8% similar) in 197 aa overlap (243-431:29-215)
220 230 240 250 260 270
pF1KE4 LKEGTYLFQLTVTSSDHPEDTANVTVTVLSTKQTEDYCLASNKVGRCRGSFPRWYYDPTE
: .. . :: : ::. . :.:::
CCDS56 MDPARPLGLSILLLFLTEAALGDAAQEPTGNNAEICLLPLDYGPCRALLLRYYYDRYT
10 20 30 40 50
280 290 300 310 320
pF1KE4 QICKSFVYGGCLGNKNNYLREEECILACRGVQG-PSMERRH----PVCSGTCQPTQFRCS
: :..:.:::: :: ::. : : :: .. :.. : . : :. . : :
CCDS56 QSCRQFLYGGCEGNANNFYTWEACDDACWRIEKVPKVCRLQVSVDDQCEGSTEKYFFNLS
60 70 80 90 100 110
330 340 350 360 370 380
pF1KE4 NGCCIDSFLE--CDDTPNCPDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCK
. : ..:. : . :::.: :: ..: :: : . : :::.
CCDS56 SMTC-EKFFSGGCHRNRIENRFPDEATCM----GFCAPKKI--PS---FCYSPKDEGLCS
120 130 140 150 160
390 400 410 420 430 440
pF1KE4 ESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQCLESC-RGISKKDVFGLRREIPIPS
.. :.:.:: . : ::: :: :: ::: ...: ..: ....::
CCDS56 ANVTRYYFNPRYRTCDAFTYTGCGGNDNNFVSREDCKRACAKALKKKKKMPKLRFASRIR
170 180 190 200 210 220
450 460 470 480 490 500
pF1KE4 TGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHL
CCDS56 KIRKKQF
230
513 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 18:35:06 2016 done: Mon Nov 7 18:35:07 2016
Total Scan time: 2.900 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]