FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4493, 513 aa 1>>>pF1KE4493 513 - 513 aa - 513 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4586+/-0.000701; mu= 19.7593+/- 0.043 mean_var=103.2722+/-19.969, 0's: 0 Z-trim(113.3): 78 B-trim: 43 in 1/53 Lambda= 0.126207 statistics sampled from 13826 (13909) to 13826 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.756), E-opt: 0.2 (0.427), width: 16 Scan time: 2.900 The best scores are: opt bits E(32554) CCDS45231.1 SPINT1 gene_id:6692|Hs108|chr15 ( 513) 3652 675.2 4.7e-194 CCDS10067.1 SPINT1 gene_id:6692|Hs108|chr15 ( 529) 2153 402.3 7e-112 CCDS5632.1 TFPI2 gene_id:7980|Hs108|chr7 ( 235) 362 75.8 5.9e-14 >>CCDS45231.1 SPINT1 gene_id:6692|Hs108|chr15 (513 aa) initn: 3652 init1: 3652 opt: 3652 Z-score: 3597.3 bits: 675.2 E(32554): 4.7e-194 Smith-Waterman score: 3652; 100.0% identity (100.0% similar) in 513 aa overlap (1-513:1-513) 10 20 30 40 50 60 pF1KE4 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 RGVQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNCPDASDEAACEKYTSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RGVQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNCPDASDEAACEKYTSGF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 DELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 CLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 CLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFH 430 440 450 460 470 480 490 500 510 pF1KE4 GHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL ::::::::::::::::::::::::::::::::: CCDS45 GHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL 490 500 510 >>CCDS10067.1 SPINT1 gene_id:6692|Hs108|chr15 (529 aa) initn: 2318 init1: 2130 opt: 2153 Z-score: 2122.1 bits: 402.3 E(32554): 7e-112 Smith-Waterman score: 3610; 97.0% identity (97.0% similar) in 529 aa overlap (1-513:1-529) 10 20 30 40 50 60 pF1KE4 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LKDVENTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTANVTVTV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC 250 260 270 280 290 300 310 320 330 340 pF1KE4 RGVQG----------------PSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNC ::::: ::::::::::::::::::::::::::::::::::::::: CCDS10 RGVQGGPLRGSSGAQATFPQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNC 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 PDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCKESIPRWYYNPFSEHCARFT 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE4 YGGCYGNKNNFEEEQQCLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YGGCYGNKNNFEEEQQCLESCRGISKKDVFGLRREIPIPSTGSVEMAVAVFLVICIVVVV 430 440 450 460 470 480 470 480 490 500 510 pF1KE4 AILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL 490 500 510 520 >>CCDS5632.1 TFPI2 gene_id:7980|Hs108|chr7 (235 aa) initn: 364 init1: 176 opt: 362 Z-score: 364.1 bits: 75.8 E(32554): 5.9e-14 Smith-Waterman score: 362; 34.0% identity (54.8% similar) in 197 aa overlap (243-431:29-215) 220 230 240 250 260 270 pF1KE4 LKEGTYLFQLTVTSSDHPEDTANVTVTVLSTKQTEDYCLASNKVGRCRGSFPRWYYDPTE : .. . :: : ::. . :.::: CCDS56 MDPARPLGLSILLLFLTEAALGDAAQEPTGNNAEICLLPLDYGPCRALLLRYYYDRYT 10 20 30 40 50 280 290 300 310 320 pF1KE4 QICKSFVYGGCLGNKNNYLREEECILACRGVQG-PSMERRH----PVCSGTCQPTQFRCS : :..:.:::: :: ::. : : :: .. :.. : . : :. . : : CCDS56 QSCRQFLYGGCEGNANNFYTWEACDDACWRIEKVPKVCRLQVSVDDQCEGSTEKYFFNLS 60 70 80 90 100 110 330 340 350 360 370 380 pF1KE4 NGCCIDSFLE--CDDTPNCPDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCK . : ..:. : . :::.: :: ..: :: : . : :::. CCDS56 SMTC-EKFFSGGCHRNRIENRFPDEATCM----GFCAPKKI--PS---FCYSPKDEGLCS 120 130 140 150 160 390 400 410 420 430 440 pF1KE4 ESIPRWYYNPFSEHCARFTYGGCYGNKNNFEEEQQCLESC-RGISKKDVFGLRREIPIPS .. :.:.:: . : ::: :: :: ::: ...: ..: ....:: CCDS56 ANVTRYYFNPRYRTCDAFTYTGCGGNDNNFVSREDCKRACAKALKKKKKMPKLRFASRIR 170 180 190 200 210 220 450 460 470 480 490 500 pF1KE4 TGSVEMAVAVFLVICIVVVVAILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHL CCDS56 KIRKKQF 230 513 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 18:35:06 2016 done: Mon Nov 7 18:35:07 2016 Total Scan time: 2.900 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]