FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1302, 145 aa
1>>>pF1KE1302 145 - 145 aa - 145 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7770+/-0.000669; mu= 15.2291+/- 0.040
mean_var=80.6959+/-15.723, 0's: 0 Z-trim(112.5): 17 B-trim: 2 in 1/49
Lambda= 0.142774
statistics sampled from 13233 (13245) to 13233 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.776), E-opt: 0.2 (0.407), width: 16
Scan time: 1.760
The best scores are: opt bits E(32554)
CCDS203.1 PLA2G2D gene_id:26279|Hs108|chr1 ( 145) 1110 237.1 2.9e-63
CCDS201.1 PLA2G2A gene_id:5320|Hs108|chr1 ( 144) 520 115.6 1.1e-26
CCDS202.1 PLA2G5 gene_id:5322|Hs108|chr1 ( 138) 473 105.9 8.8e-24
CCDS204.2 PLA2G2F gene_id:64600|Hs108|chr1 ( 211) 475 106.5 8.9e-24
CCDS72721.1 PLA2G2D gene_id:26279|Hs108|chr1 ( 62) 432 97.0 1.8e-21
CCDS200.1 PLA2G2E gene_id:30814|Hs108|chr1 ( 142) 427 96.4 6.4e-21
CCDS10555.1 PLA2G10 gene_id:8399|Hs108|chr16 ( 165) 383 87.4 3.8e-18
CCDS9195.1 PLA2G1B gene_id:5319|Hs108|chr12 ( 148) 325 75.4 1.4e-14
CCDS47919.1 OC90 gene_id:729330|Hs108|chr8 ( 477) 271 64.8 6.9e-11
>>CCDS203.1 PLA2G2D gene_id:26279|Hs108|chr1 (145 aa)
initn: 1110 init1: 1110 opt: 1110 Z-score: 1249.2 bits: 237.1 E(32554): 2.9e-63
Smith-Waterman score: 1110; 100.0% identity (100.0% similar) in 145 aa overlap (1-145:1-145)
10 20 30 40 50 60
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK
70 80 90 100 110 120
130 140
pF1KE1 RNLDTYQKRLRFYWRPHCRGQTPGC
:::::::::::::::::::::::::
CCDS20 RNLDTYQKRLRFYWRPHCRGQTPGC
130 140
>>CCDS201.1 PLA2G2A gene_id:5320|Hs108|chr1 (144 aa)
initn: 495 init1: 329 opt: 520 Z-score: 592.5 bits: 115.6 E(32554): 1.1e-26
Smith-Waterman score: 520; 46.9% identity (73.1% similar) in 145 aa overlap (1-145:1-144)
10 20 30 40 50 60
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT
:. :: ..... :.. .:...:...:.: .::: ::: ::::::.::::.:::::
CCDS20 MKTLLLLAVIMIFGLLQAHGNLVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDAT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK
: :: :::::: .:. .::. :... : . : :. . : :..::: ::: .: :.
CCDS20 DRCCVTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCAKQDS-CRSQLCECDKAAATCFA
70 80 90 100 110
130 140
pF1KE1 RNLDTYQKRLRFYWRPHCRGQTPGC
:: ::.:. ..: ::::.:: :
CCDS20 RNKTTYNKKYQYYSNKHCRGSTPRC
120 130 140
>>CCDS202.1 PLA2G5 gene_id:5322|Hs108|chr1 (138 aa)
initn: 463 init1: 343 opt: 473 Z-score: 540.4 bits: 105.9 E(32554): 8.8e-24
Smith-Waterman score: 473; 45.3% identity (75.0% similar) in 128 aa overlap (12-138:11-137)
10 20 30 40 50
pF1KE1 MELALLCGLVVMAGVIP-IQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDA
.: .: .:::.:.:..:...:::: . .: :::.:: :::: :::.
CCDS20 MKGLLPLAWFLACSVPAVQGGLLDLKSMIEKVTGKNALTNYGFYGCYCGWGGRGTPKDG
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 TDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCL
::::: .:: :: .:. .::.: . :.: :. : . : . : .:. .:::::.....::
CCDS20 TDWCCWAHDHCYGRLEEKGCNIRTQSYKYRFAWGVVTC-EPGPFCHVNLCACDRKLVYCL
60 70 80 90 100 110
120 130 140
pF1KE1 KRNLDTYQKRLRFYWRPHCRGQTPGC
:::: .:. . ... :
CCDS20 KRNLRSYNPQYQYFPNILCS
120 130
>>CCDS204.2 PLA2G2F gene_id:64600|Hs108|chr1 (211 aa)
initn: 457 init1: 367 opt: 475 Z-score: 540.4 bits: 106.5 E(32554): 8.9e-24
Smith-Waterman score: 475; 45.4% identity (74.5% similar) in 141 aa overlap (9-145:50-188)
10 20 30
pF1KE1 MELALLCGLVVMAGVI--PIQGGILNLNKMVKQVTGKM
....:: . .:..:::. ::. :::.
CCDS20 CFSGWRGPRFGASCPSRTSRSSLGMKKFFTVAILAGSVLSTAHGSLLNLKAMVEAVTGRS
20 30 40 50 60 70
40 50 60 70 80 90
pF1KE1 PILSYWPYGCHCGLGGRGQPKDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGN-I
:::. :::.::::::::::: .::::..:::::..: ::: : :.: ... ... :
CCDS20 AILSFVGYGCYCGLGGRGQPKDEVDWCCHAHDCCYQELFDQGCHPYVDHYDHTIENNTEI
80 90 100 110 120 130
100 110 120 130 140
pF1KE1 HCSD-KGSWCEQQLCACDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC
::: . . :..: : :::....:: . ::... : . .:.: ::.:
CCDS20 VCSDLNKTECDKQTCMCDKNMVLCLMNQ--TYREEYRGFLNVYCQGPTPNCSIYEPPPEE
140 150 160 170 180 190
CCDS20 VTCSHQSPAPPAPP
200 210
>>CCDS72721.1 PLA2G2D gene_id:26279|Hs108|chr1 (62 aa)
initn: 431 init1: 431 opt: 432 Z-score: 499.0 bits: 97.0 E(32554): 1.8e-21
Smith-Waterman score: 432; 98.4% identity (98.4% similar) in 63 aa overlap (1-63:1-62)
10 20 30 40 50 60
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS72 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK
: :
CCDS72 D-C
>>CCDS200.1 PLA2G2E gene_id:30814|Hs108|chr1 (142 aa)
initn: 351 init1: 210 opt: 427 Z-score: 489.0 bits: 96.4 E(32554): 6.4e-21
Smith-Waterman score: 427; 39.2% identity (67.1% similar) in 143 aa overlap (3-145:7-142)
10 20 30 40 50
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQP
:..:: ::.. . :...... :....::: :.: :::.::.:: :
CCDS20 MKSPHVLVFLCLLVAL-----VTGNLVQFGVMIEKMTGK-SALQYNDYGCYCGIGGSHWP
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 KDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVA
: :::::..::::: .:. :: . : .. :. .: :. . . :.. : :::..:
CCDS20 VDQTDWCCHAHDCCYGRLEKLGCEPKLEKYLFSVSERGIFCAGRTT-CQRLTCECDKRAA
60 70 80 90 100 110
120 130 140
pF1KE1 FCLKRNLDTYQKRLRFYWRPHCRGQTPGC
.:..::: ::... : : : :: :
CCDS20 LCFRRNLGTYNRKYAHYPNKLCTGPTPPC
120 130 140
>>CCDS10555.1 PLA2G10 gene_id:8399|Hs108|chr16 (165 aa)
initn: 331 init1: 331 opt: 383 Z-score: 439.3 bits: 87.4 E(32554): 3.8e-18
Smith-Waterman score: 383; 40.8% identity (63.2% similar) in 125 aa overlap (21-145:43-164)
10 20 30 40 50
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGL
:::.: : : . :: .: ::: :::
CCDS10 LLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI-AYMKYGCFCGL
20 30 40 50 60 70
60 70 80 90 100 110
pF1KE1 GGRGQPKDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCA
::.:::.:: ::::. ::::: . . ::: . : .. . .. :. . :.. ::
CCDS10 GGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGPAENKCQELLCK
80 90 100 110 120 130
120 130 140
pF1KE1 CDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC
::.:.: :: .. :. . :: . :. ..: :
CCDS10 CDQEIANCLAQT--EYNLKYLFYPQFLCEPDSPKCD
140 150 160
>>CCDS9195.1 PLA2G1B gene_id:5319|Hs108|chr12 (148 aa)
initn: 312 init1: 185 opt: 325 Z-score: 375.3 bits: 75.4 E(32554): 1.4e-14
Smith-Waterman score: 325; 40.2% identity (64.4% similar) in 132 aa overlap (1-121:1-130)
10 20 30 40 50
pF1KE1 MELALLCGLVVMA----GVIPIQGGILNLNKMVKQVT-GKMPILSYWPYGCHCGLGGRGQ
:.: .: :...: :. : .. .. ::.: : :. :.: : :::.::::: :
CCDS91 MKLLVLAVLLTVAAADSGISP--RAVWQFRKMIKCVIPGSDPFLEYNNYGCYCGLGGSGT
10 20 30 40 50
60 70 80 90 100
pF1KE1 PKDATDWCCQTHDCCYDHLKT-QGCSI-----YKDYYRYNFSQGNIHCSDKGSWCEQQLC
: : : :::::: :::. : ..:.. : : :. : . : ::.:.. :: .:
CCDS91 PVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNKECEAFIC
60 70 80 90 100 110
110 120 130 140
pF1KE1 ACDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC
::...:.:...
CCDS91 NCDRNAAICFSKAPYNKAHKNLDTKKYCQS
120 130 140
>>CCDS47919.1 OC90 gene_id:729330|Hs108|chr8 (477 aa)
initn: 375 init1: 236 opt: 271 Z-score: 308.9 bits: 64.8 E(32554): 6.9e-11
Smith-Waterman score: 271; 32.0% identity (59.0% similar) in 122 aa overlap (25-145:308-425)
10 20 30 40 50
pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRG
:..:. .:.. : . :::.:: :::
CCDS47 NDPEETTEKACDRFTFLHLGSGDNMQVMPQLGEMLFCLTSRCP-EEFESYGCYCGQEGRG
280 290 300 310 320 330
60 70 80 90 100 110
pF1KE1 QPKDATDWCCQTHDCCYDHLKTQGCSIYK-DYYRYNFSQGNIHCSDKGSWCEQQLCACDK
.:.: : :: .: :: .... :: . . . . . .:. . : ::. :::::.
CCDS47 EPRDDLDRCCLSHHCCLEQVRRLGCLLERLPWSPVVCVDHTPKCGGQ-SLCEKLLCACDQ
340 350 360 370 380 390
120 130 140
pF1KE1 EVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC
.: :. .... :. : : :: .:
CCDS47 TAAECMTSA--SFNQSLKSPSRLGCPGQPAACEDSLHPVPAAPTLGSSSEEDSEEDPPQE
400 410 420 430 440 450
CCDS47 DLGRAKRFLRKSLGPLGIGPLHGR
460 470
145 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 03:40:56 2016 done: Mon Nov 7 03:40:57 2016
Total Scan time: 1.760 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]