FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1302, 145 aa 1>>>pF1KE1302 145 - 145 aa - 145 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7770+/-0.000669; mu= 15.2291+/- 0.040 mean_var=80.6959+/-15.723, 0's: 0 Z-trim(112.5): 17 B-trim: 2 in 1/49 Lambda= 0.142774 statistics sampled from 13233 (13245) to 13233 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.776), E-opt: 0.2 (0.407), width: 16 Scan time: 1.760 The best scores are: opt bits E(32554) CCDS203.1 PLA2G2D gene_id:26279|Hs108|chr1 ( 145) 1110 237.1 2.9e-63 CCDS201.1 PLA2G2A gene_id:5320|Hs108|chr1 ( 144) 520 115.6 1.1e-26 CCDS202.1 PLA2G5 gene_id:5322|Hs108|chr1 ( 138) 473 105.9 8.8e-24 CCDS204.2 PLA2G2F gene_id:64600|Hs108|chr1 ( 211) 475 106.5 8.9e-24 CCDS72721.1 PLA2G2D gene_id:26279|Hs108|chr1 ( 62) 432 97.0 1.8e-21 CCDS200.1 PLA2G2E gene_id:30814|Hs108|chr1 ( 142) 427 96.4 6.4e-21 CCDS10555.1 PLA2G10 gene_id:8399|Hs108|chr16 ( 165) 383 87.4 3.8e-18 CCDS9195.1 PLA2G1B gene_id:5319|Hs108|chr12 ( 148) 325 75.4 1.4e-14 CCDS47919.1 OC90 gene_id:729330|Hs108|chr8 ( 477) 271 64.8 6.9e-11 >>CCDS203.1 PLA2G2D gene_id:26279|Hs108|chr1 (145 aa) initn: 1110 init1: 1110 opt: 1110 Z-score: 1249.2 bits: 237.1 E(32554): 2.9e-63 Smith-Waterman score: 1110; 100.0% identity (100.0% similar) in 145 aa overlap (1-145:1-145) 10 20 30 40 50 60 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK 70 80 90 100 110 120 130 140 pF1KE1 RNLDTYQKRLRFYWRPHCRGQTPGC ::::::::::::::::::::::::: CCDS20 RNLDTYQKRLRFYWRPHCRGQTPGC 130 140 >>CCDS201.1 PLA2G2A gene_id:5320|Hs108|chr1 (144 aa) initn: 495 init1: 329 opt: 520 Z-score: 592.5 bits: 115.6 E(32554): 1.1e-26 Smith-Waterman score: 520; 46.9% identity (73.1% similar) in 145 aa overlap (1-145:1-144) 10 20 30 40 50 60 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT :. :: ..... :.. .:...:...:.: .::: ::: ::::::.::::.::::: CCDS20 MKTLLLLAVIMIFGLLQAHGNLVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDAT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK : :: :::::: .:. .::. :... : . : :. . : :..::: ::: .: :. CCDS20 DRCCVTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCAKQDS-CRSQLCECDKAAATCFA 70 80 90 100 110 130 140 pF1KE1 RNLDTYQKRLRFYWRPHCRGQTPGC :: ::.:. ..: ::::.:: : CCDS20 RNKTTYNKKYQYYSNKHCRGSTPRC 120 130 140 >>CCDS202.1 PLA2G5 gene_id:5322|Hs108|chr1 (138 aa) initn: 463 init1: 343 opt: 473 Z-score: 540.4 bits: 105.9 E(32554): 8.8e-24 Smith-Waterman score: 473; 45.3% identity (75.0% similar) in 128 aa overlap (12-138:11-137) 10 20 30 40 50 pF1KE1 MELALLCGLVVMAGVIP-IQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDA .: .: .:::.:.:..:...:::: . .: :::.:: :::: :::. CCDS20 MKGLLPLAWFLACSVPAVQGGLLDLKSMIEKVTGKNALTNYGFYGCYCGWGGRGTPKDG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 TDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCL ::::: .:: :: .:. .::.: . :.: :. : . : . : .:. .:::::.....:: CCDS20 TDWCCWAHDHCYGRLEEKGCNIRTQSYKYRFAWGVVTC-EPGPFCHVNLCACDRKLVYCL 60 70 80 90 100 110 120 130 140 pF1KE1 KRNLDTYQKRLRFYWRPHCRGQTPGC :::: .:. . ... : CCDS20 KRNLRSYNPQYQYFPNILCS 120 130 >>CCDS204.2 PLA2G2F gene_id:64600|Hs108|chr1 (211 aa) initn: 457 init1: 367 opt: 475 Z-score: 540.4 bits: 106.5 E(32554): 8.9e-24 Smith-Waterman score: 475; 45.4% identity (74.5% similar) in 141 aa overlap (9-145:50-188) 10 20 30 pF1KE1 MELALLCGLVVMAGVI--PIQGGILNLNKMVKQVTGKM ....:: . .:..:::. ::. :::. CCDS20 CFSGWRGPRFGASCPSRTSRSSLGMKKFFTVAILAGSVLSTAHGSLLNLKAMVEAVTGRS 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE1 PILSYWPYGCHCGLGGRGQPKDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGN-I :::. :::.::::::::::: .::::..:::::..: ::: : :.: ... ... : CCDS20 AILSFVGYGCYCGLGGRGQPKDEVDWCCHAHDCCYQELFDQGCHPYVDHYDHTIENNTEI 80 90 100 110 120 130 100 110 120 130 140 pF1KE1 HCSD-KGSWCEQQLCACDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC ::: . . :..: : :::....:: . ::... : . .:.: ::.: CCDS20 VCSDLNKTECDKQTCMCDKNMVLCLMNQ--TYREEYRGFLNVYCQGPTPNCSIYEPPPEE 140 150 160 170 180 190 CCDS20 VTCSHQSPAPPAPP 200 210 >>CCDS72721.1 PLA2G2D gene_id:26279|Hs108|chr1 (62 aa) initn: 431 init1: 431 opt: 432 Z-score: 499.0 bits: 97.0 E(32554): 1.8e-21 Smith-Waterman score: 432; 98.4% identity (98.4% similar) in 63 aa overlap (1-63:1-62) 10 20 30 40 50 60 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK : : CCDS72 D-C >>CCDS200.1 PLA2G2E gene_id:30814|Hs108|chr1 (142 aa) initn: 351 init1: 210 opt: 427 Z-score: 489.0 bits: 96.4 E(32554): 6.4e-21 Smith-Waterman score: 427; 39.2% identity (67.1% similar) in 143 aa overlap (3-145:7-142) 10 20 30 40 50 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQP :..:: ::.. . :...... :....::: :.: :::.::.:: : CCDS20 MKSPHVLVFLCLLVAL-----VTGNLVQFGVMIEKMTGK-SALQYNDYGCYCGIGGSHWP 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 KDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVA : :::::..::::: .:. :: . : .. :. .: :. . . :.. : :::..: CCDS20 VDQTDWCCHAHDCCYGRLEKLGCEPKLEKYLFSVSERGIFCAGRTT-CQRLTCECDKRAA 60 70 80 90 100 110 120 130 140 pF1KE1 FCLKRNLDTYQKRLRFYWRPHCRGQTPGC .:..::: ::... : : : :: : CCDS20 LCFRRNLGTYNRKYAHYPNKLCTGPTPPC 120 130 140 >>CCDS10555.1 PLA2G10 gene_id:8399|Hs108|chr16 (165 aa) initn: 331 init1: 331 opt: 383 Z-score: 439.3 bits: 87.4 E(32554): 3.8e-18 Smith-Waterman score: 383; 40.8% identity (63.2% similar) in 125 aa overlap (21-145:43-164) 10 20 30 40 50 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGL :::.: : : . :: .: ::: ::: CCDS10 LLLLPSLLLLLLLPGPGSGEASRILRVHRRGILELAGTVGCVGPRTPI-AYMKYGCFCGL 20 30 40 50 60 70 60 70 80 90 100 110 pF1KE1 GGRGQPKDATDWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCA ::.:::.:: ::::. ::::: . . ::: . : .. . .. :. . :.. :: CCDS10 GGHGQPRDAIDWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGPAENKCQELLCK 80 90 100 110 120 130 120 130 140 pF1KE1 CDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC ::.:.: :: .. :. . :: . :. ..: : CCDS10 CDQEIANCLAQT--EYNLKYLFYPQFLCEPDSPKCD 140 150 160 >>CCDS9195.1 PLA2G1B gene_id:5319|Hs108|chr12 (148 aa) initn: 312 init1: 185 opt: 325 Z-score: 375.3 bits: 75.4 E(32554): 1.4e-14 Smith-Waterman score: 325; 40.2% identity (64.4% similar) in 132 aa overlap (1-121:1-130) 10 20 30 40 50 pF1KE1 MELALLCGLVVMA----GVIPIQGGILNLNKMVKQVT-GKMPILSYWPYGCHCGLGGRGQ :.: .: :...: :. : .. .. ::.: : :. :.: : :::.::::: : CCDS91 MKLLVLAVLLTVAAADSGISP--RAVWQFRKMIKCVIPGSDPFLEYNNYGCYCGLGGSGT 10 20 30 40 50 60 70 80 90 100 pF1KE1 PKDATDWCCQTHDCCYDHLKT-QGCSI-----YKDYYRYNFSQGNIHCSDKGSWCEQQLC : : : :::::: :::. : ..:.. : : :. : . : ::.:.. :: .: CCDS91 PVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNKECEAFIC 60 70 80 90 100 110 110 120 130 140 pF1KE1 ACDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC ::...:.:... CCDS91 NCDRNAAICFSKAPYNKAHKNLDTKKYCQS 120 130 140 >>CCDS47919.1 OC90 gene_id:729330|Hs108|chr8 (477 aa) initn: 375 init1: 236 opt: 271 Z-score: 308.9 bits: 64.8 E(32554): 6.9e-11 Smith-Waterman score: 271; 32.0% identity (59.0% similar) in 122 aa overlap (25-145:308-425) 10 20 30 40 50 pF1KE1 MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRG :..:. .:.. : . :::.:: ::: CCDS47 NDPEETTEKACDRFTFLHLGSGDNMQVMPQLGEMLFCLTSRCP-EEFESYGCYCGQEGRG 280 290 300 310 320 330 60 70 80 90 100 110 pF1KE1 QPKDATDWCCQTHDCCYDHLKTQGCSIYK-DYYRYNFSQGNIHCSDKGSWCEQQLCACDK .:.: : :: .: :: .... :: . . . . . .:. . : ::. :::::. CCDS47 EPRDDLDRCCLSHHCCLEQVRRLGCLLERLPWSPVVCVDHTPKCGGQ-SLCEKLLCACDQ 340 350 360 370 380 390 120 130 140 pF1KE1 EVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC .: :. .... :. : : :: .: CCDS47 TAAECMTSA--SFNQSLKSPSRLGCPGQPAACEDSLHPVPAAPTLGSSSEEDSEEDPPQE 400 410 420 430 440 450 CCDS47 DLGRAKRFLRKSLGPLGIGPLHGR 460 470 145 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 03:40:56 2016 done: Mon Nov 7 03:40:57 2016 Total Scan time: 1.760 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]