FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1360, 239 aa 1>>>pF1KE1360 239 - 239 aa - 239 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9073+/-0.000688; mu= 12.0810+/- 0.042 mean_var=75.0317+/-14.708, 0's: 0 Z-trim(110.1): 9 B-trim: 13 in 1/50 Lambda= 0.148065 statistics sampled from 11373 (11381) to 11373 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.35), width: 16 Scan time: 1.920 The best scores are: opt bits E(32554) CCDS9614.1 PSME2 gene_id:5721|Hs108|chr14 ( 239) 1550 339.8 9.8e-94 CCDS9612.1 PSME1 gene_id:5720|Hs108|chr14 ( 249) 559 128.1 5.4e-30 CCDS61415.1 PSME1 gene_id:5720|Hs108|chr14 ( 233) 486 112.5 2.5e-25 CCDS41930.1 PSME1 gene_id:5720|Hs108|chr14 ( 250) 481 111.4 5.6e-25 CCDS45689.1 PSME3 gene_id:10197|Hs108|chr17 ( 254) 424 99.2 2.6e-21 CCDS59290.1 PSME3 gene_id:10197|Hs108|chr17 ( 265) 424 99.2 2.7e-21 CCDS82133.1 PSME3 gene_id:10197|Hs108|chr17 ( 193) 397 93.4 1.1e-19 CCDS11442.1 PSME3 gene_id:10197|Hs108|chr17 ( 267) 310 74.9 5.9e-14 >>CCDS9614.1 PSME2 gene_id:5721|Hs108|chr14 (239 aa) initn: 1550 init1: 1550 opt: 1550 Z-score: 1796.3 bits: 339.8 E(32554): 9.8e-94 Smith-Waterman score: 1550; 99.6% identity (99.6% similar) in 239 aa overlap (1-239:1-239) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEVPKCGFLPGNEKVLSLLALVKPEVWTLKEKCIL :::::::::::::::::::::::::::: ::::::::::::::::::::::::::::::: CCDS96 RAPLDIPIPDPPPKDDEMETDKQEKKEVHKCGFLPGNEKVLSLLALVKPEVWTLKEKCIL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 VITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASK 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 ETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 ETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY 190 200 210 220 230 >>CCDS9612.1 PSME1 gene_id:5720|Hs108|chr14 (249 aa) initn: 763 init1: 536 opt: 559 Z-score: 652.0 bits: 128.1 E(32554): 5.4e-30 Smith-Waterman score: 746; 48.4% identity (74.0% similar) in 246 aa overlap (7-239:4-249) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL .:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..: CCDS96 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL 10 20 30 40 50 70 80 90 100 pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV .::::::.::: . .. : ::..:: : :: . :::.. :: . CCDS96 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY :::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: :::: CCDS96 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKI ::::::::.::.:. :: ::: :::: ::: : ..: ::...: :: :: :: .:.::. CCDS96 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYAVLYDIILKNFEKL 180 190 200 210 220 230 230 pF1KE1 VNPKGEEKPSMY .:.:: : .: CCDS96 KKPRGETKGMIY 240 >>CCDS61415.1 PSME1 gene_id:5720|Hs108|chr14 (233 aa) initn: 685 init1: 458 opt: 486 Z-score: 568.1 bits: 112.5 E(32554): 2.5e-25 Smith-Waterman score: 673; 47.4% identity (73.9% similar) in 230 aa overlap (7-222:4-233) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL .:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..: CCDS61 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL 10 20 30 40 50 70 80 90 100 pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV .::::::.::: . .. : ::..:: : :: . :::.. :: . CCDS61 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY :::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: :::: CCDS61 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLR-AFYAELYHIISSNLEK ::::::::.::.:. :: ::: :::: ::: : ..: ::...: :. .: .. :: CCDS61 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYVRRLCYMTSS 180 190 200 210 220 230 230 pF1KE1 IVNPKGEEKPSMY >>CCDS41930.1 PSME1 gene_id:5720|Hs108|chr14 (250 aa) initn: 685 init1: 458 opt: 481 Z-score: 561.9 bits: 111.4 E(32554): 5.6e-25 Smith-Waterman score: 668; 48.2% identity (74.1% similar) in 220 aa overlap (7-213:4-223) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL .:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..: CCDS41 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL 10 20 30 40 50 70 80 90 100 pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV .::::::.::: . .. : ::..:: : :: . :::.. :: . CCDS41 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY :::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: :::: CCDS41 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKI ::::::::.::.:. :: ::: :::: ::: : ..: ::...: : CCDS41 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYVRRQGQGRGGQRQL 180 190 200 210 220 230 230 pF1KE1 VNPKGEEKPSMY CCDS41 SQATHSLTLQARG 240 250 >>CCDS45689.1 PSME3 gene_id:10197|Hs108|chr17 (254 aa) initn: 580 init1: 397 opt: 424 Z-score: 496.0 bits: 99.2 E(32554): 2.6e-21 Smith-Waterman score: 535; 33.6% identity (66.8% similar) in 250 aa overlap (7-239:5-254) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL .... :.. .:. ::. . .:::... :.:.:.. :...:.: ::. :::.. CCDS45 MASLLKVDQEVKLKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPILNIHDLTQI 10 20 30 40 50 70 80 90 100 pF1KE1 RAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFLPGNEKVLSL .. ...:.::: . : .. .:... .: :.: .:...... CCDS45 HSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGMLKSNQQLVDI 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE1 LALVKPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTT . ::::. : ::: : :.: :::.:::::.:::.:::... .. .:.... .. CCDS45 IEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGNNFGVSIQEETVAELRTVESEAASYLDQ 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 ISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSN ::.:. :. :.: .: :: ::: : : :: : :: .. .:: :. :. .: .: CCDS45 ISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELRNQYVTLHDMILKN 180 190 200 210 220 230 230 pF1KE1 LEKIVNPKGEEKPSMY .::: :.. . ..: CCDS45 IEKIKRPRSSNAETLY 240 250 >>CCDS59290.1 PSME3 gene_id:10197|Hs108|chr17 (265 aa) initn: 571 init1: 397 opt: 424 Z-score: 495.7 bits: 99.2 E(32554): 2.7e-21 Smith-Waterman score: 526; 34.4% identity (66.4% similar) in 241 aa overlap (16-239:25-265) 10 20 30 40 50 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDS .:. ::. . .:::... :.:.:.. :...:.: CCDS59 MEKWILKKIKYLQSGGLSASYYSYKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPI 10 20 30 40 50 60 60 70 80 90 pF1KE1 LNVADLTSLRAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFL ::. :::.... ...:.::: . : .. .:... .: :.: CCDS59 LNIHDLTQIHSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGML 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE1 PGNEKVLSLLALVKPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVK .:....... ::::. : ::: : :.: :::.:::::.:::.:::... .. .:. CCDS59 KSNQQLVDIIEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGNNFGVSIQEETVAELRTVE 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE1 TKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYA ... .. ::.:. :. :.: .: :: ::: : : :: : :: .. .:: :. CCDS59 SEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELRNQYV 190 200 210 220 230 240 220 230 pF1KE1 ELYHIISSNLEKIVNPKGEEKPSMY :. .: .:.::: :.. . ..: CCDS59 TLHDMILKNIEKIKRPRSSNAETLY 250 260 >>CCDS82133.1 PSME3 gene_id:10197|Hs108|chr17 (193 aa) initn: 450 init1: 397 opt: 397 Z-score: 466.7 bits: 93.4 E(32554): 1.1e-19 Smith-Waterman score: 405; 34.2% identity (63.2% similar) in 193 aa overlap (64-239:1-193) 40 50 60 70 80 90 pF1KE1 RFLPQKIIYLNQLLQEDSLNVADLTSLRAPLDIPIPDP---PPKDDEMETDKQEKKEVPK ...:.::: . : .. .:... . CCDS82 MNLPVPDPILLTNSHDGLDGPTYKKRRLDE 10 20 30 100 110 120 130 pF1KE1 C--------------GFLPGNEKVLSLLALVKPEVWTLKEKCILVITWIQHLIPKIEDGN : :.: .:....... ::::. : ::: : :.: :::.::::: CCDS82 CEEAFQGTKVFVMPNGMLKSNQQLVDIIEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGN 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE1 DFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDE .:::.:::... .. .:.... .. ::.:. :. :.: .: :: ::: : : :: CCDS82 NFGVSIQEETVAELRTVESEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDE 100 110 120 130 140 150 200 210 220 230 pF1KE1 AAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY : :: .. .:: :. :. .: .:.::: :.. . ..: CCDS82 KEYISLRLIISELRNQYVTLHDMILKNIEKIKRPRSSNAETLY 160 170 180 190 >>CCDS11442.1 PSME3 gene_id:10197|Hs108|chr17 (267 aa) initn: 469 init1: 286 opt: 310 Z-score: 364.0 bits: 74.9 E(32554): 5.9e-14 Smith-Waterman score: 503; 31.9% identity (63.9% similar) in 263 aa overlap (7-239:5-267) 10 20 30 40 50 60 pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL .... :.. .:. ::. . .:::... :.:.:.. :...:.: ::. :::.. CCDS11 MASLLKVDQEVKLKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPILNIHDLTQI 10 20 30 40 50 70 80 90 100 pF1KE1 RAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFLPGNEKVLSL .. ...:.::: . : .. .:... .: :.: .:...... CCDS11 HSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGMLKSNQQLVDI 60 70 80 90 100 110 110 120 130 140 150 pF1KE1 LALVKPEVWTLKEKC-------------ILVITWIQHLIPKIEDGNDFGVAIQEKVLERV . ::::. : ::: . : :.: :::.:::::.:::.:::... .. CCDS11 IEKVKPEIRLLIEKCNTPSGKGPHICFDLQVKMWVQLLIPRIEDGNNFGVSIQEETVAEL 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE1 NAVKTKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLR .:.... .. ::.:. :. :.: .: :: ::: : : :: : :: .. .:: CCDS11 RTVESEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELR 180 190 200 210 220 230 220 230 pF1KE1 AFYAELYHIISSNLEKIVNPKGEEKPSMY :. :. .: .:.::: :.. . ..: CCDS11 NQYVTLHDMILKNIEKIKRPRSSNAETLY 240 250 260 239 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:28:23 2016 done: Mon Nov 7 02:28:23 2016 Total Scan time: 1.920 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]