FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3959, 507 aa 1>>>pF1KE3959 507 - 507 aa - 507 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1893+/-0.000793; mu= 12.1026+/- 0.048 mean_var=169.3289+/-34.174, 0's: 0 Z-trim(114.4): 14 B-trim: 0 in 0/52 Lambda= 0.098562 statistics sampled from 14968 (14981) to 14968 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.777), E-opt: 0.2 (0.46), width: 16 Scan time: 3.340 The best scores are: opt bits E(32554) CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 ( 507) 3585 521.6 8e-148 CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 ( 482) 1441 216.7 4.6e-56 CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 ( 329) 925 143.2 4.3e-34 CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 ( 373) 903 140.1 4.1e-33 CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 ( 416) 900 139.7 5.9e-33 >>CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 (507 aa) initn: 3585 init1: 3585 opt: 3585 Z-score: 2767.4 bits: 521.6 E(32554): 8e-148 Smith-Waterman score: 3585; 100.0% identity (100.0% similar) in 507 aa overlap (1-507:1-507) 10 20 30 40 50 60 pF1KE3 MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLPVIGSAAERGFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLPVIGSAAERGFF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 EAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSSETERKQMAVGSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSSETERKQMAVGSG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRACIEAHEKDMELS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRACIEAHEKDMELS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 FAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWRSARQFENRIVKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWRSARQFENRIVKS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 CPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRGNCPFGDTCFYKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 CPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRGNCPFGDTCFYKH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 EYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIKKELVVLRLASLLFKRFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIKKELVVLRLASLLFKRFLS 430 440 450 460 470 480 490 500 pF1KE3 LRDELPFSEDQWDLLHYELEEYFNLIL ::::::::::::::::::::::::::: CCDS10 LRDELPFSEDQWDLLHYELEEYFNLIL 490 500 >>CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 (482 aa) initn: 1572 init1: 1126 opt: 1441 Z-score: 1120.0 bits: 216.7 E(32554): 4.6e-56 Smith-Waterman score: 1619; 51.5% identity (71.1% similar) in 505 aa overlap (20-507:2-482) 10 20 30 40 50 60 pF1KE3 MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV :::: :... :: .:: .: :: . .:.:. CCDS58 MAEAATPGTTATT------SGAGAAAATA---AAASPTPIPT 10 20 30 70 80 90 100 110 120 pF1KE3 APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD . .:. : :: .:::. . :.: ::::. :::..:: ::::.::::::: CCDS58 VTAPS-LGAGG------GGGGS------DGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHD 40 50 60 70 80 130 140 150 160 170 pF1KE3 LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPP--------AASSLSLPVIG :: ... : : : .: :: : : :.:::: ..: CCDS58 LSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATATELTTKSSLAASSSLS-SIVG 90 100 110 120 130 180 190 200 210 220 pF1KE3 SAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSS---- .: . ::: :.. :. ::: :.:..:::::::::: :: . : :::::.: CCDS58 PLVEMNTGEAESRNSNF-ATVGAGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKE 140 150 160 170 180 190 230 240 250 260 270 280 pF1KE3 ETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRA :.:..: :: . ..: ::. : : ::.:.::::: :::::::.::::::::: .:... CCDS58 ESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKS 200 210 220 230 240 250 290 300 310 320 330 340 pF1KE3 CIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWR :::::::::::::::::. : :::::::::::::::..:::::::::::..:..:::.:: CCDS58 CIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWR 260 270 280 290 300 310 350 360 370 380 390 400 pF1KE3 SARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRG ::.:::..:.::::.::.::..:::::.::::.::::::: .::::::::::::: :::: CCDS58 SAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDEGRG 320 330 340 350 360 370 410 420 430 440 450 460 pF1KE3 NCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQ----LVEPVRMGEGNMLYKSIKKE .:::: .::::: ::.: .:: :. : : : . : .. :.. . . ..: CCDS58 SCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEE 380 390 400 410 420 430 470 480 490 500 pF1KE3 LVVLRLASLLFKRFLSL-RDELPFSEDQWDLLHYELEEYFNLIL .:...:. .:. . . ::: :::.:::.: :::....: : CCDS58 VVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL 440 450 460 470 480 >>CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 (329 aa) initn: 1042 init1: 687 opt: 925 Z-score: 725.6 bits: 143.2 E(32554): 4.3e-34 Smith-Waterman score: 1103; 51.4% identity (69.3% similar) in 352 aa overlap (20-359:2-329) 10 20 30 40 50 60 pF1KE3 MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV :::: :... :: .:: .: :: . .:.:. CCDS47 MAEAATPGTTATT------SGAGAAAATA---AAASPTPIPT 10 20 30 70 80 90 100 110 120 pF1KE3 APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD . .:. : :: .:::. . :.: ::::. :::..:: ::::.::::::: CCDS47 VTAPS-LGAGG------GGGGS------DGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHD 40 50 60 70 80 130 140 150 160 170 pF1KE3 LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPP--------AASSLSLPVIG :: ... : : : .: :: : : :.:::: ..: CCDS47 LSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATATELTTKSSLAASSSLS-SIVG 90 100 110 120 130 180 190 200 210 220 pF1KE3 SAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSS---- .: . ::: :.. :. ::: :.:..:::::::::: :: . : :::::.: CCDS47 PLVEMNTGEAESRNSNF-ATVGAGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKE 140 150 160 170 180 190 230 240 250 260 270 280 pF1KE3 ETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRA :.:..: :: . ..: ::. : : ::.:.::::: :::::::.::::::::: .:... CCDS47 ESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKS 200 210 220 230 240 250 290 300 310 320 330 340 pF1KE3 CIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWR :::::::::::::::::. : :::::::::::::::..:::::::::::..:..:::.:: CCDS47 CIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWR 260 270 280 290 300 310 350 360 370 380 390 400 pF1KE3 SARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRG ::.:::..:.: CCDS47 SAKQFESKIIK 320 >>CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 (373 aa) initn: 956 init1: 864 opt: 903 Z-score: 708.0 bits: 140.1 E(32554): 4.1e-33 Smith-Waterman score: 903; 41.6% identity (69.4% similar) in 320 aa overlap (121-433:1-317) 100 110 120 130 140 150 pF1KE3 SSGIWTKQIICRYYIHGQCKEGENCRYSHDLSGRKMATEGGVSPPGASAGGGPSTAAHIE .: .... . . :.:.:::. .: :: CCDS63 MSTKQITCRYDHTRPSAAAGGAVGTMAHSV 10 20 30 160 170 180 190 200 pF1KE3 PPTQEVAEAPPAASSLSLPVIGSAAERGFFEAERDNA--DRGAAGGAGVESWADAIEFVP : . ::. . :. : .. : : : .: . ::. .: : .. . . : CCDS63 PSPAFHSPHPPSEVTASI-VKTNSHEPGKRE-KRTLVLRDRNLSGMAERKTQPSMVS-NP 40 50 60 70 80 210 220 230 240 250 260 pF1KE3 G-----QPYRGRWVASAPEAPLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHG : :: : .: .. . . . . .. ..: ::. : : :..:.:::: CCDS63 GSCSDPQPSPEMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHG 90 100 110 120 130 140 270 280 290 300 310 320 pF1KE3 DICDMCGLQTLHPMDAAQREEHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKAN ..:..: ::.:::.: ::. : . :. . :..:: .:: : ..::::.:::::. :::. CCDS63 EVCEICRLQVLHPFDPEQRKAHEKICMLTFEHEMEKAFAFQASQDKVCSICMEVILEKAS 150 160 170 180 190 200 330 340 350 360 370 380 pF1KE3 PNDRRFGILSNCNHSFCIRCIRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEE ..:::::::::::..:. :::.:: :.:::: :.::::.::: ::.:::: .:::.... CCDS63 ASERRFGILSNCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNK 210 220 230 240 250 260 390 400 410 420 430 440 pF1KE3 KQKLIQQYKEAMSNKACRYFAEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWH :..::. .:..:..:::.:: .:.:.::::. :.:.: ::.: :: : CCDS63 KNELIEAFKQGMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTV 270 280 290 300 310 320 450 460 470 480 490 500 pF1KE3 QLVEPVRMGEGNMLYKSIKKELVVLRLASLLFKRFLSLRDELPFSEDQWDLLHYELEEYF CCDS63 RFFNSVRLWDFIENRESRHVPNNEDVDMTELGDLFMHLSGVESSEP 330 340 350 360 370 >>CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 (416 aa) initn: 1057 init1: 864 opt: 900 Z-score: 705.1 bits: 139.7 E(32554): 5.9e-33 Smith-Waterman score: 1005; 42.1% identity (67.6% similar) in 361 aa overlap (96-433:3-360) 70 80 90 100 110 120 pF1KE3 HLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHDLSGRK :::: :::..:: :.:: .: .::::.. : CCDS33 MSTKQITCRYFMHGVCREGSQCLFSHDLANSK 10 20 30 130 140 150 160 pF1KE3 MAT-----EGGV-----------SPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLP .: . : . :.:.:::. .: :: : . ::. . :. CCDS33 PSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHSPHPPSEVTASI- 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE3 VIGSAAERGFFEAERDNA--DRGAAGGAGVESWADAIEFVPG-----QPYRGRWVASAPE : .. : : : .: . ::. .: : .. . . :: :: : . CCDS33 VKTNSHEPGKRE-KRTLVLRDRNLSGMAERKTQPSMVS-NPGSCSDPQPSPEMKPHSYLD 100 110 120 130 140 230 240 250 260 270 280 pF1KE3 APLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQR : .. . . . . .. ..: ::. : : :..:.::::..:..: ::.:::.: :: CCDS33 AIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQVLHPFDPEQR 150 160 170 180 190 200 290 300 310 320 330 340 pF1KE3 EEHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIR . : . :. . :..:: .:: : ..::::.:::::. :::. ..:::::::::::..:. CCDS33 KAHEKICMLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILSNCNHTYCLS 210 220 230 240 250 260 350 360 370 380 390 400 pF1KE3 CIRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRY :::.:: :.:::: :.::::.::: ::.:::: .:::....:..::. .:..:..:::.: CCDS33 CIRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKY 270 280 290 300 310 320 410 420 430 440 450 460 pF1KE3 FAEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIK : .:.:.::::. :.:.: ::.: :: : CCDS33 FEQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRH 330 340 350 360 370 380 470 480 490 500 pF1KE3 KELVVLRLASLLFKRFLSLRDELPFSEDQWDLLHYELEEYFNLIL CCDS33 VPNNEDVDMTELGDLFMHLSGVESSEP 390 400 410 507 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 08:33:18 2016 done: Sun Nov 6 08:33:19 2016 Total Scan time: 3.340 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]