FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3912, 416 aa 1>>>pF1KE3912 416 - 416 aa - 416 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8530+/-0.000841; mu= 11.2430+/- 0.051 mean_var=125.0800+/-25.200, 0's: 0 Z-trim(110.9): 25 B-trim: 0 in 0/54 Lambda= 0.114678 statistics sampled from 11935 (11954) to 11935 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.732), E-opt: 0.2 (0.367), width: 16 Scan time: 3.080 The best scores are: opt bits E(32554) CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 ( 416) 2927 495.2 4.7e-140 CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 ( 373) 2563 435.0 5.8e-122 CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 ( 482) 1038 182.8 6.3e-46 CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 ( 507) 900 159.9 4.9e-39 CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 ( 329) 609 111.7 1.1e-24 >>CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 (416 aa) initn: 2927 init1: 2927 opt: 2927 Z-score: 2628.0 bits: 495.2 E(32554): 4.7e-140 Smith-Waterman score: 2927; 99.8% identity (100.0% similar) in 416 aa overlap (1-416:1-416) 10 20 30 40 50 60 pF1KE3 MSTKQITCRYFMHGVCREGSQCLFSHDLANSKPSTICKYYQKGYCAYGTRCRYDHTRPSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSTKQITCRYFMHGVCREGSQCLFSHDLANSKPSTICKYYQKGYCAYGTRCRYDHTRPSA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AAGGAVGTMAHSVPSPAFHSPHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AAGGAVGTMAHSVPSPAFHSPHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KTQPSMVSNPGSCSDPQPSPEMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KTQPSMVSNPGSCSDPQPSPEMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 FGDACVYLHGEVCEICRLQVLHPFDPEQRKAHEKICMLTFEHEMEKAFAFQANQDKVCSI ::::::::::::::::::::::::::::::::::::::::::::::::::::.::::::: CCDS33 FGDACVYLHGEVCEICRLQVLHPFDPEQRKAHEKICMLTFEHEMEKAFAFQASQDKVCSI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 CMEVILEKASASERRFGILSNCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 CMEVILEKASASERRFGILSNCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 SVYWVEDQNKKNELIEAFKQGMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SVYWVEDQNKKNELIEAFKQGMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKP 310 320 330 340 350 360 370 380 390 400 410 pF1KE3 RKQLSSQGTVRFFNSVRLWDFIENRESRHVPNNEDVDMTELGDLFMHLSGVESSEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RKQLSSQGTVRFFNSVRLWDFIENRESRHVPNNEDVDMTELGDLFMHLSGVESSEP 370 380 390 400 410 >>CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 (373 aa) initn: 2563 init1: 2563 opt: 2563 Z-score: 2303.2 bits: 435.0 E(32554): 5.8e-122 Smith-Waterman score: 2563; 99.7% identity (100.0% similar) in 366 aa overlap (51-416:8-373) 30 40 50 60 70 80 pF1KE3 QCLFSHDLANSKPSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHS :::::::::::::::::::::::::::::: CCDS63 MSTKQITCRYDHTRPSAAAGGAVGTMAHSVPSPAFHS 10 20 30 90 100 110 120 130 140 pF1KE3 PHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 PHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSP 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE3 EMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 EMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQV 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE3 LHPFDPEQRKAHEKICMLTFEHEMEKAFAFQANQDKVCSICMEVILEKASASERRFGILS ::::::::::::::::::::::::::::::::.::::::::::::::::::::::::::: CCDS63 LHPFDPEQRKAHEKICMLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILS 160 170 180 190 200 210 270 280 290 300 310 320 pF1KE3 NCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 NCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQ 220 230 240 250 260 270 330 340 350 360 370 380 pF1KE3 GMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 GMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWD 280 290 300 310 320 330 390 400 410 pF1KE3 FIENRESRHVPNNEDVDMTELGDLFMHLSGVESSEP :::::::::::::::::::::::::::::::::::: CCDS63 FIENRESRHVPNNEDVDMTELGDLFMHLSGVESSEP 340 350 360 370 >>CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 (482 aa) initn: 1285 init1: 975 opt: 1038 Z-score: 938.0 bits: 182.8 E(32554): 6.3e-46 Smith-Waterman score: 1329; 46.8% identity (73.3% similar) in 408 aa overlap (3-408:56-451) 10 20 30 pF1KE3 MSTKQITCRYFMHGVCREGSQCLFSHDLANSK :::.::::::::::.::..: .::::..: CCDS58 ASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSP 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE3 PSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHSPHPPSEVTASIV :..:::.:.::: :: ::::.:..: :..: . : : : : ... .: CCDS58 YSVVCKYFQRGYCIYGDRCRYEHSKPLKQEE-ATATELTTKSSLAASSSL--SSIVGPLV 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE3 KTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSPEMKPHSYLDAIR . :. : .:.. .. .: . . .: . :. :: : :.. CCDS58 EMNTGEAESRNSNFATVG----AGSEDWVNAIEFVPGQPYCGRTAPSCTEAP---LQGSV 150 160 170 180 190 160 170 180 190 200 210 pF1KE3 SGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQVLHPFDPEQRKAH . .. . ... ...:::::::.::::.:. ::::::. :..: ::::::.: ::. : CCDS58 TKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQH 200 210 220 230 240 250 220 230 240 250 260 270 pF1KE3 EKICMLTFEHEMEKAFAFQANQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSCIR : :. . :..:: .:: : ..: ::.:::::. :::. :::::::::::::::::.::: CCDS58 IKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIR 260 270 280 290 300 310 280 290 300 310 320 330 pF1KE3 QWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYFEQ .:: :::::. :::::::::. :.::::: ::::....:..:: .:..:..:::.::.. CCDS58 KWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDE 320 330 340 350 360 370 340 350 360 370 380 390 pF1KE3 GKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRHVPN :.:.::::..:.:.::::::: ::.. . ::. .. : ..:..::.::. . . CCDS58 GRGSCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRN--HFWELIEERENSNPFD 380 390 400 410 420 430 400 410 pF1KE3 N--EDVDMTELGDLFMHLSGVESSEP : :.: :::.... : CCDS58 NDEEEVVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL 440 450 460 470 480 >>CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 (507 aa) initn: 1057 init1: 864 opt: 900 Z-score: 814.3 bits: 159.9 E(32554): 4.9e-39 Smith-Waterman score: 1005; 41.9% identity (67.2% similar) in 360 aa overlap (3-360:96-433) 10 20 30 pF1KE3 MSTKQITCRYFMHGVCREGSQCLFSHDLANSK :::: :::..:: :.:: .: .::::.. : CCDS10 HLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHDLSGRK 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE3 PSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHSPHPPSEVTASI- .: . : . :.:.:::. .: :: : . ::. . :. CCDS10 MAT-----EGGV-----------SPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLP 130 140 150 160 100 110 120 130 140 150 pF1KE3 VKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVS-NPGSCSDPQPSPEMKPHSYLDA : .. : : : . ::. .: : .. . . :: :: : .: CCDS10 VIGSAAERGFFEAERDNA-DRGAAGGAGVESWADAIEFVPG-----QPYRGRWVASAPEA 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE3 IRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQVLHPFDPEQRK .. . . . . .. ..: ::. : : :..:.::::..:..: ::.:::.: ::. CCDS10 PLQSSETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQRE 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE3 AHEKICMLTFEHEMEKAFAFQANQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSC : . :. . :..:: .:: : ..::::.:::::. :::. ..:::::::::::..:. : CCDS10 EHMRACIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRC 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE3 IRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYF ::.:: :.:::: :.::::.::: ::.:::: .:::....:..::. .:..:..:::.:: CCDS10 IRRWRSARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYF 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE3 EQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRHV .:.:.::::. :.:.: ::.: :: : CCDS10 AEGRGNCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIKK 410 420 430 440 450 460 >>CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 (329 aa) initn: 910 init1: 600 opt: 609 Z-score: 556.8 bits: 111.7 E(32554): 1.1e-24 Smith-Waterman score: 900; 46.8% identity (70.4% similar) in 284 aa overlap (3-286:56-329) 10 20 30 pF1KE3 MSTKQITCRYFMHGVCREGSQCLFSHDLANSK :::.::::::::::.::..: .::::..: CCDS47 ASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSP 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE3 PSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHSPHPPSEVTASIV :..:::.:.::: :: ::::.:..: :..: . : : : : ... .: CCDS47 YSVVCKYFQRGYCIYGDRCRYEHSKPLKQEE-ATATELTTKSSLAASSSL--SSIVGPLV 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE3 KTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSPEMKPHSYLDAIR . :. : .:.. .. .: . . .: . :. :: : :.. CCDS47 EMNTGEAESRNSNFATVG----AGSEDWVNAIEFVPGQPYCGRTAPSCTEAP---LQGSV 150 160 170 180 190 160 170 180 190 200 210 pF1KE3 SGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQVLHPFDPEQRKAH . .. . ... ...:::::::.::::.:. ::::::. :..: ::::::.: ::. : CCDS47 TKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQH 200 210 220 230 240 250 220 230 240 250 260 270 pF1KE3 EKICMLTFEHEMEKAFAFQANQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSCIR : :. . :..:: .:: : ..: ::.:::::. :::. :::::::::::::::::.::: CCDS47 IKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIR 260 270 280 290 300 310 280 290 300 310 320 330 pF1KE3 QWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYFEQ .:: :::::. ::: CCDS47 KWRSAKQFESKIIK 320 416 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:01:06 2016 done: Sun Nov 6 09:01:07 2016 Total Scan time: 3.080 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]