GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:30:55 Sequence gi568815586f:49652494_49861780 : 209287 bp : 46.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 834 733 102 0 0 60 82 181 0.973 15.07 1.13 Intr - 1381 1232 150 1 0 107 53 230 0.991 21.76 1.12 Intr - 1809 1699 111 0 0 129 99 243 0.999 30.08 1.11 Intr - 2491 2417 75 1 0 36 77 178 0.989 11.21 1.10 Intr - 4004 3911 94 1 1 104 78 45 0.994 5.07 1.09 Intr - 4406 4330 77 2 2 104 113 56 0.999 8.01 1.08 Intr - 4697 4589 109 1 1 123 110 40 0.974 9.99 1.07 Intr - 6101 5949 153 1 0 78 96 136 0.994 12.59 1.06 Intr - 9556 9473 84 0 0 119 84 88 0.858 10.44 1.05 Intr - 13415 13339 77 2 2 85 92 31 0.928 1.51 1.04 Intr - 13714 13634 81 1 0 90 67 48 0.789 2.73 1.03 Intr - 16061 15978 84 2 0 71 80 133 0.570 10.82 1.02 Intr - 18752 18645 108 2 0 89 29 55 0.111 0.18 1.01 Init - 35073 35059 15 0 0 77 116 23 0.027 3.71 1.00 Prom - 35349 35310 40 -5.66 2.03 PlyA - 35580 35575 6 1.05 2.02 Term - 38982 38905 78 0 0 93 44 71 0.014 0.96 2.01 Init - 54687 54562 126 0 0 104 85 188 0.405 18.49 2.00 Prom - 58762 58723 40 -6.16 3.00 Prom + 61546 61585 40 -3.26 3.01 Init + 69021 69030 10 2 1 98 116 8 0.917 5.20 3.02 Intr + 79070 79224 155 0 2 37 115 20 0.002 -0.61 3.03 Intr + 99971 100056 86 1 2 133 92 101 0.983 13.62 3.04 Intr + 100480 100588 109 1 1 80 110 101 0.997 11.79 3.05 Intr + 103142 103262 121 1 1 81 98 8 0.852 1.17 3.06 Intr + 105734 105782 49 0 1 94 93 16 0.784 0.44 3.07 Intr + 105890 105987 98 2 2 86 107 30 0.721 4.35 3.08 Intr + 106190 106269 80 0 2 94 110 -42 0.841 -2.13 3.09 Intr + 106728 106828 101 2 2 36 105 106 0.944 6.01 3.10 Intr + 109211 109286 76 2 1 79 96 66 0.890 6.02 3.11 Intr + 125485 125547 63 0 0 88 58 56 0.226 1.51 3.12 Intr + 125746 125869 124 0 1 93 94 48 0.341 6.06 3.13 Term + 137456 137466 11 0 2 115 37 0 0.007 -4.04 3.14 PlyA + 137708 137713 6 1.05 4.16 PlyA - 138678 138673 6 1.05 4.15 Term - 139558 139346 213 1 0 84 55 214 0.974 14.93 4.14 Intr - 140095 139953 143 2 2 100 81 86 0.996 9.17 4.13 Intr - 140493 140185 309 1 0 111 101 135 0.862 13.38 4.12 Intr - 140940 140859 82 0 1 52 76 60 0.815 0.41 4.11 Intr - 141403 141241 163 0 1 93 115 177 0.997 20.88 4.10 Intr - 144901 142272 2630 1 2 98 94 1564 0.871 145.36 4.09 Intr - 149474 149355 120 2 0 27 96 146 0.998 9.99 4.08 Intr - 150503 150465 39 2 0 110 113 39 0.992 7.02 4.07 Intr - 150672 150604 69 0 0 103 58 95 0.981 7.38 4.06 Intr - 151557 151429 129 0 0 101 64 126 0.984 12.39 4.05 Intr - 153548 153523 26 0 2 144 70 36 0.561 5.14 4.04 Intr - 156959 156889 71 1 2 100 60 29 0.394 0.13 4.03 Intr - 162493 162391 103 2 1 44 116 53 0.024 2.93 4.02 Intr - 173480 173450 31 2 1 128 47 10 0.057 -1.40 4.01 Init - 175879 175829 51 1 0 107 109 68 0.974 9.98 4.00 Prom - 182072 182033 40 -7.76 5.05 PlyA - 183571 183566 6 1.05 5.04 Term - 186522 185878 645 0 0 61 50 306 0.993 18.23 5.03 Intr - 190645 190361 285 1 0 -3 84 240 0.194 12.04 5.02 Intr - 198837 198721 117 0 0 103 116 -33 0.634 1.66 5.01 Init - 200061 200053 9 0 0 125 91 7 0.852 4.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:49652494_49861780|GENSCAN_predicted_peptide_1|440_aa MKKKKARTPQGGDFAQFFKPRGPSVSSSVHLVEKYEPQLLTSSMNLPPDKARLLRQYDNE KKWDLICDQERFQVKNPPHTYIQKLQSFLDPSVTRKKFRRRVQESTKVLRELEISLRTNH IGWVREFLNDENKGLDVLVDYLSFAQCSVMFDFEGLESGDDGAFDKLRSWSRSIEDLQPP SALSAPFTNSLARSARQSVLRYSTLPGRRALKNSRLVSQKDDVHVCILCLRAIMNYQYGF NLVMSHPHAVNEIALSLNNKNPRTKALVLELLAAVCLVRGGHEIILAAFDNFKEVCKELH RFEKLMEYFRNEDSNIDFMVACMQFINIVVHSVEDMNFRVHLQYEFTKLGLEEFLQKSRH TESEKLQVQIQAYLDNVFDVGGLLEDAETKNVALEKVEELEEHVSHLTEKLLDLENENMM RVAELEKQLLQREKELESIK >gi568815586f:49652494_49861780|GENSCAN_predicted_CDS_1|1320_bp atgaagaaaaagaaggccaggactccccaagggggtgactttgcccagtttttcaaaccg aggggaccaagtgtttcctcatcagttcacctggtggagaagtatgagccacagctgctg acaagctccatgaacctgcctccagacaaggcccggctcctgcggcagtatgacaatgag aagaaatgggatctgatctgtgaccaggaacgattccaggtgaagaatcctccccacact tacattcagaaactccagagcttcttggaccccagtgtaactcggaagaagttcaggagg agggtgcaggagtcaaccaaagtactaagggagctggagatctctcttcgcaccaaccac attgggtgggtgcgggaatttctgaatgatgaaaacaaaggcctggatgtactggtggat tacctgtcctttgcccagtgttctgtcatgtttgactttgagggtctggaaagtggtgac gatggtgcatttgacaaactccggtcctggagcaggtcaatcgaggacctgcagccaccc agcgccctgtcggcccccttcaccaacagcctcgctcgctctgcgcgccagtctgtgctc cggtatagcactctccctgggcgcagggccctgaagaactcccgcctagtgagccagaag gatgacgtccacgtctgtatcctttgtctcagagccatcatgaactatcagtacggattc aacctggtcatgtcccacccccatgctgtcaatgagattgcacttagcctcaataacaag aatccaaggaccaaagcccttgtcttagagcttctggcagctgtgtgtttggtgcgagga ggtcacgaaatcatccttgctgcctttgacaatttcaaagaggtatgcaaggagctgcac cgctttgagaagctgatggagtatttccggaatgaggacagcaatattgacttcatggtg gcctgcatgcagttcatcaacatcgtggtgcactcggtggaggacatgaacttccgggtc cacctgcagtatgagtttaccaagctggggctagaggagttcctgcagaagtcaaggcac acagagagcgagaagctgcaggtgcagattcaggcatatctggacaacgtgtttgatgtc gggggtttgttggaggatgctgagaccaagaatgtagccctggagaaggtggaggagttg gaggagcatgtgtcccatctcacagagaagcttctggacctagagaatgaaaacatgatg cgggtggcagaactagagaagcagctgctacagcgggagaaggaactagagagcatcaag >gi568815586f:49652494_49861780|GENSCAN_predicted_peptide_2|67_aa MGNLESAEGVPGEPPSVPLLLPPGKMPMPEPCELEERFALVLASESCISSLGTAIALEVH KENKPEL >gi568815586f:49652494_49861780|GENSCAN_predicted_CDS_2|204_bp atgggcaacctggagagcgccgagggggtcccgggagagcccccctctgtcccgttgttg ctgccgcccggcaagatgccgatgcctgagccctgtgagctggaggaaaggttcgccctg gtgctggccagtgaaagctgcatctccagccttggcacagccatagccctagaagtccac aaagagaataaacctgagctctag >gi568815586f:49652494_49861780|GENSCAN_predicted_peptide_3|360_aa MAEANYIGFTFKIYPESTSHHLYHYHFGVNLLTVLLLPLNLHNLFSNSSQCYCKKSGDCC TDSGTMNIFDRKINFDALLKFSHITPSTQQHLKKVYASFALCMFVAAAGAYVHMVTHFIQ AGLLSALGSLILMIWLMATPHSHETEQKRLGLLAGFAFLTGVGLGPALEFCIAVNPSILP TAFMGTAMIFTCFTLSALYARRRSYLFLGGILMSALSLLLLSSLGNVFFGSIWLFQANLY VGLVVMCGFVLFDTQLIIEKAEHGDQDYIWHCIDLFLDFITVFRKLMMILAMNEKPGPSP LSPVAIPYLRDFPIHTNSQPYICKTPQLHLLLVFPLPSQAAGSWLPLEDTALPAMPSESF >gi568815586f:49652494_49861780|GENSCAN_predicted_CDS_3|1083_bp atggcggaagcaaattatattggttttaccttcaaaatatatccagaatctacttcccac cacctctaccactaccactttggtgtgaaccttctaactgtactgcttttacccttgaat ctccacaatttattttcaaacagcagccagtgttactgtaagaagagtggagactgctgc acggactctggaaccatgaacatatttgatcgaaagatcaactttgatgcgcttttaaaa ttttctcatataaccccgtcaacgcagcagcacctgaagaaggtctatgcaagttttgcc ctttgtatgtttgtggcggctgcaggggcctatgtccatatggtcactcatttcattcag gctggcctgctgtctgccttgggctccctgatattgatgatttggctgatggcaacacct catagccatgaaactgaacagaaaagactgggacttcttgctggatttgcattccttaca ggagttggcctgggccctgccctggagttttgtattgctgtcaaccccagcatccttccc actgctttcatgggcacggcaatgatctttacctgcttcaccctcagtgcactctatgcc aggcgccgtagctacctctttctgggaggtatcttgatgtcagccctgagcttgttgctt ttgtcttccctggggaatgttttctttggatccatttggcttttccaggcaaacctgtat gtgggactggtggtcatgtgtggcttcgtcctttttgatactcaactcattattgaaaag gccgaacatggagatcaagattatatctggcactgcattgatctcttcttagatttcatt actgtcttcagaaaactcatgatgatcctggccatgaatgaaaagcccggcccctctccg ctgtctccagtggccatcccatatcttcgcgacttccccatccacacgaattcgcagccc tacatctgcaaaactccccaacttcatcttctcctggtgttccctttgccttcgcaggca gcaggttcctggctccccctggaggacacggccctccctgcaatgccctcagaatcattt tag >gi568815586f:49652494_49861780|GENSCAN_predicted_peptide_4|1392_aa MDLADPACPGPAQHLISGRTSQELGSPAYNLSLISENIKQPKLEDILQNVDQSLQKCYND ERGLWGEQLVNEGLTRGQEALAKGRGKVTVLDARGQMSEAMDQPAGGPGNPRPGEGDDGS MEPGTCQELLHRLRELEAENSALAQANENQRETYERCLDEVANHVVQALLNQKDLREECI KLKKRVFDLERQNQMLSALFQQKLQLTTGSLPQVCWEQQLRPGGPGPPAAPPPALDALSP FLRKKAQILEVLRALEETDPLLLCSPATPWRPPGQGPGSPEPINGELCGPPQPEPSPWAP CLLLGPGNLGGLLHWERLLGGLGGEEDTGRPWGPSRGPPQAQGTSSGPNCAPGSSSSSSS DEAGDPNEAPSPDTLLGALARRQLNLGQLLEDTESYLQAFLAGAAGPLNGDHPGPGQSSS PDQAPPQLSKSKGLPKSAWGGGTPEAHRPGFGATSEGQGPLPFLSMFMGAGDAPLGSRPG HPHSSSQVKSKLQIGPPSPGEAQGPLLPSPARGLKFLKLPPTSEKSPSPGGPQLSPQLPR NSRIPCRNSGSDGSPSPLLARRGLGGGELSPEGAQGLPTSPSPCYTTPDSTQLRPPQSAL STTLSPGPVVSPCYENILDLSRSTFRGPSPEPPPSPLQVPTYPQLTLEVPQAPEVLRSPG VPPSPCLPESYPYGSPQEKSLDKAGSESPHPGRRTPGNSSKKPSQGSGRRPGDPGSTPLR DRLAALGKLKTGPEGALGSEKNGVPARPGTEKTRGPGKSGESAGDMVPSIHRPLEQLEAK GGIRGAVALGTNSLKQQEPGLMGDPGARVYSSHSMGARVDLEPVSPRSCLTKVELAKSRL AGALCPQVPRTPAKVPTSAPSLGKPNKSPHSSPTKLPSKSPTKVVPRPGAPLVTKESPKP DKGKGPPWADCGSTTAQSTPLVPGPTDPSQGPEGLAPHSAIEEKVMKGIEENVLRLQGQE RAPGAEVKHRNTSSIASWFGLKKSKLPALNRRTEATKNKEGAGGGSPLRREVKMEARKLE AESLNISKLMAKAEDLRRALEEEKAYLSSRARPRPGGPAPGPNTGLGQVQGQLAGMYQGA DTFMQQLLNRVDGKELPSKSWREPKPEYGDFQPVSSDPKSPWPACGPRNGLVGPLQGCGK PPGKPSSEPGRREEMPSEDSLAEPVPTSHFTACGSLTRTLDSGIGTFPPPDHGSSGTPSK NLPKTKPPRLDPPPGVPPARPPPLTKVPRRAHTLEREVPGIEELLVSGRHPSMPAFPALL PAAPGHRGHETCPDDPCEDPGPTPPVQLAKNWTFPNTRAAGSSSDPLMCPPRQLEGLPRT PMALPVDRKRSQEPSRPSPTPQGPPFGGSRTPSTSDMAEEGRVASGGPPGLETSESLSDS LYDSLSSCGSQG >gi568815586f:49652494_49861780|GENSCAN_predicted_CDS_4|4179_bp atggacctggcggatccggcctgccctggcccagctcaacacctgatttcgggtaggact tctcaggagttgggctctccagcatataacctcagtctaattagtgagaacatcaaacaa cccaaattggaggacattctccaaaatgttgaccagtcccttcagaagtgttacaatgat gaaagaggactctggggggaacagctggtgaatgaggggctcaccaggggccaggaggcc ttggctaagggccggggcaaggtgaccgttctggatgctagaggccagatgtcagaggcc atggaccagccagctgggggtcctggaaacccaaggccaggagagggtgatgatggcagc atggagccaggcacctgccaggagcttctgcaccgactgcgggagctggaggcagagaac tcggcacttgcccaggccaacgaaaaccagcgggagacttatgagcgctgtctggacgag gttgccaaccatgtggtacaggcgttgctgaaccagaaggacctgcgagaggagtgcatc aagctgaagaagagagtgtttgacctggaacggcagaaccagatgctgagtgccctgttt cagcagaaactccagctcacgacaggctcgctccctcaggtatgttgggagcagcagctg aggccaggaggcccaggccccccagccgccccacccccagcgctggatgccctatccccg ttccttcggaagaaggcccagattctggaggtgctgagagccctggaagagactgacccc ttgcttctctgctcacctgccaccccctggcggcctccaggccaggggcctggctcccca gagcccatcaacggcgagctgtgtggcccgcctcagcctgaaccctcaccctgggcgccc tgcctgctgctaggccctggcaacctgggaggcctgctgcactgggagcgcctcttgggg ggtctgggaggggaagaggacactgggcggccctggggtcctagcaggggacctcctcag gcccagggcaccagctctggcccaaactgtgccccaggcagcagctcctcctcctcttct gatgaggcaggtgaccccaatgaggcacccagccccgacaccctgctcggtgccctggcc cgcagacagttgaacctgggccagctccttgaggacacagagtcttacctacaggccttc ctggccggggctgcaggcccactcaatggggaccacccaggtcctgggcagtcatcctcc ccagaccaggcgcccccacagctgtctaagtccaaaggcctccccaagtcagcttggggt gggggtaccccagaggcccacaggccaggcttcggtgctacctcagagggccaggggccc ctccccttccttagcatgttcatgggtgctggggatgccccactgggctctcggcctggc cacccccattcctcatctcaggtgaaaagcaagctccaaattggccccccttctcctggg gaagctcagggaccccttctgccctctccagctaggggtctcaagtttctaaagctgcct ccaacctcggagaagagccccagcccaggaggcccgcagctcagtccccagctcccccgg aactcgcgaatcccctgtcggaacagtggctcagacggcagcccctccccactgttggcc cgaaggggtctgggtggaggagagctgtccccagagggggcgcaaggcctgcccaccagc ccttcaccctgctacacaaccccagactccacacagctcagacccccgcagtcagccttg tccaccacgctgtccccaggcccagtggtgtctccctgctatgagaacattctggacctt tctcggagcacctttagggggccttccccagagccacctccatccccactgcaggtgccc acctacccacagctaactctggaggtaccacaggcccctgaggtcctcagaagccctgga gtaccccccagtccttgcctcccagaatcgtacccctatgggagcccccaagagaagagt ttggacaaggcaggctcggagtctccccatcccggccgcaggaccccaggcaactcatcc aagaagcccagccaggggtcaggacggcgacctggggatcctggcagcacacctctgcgg gacagactggcggccctggggaagctgaagacaggccccgagggggccctgggctcagag aagaatggggtgccagccaggcctggcaccgaaaagacccggggacctgggaagtcaggg gagagtgctggagacatggtgccctccatccacaggccactggagcagctagaagccaag ggggggatacggggggcagtggccttgggcacaaacagcctgaagcagcaggaacctgga cttatgggggatcccggggcccgagtctactcctctcactccatgggggcccgggtggac ctggagcctgtctcaccaaggagctgcctcaccaaagtggagctggccaagagccggctg gcaggggccctgtgcccccaggtaccccgtacccctgccaaagtgccaacctcagcccca agcctgggcaagcccaataagagccctcacagcagccccaccaagctcccttccaagtca ccaaccaaggtggtgcctcgacctggggctccgctagtcaccaaggagtcccccaagcct gacaaagggaagggccctccctgggcagactgtggtagtaccacggcccagtccacaccc ctagtacctggccccactgacccaagtcagggccctgaggggctggccccacactcagcc atcgaggagaaggtgatgaagggcattgaggagaacgtgctgcggctccagggccaggag cgagcccctggcgccgaggtcaagcaccgcaacaccagcagcatcgccagctggttcggc cttaagaagagcaagctgccagcgctgaaccgccgcacagaggccaccaagaacaaggag ggggctggcgggggctccccgctccggagggaagtcaagatggaagcccggaagctggag gccgagagcctcaacatctccaagctgatggccaaggcggaagacctgcgtcgggcactg gaagaggagaaggcctacctaagcagccgggcccggccacggcctggtggcccagcccca gggcccaacacggggctggggcaggtgcagggccagctggctggcatgtaccaaggtgca gacaccttcatgcagcagctgctaaacagggtggatggcaaggagctgccatccaagagc tggcgggagcccaagcctgagtacggggatttccagccggtgtcttctgaccccaagagc ccctggccagcctgtgggccccggaatggcctggtgggccctcttcagggctgcggaaaa cctcctggaaagccgagcagcgagccagggaggcgggaagagatgccctcggaggacagc ctggccgagccagtgcccacctcacacttcacagcctgtggctccttgactcgaaccttg gacagtggcattgggaccttcccacccccagaccatggtagcagtgggacccccagcaag aatcttcctaagaccaagccaccgcggctggatcccccacctggggtacccccagctcgg cccccaccccttaccaaagtcccccgccgcgcccacacactggagcgggaggtgccaggc atagaggagctgctggtgagtgggcggcaccccagcatgccagccttccctgcactgcta cccgctgctccgggccaccggggccatgagacctgtcctgatgatccctgtgaagaccca ggccccacccctcctgtccagctggccaagaactggaccttccccaatactagggcagcc ggcagctcctcggaccctctcatgtgcccaccccgacaactggaggggctgcccaggacc cccatggccctgcccgtggaccggaagcgaagccaggagcccagccgcccgtcccctacg ccccagggcccacctttcgggggtagccgcacccccagcacttcggacatggccgaggaa ggcagagtggccagcgggggccccccagggctggagacctcggagtctctcagtgactca ctctacgactcgctgtcctcttgtgggagtcagggctga >gi568815586f:49652494_49861780|GENSCAN_predicted_peptide_5|351_aa MAQIDVYQLIANLVLLIHLPLSHSFEALYHFTSKCLNDYLEKQVPGFRFRHGPEAVLRLM AVPTELDGGSVKETAAEEESRVLAPGAAPFGNFPHYSRFHPPEQRLRLLPPELLRQLFPE SPENGPILGLDVGCNSGDLSVALYKHFLSLPDGETCSDASREFRLLCCDIDPVLVKRAEK ECPFPDALTFITLDFMNQRTRKVLLSSFLSQFGRSVFDIGFCMSITMWIHLNHGDHGLWE FLAHLSSLCHYLLVEPQPWKCYRAAARRLRKLGLHDFDHFHSLAIRGDMPNQIVQILTQD HGMELICCFGNTSWDRSLLLFRAKQTIETHPIPESLIEKGKEKNRLSFQKQ >gi568815586f:49652494_49861780|GENSCAN_predicted_CDS_5|1056_bp atggcccagattgatgtttatcaactcatagccaatcttgttttacttatacatcttcct ctctcccattcttttgaagcattatatcattttaccagtaaatgtttaaacgactattta gagaagcaggttcccggcttccggttccgccacggcccagaggctgtgttgaggctaatg gcggtgcccacggaactggatggagggagtgttaaggagaccgcagcggaagaggaatcg cgagttctggcacctggcgccgccccgttcggaaattttcctcattattctcgcttccac cctccggagcaacggctccgcctcctgcccccggagctgcttcgacagctctttcctgag agtcccgagaacgggccgattctggggctcgacgtggggtgtaactccggggatctgagt gtggctctatacaaacacttcctctccctacctgacggggaaacctgctcagatgcctca agagaattccgtctcctctgctgcgacatagatccagtcctggtgaagcgagccgaaaaa gaatgtccttttcctgatgccttgacttttatcaccctggacttcatgaatcaaaggacc cggaaggttctcttgagctctttcttaagccaatttggacgttcagtttttgacattggc ttctgcatgtcaataaccatgtggattcatctgaatcatggagaccatggcctatgggag ttcctggcccatctttcctccctctgccactacctccttgtggagccccaaccctggaag tgttaccgggcagctgcaaggcgtctccgaaagctgggactccatgattttgaccacttc cactcccttgccatccgaggtgacatgcccaatcagattgtgcagatcttgacccaggat catggcatggaattaatatgttgctttggcaacaccagttgggacagaagccttctgctc ttcagggcaaaacaaaccatagagactcatccaatccctgaatcactgatagaaaaaggg aaagaaaagaacagattaagtttccagaagcagtga