GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:23:19 Sequence gi568815595f:68535705_68737024 : 201320 bp : 38.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3052 3176 125 2 2 58 89 82 0.422 4.68 1.02 Term + 28658 29065 408 1 0 52 42 208 0.460 6.83 1.03 PlyA + 29794 29799 6 1.05 2.00 Prom + 35651 35690 40 -6.05 2.01 Sngl + 36372 36659 288 2 0 58 42 225 0.490 10.24 2.02 PlyA + 36665 36670 6 1.05 3.08 PlyA - 42326 42321 6 1.05 3.07 Term - 43473 43060 414 0 0 62 47 197 0.587 7.28 3.06 Intr - 47314 46998 317 2 2 22 86 100 0.032 -1.94 3.05 Intr - 67351 67172 180 2 0 71 95 86 0.447 6.52 3.04 Intr - 75788 75505 284 1 2 101 7 161 0.195 5.44 3.03 Intr - 77500 77379 122 1 2 25 27 125 0.564 -1.53 3.02 Intr - 83254 83141 114 1 0 -22 99 132 0.652 3.02 3.01 Init - 84753 84739 15 0 0 99 80 7 0.718 1.48 3.00 Prom - 85328 85289 40 -7.35 4.04 PlyA - 86032 86027 6 1.05 4.03 Term - 89383 89093 291 1 0 27 42 218 0.697 5.46 4.02 Intr - 98533 98240 294 1 0 34 54 323 0.091 19.78 4.01 Init - 98732 98679 54 2 0 48 23 42 0.566 -4.96 4.00 Prom - 99267 99228 40 -11.64 5.00 Prom + 99412 99451 40 -9.15 5.01 Init + 100001 100691 691 1 1 94 98 942 0.661 91.06 5.02 Term + 100887 101323 437 1 2 -88 39 609 0.756 32.86 5.03 PlyA + 101517 101522 6 1.05 6.03 PlyA - 101852 101847 6 1.05 6.02 Term - 108330 107980 351 0 0 -8 48 232 0.394 3.10 6.01 Init - 108843 108718 126 0 0 76 70 86 0.487 4.33 6.00 Prom - 114815 114776 40 -3.65 7.00 Prom + 121833 121872 40 -1.25 7.01 Init + 125432 125513 82 1 1 78 105 51 0.927 6.98 7.02 Term + 132921 133045 125 1 2 25 41 158 0.691 2.37 7.03 PlyA + 133168 133173 6 1.05 8.00 Prom + 135462 135501 40 -4.85 8.01 Init + 137167 137189 23 0 2 84 127 17 0.555 4.45 8.02 Intr + 139232 139275 44 2 2 121 80 11 0.097 0.57 8.03 Term + 168362 168588 227 0 2 57 42 191 0.304 7.36 8.04 PlyA + 169409 169414 6 1.05 9.00 Prom + 173626 173665 40 -3.75 9.01 Init + 181005 181076 72 2 0 59 115 72 0.727 8.12 9.02 Intr + 182792 182867 76 1 1 57 105 26 0.604 -0.63 9.03 Intr + 189550 189761 212 2 2 90 96 68 0.449 5.51 9.04 Term + 191781 191984 204 2 0 93 34 110 0.644 2.49 9.05 PlyA + 193555 193560 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_1|177_aa XSIVIGKWWCEMEPCLEGEECKTLPDNSGWMCATGNKIKTTRTKGSLDSKLKIPERFLHF RAYETFHEASASYQPAKKREQAQRTTPEMLKNRLGSGIKYSCPHSTKQNSGLWPHLTVGK AGLKKKKSPLVHLEGKRDSLLNAHSVPTITSSPGYSSPESYGKPYKLAQAANKNTKA >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_1|534_bp ncctccatagtgattgggaaatggtggtgtgagatggagccttgcctagaaggagaagaa tgtaagacactccctgacaattctggatggatgtgcgcaacaggcaacaaaattaagacc acgagaactaaggggtccttggattccaagctaaaaatacctgagaggtttctacatttc agggcctatgaaaccttccacgaagcctctgcttcctatcagccagcaaagaagagagag caagctcagaggaccactccagagatgttaaagaacaggcttggaagtggtataaaatac agctgcccacattccactaagcagaactctggcttatggccccacctaactgtagggaaa gctggtttaaaaaaaaaaaaaagtccacttgtacacctcgaagggaaaagagatagcttg ctgaatgcccattcagtgcccactataacctcctctccagggtactcctctccagaaagc tatgggaagccatataagttggctcaagctgccaataagaacaccaaagcctaa >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_2|95_aa MSKMTEIEFRIWIGRKIIEVQENIKTQSKEAKKHNKVLQELTDKIASIEKNVTNLIELQN TLQEFYNAVASIDRKIDQVEERISELEDDLYEIRH >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_2|288_bp atgtctaaaatgacagaaatagaattcagaatatggataggaaggaagattatcgaagtt caggagaacatcaaaactcaatccaaggaagctaagaaacacaataaagtactacaggag ctgacagacaaaatagccagtatagaaaagaatgtaacaaacctgatagagctgcaaaac acactacaagaattttacaatgcagttgcaagtattgatagaaaaatcgaccaagttgag gaaagaatctcagagcttgaagacgacctgtatgaaataagacattag >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_3|481_aa MATSQPVRTKQQKDEGETESNGLRICGHILTGQVKLRDLKMVQWVEARDAAKHLAIHRTA PTTKNYPVENVNSAKVEKSHSIITNPLVALTQPVGPGVQIWQRAKKGRPGGSQVGKSDKV RGSVGLHDEDMEKREDGCAENKKGLGHGKKEHKQGAGVEKSNGNTVNVESFELDPLNYPC IPLLGFSLPTCYSNTAIVIVTVQGPGIAGFIQMAIVLVGMTPSGLMISKLYKELKQLNAE NLLKLISNFSKVSVYQINVQKSQAFLHTINRQTESQITSEFPFTIASKRIKYLEMQLTRD VKNLFRENYKPLLNKIKEDTNKWKNVPYSWIGRINIVKITILPKLADYNGLGSQKDFGSR QASAAVDFGCATWYASLSSQTIPTQHCANHSSSGLRVLPSTATVAVITDLANMLETYPES LQSQAFCEGHSQTKPNCEDRNKYFFNVQTPMHDHKNQEQSEKHEKTKQNKDQYQSDKIKF Q >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_3|1446_bp atggccaccagccagcctgtacgaacaaagcagcagaaagatgaaggtgaaacagaatcg aacgggttgagaatttgtggtcatattctcactggtcaggttaaacttcgtgacctcaaa atggtgcagtgggtagaggccagggatgctgctaaacatcttgcaatacacagaacagcc cccacaacgaagaattatcccgtggaaaatgtcaatagtgccaaggttgagaaatctcac tctattatcactaatccactggttgctctcactcagccagtgggcccaggggtgcagatc tggcaaagggcaaagaagggcagacccggggggtcacaggtggggaagagtgacaaggtg agaggttcagttggtctccacgatgaagacatggagaagagagaggatgggtgtgcagag aataaaaaggggttgggtcatggcaagaaagagcacaagcaaggtgctggtgtagagaag agcaatggaaatacagtaaatgttgaaagttttgagttggatcccttgaattatccctgt attccactcttgggattctctctccccacatgctactctaatactgctatagttattgtg acagtccaaggccctggaatagcaggtttcattcagatggccatcgtccttgtaggcatg actccttcaggtctgatgatatccaaattatataaggaactcaaacaactcaatgccgaa aatctccttaagctgataagcaacttcagcaaagtctcagtataccaaatcaacgtgcaa aaatcacaagcattcctacacaccattaacagacaaacagagagccaaatcacgagtgaa ttcccattcacaattgcttccaagagaataaaatacctggaaatgcaactaacaagggat gtgaaaaacctcttcagggagaactacaaaccactgctcaacaaaataaaagaggacaca aacaaatggaagaatgttccatactcatggataggaagaatcaatattgtgaaaataacc atactgcccaagctggctgactacaatggccttggatctcagaaagacttcggcagcagg caggcctcagcagctgtagactttggctgtgccacatggtatgccagcctcagcagccag acaattccaactcagcattgtgccaaccacagcagttctgggcttagggtactccctagc actgcaacagttgcagtgatcacagacttagcaaacatgctagagacctacccagaatct ctgcaatcacaggcattctgtgaagggcattcccagacaaagccaaactgtgaagacaga aataagtacttcttcaatgtgcagacacctatgcatgaccacaagaatcaagaacaatca gagaaacatgaaaaaacaaaacaaaacaaagaccaatatcaaagtgacaaaataaagttc cagtga >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_4|212_aa MPADRREAKFLVDGPVGAALGITTKISRGIVEIVRDVQLIKTGDKVGASEAIRLNVSPIP WRIIKPVSDNGSIYIPEVLYITGDTLHPLFQGVCMVPVFADWLPSCCISTPFYHQRNIVL EILARAIRQEKKIKGIQTGIEEVKLLLFADDMTVFLENSKGSSKKLLELMNEFSKISEYK INLHKSVALLYTNSGQAENQIKKSTPFTIAAK >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_4|639_bp atgcctgcagacaggagagaggcaaaattcttagttgatggaccagttggagccgcatta ggcatcaccaccaaaatctccaggggcatcgttgaaatcgtaagggatgtgcagctcatt aagactggagacaaagtgggagccagtgaagccatacggctgaacgtctcccctattcct tggcggatcatcaagccagtgtctgacaatggcagcatctacatccctgaagtgctttac atcacaggggacactctgcatcctctcttccagggtgtctgcatggtgccagtgtttgca gattggttacccagctgttgcatcagtaccccattttatcatcaacggaacatagtactg gaaatcctagccagagcaatcagacaagagaaaaaaataaagggcatccaaactggtata gaagaagtcaaactgttgctgtttgctgatgatatgactgtattcctagaaaactctaaa ggctcctccaaaaagctcctagaactgatgaatgaattcagcaaaatttcagaatacaaa attaatctacacaaatcagtagctctgctgtacaccaatagcggccaagctgagaatcaa attaagaaatcaaccccttttacaatagctgcaaaataa >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_5|375_aa MGQSQSGGHGPGGGKKDDKDKKKKYEPPVPTRVGKKKKKTKGPDAASKLPLVTPHTQCRL KLLKLERIKDYLLMEEEFIRNQEQMKPLEEKQEEERSKVDDLRGTPMSVGTLEEIIDDNH AIVSTSVGSEHYVSILSFVDKDLLEPGCSVLLNHKVHAVIGVLMDDTDPLVTVMKVEKAP QETYADIGGLDNQIQEIKESVELPLTHPEYYEEMGIKPPKGVILYGPPGTDSNSGGEREI QRTMLELLNQLDGFDSRGDVKVIMATNRIETLDPALIRPGRIDRKIEFPLPDEKTKKRIF QIHTSRMTLADDVTLDDLIMAKDDLSGADIKAICTEAGLMALRERRMKVTNEDFKKSKEN VLYKKQEGTPEGLYL >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_5|1128_bp atgggtcaaagtcagagtggtggtcatggtcctggaggtggcaagaaggatgacaaggac aagaaaaagaaatatgaacctcctgtaccaactagagtggggaaaaagaagaagaaaaca aagggaccagatgctgccagcaaactgccactggtgacacctcacactcagtgccggtta aaattactgaagttagagagaattaaagactatcttctcatggaggaagaattcattaga aatcaggaacaaatgaaaccattagaagaaaagcaagaggaggaaagatcaaaagtggat gatctgagggggaccccgatgtcagtaggaaccttggaagagatcattgatgacaatcat gccatcgtgtctacatctgtgggctcagaacactacgtcagcattctttcatttgtagac aaggatctgctggaacctggctgctcggtcctgctcaaccacaaggtgcatgccgtgata ggggtgctgatggatgacacggatcccctggtcacagtgatgaaggtagaaaaggccccc caggagacctatgcagatattggggggttggacaaccaaattcaggaaattaaggaatct gtggagcttcctctcacccatcctgaatattatgaagagatgggtataaagcctcctaag ggggtcattctctatggtccacctggcacagactccaattctggtggtgagagagaaatt cagcgaacaatgttggaactgctgaaccagttggatggatttgattctaggggagatgtg aaagttatcatggccacaaaccgaatagaaactttggatccagcacttatcagaccaggc cgcattgacaggaagattgagttccccctgcctgatgaaaagacgaagaagcgcatcttt cagattcacacaagcaggatgacgctggctgatgatgtaaccctggacgacctgatcatg gctaaagatgacctctctggtgctgacatcaaggcaatctgtacagaagctggtctgatg gccttaagagaacgtagaatgaaagtaacaaatgaagacttcaaaaaatctaaagaaaat gttctttataagaaacaggaaggcacccctgaggggctgtatctctaa >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_6|158_aa MCGPHHAIAASAHVNLATAALPLPVQVYSTCVHMDATTPHLLGTEEYKSKKTPSEGQQCQ ILKEHQPTQMRNNQLKNSGNSKSQSVFLPPNNHTSSSAMVLNQAEMAEKTDIEFSIWIVT KIIEIQEKVETQSKESKESNKMKQELNQTDLIELNNSL >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_6|477_bp atgtgtggacctcaccacgctattgccgccagtgcacacgtgaacctcgccactgctgcc ctgcccctgccagtgcaggtgtacagtacatgtgtgcatatggatgccaccaccccacat ttgctgggcactgaagaatacaaaagcaaaaaaaccccatccgaaggacagcaatgtcaa atactaaaggaacatcagcccacacagatgagaaataaccagctcaagaactctggcaac tcaaaaagtcagagtgtcttcttacctccaaacaatcacactagctcttcagcaatggtt cttaaccaggctgaaatggctgaaaagacagacattgaattcagtatctggatagtaaca aagatcattgagattcaggagaaagttgaaacccaatccaaggaatctaaggaatccaat aaaatgaaacaagagctcaaccaaactgatctgatagagctaaataactcactataa >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_7|68_aa MNQYQSMAWGLGTPVLESRRRRCDGNRGCLAASYPPDAGSTPFPPPLSPSVNNQKCLQTL PNVTWGVK >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_7|207_bp atgaaccagtaccagtccatggcctggggattggggactcctgttctagagagtcggaga aggagatgtgatggaaacagaggttgtttggcagcatcctacccaccagatgccggcagc acaccctttccacccccactctccccaagtgtgaacaatcaaaaatgtctccagacattg ccaaatgtcacctggggagtaaaatag >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_8|97_aa MVVKSMKRSRCPQLSYLLLFSAGPAPVQGEGITKPEHQGTGNSGTTLEFCLPNAIRDIGK GKDFMKTSKAIATKAKIDKCDLIKELLQSKINYQQSE >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_8|294_bp atggtggtgaaatccatgaaacgatcccgctgtccacagttatcctatctcctcttgttc tcagcaggtcctgctcctgtccaaggggaggggattacaaagcctgagcaccaagggaca ggaaattcagggaccaccttagaattctgcctacccaatgccattcgagacataggcaaa ggcaaggatttcatgaagacatcaaaagcaattgcaacaaaagcaaaaattgacaaatgt gatcttattaaagagcttctgcagagcaaaataaactatcaacagagtgaatag >gi568815595f:68535705_68737024|GENSCAN_predicted_peptide_9|187_aa MAFLALWVVLPPQEKYFSPSYDDQRRWIYNSAHQLQRARVLPHICGVMGGHFQPATAGQL EFQASESYEVPWVGAHRMRLLGSLDSAPCLGKCTGSPALLEFLGLEYAKLPGFPKCPSEQ WCSDVHQIVIKGQGIYLSMKKAQEFFNILPNRQLRATGSNASRVFEFVPSLLAQISDLET KKKHETI >gi568815595f:68535705_68737024|GENSCAN_predicted_CDS_9|564_bp atggcatttctagctctctgggttgttcttcctcctcaagagaaatacttcagtccaagc tacgatgaccagaggagatggatttacaattctgcccaccagctccagagagcaagggtg cttcctcatatctgtggtgtcatgggaggccatttccagcctgccactgctggccagctg gaattccaagccagtgagtcttatgaggtgccatgggtgggggcccacagaatgaggctg cttggttccctggattcagccccttgcctaggcaaatgcacaggatctcctgccttgctg gaattcctggggctggagtatgcaaaactcccgggtttccctaaatgccccagtgagcag tggtgctctgatgtccaccagatagttattaaaggccaaggcatttacttgtccatgaag aaagcacaagagtttttcaacattctgccaaatcgacagctaagggcaacaggaagcaat gcttctcgagtgtttgagtttgtcccttctttacttgctcaaatttctgacttggaaaca aagaaaaaacatgaaacgatctag