GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:28:29 Sequence gi568815597r:39974064_40197238 : 223175 bp : 42.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 917 985 69 1 0 62 62 100 0.644 5.90 1.02 Term + 7926 8084 159 2 0 80 44 82 0.088 -0.04 1.03 PlyA + 8442 8447 6 1.05 2.04 PlyA - 8478 8473 6 1.05 2.03 Term - 40994 40891 104 0 2 92 49 91 0.785 3.06 2.02 Intr - 52499 52340 160 2 1 101 78 115 0.532 10.44 2.01 Init - 61228 61223 6 1 0 70 102 10 0.184 0.96 2.00 Prom - 63322 63283 40 -7.65 3.00 Prom + 64809 64848 40 -9.25 3.01 Init + 66104 66404 301 1 1 67 77 157 0.239 9.96 3.02 Intr + 66641 66738 98 0 2 9 78 81 0.163 -1.99 3.03 Intr + 68367 68533 167 2 2 -7 87 177 0.246 5.94 3.04 Intr + 85271 85395 125 2 2 58 121 106 0.516 10.21 3.05 Intr + 86004 86107 104 1 2 73 75 124 0.835 8.57 3.06 Intr + 87672 87749 78 2 0 91 100 79 0.994 8.23 3.07 Intr + 90164 90307 144 1 0 99 89 118 0.997 12.56 3.08 Intr + 90411 90496 86 2 2 101 75 88 0.943 6.50 3.09 Intr + 92152 92257 106 1 1 82 61 61 0.852 2.10 3.10 Intr + 93477 93654 178 2 1 88 101 73 0.709 7.17 3.11 Intr + 95507 95811 305 0 2 -9 89 270 0.385 12.78 3.12 Intr + 96096 96219 124 2 1 60 95 71 0.999 4.24 3.13 Intr + 96367 96449 83 2 2 114 93 99 0.999 11.44 3.14 Intr + 96773 96916 144 1 0 124 78 139 0.999 16.06 3.15 Term + 97387 97470 84 0 0 130 47 98 0.999 6.57 3.16 PlyA + 98526 98531 6 1.05 4.10 PlyA - 98667 98662 6 1.05 4.09 Term - 100120 99998 123 1 0 115 47 133 0.981 9.30 4.08 Intr - 102850 102779 72 1 0 101 93 3 0.619 0.78 4.07 Intr - 104595 104497 99 0 0 95 78 167 0.718 15.79 4.06 Intr - 106424 106334 91 1 1 113 102 87 0.998 11.68 4.05 Intr - 115449 115347 103 1 1 31 92 149 0.770 7.91 4.04 Intr - 117336 117266 71 2 2 83 88 5 0.574 -2.09 4.03 Intr - 118109 117982 128 2 2 117 46 40 0.632 1.26 4.02 Intr - 118444 118335 110 2 2 70 94 39 0.625 1.78 4.01 Init - 123175 123052 124 1 1 106 92 170 0.979 17.58 4.00 Prom - 138344 138305 40 -3.35 5.04 PlyA - 142723 142718 6 1.05 5.03 Term - 147089 146975 115 1 1 60 48 76 0.265 -2.14 5.02 Intr - 149416 149354 63 0 0 120 76 42 0.528 3.11 5.01 Init - 151579 151521 59 1 2 72 116 28 0.901 4.83 5.00 Prom - 158511 158472 40 -6.15 6.04 PlyA - 158545 158540 6 1.05 6.03 Term - 159349 159096 254 2 2 40 38 331 0.664 18.12 6.02 Intr - 160875 160699 177 1 0 2 4 260 0.672 7.97 6.01 Init - 173444 173393 52 2 1 53 82 32 0.091 0.48 6.00 Prom - 173489 173450 40 -5.45 7.00 Prom + 178932 178971 40 -6.15 7.01 Init + 187337 187573 237 1 0 101 94 290 0.050 26.96 7.02 Intr + 214992 215146 155 2 2 87 93 125 0.889 10.85 7.03 Intr + 216709 216790 82 1 1 64 109 31 0.467 1.52 7.04 Intr + 221569 221701 133 0 1 87 115 23 0.120 4.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 187593 187158 436 2 1 96 43 357 0.803 25.77 S.002 Init - 193194 193177 18 0 0 78 116 43 0.860 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_1|75_aa MSGIDEYKTSGGSGDRKGLIWQLGSDSRASPPAIYGFQQMGLAISVNTKGQLIVWEGLFY SLLFLKGVAEAEDET >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_1|228_bp atgagtggtattgatgagtacaagacgagtggaggcagtggagatagaaaaggactgatt tggcagctgggctcagactccagagctagtcctccagcaatctatggctttcagcagatg ggtttggccatttctgttaacacaaagggacaactcatagtgtgggaggggctgttctac tccttgctgttccttaaaggagtagcagaggcagaggacgagacctag >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_2|89_aa MKEERPVRAVEELLSNDVGQVDLGSARDGLCDQGVDCVHNVLHPPSQRESAVRCRDNYGR VPEIWPMESWTVLHALSPSVATLNVVCSR >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_2|270_bp atgaaggaagagaggcctgtgagagctgtagaggaacttctctccaatgatgtggggcag gtggatcttggcagtgcaagggatgggctatgtgaccaaggggtggactgtgtgcacaat gtcttacatccaccttcacaaagagaatctgctgtgaggtgtagagataactatggccgt gtgcctgagatctggccaatggaatcgtggacagttctccatgctctttctccttctgtg gcaactctgaatgtcgtgtgttcaagatga >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_3|708_aa MSRLIAFPRHKRGNRGMHAQRQAERGLDHHRELTAARIPALRESWPPVVPRPFTDCSLKE PASAIETKLTEELVTLLFFPERSLARDFCPSWARASGAIPVRARKWVARRLLGGSCRKRR AADRSPEVRRNSEGIRKRDFQAETTTGARTWRTGQLKDHDEFSVTGILVMAGEVEEVRCG QVSHMWRTFRWSIMADMQNLVERLERAVGRLEAVSHTSDMHRGYADSPSKAGAAPYVQAF DSLLAGPVAEYLKISKEIGGDVQKHAEMVHTGLKLERALLVTASQCQQPAENKLSDLLAP ISEQIKEVITFREKNRGSKLFNHLSAVSESIQALGWVAMAPKPGPYVKEMNDAAMFYTNR VLKEYKDVDKKHVDWVKAYLSIWTELQAYIKEFHTTGLAWSKTGPVAKELSGLPSGPSAG SCPPPPPPCPPPPPVSTISCSYESASRSSLFAQINQGESITHENISKALVLALNFLLRMF NMTIAAQRSGMKDRNKVAVVLTALKHVSDDMKTHKNPALKAQSGPVRSGPKPFSAPKPQT SPSPKRATKKEPAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTTLQIKGKI NSITVDNCKKLGLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYLSKNSLDC EIVSAKSSEMNVLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_3|2127_bp atgtccagactaattgccttcccacgtcacaaacggggaaaccgaggtatgcacgcccag agacaggcggagcgtggcctcgatcaccaccgagagctgacggccgcccggattcccgcc ctcagagaatcctggcccccagtcgttccaagaccctttacggactgcagcttaaaggaa ccggcctctgccattgagaccaaactgacagaggaactggtcactcttctgttctttcca gagaggagcttagccagggacttctgcccttcctgggcccgggcctcgggggcgattccg gtgagggcccggaagtgggtcgcgcggagattgctgggcggttcttgccggaagcggaga gcggctgatcgcagtccggaggtgaggcggaactctgagggaatccggaaaagggatttc caggcagagactactacaggtgcaaggacctggagaaccggacagctgaaagaccatgac gagttcagtgtgactggaattctagtaatggctggggaagtggaggaagtgagatgtggt caggtcagtcatatgtggaggacattcaggtggtccattatggctgacatgcaaaatctg gtagaaagattggagagggcagtgggccgcctggaggcagtatctcatacctctgacatg caccgtgggtatgcagacagtccttcaaaagcaggagcagctccatatgtgcaggcattt gactcgctgcttgctggtcctgtggcagagtacttgaagatcagtaaagagattggggga gacgtgcagaaacatgcggagatggtccacacaggtttgaagttggagcgagctctgttg gttacagcttctcagtgtcaacagccagcagaaaataagctttccgatttgttggcaccc atctcagagcagatcaaagaagtgataacctttcgggagaagaaccgaggcagcaagttg tttaatcacctgtcagctgtcagcgaaagtatccaggccctgggctgggtggctatggct cccaagcctggcccttatgtgaaagaaatgaatgatgccgccatgttttatacaaaccga gtcctcaaagagtacaaagatgtggataagaagcatgtagactgggtcaaagcttattta agtatatggacagagctgcaggcttacattaaggagttccataccaccggactggcctgg agcaaaacggggcctgtggcaaaagaactgagcggactgccatctggaccctctgccgga tcatgtcctcctccccctccaccatgcccccctcctcccccagtctctaccatttcatgc tcatatgagtctgcttcccgctcatcactgttcgcgcagattaatcagggggagagcatt acacatgagaacatctcaaaggctctggttttggctttgaattttctgttaaggatgttt aacatgacgatagcagctcaaaggtctggtatgaaggacaggaacaaggtagctgttgtt cttacagccctgaaacatgtatctgatgacatgaagactcacaagaaccctgccctgaag gctcagagtggtccagtacgcagtggccccaaaccattctctgcacctaaaccccaaacc agcccatcccccaaacgagccacaaagaaggagccagctgtacttgaactggagggcaag aagtggagagtggaaaatcaggaaaatgtttccaacctggtgattgaggacacagagctg aaacaggtggcttacatatacaagtgtgtcaacacgacattgcaaatcaagggcaaaatt aactccattacagtagataactgtaagaaacttggcctggtattcgatgacgtggtgggc attgtggagataatcaacagtaaggatgtcaaagttcaggtaatgggtaaagtgccaacc atatccatcaacaaaacagatggctgccatgcttacctgagcaagaattccctggattgt gaaatagtcagtgccaaatcttccgagatgaatgtcctcattcctacagaaggcggtgac tttaatgaattcccagttcctgagcagttcaagaccctatggaacgggcagaagttggtc accacagtgacagaaattgctggataa >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_4|306_aa MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH IIPFLG >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_4|921_bp atggcgtcgcccggctgcctgtggctcttggctgtggctctcctgccatggacctgcgct tctcgggcgctgcagcatctggacccgccggcgccgctgccgttggtgatctggcatggg atgggagacagctgttgcaatcccttaagcatgggtgctattaaaaaaatggtggagaag aaaatacctggaatttacgtcttatctttagagattgggaagaccctgatggaggacgtg gagaacagcttcttcttgaatgtcaattcccaagtaacaacagtgtgtcaggcacttgct aaggatcctaaattgcagcaaggctacaatgctatgggattctcccagggaggccaattt ctgagggcagtggctcagagatgcccttcacctcccatgatcaatctgatctcggttggg ggacaacatcaaggtgtttttggactccctcgatgcccaggagagagctctcacatctgt gacttcatccgaaaaacactgaatgctggggcgtactccaaagttgttcaggaacgcctc gtgcaagccgaatactggcatgaccccataaaggaggatgtgtatcgcaaccacagcatc ttcttggcagatataaatcaggagcggggtatcaatgagtcctacaagaaaaacctgatg gccctgaagaagtttgtgatggtgaaattcctcaatgattccattgtggaccctgtagat tcggagtggtttggattttacagaagtggccaagccaaggaaaccattcccttacaggag acctccctgtacacacaggaccgcctggggctaaaggaaatggacaatgcaggacagcta gtgtttctggctacagaaggggaccatcttcagttgtctgaagaatggttttatgcccac atcataccattccttggatga >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_5|78_aa MYLLGMFVPKHSKHQDLTRRARLSQTELSEILYEGIDFKLLNVLKKRDLLDLHLDVPKTK AKSMNDSLLATSDPNVRH >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_5|237_bp atgtatttgcttggaatgtttgttccaaaacattccaagcatcaagatctcacaagaagg gcacgtctgtcacaaacagaactttcagaaatcctttatgaaggtatagacttcaaactc ctcaatgttttgaagaaaagggatctccttgatcttcatcttgatgttcctaagacaaaa gccaaatcaatgaatgactcattgctggccacatctgacccaaatgttagacactga >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_6|160_aa MRAAQERSAPMIQLPPTAWVTEQDPVEEEKKEEKETEKEKGKEEEEEEEDEETYYEIQVI LSMRKKKRRRRRRRRPSHEASVGSLGDKGSHNLKAPKAEGPEQGGSVLVVLVEANEDMVG AQLLLGELLENCKAILAPLGQRTTRNLRCRGCRRSARLAS >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_6|483_bp atgagagcagcacaggaaagatctgcccccatgattcagttacctcccacagcctgggtg acagaacaagaccctgttgaagaagaaaagaaggaggagaaggagacagagaaggagaag gggaaagaggaggaggaggaagaggaggatgaggagacctactatgagattcaagtgatt cttagtatgaggaagaagaagaggaggaggaggaggaggaggagacctagccatgaagca agtgtcgggtctcttggggacaaggggtctcacaatctcaaagccccaaaagctgaaggt ccggagcaaggcggctctgtcctcgtggttcttgtggaagcaaatgaagacatggtcggc gctcagctgctcctcggcgaactgctggagaactgcaaagctatcctcgctcccctgggg cagcgcaccaccaggaatctccgatgcagaggctgccgccgctcagcacggctcgccagt taa >gi568815597r:39974064_40197238|GENSCAN_predicted_peptide_7|203_aa MADGKGDAAAVAGAGAEAPAVAGAGDGVETESMVRGHRPVSPAPGASGLRPCLWQLETEL REQEVSEVSSLNYCRSFCQTLLQYASNKNASEHIVYLLEVYRLAIQSFASARPYLTTECE DVLLVLGRLVLSCFELLLSVSESELPCEVWLPFLQSLQESHDALLEFGNNNLQILVHVTK EGVWKNPVLLKILSQQPVETEEX >gi568815597r:39974064_40197238|GENSCAN_predicted_CDS_7|609_bp atggcggacggaaagggagacgccgccgctgtcgccggggctggggctgaggctccggcg gtagcgggagccggagatggagtcgagactgagtccatggttcggggtcatcgccccgta tctccagcgccgggagcctcgggactgcggccgtgtctgtggcagctggagacagagctg agggagcaagaggtgtcggaggtctcatctttgaactactgccggagcttctgccagacc ttattgcaatatgcaagcaacaagaatgcatcagaacatattgtgtatcttctggaggta tatcgacttgccatccaaagctttgccagtgcacgtccatacttaactactgaatgtgaa gatgtcctcttagtgcttggcagattagtactgagttgtttcgaattactgctttcagtg tctgaaagtgaactgccatgtgaagtctggctaccattccttcagtctctacaggagtca catgatgcattattggaatttgggaataataacctacaaatattggttcatgttaccaag gaaggggtgtggaaaaacccagttcttcttaaaattctgtctcaacagccagtagaaacg gaggaagnn