GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:32:59 Sequence gi568815597r:150166525_150368810 : 202286 bp : 45.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2582 2754 173 1 2 75 86 89 0.332 5.47 1.02 Intr + 20913 20953 41 0 2 111 84 22 0.226 1.97 1.03 Intr + 23226 23366 141 1 0 52 57 81 0.396 1.72 1.04 Term + 26903 26940 38 0 2 74 47 85 0.513 0.50 1.05 PlyA + 29396 29401 6 1.05 2.02 PlyA - 32055 32050 6 1.05 2.01 Sngl - 33308 33015 294 2 0 53 37 204 0.910 7.50 2.00 Prom - 46730 46691 40 -5.46 3.08 PlyA - 47421 47416 6 1.05 3.07 Term - 54237 54167 71 1 2 50 36 145 0.983 3.60 3.06 Intr - 56716 56662 55 1 1 18 96 136 0.719 5.95 3.05 Intr - 60271 60084 188 2 2 48 68 355 0.984 28.91 3.04 Intr - 62713 62548 166 1 1 58 51 143 0.983 7.13 3.03 Intr - 64169 64047 123 2 0 25 119 40 0.749 1.58 3.02 Intr - 65402 65253 150 2 0 92 106 76 0.999 10.16 3.01 Init - 69262 69209 54 1 0 93 78 79 0.952 8.68 3.00 Prom - 84794 84755 40 -4.36 4.00 Prom + 91214 91253 40 -4.26 4.01 Init + 91605 91659 55 2 1 53 61 99 0.932 3.22 4.02 Intr + 94935 95114 180 1 0 57 98 182 0.867 15.94 4.03 Intr + 95634 95776 143 1 2 99 66 25 0.998 1.47 4.04 Intr + 96001 96096 96 0 0 114 78 46 0.988 6.41 4.05 Intr + 96280 96346 67 0 1 146 105 1 0.979 6.08 4.06 Intr + 96518 96675 158 0 2 36 111 111 0.966 7.93 4.07 Intr + 96775 96895 121 0 1 96 55 86 0.987 6.17 4.08 Intr + 97135 97155 21 2 0 101 115 17 0.802 3.22 4.09 Term + 98461 98489 29 2 2 103 43 23 0.294 -2.46 4.10 PlyA + 98534 98539 6 -1.75 5.08 PlyA - 98898 98893 6 1.05 5.07 Term - 99670 99606 65 2 2 116 55 92 0.992 6.75 5.06 Intr - 100132 100009 124 1 1 101 47 118 0.995 9.16 5.05 Intr - 100678 100551 128 2 2 97 105 50 0.996 8.00 5.04 Intr - 100954 100832 123 2 0 83 110 44 0.985 6.66 5.03 Intr - 101265 101192 74 2 2 100 89 37 0.999 4.05 5.02 Intr - 101603 101433 171 1 0 99 60 206 0.999 17.96 5.01 Init - 102286 102174 113 1 2 100 96 155 0.954 15.39 5.00 Prom - 104769 104730 40 -10.74 6.00 Prom + 105153 105192 40 -4.36 6.01 Init + 106294 106339 46 0 1 69 106 72 0.964 6.29 6.02 Intr + 107563 107646 84 2 0 128 101 57 0.995 10.79 6.03 Intr + 113194 113217 24 2 0 97 116 -2 0.166 1.50 6.04 Intr + 116814 117109 296 1 2 114 116 68 0.532 8.93 6.05 Intr + 117281 117356 76 1 1 83 74 51 0.991 2.29 6.06 Intr + 117902 117980 79 0 1 131 98 52 0.995 9.11 6.07 Intr + 118073 118184 112 2 1 52 116 21 0.788 1.68 6.08 Term + 119906 120430 525 1 0 104 41 101 0.614 1.26 6.09 PlyA + 124814 124819 6 1.05 7.00 Prom + 124839 124878 40 -7.76 7.01 Init + 127843 127925 83 0 2 65 93 98 0.736 8.44 7.02 Term + 141524 141704 181 2 1 91 54 228 0.975 16.78 7.03 PlyA + 141800 141805 6 1.05 8.00 Prom + 145696 145735 40 -8.36 8.01 Init + 158419 158563 145 0 1 65 96 132 0.997 11.98 8.02 Intr + 159227 159357 131 0 2 90 100 84 0.989 10.21 8.03 Intr + 161796 161942 147 2 0 74 92 141 0.975 13.53 8.04 Intr + 166455 166675 221 2 2 116 86 252 0.995 24.90 8.05 Intr + 168411 168717 307 0 1 73 65 297 0.994 22.35 8.06 Intr + 171636 171802 167 2 2 81 65 158 0.858 11.56 8.07 Intr + 173874 173953 80 0 2 46 95 29 0.981 -1.41 8.08 Intr + 176785 176928 144 2 0 47 52 77 0.535 0.25 8.09 Intr + 177638 177737 100 0 1 99 100 56 0.989 7.07 8.10 Intr + 177910 178023 114 1 0 76 82 47 0.845 2.46 8.11 Intr + 179494 179612 119 1 2 64 84 168 0.999 14.11 8.12 Intr + 179884 179967 84 2 0 98 23 97 0.838 3.99 8.13 Intr + 182633 182694 62 0 2 47 92 63 0.905 1.05 8.14 Term + 186309 186455 147 2 0 82 53 113 0.958 5.10 8.15 PlyA + 189537 189542 6 1.05 9.03 PlyA - 190034 190029 6 1.05 9.02 Term - 197248 197142 107 2 2 32 52 86 0.418 -2.03 9.01 Intr - 198412 198091 322 1 1 70 47 291 0.258 18.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 155054 154942 113 0 2 45 47 164 0.813 6.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_1|130_aa MGILADVQLQVGPPGPWLHLVVIAPVPECITGIGIFSSWGSPDVGPLLYDIRAIMWGRDP AQSHIRCNYRAVKLKVIELYNRQALISRPLEWKSVLATVTVNTECQLDWIEGRKVLILAG INVGIKVSSY >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_1|393_bp atggggattctggctgatgtccagcttcaagtgggtccaccggggccgtggctccacctg gtggtcattgccccagtccctgagtgtataactgggattggtatattcagcagttgggga agccccgatgtggggcccctgctttatgatataagagctatcatgtggggaagagacccc gcccaaagtcacatccgttgcaattatcgggcagtgaaattgaaagtcatagagctctac aatcgtcaagctcttattagcagaccgttggaatggaaatctgttttagctactgtaacg gttaatactgagtgtcaacttgattggattgaaggacgcaaagtattgatcctggctggc atcaatgtgggcatcaaggtgtcctcctactaa >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_2|97_aa MDDFEEFKTSVEEGTADVVEIARGLDLEVELEDESELPQSHDKTLTDVELLLMDKQRKRF LEMESTCEDSVNIVEMTTKDLKYYNNLVVKAVAGFED >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_2|294_bp atggatgactttgaggagtttaagacatcagtggaggaaggaactgcagatgtggtagaa atagcaaggggactagatttagaagtggagcttgaagatgaatctgaattgcctcaatct catgataaaactttaacagatgtggagttgcttcttatggataagcaaagaaagagattt cttgagatggaatctacttgtgaagattctgtgaacatagtggaaatgacaacaaaggat ttaaaatattacaacaacttagttgttaaagcagtggcagggtttgaggattga >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_3|268_aa MEMKKKINLELRNRSPEEVTELVLDNCLCVNGEIEGLNDTFKELEFLSMANVELSSLARL PSLNKLRKLELSDNIISGGLEVLAEKCPNLTYLNLSGNKIKDLSTVEALQNLKNLKSLDL FNCEITNLEDYRESIFELLQQITYLDGFDQEDNEAPDSEEEDDEDGDEDDEEEEENEAGP PEGYEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDDDDYVEEGE EEEEEEEGGLRGEKRKRDAEDDGEEEDD >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_3|807_bp atggagatgaagaagaagattaacctggagttaaggaacagatccccggaggaggtgaca gagttagtccttgataattgcctgtgtgtcaatggggaaattgaaggcctgaatgatact ttcaaagaactagaatttctgagtatggctaatgtggaactaagttcgctggcccggctt cccagcttaaataaacttcgaaaattggagcttagtgataatataatttctggaggcttg gaagtcctggcagagaaatgtccaaatcttacctacctcaatctgagtggaaacaaaata aaagatctcagtacagtagaagctctgcaaaatcttaaaaatttgaaaagtcttgacctg tttaactgtgagatcacaaacctggaagattatagagaaagtatttttgaactactgcag caaatcacatacttagatggatttgatcaggaggataatgaagcgccggactctgaagag gaggatgatgaggatggcgatgaagatgatgaagaggaagaggaaaatgaagctggtcca ccggaaggatatgaggaagaggaggaggaagaggaagaggaggatgaggatgaggatgaa gatgaagatgaagcaggttcagagttgggagagggagaagaggaagtgggcctctcatac ttaatgaaagaagaaattcaggatgaagaagatgatgatgactatgttgaagaaggggaa gaagaggaagaagaggaagaaggaggtcttcgaggggagaagaggaaacgagatgctgaa gacgatggagaggaagaagatgactag >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_4|289_aa MLFSALLLEVIWILAADGGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQPH GYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINS EATFAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHK DQKTSVPPFNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGT LFSTEEEPSKLLVQNYRALQPLNQRMVFASFIQAGSSYTTDILRDLSLG >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_4|870_bp atgttgttctccgccctcctgctggaggtgatttggatcctggctgcagatgggggccca catggtcaggaccattggccagcctcttaccctgagtgtggaaacaatgcccagtcgccc atcgatattcagacagacagtgtgacatttgaccctgatttgcctgctctgcagccccac ggatatgaccagcctggcaccgagcctttggacctgcacaacaatggccacacagtgcaa ctctctctgccctctaccctgtatctgggtggacttccccgaaaatatgtagctgcccag ctccacctgcactggggtcagaaaggatccccaggggggtcagaacaccagatcaacagt gaagccacatttgcagagctccacattgtacattatgactctgattcctatgacagcttg agtgaggctgctgagaggcctcagggcctggctgtcctgggcatcctaattgaggtgggt gagactaagaatatagcttatgaacacattctgagtcacttgcatgaagtcaggcataaa gatcagaagacctcagtgcctcccttcaacctaagagagctgctccccaaacagctgggg cagtacttccgctacaatggctcgctcacaactcccccttgctaccagagtgtgctctgg acagttttttatagaaggtcccagatttcaatggaacagctggaaaagcttcaggggaca ttgttctccacagaagaggagccctctaagcttctggtacagaactaccgagcccttcag cctctcaatcagcgcatggtctttgcttctttcatccaagcaggatcctcgtataccaca gatatactgcgggatctctccttaggataa >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_5|265_aa MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ RSLLCRRQEDSRVMVYSALRIPPED >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_5|798_bp atgggggctgcggtgtttttcggctgcactttcgtcgcgttcggcccggccttcgcgctt ttcttgatcactgtggctggggacccgcttcgcgttatcatcctggtcgcaggggcattt ttctggctggtctccctgctcctggcctctgtggtctggttcatcttggtccatgtgacc gaccggtcagatgcccggctccagtacggcctcctgatttttggtgctgctgtctctgtc cttctacaggaggtgttccgctttgcctactacaagctgcttaagaaggcagatgagggg ttagcatcgctgagtgaggacggaagatcacccatctccatccgccagatggcctatgtt tctggtctctccttcggtatcatcagtggtgtcttctctgttatcaatattttggctgat gcacttgggccaggtgtggttgggatccatggagactcaccctattacttcctgacttca gcctttctgacagcagccattatcctgctccataccttttggggagttgtgttctttgat gcctgtgagaggagacggtactgggctttgggcctggtggttgggagtcacctactgaca tcgggactgacattcctgaacccctggtatgaggccagcctgctgcccatctatgcagtc actgtttccatggggctctgggccttcatcacagctggagggtccctccgaagtattcag cgcagcctcttgtgccgacggcaggaggacagtcgggtgatggtgtattctgccctgcgc atcccacccgaggactga >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_6|413_aa MDVLFVAIFAVPLILGQEYEDEERLGEDEYYQVVYYYTVTPSYGGDVFHVEVNSDFGFPS DSEREDKGAHGPRPDTVGQRGGSRPSPGPIRCRHRSKVSGNQHTPSHPKQRGSASPMAGS GAKRSRDGELETSLNTQGCTTEGDLLFAQKCKELQGFIPPLTDLLNGLKMGRFERGLSSF QQSVAMDRIQRIVGVLQKPQMGERYLGTLLQVEGMLKTWFPQIAAQKSSLGGGKHQLTKH FPSHHSDSAASSPASPMEKMDQTQLGHLALKPKQPWHLTQWPAMNLTWIHTTPICNPPLS SPGTISFSHGPLGTGTGIGVILFLQHGVQPFTHSAPTTPVPPTTASPVIPGEPMKLSGEG PRCYSLPVTLPSDWSYTLSPPSLPTLARKMTIGHREQQRSHPPVAADAHLLNL >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_6|1242_bp atggatgtcctctttgtagccatctttgctgtgccacttatcctgggacaagaatatgag gatgaagaaagactgggagaggatgaatattatcaggtggtctattattatacagtcacc cccagttatggtggggatgtatttcatgtagaagtgaacagtgactttggcttcccctct gatagtgagagggaggacaagggggcccatgggcccaggccagacactgttgggcagagg ggaggttcacggcccagcccgggtcctatccgctgcaggcatcgatcgaaggtttccggt aaccagcatacaccatctcatccgaaacagcggggttcggcttctcctatggcaggatct ggggcgaaaagatcaagagatggtgaactggagaccagtctaaacacccaaggttgtacc acagagggagacctgctgtttgcccagaagtgtaaagaactccaaggatttatacctcct ctcacagacctactcaatgggctgaagatgggtcgttttgagagaggattaagcagtttt cagcagagtgtggcaatggacaggatccagcgtattgtaggtgttttgcagaagccacag atgggggaacgttacctaggaaccttgctacaggtagaagggatgttaaagacttggttt ccacaaatagctgcccagaagtcatcattgggtggtggcaagcatcagctgaccaagcat tttccaagccaccacagtgattcagctgcttcctctcctgcatctcctatggaaaagatg gaccagacacagctaggacatctagctttaaaaccaaagcagccttggcacctcacacaa tggccagctatgaacctcacctggatccacaccactccaatttgcaacccccctctcagc tccccaggtactatctcctttagccatggtcctttaggcactggaaccggcattggcgtc attcttttcctccagcatggagtgcaacccttcacccactctgccccaaccaccccagtc ccacctactacagcatctcctgtcatccctggtgagcctatgaaactatctggagagggt cctcgttgctacagtttgccagtaactctgccatcagactggagctataccctatcccct cccagtctacccaccttggccagaaagatgaccataggacaccgggagcagcagagaagc catcctccagttgctgctgatgctcatcttctcaacctctag >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_7|87_aa MAKHLKFIARTVMVQEGNVESAYRTLNRILTMDGLIEDIKHRRYYEKPCCRRQRESYERC RRIYNMEMARKINFLMRKNRADPWQGC >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_7|264_bp atggcaaaacatctgaagttcatcgccaggactgtgatggtacaggaagggaacgtggaa agcgcatacaggaccctaaacagaatcctcactatggatgggctcattgaggacattaag catcggcggtattatgagaagccatgctgccggcgacagagggaaagctatgaaaggtgc cggcggatctacaacatggaaatggctcgcaagatcaacttcttgatgcgaaagaatcgg gcagatccgtggcagggctgctga >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_8|655_aa MALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDKKKAADHLKPFLDDSTL RFVDKLFEAVEEGRSSRHSKSSSDRSRKRELKEVFGDDSEISKESSGVKKRRIPRFEEVE EEPEVIPGPPSESPGMLTKLQPKTPSSSQPERLPIGNTIQPSQAATFMNDAIEKARKAAE LQARIQAQLALKPGLIGNANMVGLANLHAMGIAPPKVELKDQTKPTPLILDEQGRTVDAT GKEIELTHRMPTLKANIRAVKREQFKQQLKEKPSEDMESNTFFDPRVSIAPSQRQRRTFK FHDKGKFEKIAQRLRTKAQLEKLQAEISQAARKTGIHTSTRLALIAPKKELKEGDIPEIE WWDSYIIPNGFDLTEENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKL RRQTRREAQKELQEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQ KAHEEANAARKLTAEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLY LTGVVVLHKDVNVVVVEGGPKAQKKFKRLMLHRIKWDEQTSNTKGDDDEESDEEAVKKTN KCVLVWEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALSESVLESTD >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_8|1968_bp atggcactgtcaaagagggagctggatgagctgaaaccatggatagagaagacagtgaag agggtcctgggtttctcagagcctacggtggtcacagcagcattgaactgtgtggggaag ggcatggacaagaagaaggcagccgatcatctgaaaccttttcttgatgattctactctc cgatttgtggacaaactgtttgaggctgtggaggaaggccgaagctctaggcattccaag tctagcagtgacaggagcagaaaacgagagctaaaggaggtgtttggtgatgactctgag atctctaaagaatcatcaggagtaaagaagcgacgaataccccgttttgaggaggtggaa gaagagccagaggtgatccctgggcctccatcagagagccctggcatgctgactaagctc cagccaaagactccttcttcctcccaaccagaacgacttcctattggcaacactattcag ccctcccaggctgccactttcatgaatgatgccattgagaaggcaaggaaagcagctgaa ctgcaagctcgaatccaagcccagctggcactgaagccaggactcatcggcaatgccaac atggtgggcctggctaatctccatgccatgggcattgctcccccgaaggtggagttaaaa gaccaaacgaaacctacaccactgatcctggatgagcaagggcgcactgtagatgcaaca ggcaaggagattgagctgacacaccgcatgcctactctgaaagccaatattcgtgctgtg aagagggaacaattcaagcaacaactaaaggaaaagccatcagaagacatggaatccaat accttttttgacccccgagtctccattgccccttcccagcgccagagacgcacttttaaa ttccatgacaagggcaaatttgagaagattgctcagcgattacggacaaaggctcaactg gagaagctacaggcagagatttcacaagcagctcgaaaaacaggcatccatacttcgact aggcttgccctcattgctcctaagaaggagctaaaggaaggagatattcctgaaattgag tggtgggactcttacataatccccaatggctttgatcttacagaggaaaatcccaagaga gaagattattttggaatcacaaatcttgttgaacatccagcccagctcaatcctccagtt gacaatgacacaccagttactctgggagtatatcttaccaagaaggaacagaaaaaactt cggagacaaacaaggagggaagcacagaaggaactacaagaaaaagtcaggctgggcctg atgcctcctccagaacccaaagtgagaatttctaatttgatgcgagtattaggaacagaa gctgttcaagaccccacgaaggtagaagcccacgtcagagctcagatggcaaaaagacag aaagcgcatgaagaggccaacgctgcccgaaaactcacagcagaacagagaaaggtcaag aaaattaaaaagcttaaagaagacatttcacagggggtacacatatctgtatatagagtt cgaaatttgagcaacccagccaagaagttcaagattgaagccaatgctgggcaactgtac ctgacaggggtggtggtactgcacaaggatgtcaacgtggtagtagtggaagggggcccc aaggcccagaagaaatttaagcgtcttatgctgcatcggataaagtgggatgaacagaca tctaacacaaagggagatgatgatgaggagtctgatgaggaagctgtgaagaaaaccaac aaatgtgtactagtctgggagggtacagccaaagaccggagctttggagagatgaagttt aaacagtgtcctacagagaacatggctcgtgagcatttcaaaaagcatggggctgaacac tactgggaccttgcgctgagtgaatctgtgttagagtccactgattga >gi568815597r:150166525_150368810|GENSCAN_predicted_peptide_9|142_aa SPPTLTSPEPLHPMIDDSTVVFFVLYTPRRQALNGLHGVGYRLEFSIQRGLQSPCRRGRR GGGLTAASAAAGRHLPGLHKRCCSSGGGGGGGGGSGQNNHCECVKAVGGCSHRPQLLGGA TRDKMAGSHLAAPQKMRMRQIQ >gi568815597r:150166525_150368810|GENSCAN_predicted_CDS_9|429_bp tcacccccaacactcacatctccggagccacttcatccaatgatagacgatagtactgtg gtgttttttgttctctatacaccaagacgacaagccttgaatggactccatggtgttggt taccgactggaattttcgatccaacgaggactccagagcccctgccgaagaggccgacga ggaggaggccttactgctgcctccgccgccgccggccgccatcttcccggtttgcacaag cgctgctgctcctctggcggcggcggcggcggcggcggcgggagcgggcaaaacaatcac tgcgagtgcgtgaaagccgtagggggctgcagccatcgcccccaactcctcggaggagca acgcgcgacaaaatggctggcagccatcttgcggcaccacaaaagatgcgcatgcgccag atccagtga