GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:29:46 Sequence gi568815597f:150172818_150379735 : 206918 bp : 44.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 156 198 43 2 1 65 86 45 0.649 2.58 1.02 Intr + 923 997 75 0 0 63 115 28 0.123 2.49 1.03 Intr + 16933 17073 141 2 0 52 57 81 0.399 1.72 1.04 Term + 20610 20647 38 1 2 74 47 85 0.517 0.50 1.05 PlyA + 23103 23108 6 1.05 2.02 PlyA - 25762 25757 6 1.05 2.01 Sngl - 27015 26722 294 0 0 53 37 204 0.910 7.50 2.00 Prom - 40437 40398 40 -5.46 3.08 PlyA - 41128 41123 6 1.05 3.07 Term - 47944 47874 71 2 2 50 36 145 0.983 3.60 3.06 Intr - 50423 50369 55 2 1 18 96 136 0.719 5.95 3.05 Intr - 53978 53791 188 0 2 48 68 355 0.984 28.91 3.04 Intr - 56420 56255 166 2 1 58 51 143 0.983 7.13 3.03 Intr - 57876 57754 123 0 0 25 119 40 0.749 1.58 3.02 Intr - 59109 58960 150 0 0 92 106 76 0.999 10.16 3.01 Init - 62969 62916 54 2 0 93 78 79 0.952 8.68 3.00 Prom - 78501 78462 40 -4.36 4.00 Prom + 84921 84960 40 -4.26 4.01 Init + 85312 85366 55 0 1 53 61 99 0.932 3.22 4.02 Intr + 88642 88821 180 2 0 57 98 182 0.867 15.94 4.03 Intr + 89341 89483 143 2 2 99 66 25 0.998 1.47 4.04 Intr + 89708 89803 96 1 0 114 78 46 0.988 6.41 4.05 Intr + 89987 90053 67 1 1 146 105 1 0.979 6.08 4.06 Intr + 90225 90382 158 1 2 36 111 111 0.966 7.93 4.07 Intr + 90482 90602 121 1 1 96 55 86 0.987 6.17 4.08 Intr + 90842 90862 21 0 0 101 115 17 0.802 3.22 4.09 Term + 92168 92196 29 0 2 103 43 23 0.294 -2.46 4.10 PlyA + 92241 92246 6 -1.75 5.08 PlyA - 92605 92600 6 1.05 5.07 Term - 93377 93313 65 0 2 116 55 92 0.992 6.75 5.06 Intr - 93839 93716 124 2 1 101 47 118 0.995 9.16 5.05 Intr - 94385 94258 128 0 2 97 105 50 0.996 8.00 5.04 Intr - 94661 94539 123 0 0 83 110 44 0.985 6.66 5.03 Intr - 94972 94899 74 0 2 100 89 37 0.999 4.05 5.02 Intr - 95310 95140 171 2 0 99 60 206 0.999 17.96 5.01 Init - 95993 95881 113 2 2 100 96 155 0.954 15.39 5.00 Prom - 98476 98437 40 -10.74 6.00 Prom + 98860 98899 40 -4.36 6.01 Init + 100001 100046 46 1 1 69 106 72 0.964 6.29 6.02 Intr + 101270 101353 84 0 0 128 101 57 0.995 10.79 6.03 Intr + 106901 106924 24 0 0 97 116 -2 0.166 1.50 6.04 Intr + 110521 110816 296 2 2 114 116 68 0.532 8.93 6.05 Intr + 110988 111063 76 2 1 83 74 51 0.991 2.29 6.06 Intr + 111609 111687 79 1 1 131 98 52 0.995 9.11 6.07 Intr + 111780 111891 112 0 1 52 116 21 0.788 1.68 6.08 Term + 113613 114137 525 2 0 104 41 101 0.614 1.26 6.09 PlyA + 118521 118526 6 1.05 7.00 Prom + 118546 118585 40 -7.76 7.01 Init + 121550 121632 83 1 2 65 93 98 0.736 8.44 7.02 Term + 135231 135411 181 0 1 91 54 228 0.975 16.78 7.03 PlyA + 135507 135512 6 1.05 8.00 Prom + 139403 139442 40 -8.36 8.01 Init + 152126 152270 145 1 1 65 96 132 0.997 11.98 8.02 Intr + 152934 153064 131 1 2 90 100 84 0.989 10.21 8.03 Intr + 155503 155649 147 0 0 74 92 141 0.975 13.53 8.04 Intr + 160162 160382 221 0 2 116 86 252 0.995 24.90 8.05 Intr + 162118 162424 307 1 1 73 65 297 0.994 22.35 8.06 Intr + 165343 165509 167 0 2 81 65 158 0.858 11.56 8.07 Intr + 167581 167660 80 1 2 46 95 29 0.981 -1.41 8.08 Intr + 170492 170635 144 0 0 47 52 77 0.535 0.25 8.09 Intr + 171345 171444 100 1 1 99 100 56 0.989 7.07 8.10 Intr + 171617 171730 114 2 0 76 82 47 0.845 2.46 8.11 Intr + 173201 173319 119 2 2 64 84 168 0.999 14.11 8.12 Intr + 173591 173674 84 0 0 98 23 97 0.838 3.99 8.13 Intr + 176340 176401 62 1 2 47 92 63 0.905 1.05 8.14 Term + 180016 180162 147 0 0 82 53 113 0.957 5.10 8.15 PlyA + 183244 183249 6 1.05 9.03 PlyA - 183741 183736 6 1.05 9.02 Term - 190955 190849 107 0 2 32 52 86 0.414 -2.03 9.01 Intr - 192119 191798 322 2 1 70 47 291 0.184 18.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 148761 148649 113 1 2 45 47 164 0.813 6.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_1|98_aa MSDPNCLDSGQTSQGHMLLWRTEKGEQTPIMTPNSKRTEVKLKVIELYNRQALISRPLEW KSVLATVTVNTECQLDWIEGRKVLILAGINVGIKVSSY >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_1|297_bp atgtctgaccccaactgcctggattcaggccagacttcacagggtcacatgttgctctgg agaaccgagaagggtgagcagacaccaatcatgaccccaaacagcaagaggacagaagtg aaattgaaagtcatagagctctacaatcgtcaagctcttattagcagaccgttggaatgg aaatctgttttagctactgtaacggttaatactgagtgtcaacttgattggattgaagga cgcaaagtattgatcctggctggcatcaatgtgggcatcaaggtgtcctcctactaa >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_2|97_aa MDDFEEFKTSVEEGTADVVEIARGLDLEVELEDESELPQSHDKTLTDVELLLMDKQRKRF LEMESTCEDSVNIVEMTTKDLKYYNNLVVKAVAGFED >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_2|294_bp atggatgactttgaggagtttaagacatcagtggaggaaggaactgcagatgtggtagaa atagcaaggggactagatttagaagtggagcttgaagatgaatctgaattgcctcaatct catgataaaactttaacagatgtggagttgcttcttatggataagcaaagaaagagattt cttgagatggaatctacttgtgaagattctgtgaacatagtggaaatgacaacaaaggat ttaaaatattacaacaacttagttgttaaagcagtggcagggtttgaggattga >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_3|268_aa MEMKKKINLELRNRSPEEVTELVLDNCLCVNGEIEGLNDTFKELEFLSMANVELSSLARL PSLNKLRKLELSDNIISGGLEVLAEKCPNLTYLNLSGNKIKDLSTVEALQNLKNLKSLDL FNCEITNLEDYRESIFELLQQITYLDGFDQEDNEAPDSEEEDDEDGDEDDEEEEENEAGP PEGYEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQDEEDDDDYVEEGE EEEEEEEGGLRGEKRKRDAEDDGEEEDD >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_3|807_bp atggagatgaagaagaagattaacctggagttaaggaacagatccccggaggaggtgaca gagttagtccttgataattgcctgtgtgtcaatggggaaattgaaggcctgaatgatact ttcaaagaactagaatttctgagtatggctaatgtggaactaagttcgctggcccggctt cccagcttaaataaacttcgaaaattggagcttagtgataatataatttctggaggcttg gaagtcctggcagagaaatgtccaaatcttacctacctcaatctgagtggaaacaaaata aaagatctcagtacagtagaagctctgcaaaatcttaaaaatttgaaaagtcttgacctg tttaactgtgagatcacaaacctggaagattatagagaaagtatttttgaactactgcag caaatcacatacttagatggatttgatcaggaggataatgaagcgccggactctgaagag gaggatgatgaggatggcgatgaagatgatgaagaggaagaggaaaatgaagctggtcca ccggaaggatatgaggaagaggaggaggaagaggaagaggaggatgaggatgaggatgaa gatgaagatgaagcaggttcagagttgggagagggagaagaggaagtgggcctctcatac ttaatgaaagaagaaattcaggatgaagaagatgatgatgactatgttgaagaaggggaa gaagaggaagaagaggaagaaggaggtcttcgaggggagaagaggaaacgagatgctgaa gacgatggagaggaagaagatgactag >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_4|289_aa MLFSALLLEVIWILAADGGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQPH GYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINS EATFAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHK DQKTSVPPFNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGT LFSTEEEPSKLLVQNYRALQPLNQRMVFASFIQAGSSYTTDILRDLSLG >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_4|870_bp atgttgttctccgccctcctgctggaggtgatttggatcctggctgcagatgggggccca catggtcaggaccattggccagcctcttaccctgagtgtggaaacaatgcccagtcgccc atcgatattcagacagacagtgtgacatttgaccctgatttgcctgctctgcagccccac ggatatgaccagcctggcaccgagcctttggacctgcacaacaatggccacacagtgcaa ctctctctgccctctaccctgtatctgggtggacttccccgaaaatatgtagctgcccag ctccacctgcactggggtcagaaaggatccccaggggggtcagaacaccagatcaacagt gaagccacatttgcagagctccacattgtacattatgactctgattcctatgacagcttg agtgaggctgctgagaggcctcagggcctggctgtcctgggcatcctaattgaggtgggt gagactaagaatatagcttatgaacacattctgagtcacttgcatgaagtcaggcataaa gatcagaagacctcagtgcctcccttcaacctaagagagctgctccccaaacagctgggg cagtacttccgctacaatggctcgctcacaactcccccttgctaccagagtgtgctctgg acagttttttatagaaggtcccagatttcaatggaacagctggaaaagcttcaggggaca ttgttctccacagaagaggagccctctaagcttctggtacagaactaccgagcccttcag cctctcaatcagcgcatggtctttgcttctttcatccaagcaggatcctcgtataccaca gatatactgcgggatctctccttaggataa >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_5|265_aa MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ RSLLCRRQEDSRVMVYSALRIPPED >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_5|798_bp atgggggctgcggtgtttttcggctgcactttcgtcgcgttcggcccggccttcgcgctt ttcttgatcactgtggctggggacccgcttcgcgttatcatcctggtcgcaggggcattt ttctggctggtctccctgctcctggcctctgtggtctggttcatcttggtccatgtgacc gaccggtcagatgcccggctccagtacggcctcctgatttttggtgctgctgtctctgtc cttctacaggaggtgttccgctttgcctactacaagctgcttaagaaggcagatgagggg ttagcatcgctgagtgaggacggaagatcacccatctccatccgccagatggcctatgtt tctggtctctccttcggtatcatcagtggtgtcttctctgttatcaatattttggctgat gcacttgggccaggtgtggttgggatccatggagactcaccctattacttcctgacttca gcctttctgacagcagccattatcctgctccataccttttggggagttgtgttctttgat gcctgtgagaggagacggtactgggctttgggcctggtggttgggagtcacctactgaca tcgggactgacattcctgaacccctggtatgaggccagcctgctgcccatctatgcagtc actgtttccatggggctctgggccttcatcacagctggagggtccctccgaagtattcag cgcagcctcttgtgccgacggcaggaggacagtcgggtgatggtgtattctgccctgcgc atcccacccgaggactga >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_6|413_aa MDVLFVAIFAVPLILGQEYEDEERLGEDEYYQVVYYYTVTPSYGGDVFHVEVNSDFGFPS DSEREDKGAHGPRPDTVGQRGGSRPSPGPIRCRHRSKVSGNQHTPSHPKQRGSASPMAGS GAKRSRDGELETSLNTQGCTTEGDLLFAQKCKELQGFIPPLTDLLNGLKMGRFERGLSSF QQSVAMDRIQRIVGVLQKPQMGERYLGTLLQVEGMLKTWFPQIAAQKSSLGGGKHQLTKH FPSHHSDSAASSPASPMEKMDQTQLGHLALKPKQPWHLTQWPAMNLTWIHTTPICNPPLS SPGTISFSHGPLGTGTGIGVILFLQHGVQPFTHSAPTTPVPPTTASPVIPGEPMKLSGEG PRCYSLPVTLPSDWSYTLSPPSLPTLARKMTIGHREQQRSHPPVAADAHLLNL >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_6|1242_bp atggatgtcctctttgtagccatctttgctgtgccacttatcctgggacaagaatatgag gatgaagaaagactgggagaggatgaatattatcaggtggtctattattatacagtcacc cccagttatggtggggatgtatttcatgtagaagtgaacagtgactttggcttcccctct gatagtgagagggaggacaagggggcccatgggcccaggccagacactgttgggcagagg ggaggttcacggcccagcccgggtcctatccgctgcaggcatcgatcgaaggtttccggt aaccagcatacaccatctcatccgaaacagcggggttcggcttctcctatggcaggatct ggggcgaaaagatcaagagatggtgaactggagaccagtctaaacacccaaggttgtacc acagagggagacctgctgtttgcccagaagtgtaaagaactccaaggatttatacctcct ctcacagacctactcaatgggctgaagatgggtcgttttgagagaggattaagcagtttt cagcagagtgtggcaatggacaggatccagcgtattgtaggtgttttgcagaagccacag atgggggaacgttacctaggaaccttgctacaggtagaagggatgttaaagacttggttt ccacaaatagctgcccagaagtcatcattgggtggtggcaagcatcagctgaccaagcat tttccaagccaccacagtgattcagctgcttcctctcctgcatctcctatggaaaagatg gaccagacacagctaggacatctagctttaaaaccaaagcagccttggcacctcacacaa tggccagctatgaacctcacctggatccacaccactccaatttgcaacccccctctcagc tccccaggtactatctcctttagccatggtcctttaggcactggaaccggcattggcgtc attcttttcctccagcatggagtgcaacccttcacccactctgccccaaccaccccagtc ccacctactacagcatctcctgtcatccctggtgagcctatgaaactatctggagagggt cctcgttgctacagtttgccagtaactctgccatcagactggagctataccctatcccct cccagtctacccaccttggccagaaagatgaccataggacaccgggagcagcagagaagc catcctccagttgctgctgatgctcatcttctcaacctctag >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_7|87_aa MAKHLKFIARTVMVQEGNVESAYRTLNRILTMDGLIEDIKHRRYYEKPCCRRQRESYERC RRIYNMEMARKINFLMRKNRADPWQGC >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_7|264_bp atggcaaaacatctgaagttcatcgccaggactgtgatggtacaggaagggaacgtggaa agcgcatacaggaccctaaacagaatcctcactatggatgggctcattgaggacattaag catcggcggtattatgagaagccatgctgccggcgacagagggaaagctatgaaaggtgc cggcggatctacaacatggaaatggctcgcaagatcaacttcttgatgcgaaagaatcgg gcagatccgtggcagggctgctga >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_8|655_aa MALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDKKKAADHLKPFLDDSTL RFVDKLFEAVEEGRSSRHSKSSSDRSRKRELKEVFGDDSEISKESSGVKKRRIPRFEEVE EEPEVIPGPPSESPGMLTKLQPKTPSSSQPERLPIGNTIQPSQAATFMNDAIEKARKAAE LQARIQAQLALKPGLIGNANMVGLANLHAMGIAPPKVELKDQTKPTPLILDEQGRTVDAT GKEIELTHRMPTLKANIRAVKREQFKQQLKEKPSEDMESNTFFDPRVSIAPSQRQRRTFK FHDKGKFEKIAQRLRTKAQLEKLQAEISQAARKTGIHTSTRLALIAPKKELKEGDIPEIE WWDSYIIPNGFDLTEENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKL RRQTRREAQKELQEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQ KAHEEANAARKLTAEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLY LTGVVVLHKDVNVVVVEGGPKAQKKFKRLMLHRIKWDEQTSNTKGDDDEESDEEAVKKTN KCVLVWEGTAKDRSFGEMKFKQCPTENMAREHFKKHGAEHYWDLALSESVLESTD >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_8|1968_bp atggcactgtcaaagagggagctggatgagctgaaaccatggatagagaagacagtgaag agggtcctgggtttctcagagcctacggtggtcacagcagcattgaactgtgtggggaag ggcatggacaagaagaaggcagccgatcatctgaaaccttttcttgatgattctactctc cgatttgtggacaaactgtttgaggctgtggaggaaggccgaagctctaggcattccaag tctagcagtgacaggagcagaaaacgagagctaaaggaggtgtttggtgatgactctgag atctctaaagaatcatcaggagtaaagaagcgacgaataccccgttttgaggaggtggaa gaagagccagaggtgatccctgggcctccatcagagagccctggcatgctgactaagctc cagccaaagactccttcttcctcccaaccagaacgacttcctattggcaacactattcag ccctcccaggctgccactttcatgaatgatgccattgagaaggcaaggaaagcagctgaa ctgcaagctcgaatccaagcccagctggcactgaagccaggactcatcggcaatgccaac atggtgggcctggctaatctccatgccatgggcattgctcccccgaaggtggagttaaaa gaccaaacgaaacctacaccactgatcctggatgagcaagggcgcactgtagatgcaaca ggcaaggagattgagctgacacaccgcatgcctactctgaaagccaatattcgtgctgtg aagagggaacaattcaagcaacaactaaaggaaaagccatcagaagacatggaatccaat accttttttgacccccgagtctccattgccccttcccagcgccagagacgcacttttaaa ttccatgacaagggcaaatttgagaagattgctcagcgattacggacaaaggctcaactg gagaagctacaggcagagatttcacaagcagctcgaaaaacaggcatccatacttcgact aggcttgccctcattgctcctaagaaggagctaaaggaaggagatattcctgaaattgag tggtgggactcttacataatccccaatggctttgatcttacagaggaaaatcccaagaga gaagattattttggaatcacaaatcttgttgaacatccagcccagctcaatcctccagtt gacaatgacacaccagttactctgggagtatatcttaccaagaaggaacagaaaaaactt cggagacaaacaaggagggaagcacagaaggaactacaagaaaaagtcaggctgggcctg atgcctcctccagaacccaaagtgagaatttctaatttgatgcgagtattaggaacagaa gctgttcaagaccccacgaaggtagaagcccacgtcagagctcagatggcaaaaagacag aaagcgcatgaagaggccaacgctgcccgaaaactcacagcagaacagagaaaggtcaag aaaattaaaaagcttaaagaagacatttcacagggggtacacatatctgtatatagagtt cgaaatttgagcaacccagccaagaagttcaagattgaagccaatgctgggcaactgtac ctgacaggggtggtggtactgcacaaggatgtcaacgtggtagtagtggaagggggcccc aaggcccagaagaaatttaagcgtcttatgctgcatcggataaagtgggatgaacagaca tctaacacaaagggagatgatgatgaggagtctgatgaggaagctgtgaagaaaaccaac aaatgtgtactagtctgggagggtacagccaaagaccggagctttggagagatgaagttt aaacagtgtcctacagagaacatggctcgtgagcatttcaaaaagcatggggctgaacac tactgggaccttgcgctgagtgaatctgtgttagagtccactgattga >gi568815597f:150172818_150379735|GENSCAN_predicted_peptide_9|142_aa SPPTLTSPEPLHPMIDDSTVVFFVLYTPRRQALNGLHGVGYRLEFSIQRGLQSPCRRGRR GGGLTAASAAAGRHLPGLHKRCCSSGGGGGGGGGSGQNNHCECVKAVGGCSHRPQLLGGA TRDKMAGSHLAAPQKMRMRQIQ >gi568815597f:150172818_150379735|GENSCAN_predicted_CDS_9|429_bp tcacccccaacactcacatctccggagccacttcatccaatgatagacgatagtactgtg gtgttttttgttctctatacaccaagacgacaagccttgaatggactccatggtgttggt taccgactggaattttcgatccaacgaggactccagagcccctgccgaagaggccgacga ggaggaggccttactgctgcctccgccgccgccggccgccatcttcccggtttgcacaag cgctgctgctcctctggcggcggcggcggcggcggcggcgggagcgggcaaaacaatcac tgcgagtgcgtgaaagccgtagggggctgcagccatcgcccccaactcctcggaggagca acgcgcgacaaaatggctggcagccatcttgcggcaccacaaaagatgcgcatgcgccag atccagtga