GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:51:38 Sequence gi568815597f:150224943_150452976 : 228034 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1853 1666 188 0 2 48 68 355 0.131 28.91 1.04 Intr - 4295 4130 166 2 1 58 51 143 0.983 7.13 1.03 Intr - 5751 5629 123 0 0 25 119 40 0.749 1.58 1.02 Intr - 6984 6835 150 0 0 92 106 76 0.999 10.16 1.01 Init - 10844 10791 54 2 0 93 78 79 0.952 8.68 1.00 Prom - 26376 26337 40 -4.36 2.00 Prom + 32796 32835 40 -4.26 2.01 Init + 33187 33241 55 0 1 53 61 99 0.932 3.22 2.02 Intr + 36517 36696 180 2 0 57 98 182 0.867 15.94 2.03 Intr + 37216 37358 143 2 2 99 66 25 0.998 1.47 2.04 Intr + 37583 37678 96 1 0 114 78 46 0.988 6.41 2.05 Intr + 37862 37928 67 1 1 146 105 1 0.979 6.08 2.06 Intr + 38100 38257 158 1 2 36 111 111 0.966 7.93 2.07 Intr + 38357 38477 121 1 1 96 55 86 0.987 6.17 2.08 Intr + 38717 38737 21 0 0 101 115 17 0.802 3.22 2.09 Term + 40043 40071 29 0 2 103 43 23 0.294 -2.46 2.10 PlyA + 40116 40121 6 -1.75 3.08 PlyA - 40480 40475 6 1.05 3.07 Term - 41252 41188 65 0 2 116 55 92 0.992 6.75 3.06 Intr - 41714 41591 124 2 1 101 47 118 0.995 9.16 3.05 Intr - 42260 42133 128 0 2 97 105 50 0.996 8.00 3.04 Intr - 42536 42414 123 0 0 83 110 44 0.985 6.66 3.03 Intr - 42847 42774 74 0 2 100 89 37 0.999 4.05 3.02 Intr - 43185 43015 171 2 0 99 60 206 0.999 17.96 3.01 Init - 43868 43756 113 2 2 100 96 155 0.954 15.39 3.00 Prom - 46351 46312 40 -10.74 4.00 Prom + 46735 46774 40 -4.36 4.01 Init + 47876 47921 46 1 1 69 106 72 0.964 6.29 4.02 Intr + 49145 49228 84 0 0 128 101 57 0.995 10.79 4.03 Intr + 54776 54799 24 0 0 97 116 -2 0.166 1.50 4.04 Intr + 58396 58691 296 2 2 114 116 68 0.532 8.93 4.05 Intr + 58863 58938 76 2 1 83 74 51 0.991 2.29 4.06 Intr + 59484 59562 79 1 1 131 98 52 0.995 9.11 4.07 Intr + 59655 59766 112 0 1 52 116 21 0.788 1.68 4.08 Term + 61488 62012 525 2 0 104 41 101 0.614 1.26 4.09 PlyA + 66396 66401 6 1.05 5.00 Prom + 66421 66460 40 -7.76 5.01 Init + 69425 69507 83 1 2 65 93 98 0.736 8.44 5.02 Term + 83106 83286 181 0 1 91 54 228 0.975 16.78 5.03 PlyA + 83382 83387 6 1.05 6.00 Prom + 87278 87317 40 -8.36 6.01 Init + 100001 100145 145 1 1 65 96 132 0.997 11.98 6.02 Intr + 100809 100939 131 1 2 90 100 84 0.989 10.21 6.03 Intr + 103378 103524 147 0 0 74 92 141 0.975 13.53 6.04 Intr + 108037 108257 221 0 2 116 86 252 0.995 24.90 6.05 Intr + 109993 110299 307 1 1 73 65 297 0.994 22.35 6.06 Intr + 113218 113384 167 0 2 81 65 158 0.858 11.56 6.07 Intr + 115456 115535 80 1 2 46 95 29 0.981 -1.41 6.08 Intr + 118367 118510 144 0 0 47 52 77 0.535 0.25 6.09 Intr + 119220 119319 100 1 1 99 100 56 0.989 7.07 6.10 Intr + 119492 119605 114 2 0 76 82 47 0.845 2.46 6.11 Intr + 121076 121194 119 2 2 64 84 168 0.999 14.11 6.12 Intr + 121466 121592 127 0 1 98 59 98 0.105 7.64 6.13 Intr + 139625 139800 176 2 2 77 64 116 0.003 7.68 6.14 Intr + 139834 139977 144 2 0 88 76 93 0.012 8.35 6.15 Intr + 185014 185163 150 2 0 33 89 56 0.102 0.33 6.16 Intr + 192654 192783 130 1 1 98 100 68 0.480 8.75 6.17 Intr + 215981 216081 101 2 2 101 80 -6 0.085 -0.35 6.18 Intr + 218289 218341 53 1 2 75 91 49 0.841 2.53 6.19 Intr + 219315 219435 121 2 1 64 90 68 0.656 4.67 6.20 Intr + 221284 221459 176 2 2 62 101 186 0.974 16.96 6.21 Term + 225043 225054 12 0 0 115 36 8 0.586 -3.50 6.22 PlyA + 227433 227438 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 1853 1624 230 0 2 48 48 358 0.864 24.59 S.002 Term - 96636 96524 113 1 2 45 47 164 0.813 6.72 S.003 Intr + 121466 121549 84 0 0 98 23 97 0.838 3.99 S.004 Intr + 124215 124276 62 1 2 47 92 63 0.905 1.05 S.005 Term + 127891 128037 147 0 0 82 53 113 0.957 5.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_1|227_aa MEMKKKINLELRNRSPEEVTELVLDNCLCVNGEIEGLNDTFKELEFLSMANVELSSLARL PSLNKLRKLELSDNIISGGLEVLAEKCPNLTYLNLSGNKIKDLSTVEALQNLKNLKSLDL FNCEITNLEDYRESIFELLQQITYLDGFDQEDNEAPDSEEEDDEDGDEDDEEEEENEAGP PEGYEEEEEEEEEEDEDEDEDEDEAGSELGEGEEEVGLSYLMKEEIQ >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_1|681_bp atggagatgaagaagaagattaacctggagttaaggaacagatccccggaggaggtgaca gagttagtccttgataattgcctgtgtgtcaatggggaaattgaaggcctgaatgatact ttcaaagaactagaatttctgagtatggctaatgtggaactaagttcgctggcccggctt cccagcttaaataaacttcgaaaattggagcttagtgataatataatttctggaggcttg gaagtcctggcagagaaatgtccaaatcttacctacctcaatctgagtggaaacaaaata aaagatctcagtacagtagaagctctgcaaaatcttaaaaatttgaaaagtcttgacctg tttaactgtgagatcacaaacctggaagattatagagaaagtatttttgaactactgcag caaatcacatacttagatggatttgatcaggaggataatgaagcgccggactctgaagag gaggatgatgaggatggcgatgaagatgatgaagaggaagaggaaaatgaagctggtcca ccggaaggatatgaggaagaggaggaggaagaggaagaggaggatgaggatgaggatgaa gatgaagatgaagcaggttcagagttgggagagggagaagaggaagtgggcctctcatac ttaatgaaagaagaaattcag >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_2|289_aa MLFSALLLEVIWILAADGGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQPH GYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINS EATFAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHK DQKTSVPPFNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGT LFSTEEEPSKLLVQNYRALQPLNQRMVFASFIQAGSSYTTDILRDLSLG >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_2|870_bp atgttgttctccgccctcctgctggaggtgatttggatcctggctgcagatgggggccca catggtcaggaccattggccagcctcttaccctgagtgtggaaacaatgcccagtcgccc atcgatattcagacagacagtgtgacatttgaccctgatttgcctgctctgcagccccac ggatatgaccagcctggcaccgagcctttggacctgcacaacaatggccacacagtgcaa ctctctctgccctctaccctgtatctgggtggacttccccgaaaatatgtagctgcccag ctccacctgcactggggtcagaaaggatccccaggggggtcagaacaccagatcaacagt gaagccacatttgcagagctccacattgtacattatgactctgattcctatgacagcttg agtgaggctgctgagaggcctcagggcctggctgtcctgggcatcctaattgaggtgggt gagactaagaatatagcttatgaacacattctgagtcacttgcatgaagtcaggcataaa gatcagaagacctcagtgcctcccttcaacctaagagagctgctccccaaacagctgggg cagtacttccgctacaatggctcgctcacaactcccccttgctaccagagtgtgctctgg acagttttttatagaaggtcccagatttcaatggaacagctggaaaagcttcaggggaca ttgttctccacagaagaggagccctctaagcttctggtacagaactaccgagcccttcag cctctcaatcagcgcatggtctttgcttctttcatccaagcaggatcctcgtataccaca gatatactgcgggatctctccttaggataa >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_3|265_aa MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVT DRSDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYV SGLSFGIISGVFSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFD ACERRRYWALGLVVGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQ RSLLCRRQEDSRVMVYSALRIPPED >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_3|798_bp atgggggctgcggtgtttttcggctgcactttcgtcgcgttcggcccggccttcgcgctt ttcttgatcactgtggctggggacccgcttcgcgttatcatcctggtcgcaggggcattt ttctggctggtctccctgctcctggcctctgtggtctggttcatcttggtccatgtgacc gaccggtcagatgcccggctccagtacggcctcctgatttttggtgctgctgtctctgtc cttctacaggaggtgttccgctttgcctactacaagctgcttaagaaggcagatgagggg ttagcatcgctgagtgaggacggaagatcacccatctccatccgccagatggcctatgtt tctggtctctccttcggtatcatcagtggtgtcttctctgttatcaatattttggctgat gcacttgggccaggtgtggttgggatccatggagactcaccctattacttcctgacttca gcctttctgacagcagccattatcctgctccataccttttggggagttgtgttctttgat gcctgtgagaggagacggtactgggctttgggcctggtggttgggagtcacctactgaca tcgggactgacattcctgaacccctggtatgaggccagcctgctgcccatctatgcagtc actgtttccatggggctctgggccttcatcacagctggagggtccctccgaagtattcag cgcagcctcttgtgccgacggcaggaggacagtcgggtgatggtgtattctgccctgcgc atcccacccgaggactga >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_4|413_aa MDVLFVAIFAVPLILGQEYEDEERLGEDEYYQVVYYYTVTPSYGGDVFHVEVNSDFGFPS DSEREDKGAHGPRPDTVGQRGGSRPSPGPIRCRHRSKVSGNQHTPSHPKQRGSASPMAGS GAKRSRDGELETSLNTQGCTTEGDLLFAQKCKELQGFIPPLTDLLNGLKMGRFERGLSSF QQSVAMDRIQRIVGVLQKPQMGERYLGTLLQVEGMLKTWFPQIAAQKSSLGGGKHQLTKH FPSHHSDSAASSPASPMEKMDQTQLGHLALKPKQPWHLTQWPAMNLTWIHTTPICNPPLS SPGTISFSHGPLGTGTGIGVILFLQHGVQPFTHSAPTTPVPPTTASPVIPGEPMKLSGEG PRCYSLPVTLPSDWSYTLSPPSLPTLARKMTIGHREQQRSHPPVAADAHLLNL >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_4|1242_bp atggatgtcctctttgtagccatctttgctgtgccacttatcctgggacaagaatatgag gatgaagaaagactgggagaggatgaatattatcaggtggtctattattatacagtcacc cccagttatggtggggatgtatttcatgtagaagtgaacagtgactttggcttcccctct gatagtgagagggaggacaagggggcccatgggcccaggccagacactgttgggcagagg ggaggttcacggcccagcccgggtcctatccgctgcaggcatcgatcgaaggtttccggt aaccagcatacaccatctcatccgaaacagcggggttcggcttctcctatggcaggatct ggggcgaaaagatcaagagatggtgaactggagaccagtctaaacacccaaggttgtacc acagagggagacctgctgtttgcccagaagtgtaaagaactccaaggatttatacctcct ctcacagacctactcaatgggctgaagatgggtcgttttgagagaggattaagcagtttt cagcagagtgtggcaatggacaggatccagcgtattgtaggtgttttgcagaagccacag atgggggaacgttacctaggaaccttgctacaggtagaagggatgttaaagacttggttt ccacaaatagctgcccagaagtcatcattgggtggtggcaagcatcagctgaccaagcat tttccaagccaccacagtgattcagctgcttcctctcctgcatctcctatggaaaagatg gaccagacacagctaggacatctagctttaaaaccaaagcagccttggcacctcacacaa tggccagctatgaacctcacctggatccacaccactccaatttgcaacccccctctcagc tccccaggtactatctcctttagccatggtcctttaggcactggaaccggcattggcgtc attcttttcctccagcatggagtgcaacccttcacccactctgccccaaccaccccagtc ccacctactacagcatctcctgtcatccctggtgagcctatgaaactatctggagagggt cctcgttgctacagtttgccagtaactctgccatcagactggagctataccctatcccct cccagtctacccaccttggccagaaagatgaccataggacaccgggagcagcagagaagc catcctccagttgctgctgatgctcatcttctcaacctctag >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_5|87_aa MAKHLKFIARTVMVQEGNVESAYRTLNRILTMDGLIEDIKHRRYYEKPCCRRQRESYERC RRIYNMEMARKINFLMRKNRADPWQGC >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_5|264_bp atggcaaaacatctgaagttcatcgccaggactgtgatggtacaggaagggaacgtggaa agcgcatacaggaccctaaacagaatcctcactatggatgggctcattgaggacattaag catcggcggtattatgagaagccatgctgccggcgacagagggaaagctatgaaaggtgc cggcggatctacaacatggaaatggctcgcaagatcaacttcttgatgcgaaagaatcgg gcagatccgtggcagggctgctga >gi568815597f:150224943_150452976|GENSCAN_predicted_peptide_6|954_aa MALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDKKKAADHLKPFLDDSTL RFVDKLFEAVEEGRSSRHSKSSSDRSRKRELKEVFGDDSEISKESSGVKKRRIPRFEEVE EEPEVIPGPPSESPGMLTKLQPKTPSSSQPERLPIGNTIQPSQAATFMNDAIEKARKAAE LQARIQAQLALKPGLIGNANMVGLANLHAMGIAPPKVELKDQTKPTPLILDEQGRTVDAT GKEIELTHRMPTLKANIRAVKREQFKQQLKEKPSEDMESNTFFDPRVSIAPSQRQRRTFK FHDKGKFEKIAQRLRTKAQLEKLQAEISQAARKTGIHTSTRLALIAPKKELKEGDIPEIE WWDSYIIPNGFDLTEENPKREDYFGITNLVEHPAQLNPPVDNDTPVTLGVYLTKKEQKKL RRQTRREAQKELQEKVRLGLMPPPEPKVRISNLMRVLGTEAVQDPTKVEAHVRAQMAKRQ KAHEEANAARKLTAEQRKVKKIKKLKEDISQGVHISVYRVRNLSNPAKKFKIEANAGQLY LTGVVVLHKDVNVVVVEGGPKAQKKFKRLMLHRIKWDEQTSNTKGDGEWGLEGIKGESYG RRAQRVHHPHPLASLPTYGFHALAVIVLPAPAAAAAAAARGAAALVQTGKMAAGGGGGSR ALESSLDRKFQSVTNTMESIQGLSSWCIENKKHHSTIVYHWMKWLRRFLSKVVRNIFTEK GTFEQDLKVVIERAMRKTGEESSRTVSLTDREARTHPAAYPHRLNLFYLANDVIQNCKRK NAIIFRESFADVLPEAAALVKDPSVSKSVERIFKIWEDRNVYPEEMIVALREALTSTNPK AALKSKIVAEFRALIEELLLYKRSEDQIELKEKQLSTMRVDVCSTETLKCLKDKTGGKKF SKEFEEASSKLEEFVNGLDKQVKNGPSLTEALENAGIFYEAQYKEVKVVANIFY >gi568815597f:150224943_150452976|GENSCAN_predicted_CDS_6|2865_bp atggcactgtcaaagagggagctggatgagctgaaaccatggatagagaagacagtgaag agggtcctgggtttctcagagcctacggtggtcacagcagcattgaactgtgtggggaag ggcatggacaagaagaaggcagccgatcatctgaaaccttttcttgatgattctactctc cgatttgtggacaaactgtttgaggctgtggaggaaggccgaagctctaggcattccaag tctagcagtgacaggagcagaaaacgagagctaaaggaggtgtttggtgatgactctgag atctctaaagaatcatcaggagtaaagaagcgacgaataccccgttttgaggaggtggaa gaagagccagaggtgatccctgggcctccatcagagagccctggcatgctgactaagctc cagccaaagactccttcttcctcccaaccagaacgacttcctattggcaacactattcag ccctcccaggctgccactttcatgaatgatgccattgagaaggcaaggaaagcagctgaa ctgcaagctcgaatccaagcccagctggcactgaagccaggactcatcggcaatgccaac atggtgggcctggctaatctccatgccatgggcattgctcccccgaaggtggagttaaaa gaccaaacgaaacctacaccactgatcctggatgagcaagggcgcactgtagatgcaaca ggcaaggagattgagctgacacaccgcatgcctactctgaaagccaatattcgtgctgtg aagagggaacaattcaagcaacaactaaaggaaaagccatcagaagacatggaatccaat accttttttgacccccgagtctccattgccccttcccagcgccagagacgcacttttaaa ttccatgacaagggcaaatttgagaagattgctcagcgattacggacaaaggctcaactg gagaagctacaggcagagatttcacaagcagctcgaaaaacaggcatccatacttcgact aggcttgccctcattgctcctaagaaggagctaaaggaaggagatattcctgaaattgag tggtgggactcttacataatccccaatggctttgatcttacagaggaaaatcccaagaga gaagattattttggaatcacaaatcttgttgaacatccagcccagctcaatcctccagtt gacaatgacacaccagttactctgggagtatatcttaccaagaaggaacagaaaaaactt cggagacaaacaaggagggaagcacagaaggaactacaagaaaaagtcaggctgggcctg atgcctcctccagaacccaaagtgagaatttctaatttgatgcgagtattaggaacagaa gctgttcaagaccccacgaaggtagaagcccacgtcagagctcagatggcaaaaagacag aaagcgcatgaagaggccaacgctgcccgaaaactcacagcagaacagagaaaggtcaag aaaattaaaaagcttaaagaagacatttcacagggggtacacatatctgtatatagagtt cgaaatttgagcaacccagccaagaagttcaagattgaagccaatgctgggcaactgtac ctgacaggggtggtggtactgcacaaggatgtcaacgtggtagtagtggaagggggcccc aaggcccagaagaaatttaagcgtcttatgctgcatcggataaagtgggatgaacagaca tctaacacaaagggagatggtgaatgggggttagaggggattaagggggagagctatggg agaagagcccagcgcgtgcaccatccccaccccctagcttccctccccacctacggcttt cacgcactcgcagtgattgttttgcccgctcccgccgccgccgccgccgccgccgccaga ggagcagcagcgcttgtgcaaaccgggaagatggcggccggcggcggcggaggcagcagg gctctggagtcctcgttggatcgaaaattccagtcggtaaccaacaccatggagtccatt caaggcttgtcgtcttggtgtatagagaacaaaaaacaccacagtactatcgtctatcat tggatgaagtggctccggagatttttaagtaaggtagtcaggaacatattcactgagaaa ggaacatttgagcaagatttaaaggtggtgattgaacgagccatgagaaaaacgggggaa gaaagttccagaacagtgagcttaacagaccgtgaagcaaggactcacccagctgcatat ccccaccgtttgaatctcttttaccttgccaatgatgtcatacagaactgtaaaaggaaa aatgcaatcatattccgtgaatcatttgctgatgtacttcctgaagcagctgctctagtg aaggatccatctgtctctaagtctgtagaacgaatctttaaaatctgggaagatagaaat gtatacccagaagaaatgattgtggcattgagagaagctttgacatctacaaatccaaaa gctgctctcaagtctaagatagttgctgaatttcgagccctaattgaagagctgttgcta tacaagcgctcagaagatcagatagaactgaaggaaaagcagttgtcaactatgagggtg gatgtgtgcagcacagaaactctcaaatgcttaaaagataaaacaggtgggaagaagttc tccaaagaatttgaagaggcaagctccaagctggaagaatttgtgaatggattagataag caggtgaaaaacggaccctcattaacagaagcactggaaaatgctggaattttctatgaa gcacaatacaaagaagtaaaagtggtggctaatatattctactaa