GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:12:29 Sequence gi568815592f:20302233_20590427 : 288195 bp : 41.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3087 3093 7 2 1 97 98 0 0.010 2.90 1.02 Intr + 15722 15775 54 0 0 83 82 31 0.130 0.03 1.03 Intr + 16133 16310 178 0 1 31 64 177 0.033 7.66 1.04 Intr + 16889 17029 141 2 0 47 15 135 0.140 0.65 1.05 Term + 27030 27267 238 0 1 81 43 171 0.113 6.56 1.06 PlyA + 27647 27652 6 1.05 2.10 PlyA - 27694 27689 6 1.05 2.09 Term - 29293 29145 149 2 2 61 41 191 0.899 8.78 2.08 Intr - 38698 38554 145 1 1 91 82 60 0.645 4.63 2.07 Intr - 41682 41648 35 1 2 92 80 27 0.522 -0.68 2.06 Intr - 45400 45191 210 2 0 54 88 136 0.251 8.16 2.05 Intr - 54520 54472 49 1 1 83 75 53 0.201 0.83 2.04 Intr - 54840 54631 210 0 0 62 107 26 0.064 0.09 2.03 Intr - 58652 58490 163 1 1 36 47 148 0.090 4.56 2.02 Intr - 59295 59186 110 0 2 44 99 29 0.308 -2.24 2.01 Init - 68727 68677 51 0 0 38 84 86 0.314 4.41 2.00 Prom - 69700 69661 40 -5.45 3.03 PlyA - 72186 72181 6 1.05 3.02 Term - 75581 75368 214 1 1 69 29 223 0.694 10.22 3.01 Init - 80195 80173 23 2 2 46 113 10 0.710 -1.49 3.00 Prom - 81035 80996 40 -0.95 4.00 Prom + 81438 81477 40 -3.65 4.01 Init + 91338 91429 92 2 2 49 100 145 0.682 11.81 4.02 Intr + 92778 92974 197 0 2 -33 92 128 0.238 -0.76 4.03 Term + 97650 97744 95 1 2 75 33 90 0.278 -0.79 4.04 PlyA + 98058 98063 6 1.05 5.05 PlyA - 99159 99154 6 1.05 5.04 Term - 100466 100115 352 1 1 45 43 374 0.521 21.57 5.03 Intr - 100966 100689 278 1 2 109 37 123 0.686 4.69 5.02 Intr - 101662 101483 180 1 0 72 56 158 0.746 10.14 5.01 Init - 104570 104463 108 2 0 73 59 70 0.571 2.87 5.00 Prom - 104754 104715 40 -8.25 6.03 PlyA - 105339 105334 6 1.05 6.02 Term - 108057 107663 395 1 2 104 41 204 0.970 11.51 6.01 Init - 109488 109305 184 0 1 66 36 127 0.545 4.63 6.00 Prom - 115604 115565 40 -4.75 7.07 PlyA - 116123 116118 6 1.05 7.06 Term - 118523 118439 85 1 1 80 43 124 0.733 3.25 7.05 Intr - 126421 126332 90 0 0 35 107 76 0.293 2.39 7.04 Intr - 129268 129140 129 0 0 86 59 85 0.115 4.29 7.03 Intr - 140886 140847 40 1 1 82 72 37 0.006 -2.14 7.02 Intr - 148472 148278 195 0 0 16 103 118 0.601 4.56 7.01 Init - 153373 153262 112 1 1 67 81 99 0.311 7.52 7.00 Prom - 154583 154544 40 -8.05 8.00 Prom + 156874 156913 40 -7.05 8.01 Init + 162026 162160 135 1 0 40 96 64 0.081 2.59 8.02 Intr + 177614 177725 112 1 1 66 78 129 0.874 8.73 8.03 Intr + 178974 179193 220 1 1 82 76 139 0.730 8.54 8.04 Intr + 180530 180717 188 2 2 38 32 161 0.510 3.91 8.05 Intr + 184480 184571 92 2 2 25 94 95 0.166 2.59 8.06 Intr + 185881 186016 136 0 1 80 93 36 0.987 2.52 8.07 Term + 188011 188198 188 2 2 78 38 170 0.686 7.67 8.08 PlyA + 188368 188373 6 1.05 9.04 PlyA - 189017 189012 6 -0.45 9.03 Term - 189349 189126 224 2 2 76 44 138 0.806 4.30 9.02 Intr - 189778 189677 102 2 0 87 62 99 0.447 6.43 9.01 Init - 224517 224325 193 0 1 82 91 64 0.243 5.28 9.00 Prom - 229031 228992 40 -6.15 10.03 PlyA - 230887 230882 6 1.05 10.02 Term - 232435 232174 262 0 1 33 48 263 0.862 10.81 10.01 Init - 240555 240437 119 0 2 92 38 89 0.223 4.02 10.00 Prom - 240787 240748 40 -7.05 11.00 Prom + 241914 241953 40 -9.15 11.01 Init + 244119 244291 173 2 2 56 115 230 0.934 21.46 11.02 Intr + 274814 274925 112 2 1 89 87 98 0.218 9.26 11.03 Term + 283218 283289 72 2 0 75 48 78 0.073 -0.57 11.04 PlyA + 285099 285104 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 6626 6569 58 2 1 43 45 98 0.857 2.52 S.002 Init + 209882 209939 58 1 1 73 107 47 0.829 6.62 S.003 Intr + 246361 246473 113 1 2 74 95 61 0.920 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_1|205_aa MQGISPNAIPPRSPHPTTSPAPDQEGQMGAEIQLALEYSGMWLPPKTETETYLRRICLRR GLLLVCTMGRCVVPVTLALRAWDVQPAMSEPPLPSPDGLLRGLRLPYERRSLLHGAPSTI DRPRAEKTMIAGKVQGKEIGDQWENTSGGASGMSSQLLRPPPPFPLPLAPLLLLGDSSQA IEVSVITRVFDDVFLPSYTIIFLKA >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_1|618_bp atgcaaggtatatctcctaatgctatccctccccgctccccccaccccacaacaagcccc gcacctgaccaagagggacaaatgggagctgaaatccagcttgcactcgaatattctggc atgtggctgccaccaaaaacagagacagaaacatacttgcgacgcatctgcctgcggaga ggacttcttctggtttgcacaatgggtcggtgtgtggtacctgtaaccctggcactgagg gcttgggacgtgcagcccgccatgtctgagcctcccctaccctctcctgacgggctcctg cgcggcctgaggctcccctacgagcgccgctccctgctccatggggcgcccagtaccatc gaccgcccaagagctgagaagacaatgatagcaggcaaggtgcaaggaaaggaaataggt gatcaatgggaaaatacgtcaggaggtgcttcaggaatgtcatctcagttactcagacca ccgccaccttttcctctcccactggcaccactgctccttcttggggactcctcacaggct atagaagtttcagtcattacccgtgtatttgatgatgtgtttctcccaagctatactatc attttcttgaaagcatag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_2|373_aa MEHLLNAKRYAGYEATELLQEYYPLSHLPLSQCLLNNYYVLRDVSGAVEPKQMRYRGQLP DLAGVLVDLAQAIQCDLGLAVLGDLHNQMSKDSTDPSLAGMTSSSPFKFSLWVGLQTAQL AGDPITLSIERELILPLEFQIYNYDICLMAENDNFAEHSAYPTLRSVYGDQWGKSWVLRD ILKSSGQPERLDVKGNNDKKKDSVHVPYKHNPSSSNYIFHPRLVESVDVEPLDTECQLYL VIQQGSRQGSNKDDMKDRCQGTGQDELTLLRPAKYQFHDFPSGSFNFILSQTHFMTQFWI CSRRLDMTLSLYFHSAFATVAIALEKLLGRLKEQIRGGYASRAAFQNHTTGLAASGPHSH DASAPIRKLLVLE >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_2|1122_bp atggagcacctactgaatgccaagcgctatgctggatatgaggccacagagttactccag gagtattatcccctttctcatctcccattaagtcaatgtttgctgaacaattactatgtg ctaagagatgtgtcaggtgctgtggagccaaagcaaatgagatacaggggacaacttcca gacctggcaggagttctagtggacctggctcaggctatccagtgtgatttaggattagcg gttctgggtgacctgcacaaccaaatgagcaaagatagcactgaccctagtctagctgga atgacaagcagcagtcccttcaagttttcgctctgggtggggctgcaaacagctcagcta gctggagacccaatcaccttgtctatagaaagagaattaattctgcctctggaattccaa atctataattatgacatatgcctgatggctgaaaatgacaactttgcagaacattctgct taccccactttgagaagtgtctatggggaccaatgggggaagtcctgggttctgagagac atcctgaagtcttctggacagccagagagattagatgtaaaagggaataatgacaagaaa aaagattctgtacatgttccgtacaaacacaacccttctagttctaattacattttccat ccgaggttggtggaatccgtggatgtggaacccttagatacggagtgccaattgtacctg gtaattcagcagggaagccgacaggggagcaataaggatgacatgaaggacagatgccaa ggtacaggacaagatgagctgactcttttgcgaccagccaaatatcagtttcatgatttc ccatcaggctcttttaactttattctcagtcaaactcattttatgacccagttttggatt tgctctcgaagactcgacatgactctcagtctgtattttcattcagccttcgccacagta gccattgccctagagaaattgttgggaagattgaaagagcagattcgaggaggctatgct tcaagagcagctttccagaaccacaccacaggactggctgcttctgggccccacagccac gatgcctctgcaccaatcaggaagctgctggttttagaataa >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_3|78_aa MTPEKGRRLCEFTAHITAEAGPQAKFKVKVGCRTVVPVKMNSPFATYILFGGITIFDSSE LSAAFQIRPFTCTDKSKS >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_3|237_bp atgacacctgagaaaggtagaaggctgtgtgagttcacagctcacattacagcagaagca ggaccacaagccaaatttaaagtaaaagtgggctgcaggacagttgtacctgtaaaaatg aattctccttttgctacctacattctttttggtggcatcaccatctttgacagctcggaa ctgtctgcagcctttcagatacggccctttacctgcactgacaaaagtaaatcttag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_4|127_aa MVQGECIKQEEEKEEDIKGIIEENEAINEIHTSEWGRATKCHKSCVASAVREKGNQVVLV GPENRRTPKQPIELHWDVSELTQGQPLKLRRVLSSGGISIEESNIDGTRAGKLLVYYTSS SCPLSTT >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_4|384_bp atggtccaaggagaatgtataaaacaggaagaagagaaggaagaagatattaagggaata atagaagaaaacgaagccatcaatgagatccatacaagtgagtggggacgagctaccaag tgccataagagctgtgtggcatcagcagtgagggagaaggggaaccaggtagtgttggtg ggacccgagaataggagaacccccaaacagccaatagagcttcactgggatgtcagtgag ctgacacagggacagcctctaaaactgagaagggtcctgtcctccggaggtatatccata gaggaaagcaacattgatggaacaagagcagggaagctgctggtatattacacttctagc tcatgccctttgagtaccacttaa >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_5|305_aa MVKIESSGLTKAAKHKLHPKVGRSVESWTEWNSGKKPPTESGAPAASAAPEPSRSLTCCC KGISETSLAPGGGGGPAGEGARRAEQTGGAGRGRAPPLCCRELQILKCVKRSPADPLRSE HAAGTPGRQRASKADWGLAPQQSPGFNPSEPPTPQFTTPSPTPFSIRGVTPSYARERGSF ELEGKGEWGAPTTPRGASPPPAGAGDGGEGGYYRRAAAAAAAAASQRWWLLQQPGSAGGS VRRGVEEAAGGLLRAGGTGEEGAGGYGAALEGGGTGGGGSVRGEDLDVRARGGGSGGGGG EAGAG >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_5|918_bp atggtgaagattgagagttcaggtctaacaaaagccgctaagcataaactgcacccaaaa gttggcaggtctgtggaatcatggacggagtggaactcaggcaaaaaacccccaacagaa tccggtgcccccgcggcgtcggcggcccccgagccgtcccggtcactaacctgctgctgt aagggcatttccgaaaccagcctggctccggggggaggtggggggccggctggagagggg gcccggagggccgaacagaccggcggcgcagggcgagggcgggcgccgcccctttgttgc cgggagttacagatcctaaaatgtgtcaagcgcagccccgccgatcccctgcgctccgag cacgccgcagggacccccggccgccaacgagcctccaaggccgactgggggctagcaccc caacagtcccccggtttcaacccctcagagccccccacaccccagttcactactccctcc cccacccccttctccatccggggcgtgaccccttcctacgcccgagaaagaggaagcttc gagttggaggggaagggagagtggggcgcgcccacgaccccgcgcggtgcgagcccacct cccgccggggctggggacggtggggagggagggtattaccggagggccgccgccgccgcc gctgccgccgcgtcccagcgctggtggctgctgcagcagcccggctctgctggagggtcc gtgcggcgtggtgtagaggaggctgccggcggtctgctccgcgccgggggcactggggag gagggggccggcggctacggcgccgctttggagggaggaggaacaggaggtggtggaagt gttcgtggtgaggatctggatgtacgcgcccggggcggcggcagcggcggcggcggcggc gaagccggggctggctag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_6|192_aa MGQWEEGNKRSKVEREKLRANTPEGNLRCLQHHTVAAAECRGMGSKTEKARFHHPLMAGK GGQGAGYKHMSLLLSKGLPDAERHTGDSVGLRDAALVMYQKWSPQSIRNGEEQSGTKYTD PEFTAHHSQSPELPPFRIIKTRAGRSSPQQKVKRHSNPPLSKTNSTRQKTFQRKSASLIT TLISWYLMLTGC >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_6|579_bp atgggccagtgggaggagggcaacaaaaggagtaaggtggagagggagaagctacgagca aacacaccagagggaaatctgcggtgcctgcagcaccacactgtcgcagcagccgagtgc agaggaatgggaagcaagaccgaaaaggcaagatttcaccatcccttgatggcaggaaaa ggtggacagggtgcaggttataaacacatgagcctccttctctccaagggactgccagat gcagaaaggcacacaggagatagtgtgggcctcagggatgctgctctggttatgtatcag aaatggagcccccaaagcatcagaaatggggaagagcagagtggaacgaaatatacagac cctgagttcactgcccatcattctcagagtccagagctaccacccttcaggatcataaaa actagagcaggcagaagctctcctcagcagaaagtcaagaggcattcaaatcctccactc tctaagacaaactctactcgccaaaaaactttccagaggaaatcagcttcactgattact actctcatcagttggtacttaatgctcactggctgttag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_7|216_aa MTRYRENCTAVSINKTIGFYDESSSSAVDKAISDGPKGTGKDFMTKTPKAIGTKAKIDKW DLIKLKSFCTAKETLNRVNRQHIEWEKIFANYASDKGLISSIAILEQTQLSVLTDRLQYV QSAPRETKKQKAVLLAPEAGHRQQKANLSSRDTGPWDASTYQRANLSDPVYIPNIGSHSA APFPVNPESNSLQCEGKTLHQQKDDSLKAQGIVGIL >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_7|651_bp atgactcgctacagggaaaattgtacagcagtgtcaataaataaaaccataggcttctat gatgaatcgtcaagttctgctgtggacaaggcaatatctgatgggccaaaaggaacgggc aaagatttcatgacgaagacaccaaaagcaattggaacaaaagcaaaaattgacaaatgg gatctaattaaactaaagagcttctgcacagcaaaagaaactctcaacagagtaaacaga caacatatagaatgggagaaaatttttgcaaactatgcatctgacaaaggtctaatatcc agcatcgcaattcttgagcagacccagctttctgtgttgacagacaggttgcaatatgtt cagtctgctcctagagagacaaaaaagcaaaaagcagtgctcctggcacctgaggcagga caccgtcagcagaaggcaaatctcagctcacgggacactgggccatgggatgcgagcacg tatcagagggctaacctttctgaccctgtctacattcccaacattggttctcattcagcc gctcccttcccagtcaaccctgagagcaacagtcttcagtgtgaaggcaaaaccctccac cagcaaaaagatgactcgctgaaggctcaggggattgttggcattttgtag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_8|356_aa MHHKHSSSLSVNQRAVYQTLKNPAAPHTVPESGLRGRLSPPCIPQAKRRLELGESGHQYL SDGLKTPKGKGRAALRSPDSPKTPKSPSEKTRYDTSLGLLTKKFIQLLSQSPDGVLDLNK AAEVLKVQKRRIYDITNVLEGIHLIKKKSKNNVQWMGCSLSEDGGMLAQCQGLSKEVTEL SQEEKKLDELIQSCTLDLKLLTEDSENQRYPLCRQFSGDIRKISGLKDQTVIVVKAPPET RLEVPDSIESLQIHLASTQGPIEVYLCPEETETHSPMKTNNQDHNGNIPKPASKANLLQQ TEDQIPSNLEGPFVNLLPPLLQEDYLLSLGEEEGISDLFDAYDLEKLPLVEDFMCS >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_8|1071_bp atgcatcataaacacagcagctctttgtcagtcaatcagagagctgtttatcagacactg aagaatccagcagctcctcacacagttccagagtcaggcctaagggggaggttatctcct ccttgcattcctcaggcaaagcgaaggctggagctaggagaaagcggtcatcagtacctc tcagatggtttaaaaacccccaagggcaaaggaagagctgcactacgaagtccagatagt ccaaaaactccaaaatctccctcagaaaaaacgcggtatgatacgtctcttggtctgctc accaagaagttcattcagctcctgagccagtcacccgatggggtattggatttgaacaag gcagcagaagtgctaaaagtgcaaaagagaaggatttatgatatcaccaacgttctggaa ggcatccacctcattaagaagaagtctaaaaacaacgtccaatggatgggctgcagtctg tctgaggatgggggcatgctggcccagtgtcaaggcctgtcaaaagaagtgaccgagctc agtcaggaagagaagaaattagatgaactgatccaaagctgcaccctggacctcaaactg ttaaccgaggattcagagaatcaaagatatcctttgtgccgccagttctctggggatatt cgaaaaattagtggccttaaagaccaaactgttatagttgtgaaagcccctccagaaaca agacttgaagtgcctgactcaatagagagcctacaaatacatttggcaagtacccaaggg cccattgaggtttacttatgtccagaagagactgaaacacacagtccaatgaaaacaaac aaccaagaccacaatgggaatatccctaaacccgcttccaaagccaacctcttacagcag actgaggaccaaattccttccaacctagaaggaccgtttgtgaacttactgcctcccctg ctgcaagaggactatctcctgagcctcggggaggaggaaggcatcagcgatctcttcgat gcttacgatttggaaaagctcccactggtggaagacttcatgtgtagttga >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_9|172_aa MATKLQQIKKKQNQALQFLKMVVRVGLLRKRRLSKGSKNVTGSAKECMGEEHSRKRGHLV EKPSVLSIQGMDRRYYAIFPVGDGRCHLPEGKPFLDITGAREAQGWRDPCLLLLGAVTSS DLVLLSNGPPMSASPRTDSLQAPPPGNGSTMGQSSLQRLINLRSHRFRCSDQ >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_9|519_bp atggccacaaaattacagcagattaagaaaaagcagaatcaggccttgcagtttttaaag atggtggtcagagtaggcttattgagaaaaagacgtttaagtaaaggctcaaaaaatgtg acggggtcagccaaagaatgtatgggggaagagcattcaaggaaaaggggccacctagtg gaaaagccttcagtccttagcatccaaggaatggaccggagatactacgccatattccca gtgggagatggccgctgccacctgcccgagggaaagccattcttggacataactggggcc cgggaagcccaaggatggagggacccatgcctgctgctccttggggctgttacatcatct gacctggttctcctttccaatgggccacccatgtcagcaagccccaggactgactcgctc caagctccccctcctgggaatggaagcacgatgggacaaagctccttgcagaggctgata aaccttagatcacacaggtttagatgctctgatcagtag >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_10|126_aa MRNPCDDGNVLHPDCINVNILVVILYCIVVLQDVTNGRPGPRKTETGETRLGHKRSCGHT LPHMDPSATLTQKSGSKNQSKVRKSTRHSPEAAQTLDGERHDTYVPPRLFAHAQRRWPEE IKLHHC >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_10|381_bp atgaggaatccttgtgatgatggaaatgttttgcatcctgactgtatcaatgtcaatatc ctggttgtgatactgtattgtattgtagttttgcaagatgttaccaacgggagacctggc ccaagaaaaactgaaactggagaaacaaggcttggccataaaaggagctgtggccacact ctgccccacatggacccatcggccacccttacccagaagtcgggaagtaaaaaccagtcc aaagtgagaaaatcaacccgccactctccggaagctgcacagactttagatggagagcgc catgacacatacgtcccgccacgtttatttgcgcatgctcagcgacggtggcccgaagag atcaaattacatcattgctga >gi568815592f:20302233_20590427|GENSCAN_predicted_peptide_11|118_aa MPSASCDTLLDDIEDIVSQEDSKPQDRHFVRKDVVPKVRRRNTQKYLQEEENSPPSDRPP YTPITRGLLSLFHISLRCWRPNLGDRKRSVVRERAQNSLEGPAASLSVSSSLYGGPFT >gi568815592f:20302233_20590427|GENSCAN_predicted_CDS_11|357_bp atgccttctgcatcctgtgatacactactggatgacatcgaagatatcgtgtctcaggaa gattcaaaaccacaagataggcattttgtaagaaaggatgttgtcccgaaggtacgaagg cgaaatacccaaaaatatttgcaagaggaagaaaacagtccaccaagtgacaggcctcca tacactcctattacaagaggacttctgtctctgttccacatttcccttcgttgttggcga cctaaccttggtgaccgcaagcgatcggttgtgagagaaagagctcaaaattccttggaa ggacctgctgcttctctttctgtttctagttcactctacggaggcccttttacttga