GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:08:27 Sequence gi568815596r:210088964_210325263 : 236300 bp : 36.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 15338 15141 198 2 0 88 103 247 0.734 24.83 1.03 Intr - 27235 27072 164 2 2 72 73 143 0.166 9.97 1.02 Intr - 31093 31068 26 0 2 85 75 34 0.012 -1.45 1.01 Init - 65619 64532 1088 0 2 92 84 607 0.025 54.62 1.00 Prom - 67902 67863 40 -3.95 2.19 PlyA - 67911 67906 6 1.05 2.18 Term - 75659 75630 30 2 0 106 32 54 0.166 -1.52 2.17 Intr - 81679 81590 90 1 0 89 70 34 0.150 0.97 2.16 Intr - 82444 82285 160 0 1 39 39 262 0.068 15.47 2.15 Intr - 103927 103841 87 0 0 53 106 60 0.517 2.47 2.14 Intr - 106375 106248 128 1 2 47 64 70 0.584 -0.94 2.13 Intr - 114481 114368 114 1 0 88 115 93 0.999 11.72 2.12 Intr - 115719 115618 102 0 0 59 71 111 0.984 5.95 2.11 Intr - 116833 116669 165 1 0 74 109 118 0.999 11.74 2.10 Intr - 121299 121233 67 2 1 90 93 55 0.995 4.19 2.09 Intr - 126190 126122 69 0 0 16 84 131 0.879 2.88 2.08 Intr - 127548 127384 165 2 0 72 115 131 0.999 12.35 2.07 Intr - 129139 129002 138 0 0 68 92 113 0.997 8.36 2.06 Intr - 131839 131684 156 0 0 31 89 185 0.999 11.10 2.05 Intr - 136086 135872 215 0 2 23 20 181 0.223 1.39 2.04 Intr - 136644 136224 421 2 1 51 101 295 0.292 20.12 2.03 Intr - 144602 144561 42 1 0 119 90 50 0.306 4.74 2.02 Intr - 161754 161688 67 1 1 89 115 37 0.064 3.54 2.01 Init - 183201 183192 10 0 1 91 93 4 0.095 1.95 2.00 Prom - 185763 185724 40 -4.05 3.02 PlyA - 186243 186238 6 1.05 3.01 Sngl - 193181 192804 378 2 0 84 31 218 0.980 11.61 3.00 Prom - 193780 193741 40 -8.35 4.07 PlyA - 194133 194128 6 1.05 4.06 Term - 195702 195581 122 1 2 50 49 79 0.082 -2.14 4.05 Intr - 204837 204760 78 1 0 29 70 138 0.791 4.70 4.04 Intr - 205455 205282 174 1 0 57 82 194 0.919 14.69 4.03 Intr - 209600 209457 144 0 0 72 86 75 0.950 5.03 4.02 Intr - 213829 213805 25 1 1 78 119 44 0.527 3.18 4.01 Init - 226079 225948 132 2 0 61 88 127 0.663 10.48 4.00 Prom - 229669 229630 40 0.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:210088964_210325263|GENSCAN_predicted_peptide_1|492_aa MTPALREATAKGISFSSLPSTMESDKMLYMESPRTVDEKLKGDTFSQMLGFPTPEPTLNT NFVNLKHFGSPQSSKHYQTVFLMRSNSTLNKHNENYKQKKLGEPSCNKLKNILYNGSNIQ LSKICLSHSEEFIKKEPLSDTTSQCMKDVQIILDSNITKDTNVDKVQLQNCKWYQENALL DKVTDAEIKKGLLHCTQKKIVPGHSNVPVSSSAAEKEEEVHARLLHCVSKQKILLSQARR TQKHLQMLLAKHVVKHYGQQMKLSMKHQLPKMKTFHEPTTILGNSLPKCTEIKPEVNTLT AENKLWDDAKNGFARCTAAEIQRFAFSATGLLSHVEEGLDSDATDSSSDDDLDEYTLRKN VAVFVEDQTVVEQSLPMATTAPGPQQVLPVYGQCLLKAQGFFSQLLVNAASPESLPSGQW APLWTRGIVVLEECQLPKDILKKQMQFADQAASLNILGNPQVPQECQDPVPEQDFEMSPS SPTLLLRNIEKQ >gi568815596r:210088964_210325263|GENSCAN_predicted_CDS_1|1476_bp atgaccccagctctgagggaggcaacagcaaagggtatcagcttttcatctttgccaagt accatggagtctgacaagatgctctacatggaaagtcccagaactgtagatgaaaagcta aagggagacaccttttctcagatgcttggatttccaactcctgaacctactcttaatact aattttgtgaatttaaaacattttggctcccctcagtcttcaaaacattaccagactgtt tttttaatgagatctaattctacattaaataaacacaatgagaattataaacaaaagaaa ttaggggagcccagttgcaataagctgaaaaacatactgtataatggcagcaacattcag ctcagtaaaatctgtctttctcattctgaagagttcatcaaaaaggagcctctatcagat accacgagccagtgcatgaaagatgtacaaattattctggattcaaatataaccaaagac actaatgtagataaagtacaactacaaaactgtaaatggtatcaagagaatgcacttttg gataaagttactgatgctgagattaaaaagggtttattgcactgtactcaaaagaaaatt gtacctggccactcaaatgtgcctgttagttcttcagctgctgaaaaagaggaggaagta catgctcgtttacttcattgtgtaagcaaacagaaaattttacttagccaggctagaaga actcagaaacatttgcagatgctcctggcaaagcatgttgttaagcactatggtcagcag atgaaattgtctatgaaacatcaactccccaaaatgaagacatttcatgaacctaccaca attttgggtaatagtttacctaaatgcactgaaattaagccagaagttaacacattgact gcagagaataaattgtgggatgatgcaaaaaatggctttgcacggtgtacagctgcggaa atccaaagatttgcattttctgctacagggctgttgtctcatgttgaagagggtttggat tccgatgcaactgatagcagctctgatgacgatttggatgaatatacccttagaaaaaat gtggcagtgtttgttgaagatcagacagttgtagagcagtcccttcccatggccaccact gccccagggccacagcaagtactgcctgtctacggccaatgtttactaaaggcccaaggc ttcttcagtcagcttttggtgaatgctgccagtcctgagtctctcccttcaggacagtgg gctcccctctggaccagggggatagtggtcctagaagaatgtcagcttccaaaagatatt ttgaaaaaacaaatgcaatttgcagaccaagcagcttcactaaacattctagggaatcct caggttccccaggagtgtcaagatcctgtaccggaacaagattttgaaatgtcaccaagc agccctactttacttcttcgaaacatcgaaaaacag >gi568815596r:210088964_210325263|GENSCAN_predicted_peptide_2|741_aa MQNSGIKEVVNYGAGFSLQAHRLLRRHFFIRFSIYLVNPGYLRVFGGSIAFIRYSKRFRT LKRLRAASINKKECCKQRQQFRCKILLLTAKTGVFPRDQPVAWLGLRKGARERFFGRTPQ VDASADRPPSRGPCPESPSSRRVASVAPPPLCFGHGRTPSPRVPTRPGRPPCAAPAARRA SWPPFLDPADWVQPASGRRKKGCQPCRPENRPPSPRRGKPKDFSRSSFRSPNPTRLVQRP AVGPGSLRFGRRCSHSGGEERLETPSAKKLTDIGIRRIFSPEHDIFRKSVRKFFQEEVIP HHSEWEKAGEVSREVWEKAGKQGLLGVNIAEHLGGIGGDLYSAAIVWEEQAYSNCSGPGF SIHSGIVMSYITNHGSEEQIKHFIPQMTAGKCIGAIAMTEPGAGRQQQLVGQQKANSAKK ELKDAKEVDLQGIKTNAKKDGSDWILNGSKVFISNGSLSDVVIVVAVTNHEAPSPAHGIS LFLVENGMKGFIKGRKLHKMGLKAQDTAELFFEDIRLPASALLGEENKGFYYIMKELPQE RLLIADVAISASEFMFEETRNYVKQRKAFGKTVAHLQTVQHKLAELKTHICVTRAFVDNC LQLHEAKRLDSATACMAKYWASELQNSVAYDCVQLHGGWGYMWEYPIAKPRQGSVPPSRT ELRAAGGLLNRGGGGGGGGGGGGGGGRGGSGARSGDESAAPRENVDLGFRVRWDRAESRG FLRDGASKFVSPVLRSSASLD >gi568815596r:210088964_210325263|GENSCAN_predicted_CDS_2|2226_bp atgcagaactctggtataaaagaagttgtcaattatggtgctggtttttctctacaagct cacagacttctgagaagacacttttttattcgatttagcatctacctggttaatccaggg tatttgcgcgtatttgggggctccatagctttcatccgttactcaaagcgcttcaggacg ctaaaaaggctcagagctgcttcgataaacaagaaagaatgctgcaagcaaaggcagcag tttcgctgtaaaatcctattgctgacagccaagacgggcgtattccctcgcgaccagcct gtggcgtggttggggctccggaagggcgcgcgcgagcgcttttttgggaggacaccacag gtggacgcctcagctgatcgtcctccctcccggggaccctgccccgagtcgccgagtagc cgcagagtcgcctccgtcgccccgccgcccctgtgtttcggacatggccgcacgccttct ccgagggtccctacgcgtcctgggcggccaccgtgcgccgcgccagctgcccgccgcgcg agctggcctcctttcctggatcccgcggactgggtgcaaccagcatctgggcggaggaaa aaggggtgtcagccgtgccgccccgagaacagaccgccgagcccacgtcggggaaagccg aaagatttctccagaagttcattccggagccccaatccaacccgtctagttcagcggcct gcagtgggacccgggagcctgcgttttgggaggagatgttctcattccggaggggaagaa cgtctagaaactccttctgctaaaaaattaacagatataggaattcgaagaatcttttct ccagagcatgacattttccggaaaagtgtaaggaagtttttccaagaagaagtgattcct catcactcagaatgggagaaagctggagaagtaagtagggaggtttgggaaaaagctgga aaacaaggactgcttggtgtcaatattgcagagcatcttggtggaattggaggggatctg tactccgcagctattgtctgggaggagcaagcttattcaaattgttcaggcccaggtttt agtattcattcaggtattgtcatgtcctatattacaaaccatggctcagaagaacagatt aagcactttattccccagatgactgcaggcaaatgtattggtgcaatagcaatgacagag cctggagctggaagacagcagcaacttgtgggtcaacagaaagcaaatagtgcaaagaaa gaacttaaagatgccaaagaagttgacttacagggaataaaaacaaatgctaaaaaggat ggaagtgactggattctcaatggaagcaaggtgttcatcagtaatgggtcattaagtgat gttgtgattgtagttgcggtcacaaatcatgaagctccctcccctgcccatggtattagc ctttttctggtggaaaatggaatgaaaggatttatcaagggacgaaagctacataaaatg ggattaaaagcccaggataccgcagaactattctttgaagatatacggttgccagctagt gccctacttggagaagagaataaaggcttctattacatcatgaaagagcttccacaggaa aggctgttaattgctgatgtggcaatttcagctagtgaattcatgtttgaagaaaccagg aactatgttaaacaaagaaaagcttttggcaaaacagttgctcacctacagacagtgcaa cataaattagcagaattaaaaacacatatatgtgtaacccgagcatttgtggacaactgt ctccagctgcatgaagcgaaacgtttggactccgccactgcttgcatggcgaaatattgg gcatctgagttacaaaatagtgtagcttacgactgtgtacagctccatggaggttgggga tacatgtgggagtacccaattgcaaaaccgcggcagggttccgtccccccgagccgcacg gagctgcgggcagcgggcggactgttaaaccgcggcggcggcggcggcggcggcggcggc ggcggcggcggcggcggccggggcgggagcggggcgcgctctggagacgagtcagcggcg ccccgggaaaacgtagatttgggctttagagttagatgggatagagcagaatctagggga tttttgagggacggtgcttccaagtttgtgtcaccggttcttcgaagttctgcatcatta gactaa >gi568815596r:210088964_210325263|GENSCAN_predicted_peptide_3|125_aa MESIIKSLPSKKSPGPNHFHAEFYQKFKDELSPILRKFYQKTEEERILPNSFYDVGIALI PKPDKDTRKENYRSISLMNMDPKILKMLANQIHQHIKKIIHHDQVEQIHLKIYTISIDAE KKHLI >gi568815596r:210088964_210325263|GENSCAN_predicted_CDS_3|378_bp atggaatctataataaaaagtcttccatcaaagaaaagcccaggacctaatcacttccat gctgaattctatcaaaaatttaaagacgaactaagtccaattcttcgcaaattctaccaa aaaactgaagaggagagaattcttccaaactcattctatgatgtcggcattgccctgata ccaaaaccagacaaggacacaagaaaagaaaactatagatcaatatccctgatgaacatg gatccaaaaattctcaaaatgctagcaaaccaaattcaccaacacattaaaaagattatt caccatgatcaagtggaacagatccacttgaaaatatataccatttcaatagatgcagaa aaaaagcatttgatctaa >gi568815596r:210088964_210325263|GENSCAN_predicted_peptide_4|224_aa MAPKKDVKKPVAAAAAAPAPAPAPAPAPAPAKPKEEKIDLSAIKSFSADQIAEFKEAFLL FDRTGDSKITLSQVGDVLRALGTNPTNAEVRKVLGNPSNEELNAKKIEFEQFLPMMQAIS NNKDQATYEDFVEGLRVFDKEGNGTVMGAELRHVLATLGEKMKEEEVEALMAGQEDSNGC INYEATLKTSIWNLVSLQLPVGCVRPDTKMGLLSYVSPLWRKLE >gi568815596r:210088964_210325263|GENSCAN_predicted_CDS_4|675_bp atggcaccaaagaaagacgtgaagaaacctgtggctgcggctgcggctgccccagccccg gcaccggcacctgcacctgcccctgccccagccaaacccaaagaagaaaaaattgacctc tctgccattaagtccttcagtgctgaccagattgctgaattcaaggaggcatttctcctg tttgacagaacaggtgattccaagatcaccttaagccaggtcggtgatgtccttcgagct ctgggcacaaatcccaccaatgcagaggtcaggaaagttctgggaaaccccagcaatgaa gagctgaatgccaagaaaattgagtttgaacaatttctgcctatgatgcaagccatttcc aacaacaaggaccaggccacctatgaagactttgttgagggtctgcgtgtctttgacaag gaaggcaatggcacagtcatgggtgctgaactccgccatgttctagccaccctgggtgaa aagatgaaagaggaagaagtggaagccctgatggcaggtcaagaagactccaatggctgc atcaactacgaagccactctgaaaaccagtatctggaacctggtgtcactacaactgcct gtaggctgtgtcagacctgacaccaagatgggtctcctcagttatgtctccccactgtgg agaaaattagaatag