GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:34:59 Sequence gi568815594r:119960121_120166734 : 206614 bp : 36.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 586 581 6 1.05 1.06 Term - 4780 4656 125 2 2 44 34 110 0.041 -1.23 1.05 Intr - 21352 21281 72 2 0 71 101 44 0.181 2.46 1.04 Intr - 36303 36185 119 2 2 5 109 145 0.003 7.49 1.03 Intr - 46316 46171 146 2 2 97 79 139 0.849 12.06 1.02 Intr - 51336 51221 116 1 2 49 75 126 0.964 6.65 1.01 Init - 51567 51429 139 0 1 70 89 84 0.628 7.06 1.00 Prom - 52120 52081 40 -5.55 2.13 PlyA - 52519 52514 6 1.05 2.12 Term - 56484 55798 687 0 0 64 42 199 0.001 5.52 2.11 Intr - 66880 66796 85 0 1 23 84 63 0.003 -1.70 2.10 Intr - 72651 72557 95 0 2 49 98 110 0.039 5.94 2.09 Intr - 90008 89858 151 1 1 47 27 125 0.020 1.54 2.08 Intr - 93527 93259 269 2 2 48 92 84 0.552 0.21 2.07 Intr - 100170 100040 131 1 2 69 33 139 0.063 5.99 2.06 Intr - 100857 100754 104 2 2 77 89 106 0.997 8.50 2.05 Intr - 101975 101855 121 0 1 46 99 104 0.947 5.93 2.04 Intr - 105698 105552 147 0 0 95 63 147 0.953 12.19 2.03 Intr - 106876 106542 335 0 2 10 84 214 0.229 7.49 2.02 Intr - 107370 107247 124 1 1 95 -21 170 0.236 5.52 2.01 Init - 141382 141241 142 1 1 64 81 64 0.359 3.64 2.00 Prom - 147575 147536 40 -3.65 3.03 PlyA - 148339 148334 6 1.05 3.02 Term - 157932 157516 417 0 0 13 43 221 0.702 4.39 3.01 Init - 158551 158345 207 1 0 39 47 136 0.271 3.47 3.00 Prom - 160792 160753 40 -4.65 4.00 Prom + 164300 164339 40 -8.15 4.01 Sngl + 166375 167391 1017 0 0 37 38 345 0.980 21.17 4.02 PlyA + 167572 167577 6 1.05 5.03 PlyA - 168369 168364 6 1.05 5.02 Term - 179022 178753 270 0 0 54 43 198 0.199 6.40 5.01 Intr - 182568 182397 172 2 1 53 68 88 0.025 2.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 33945 34143 199 2 1 72 107 74 0.831 6.82 S.002 Sngl - 56520 55798 723 0 0 61 42 222 0.932 10.87 S.003 Term - 100170 99998 173 1 2 69 55 185 0.927 10.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:119960121_120166734|GENSCAN_predicted_peptide_1|238_aa MLLPQEPGCSTSSSTLQEPFQEQPSPDIEGGFILKPRQPARGQQGSEEGGQYRQEANTCP ESDLEKPEMSEVKAQLTSRKISFMCVSIVQFLIENCLSIFGEDITSLLEKSSMTCDNSDA SGTLKNSVGFDEKQGLPRRSGSVNLQPLSREDLGRSQSESLGPEFQGLWEWLPGLFRAEL RLSALNSSSQNANCPYKYILTILKKNTGFGKRDHETLSRKTFMDQGLMAKTVITFTPT >gi568815594r:119960121_120166734|GENSCAN_predicted_CDS_1|717_bp atgctcctgccacaggaaccagggtgctccacctcttcttccacactccaggagcccttc caggagcagccatccccagacatagaaggcgggttcattctgaagcccaggcagccagcc agaggccagcaggggagtgaggaaggaggacagtataggcaggaagccaatacctgtcca gaatcagacttagagaaacctgaaatgtctgaagtcaaggcccaacttacttctaggaag atttctttcatgtgtgtttctattgtacaatttctcattgaaaactgcctcagcatattt ggagaagacatcacttccctcttggagaagagctcaatgacttgtgacaacagtgatgct tcaggtaccttgaaaaattcagttggttttgatgagaaacaggggcttccgaggcgatcg ggcagtgtcaatcttcagccgctaagccgagaagatctgggaaggagtcagtcagagagc cttgggccagagttccaggggctctgggagtggctgccaggtttgtttagagcagaactg aggctttcagcactaaattcaagctctcaaaatgccaactgcccatacaaatatattttg accatcctgaagaaaaatacaggatttggcaagcgtgaccatgaaactctttctagaaaa accttcatggatcaaggattaatggcaaaaactgtaattactttcacaccaacataa >gi568815594r:119960121_120166734|GENSCAN_predicted_peptide_2|796_aa MDMKRGDSDEGSERKEKSYRESIHLLREYTNNCKQNVGKNMDYKGHSGWAGWTERSLDFR TQPPTAPDSPTPLASEEGPKPRTDVEKGHVTPGTARPGPPTAPARTGRSRDITGRLRERA GPRCVGLGLKARRACAKDLTTCCVVTFETLGGEVLLEPLWLLSAEWKRVLLFVSLAMALQ LSREQGITLRGSAEIVAEFFSFGINSILYQRGIYPSETFTRVQKYGLTLLVTTDLELIKY LNNVVEQLKDWLYKCSVQKLVVVISNIESGEVLERWQFDIECDKTAKDDSAPREKSQKAI QDEIRSVIRQITATVTFLPLLEVSCSFDLLIYTDKDLVVPEKWEESGPQFITNSEEVRLR SFTTTIHKEEALHAELPRAKLESTVLELLQAMEKFLLCCLGWAERKSGERAPERYSSAVF CHQDWNRNPKSGVTSIPPAICQLFHLLVYYSVCSTGIITCLETGPERVTIISSYGQQRAA KGSGVRACSQGLGILHFVEAKIRNWEVENNSISKPDDSGEKKGKGKNDEGTILQRKSFRR YINLAKEKDHPKNIPLQIFTEDIKKCASETHLTHKDSHKLKVKGWKKAFHANGHQKQAGV AILTSDKTNFKATAVKRDKAGHYLLVEDLVPQENITTLNIYTPKTGAPKFIKQLLIDRRN EIDSNTIIVGDFSTPLTALDRSSRQKVNKETMDLNYALVQMDLTNIYRTFHPTNTEYTFY SAAYGTFSKIDHMIGHKMSLNKFQKIEIMSSALSDHKRIKLEVNSKRNLQNHANTWKLNN LLLNEHWVKNKIKMEI >gi568815594r:119960121_120166734|GENSCAN_predicted_CDS_2|2391_bp atggacatgaaaaggggagattctgatgagggatcagaaagaaaagagaagagctataga gaaagcatccatcttcttagagaatacacaaataattgtaaacagaatgttggcaaaaat atggattacaaaggccattctgggtgggcagggtggacagaacgcagcctggacttccgg acgcagccccctaccgccccggactcgcctacccctctggcctctgaggagggccctaag cccaggacagacgtggaaaaagggcatgtcacacccggcacagcgcgaccaggtccacca actgcgcctgcgcgaaccggccgctcccgcgacatcacaggaaggctgagagaaagggca ggtccccgctgcgtggggctgggcttaaaggctcgtcgcgcctgcgcaaaggacctgacg acgtgctgcgtcgttacttttgaaacgcttggcggggaagtgctgttggagccgctgtgg ttgctgtccgcggagtggaagcgcgtgcttttgtttgtgtccctggccatggcgctgcag ctctcccgggagcagggaatcaccctgcgcgggagcgccgaaatcgtggccgagttcttc tcattcggcatcaacagcattttatatcagcgtggcatatatccatctgaaacctttact cgagtgcagaaatacggactcaccttgcttgtaactactgatcttgagctcataaaatac ctaaataatgtggtggaacaactgaaagattggttatacaagtgttcagttcagaaactg gttgtagttatctcaaatattgaaagtggtgaggtcctggaaagatggcagtttgatatt gagtgtgacaagactgcaaaagatgacagtgcacccagagaaaagtctcagaaagctatc caggatgaaatccgttcagtgatcagacagatcacagctacggtgacatttctgccactg ttggaagtttcttgttcatttgatctgctgatttatacagacaaagatttggttgtacct gaaaaatgggaagagtcgggaccacagtttattaccaattctgaggaagtccgccttcgt tcatttactactacaatccacaaagaggaagccttgcatgctgaattgccaagggcaaaa ttagaaagcactgtactcgagctgctacaagccatggaaaaatttcttctttgctgcttg ggatgggctgagagaaaaagtggagagagggccccagaacgttactccagtgctgttttc tgtcatcaggattggaacagaaaccctaagagtggagtaacaagcatccctcctgctatt tgccagctttttcatctcttggtatattacagtgtgtgttccactggcattattacctgc cttgagacagggcctgaaagagtaaccatcatttccagttatggtcagcagagggcagca aagggcagtggagtacgtgcttgctctcagggcctgggcatccttcactttgtggaggca aaaatcaggaactgggaagtagagaataattccatcagcaaacctgatgactctggggaa aagaagggcaagggcaagaatgatgaagggactattcttcaacgcaagagctttagaagg tacataaatctagccaaagaaaaggaccacccgaagaacattccccttcaaatcttcact gaggacatcaagaaatgtgcctctgagactcacctaacacataaggattcacataaactt aaagtaaaagggtggaaaaaggcatttcatgcaaatggacaccaaaagcaagcaggggta gctattcttacatcagacaaaacaaattttaaagcaacagcagttaaaagagacaaagcg ggacattatttgctagtagaagatcttgtcccacaggaaaatatcacaaccctaaacata tacacacctaaaactggagctcccaaatttataaaacaattactaatagaccgaagaaat gagatagacagcaacacaataatagtgggggacttcagtactccactgacagcactagac aggtcatcaagacagaaagtcaacaaagaaacaatggatttaaactatgccttggtacaa atggacttaacaaatatatacagaacatttcatccaacaaacacagaatacacattctat tcagcagcatatggaactttttctaagatagaccatatgataggccataaaatgagcctc aataaatttcagaaaattgaaatcatgtcaagtgctctctcagatcacaagagaataaaa ttggaagtcaactccaaaaggaaccttcagaaccatgcaaatacatggaaattaaataac ctgctcctgaatgaacattgggtcaaaaacaaaatcaagatggaaatttaa >gi568815594r:119960121_120166734|GENSCAN_predicted_peptide_3|207_aa MDNEIQTEMVSDGEEKLVGNWSKGNSCYVLAKRLAAFCPCPKDLWNFELERDDLRYLVEE ISKWQNMQELPRGVEPASAQKSKIGVWEPLTRFQQMYGNAWMPRQKFAVGVGSSWRTYAR AMWKAYVGLVLPHGVPTGAPPNGTVRRGPPSSRPQNGRSTDSLHYAPGRAADTQCQPCEH SQEEGCSLQSHRGRAAQDHGNPPLALE >gi568815594r:119960121_120166734|GENSCAN_predicted_CDS_3|624_bp atggacaatgaaatccagactgagatggtctcagatggagaagagaaacttgttgggaac tggagcaaaggtaactcctgttatgttttagcaaaaagactggcagcattttgcccctgt cctaaagatttgtggaactttgaacttgagagagatgacttaaggtatctggtggaagaa atttctaagtggcaaaacatgcaagagcttccacgtggtgttgagcctgcaagtgcacag aagtcaaaaattggggtttgggaacctctgactagatttcagcagatgtatggaaatgca tggatgcccaggcagaagtttgctgtaggggtggggtcctcatggagaacctatgctagg gcaatgtggaaggcatatgtggggttggtgctgccacatggagtccctactggggcaccg cctaatggaactgtgagaagagggccaccatcctccagacctcagaatggtagatccact gacagcttgcactatgcacctggaagagctgcagacactcaatgccagccctgtgaacac agccaagaggaaggctgttccctgcaaagccacaggggcagagctgcccaagaccatggg aacccacctcttgcattagagtga >gi568815594r:119960121_120166734|GENSCAN_predicted_peptide_4|338_aa MNIDAKILNKILANQIQQHVKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHI IISIDAEKAFDKIQQPFMLKTLSKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLNT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADEMIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRLTESQIMSELPFRVASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRSNAIPIKLPMT FFTEFEKNYFRVHMEPKKSPHHQVNPKQKEQSWRHHTT >gi568815594r:119960121_120166734|GENSCAN_predicted_CDS_4|1017_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccaaatccagcagcacgtc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatc attatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaa actctcagtaaattaggtattgatgggacgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaacact ggcacaagacagggatgccctctctcaccactcctgttcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaaaggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgagatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaagtcacaagcattcttatacaccaataacagactaacagagagccaaatcatgagt gaactcccattcagagttgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttatagatccaatgccatccccatcaagctaccaatgact ttcttcacagaatttgaaaaaaactactttagagttcatatggaaccaaaaaagagccca catcaccaagtcaatcctaagcaaaaagaacaaagctggaggcatcacactacctga >gi568815594r:119960121_120166734|GENSCAN_predicted_peptide_5|147_aa XREQPLKPICGLFSKRSQPVVSIVQGPMGIDPWITGSLPITQKEVVLVSPTSLRMIVLPS TKIVRLLQPNGKFKLTSFLCKLPGPRYVFISSIKMDYSSKLVPIEWGIAEKIPKNVEATL ELGNRQRLEESGGLRRREKNVGKFGTS >gi568815594r:119960121_120166734|GENSCAN_predicted_CDS_5|444_bp nnaagggagcagcctctgaagcccatctgtggattattcagcaaaaggagccagcctgta gtgtcaattgtgcaaggccccatgggaattgacccctggattacaggttccctgccaatc acccaaaaagaggtggttcttgttagccccacatccctcagaatgatagtgctgccttcc accaagattgtgaggcttctccagccaaatggcaagttcaagttaacctctttcttatgt aaattgcccggtccccggtatgtctttatcagcagcataaaaatggattattccagtaaa ttggtacctatagagtggggcattgctgaaaagatacccaaaaatgtggaagcaactttg gaactgggtaacaggcagaggttggaagagtctggagggctcagaagaagagagaaaaat gtgggaaagtttggaacttcctag