GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:09:24 Sequence gi568815591r:41589653_41800374 : 210722 bp : 39.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 2624 2317 308 2 2 60 105 148 0.295 10.51 1.00 Prom - 14638 14599 40 -5.45 2.00 Prom + 20631 20670 40 -4.45 2.01 Init + 31441 31524 84 0 0 79 115 58 0.675 8.47 2.02 Term + 37851 37919 69 2 0 77 54 65 0.206 -1.04 2.03 PlyA + 38089 38094 6 1.05 3.03 PlyA - 38881 38876 6 1.05 3.02 Term - 45705 45423 283 2 1 52 42 164 0.001 2.11 3.01 Init - 58952 58735 218 2 2 50 18 240 0.154 11.41 3.00 Prom - 63155 63116 40 -3.65 4.16 PlyA - 63930 63925 6 1.05 4.15 Term - 68293 68102 192 1 0 53 40 131 0.481 1.24 4.14 Intr - 69269 69241 29 0 2 96 50 38 0.137 -2.28 4.13 Intr - 77180 76968 213 0 0 24 73 183 0.352 8.16 4.12 Intr - 83013 82920 94 0 1 44 84 32 0.001 -2.98 4.11 Intr - 95099 94987 113 0 2 148 72 26 0.040 6.18 4.10 Intr - 100890 100012 879 1 0 112 15 929 0.074 78.37 4.09 Intr - 102718 102506 213 2 0 42 53 134 0.015 3.06 4.08 Intr - 110649 110335 315 1 0 28 77 354 0.370 23.41 4.07 Intr - 115819 115632 188 0 2 69 87 214 0.966 17.81 4.06 Intr - 116196 116078 119 0 2 75 36 118 0.837 3.64 4.05 Intr - 116552 116488 65 0 2 46 77 47 0.377 -3.08 4.04 Intr - 119353 119201 153 2 0 61 94 98 0.271 6.82 4.03 Intr - 127383 127123 261 1 0 -3 47 200 0.344 3.34 4.02 Intr - 130758 130653 106 0 1 52 72 71 0.108 0.77 4.01 Init - 137027 136956 72 2 0 94 27 84 0.459 4.02 4.00 Prom - 137206 137167 40 -7.25 5.02 PlyA - 140782 140777 6 1.05 5.01 Sngl - 143324 143121 204 2 0 100 44 301 0.818 21.84 5.00 Prom - 145410 145371 40 -7.45 6.00 Prom + 150757 150796 40 -3.45 6.01 Init + 156256 156369 114 0 0 58 102 37 0.200 2.37 6.02 Intr + 160968 161044 77 2 2 87 92 37 0.073 1.39 6.03 Term + 177559 177691 133 1 1 100 47 114 0.648 5.08 6.04 PlyA + 178411 178416 6 1.05 7.00 Prom + 180369 180408 40 -5.35 7.01 Init + 180494 180631 138 1 0 78 82 56 0.418 4.19 7.02 Intr + 187583 187748 166 1 1 73 87 76 0.437 4.61 7.03 Intr + 193883 194061 179 0 2 59 70 111 0.106 5.12 7.04 Intr + 199568 199693 126 1 0 1 54 136 0.076 1.46 7.05 Term + 201978 202136 159 2 0 32 43 145 0.112 1.36 7.06 PlyA + 203421 203426 6 1.05 8.02 PlyA - 204516 204511 6 1.05 8.01 Term - 208672 208525 148 0 1 96 42 106 0.600 3.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100890 99998 893 1 2 112 40 933 0.922 82.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_1|103_aa MACPGRVGGAAESDCNSFQRTGMYLLCTKSDVRRAAFSYEASQVKSCKCLWLGPSLANAY GCCLELDSLMTIVYFSNPTAMNVALVHILDDDDENHILKMAES >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_1|309_bp atggcttgtccaggcagggttgggggagcagctgaatctgactgcaactccttccagaga actggaatgtacttgctgtgcaccaagtctgatgtgaggagggcagccttctcctatgag gcctcccaggtaaagtcttgcaaatgcttatggttgggaccaagccttgcaaatgcttat ggttgttgcctagagctggactccttgatgacaattgtttatttttctaatcccactgct atgaatgtggctctagttcacatcttagatgatgatgatgagaaccatatactaaagatg gcagaaagn >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_2|50_aa MGSLLVLRALQAAMEPVRFPVHRMAGEMLWQPKMSSDIAKCPLKCEIAPS >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_2|153_bp atgggctccctcttggtcctaagagcactacaggcagcaatggagccagtgagatttcct gttcacaggatggcaggagagatgttgtggcaaccaaaaatgtcttcagatattgctaaa tgtcccctgaagtgtgaaattgcccccagttga >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_3|166_aa MLPLDAKTNSGDETKAGQELIVLRSLKLKNSGRDEYGKEKKQQAVEVDQVPEGRKISGNS ASRGAKQQLWQLGRVHGSWPDYRDKMACCRHYAEAEDMGHLTEISISQLGTVQNASPVST STTKGDFSAVSPSTYSKAMYRKWPLSFDLLSPPVCLLPTLVTQGPA >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_3|501_bp atgctgcctcttgatgccaaaaccaactctggagatgaaacaaaagcaggacaggagttg attgttctcagatctttaaaactgaaaaattcagggagagacgaatatggaaaggaaaag aaacagcaggcagtagaagtggatcaagtacccgagggaaggaaaattagtgggaattct gcctcaaggggtgcaaaacagcagctgtggcagctgggccgagttcatgggagctggcca gactacagagacaagatggcctgttgcaggcactacgctgaggcagaggacatgggccac cttacagagatctcaatttcacagcttggcacagttcagaatgcgagtccagtaagcact agtacaacaaaaggagacttctctgcagtctcaccatccacctactccaaggctatgtac aggaagtggccattgagtttcgaccttctgtccccgccagtatgcctgctgcccacactg gtcacacagggcccagcttaa >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_4|1003_aa MGNSLKPSQSLWSGNYDYNPTGYMGRLLLNLEQAMVGIWHMQRRKENELDMAKGGEEVVG WSPLLFYIEWSKKASAIWKQITRNSGSELRGCLMENDEIPGRGDVKHKGLGNARAHSVGR IARGITWEEDRDQGETEKKARSGMWPAIWEQVVGGSLWTGKSGCKSRLLLSYQSNGIYVL NHHVFKSPMELITQGDPGTAAATGKHHYRVGRGPKLGVTQAAPPGSLVAAPRGENRAMQM LQRGVGFVKSPVNQAASTSCQKGGKNQELLLKEVALAGAQGKNRGCRLDRLGPGSRHPAS WARAAAAFRRDPWKLPAGAAQGSEGHSAAPDCPSCALAALPKDVPNSQPEMVEAVKKHIL NMLHLKKRPDVTQPVPKAALLNAIRKLHVGKVGENGYVEIEDDIGRRAEMNELMEQTSEI ITFAESGPRVISMAGFPLTIEIQAGRSSQGVASSKMNFLKIVPRFVHGIHVQQTVVRRST VMEKRICFYMPPSGKNGRTARKTLHFEISKEGSDLSVVERAEVWLFLKVPKANRTRTKVT IRLFQQQKHPQGSLDTGEEAEEVGLKGERSELLLSEKVVDARKSTWHVFPVSSSIQRLLD QGKSSLDVRIACEQCQESGASLVLLGKKKKKEEEGEGKKKGGGEGGAGADEEKEQSHRPF LMLQARQSEDHPHRRRRRGLECDGKVNICCKKQFFVSFKDIGWNDWIIAPSGYHANYCEG ECPSHIAGTSGSSLSFHSTVINHYRMRGHSPFANLKSCCVPTKLRPMSMLYYDDGQNIIK KDIQNMIVEESRKLEMHGATRARDLYTWGKMYLVLLPGPTELEFGITEILIIDIFLKLAQ VIQLACGGHTNLEQKSIIWGPWVAGVYPSSSGHNVEPALDKTPSNLRVHSHQDTLDIPVS PTCTSLGYGKKLEGLEKTYNRHGENVGTPHVGIESDAENPELALVPWNGDEGREKATVLS AREQTRLTFPVDRCACSLAEKTVAFSLHPSPPNCLPQFQEWHH >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_4|3012_bp atgggtaattcactgaagccttctcaaagcctgtggagtgggaactacgattacaacccc actggctacatgggcagacttttactaaatttggaacaggcaatggtgggtatctggcac atgcagcgcagaaaggaaaatgaactggacatggccaagggaggagaggaagtggtggga tggagtccgttgctattttatattgagtggtccaaaaaggcttcagcgatctggaaacaa ataacaagaaattcagggagtgagctgcgagggtgtctgatggaaaatgatgaaattcct ggcagaggcgacgtcaagcacaaaggcttgggaaatgcacgagctcattctgttggaagg atagcaagggggatcacatgggaagaagacagagaccaaggagaaacagagaagaaggca cgttcggggatgtggccagcgatttgggaacaggttgtggggggatctttgtggacaggc aaaagtggatgcaaatcaagattgctgctctcttatcagtccaatggaatctacgtgcta aaccaccatgtcttcaagtcaccaatggaacttattacccaaggggatccagggacagct gctgccacaggtaaacaccactatagggttggccgtggccctaagctaggtgtcacccaa gctgctccaccgggttcgctagtggctgctcctcgaggcgagaacagagccatgcaaatg ctccagcgtggagttgggtttgtgaagtcgccagtaaatcaggccgcctccactagctgc caaaaggggggaaaaaatcaagagctgctcttaaaagaagttgcccttgctggtgctcag ggtaaaaatagaggctgccgcttagaccggcttggccctggctccaggcatcctgcgagc tgggctcgagcagcggccgcgttccggcgtgatccctggaagctgccagcaggtgctgct caaggatccgaggggcacagcgcggcccccgactgtccgtcctgtgcgctggccgccctc ccaaaggatgtacccaactctcagccagagatggtggaggccgtcaagaagcacatttta aacatgctgcacttgaagaagagacccgatgtcacccagccggtacccaaggcggcgctt ctgaacgcgatcagaaagcttcatgtgggcaaagtcggggagaacgggtatgtggagata gaggatgacattggaaggagggcagaaatgaatgaacttatggagcagacctcggagatc atcacgtttgccgagtcagggcctcgagtgataagtatggctggatttccgttaactata gaaatccaggcagggagatcctcccagggagttgcaagttctaagatgaacttcttgaag attgttcccagatttgtgcatgggatacatgtacagcagacagttgtaaggaggtccacg gtcatggaaaagaggatatgtttctatatgccaccaagtgggaaaaatggaagaacagcc aggaagacgctgcacttcgagatttccaaggaaggcagtgacctgtcagtggtggagcgt gcagaagtctggctcttcctaaaagtccccaaggccaacaggaccaggaccaaagtcacc atccgcctcttccagcagcagaagcacccgcagggcagcttggacacaggggaagaggcc gaggaagtgggcttaaagggggagaggagtgaactgttgctctctgaaaaagtagtagac gctcggaagagcacctggcatgtcttccctgtctccagcagcatccagcggttgctggac cagggcaagagctccctggacgttcggattgcctgtgagcagtgccaggagagtggcgcc agcttggttctcctgggcaagaagaagaagaaagaagaggagggggaagggaaaaagaag ggcggaggtgaaggtggggcaggagcagatgaggaaaaggagcagtcgcacagacctttc ctcatgctgcaggcccggcagtctgaagaccaccctcatcgccggcgtcggcggggcttg gagtgtgatggcaaggtcaacatctgctgtaagaaacagttctttgtcagtttcaaggac atcggctggaatgactggatcattgctccctctggctatcatgccaactactgcgagggt gagtgcccgagccatatagcaggcacgtccgggtcctcactgtccttccactcaacagtc atcaaccactaccgcatgcggggccatagcccctttgccaacctcaaatcgtgctgtgtg cccaccaagctgagacccatgtccatgttgtactatgatgatggtcaaaacatcatcaaa aaggacattcagaacatgatcgtggaggagtctagaaaactggaaatgcatggagcaact agagctagagatctatatacatggggaaaaatgtacttagtcttattaccgggaccaaca gagttggaatttggaattacagaaattttaattatcgatatatttcttaaacttgcccaa gtcattcagcttgcatgtggcggacatacgaatctagaacagaagtccattatctggggg ccttgggtggccggagtataccccagcagctcagggcacaatgtggaaccagccctggac aagacgccctccaatctcagggtacactcacaccaggacactttagacataccagttagc ccaacatgcacatctttgggatatgggaaaaaactggagggcctggagaaaacctacaac agacatggggagaatgtgggaactccacacgttggtatagaatctgatgcagaaaaccca gagctggctttggttccatggaatggggatgagggtagagaaaaagccacagtcctctct gccagagaacagacacggctgacctttccagtggacagatgtgcctgttctctggcagaa aagactgtggctttttctctacacccatctccgcccaactgcctgccccagtttcaggaa tggcaccattag >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_5|67_aa MAVTMERGNLVVRTGKVTAEYGKSEVERRGQGVAANEVTAERGDSGQSRGDQMTVNLRSE VPHPNES >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_5|204_bp atggcggtgacaatggaaagagggaacctggtggttcgcacaggaaaggtcacggcagaa tacgggaagtcagaggtggagcggagaggccaaggggtggctgcaaatgaagttacagca gagaggggggacagcggtcagagcagaggggatcaaatgaccgtgaacctgagaagtgaa gtgcctcatccaaatgaaagttga >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_6|107_aa MATPQAEQLLGLLNQGYLLLLLDYMTNKGELFRSFLRKKDVTGKVFQSRPQVRVLGSCTR KNSGPLGTPLAAGNILDRIRGQKGSQPICFIHAFNKPAPSHQNVMSL >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_6|324_bp atggctactccacaggcagagcagctgcttgggctgctcaatcaaggatacttattatta cttctcgattatatgacaaacaagggtgaattattcaggagttttctgagaaagaaagat gttactggaaaggtgttccagtccagaccccaagtgagggttcttggatcttgcacaaga aagaattctggcccgctgggcacccctttagcagctggtaacattcttgacagaattaga ggacagaaaggaagtcaaccaatctgcttcatccatgccttcaacaagcctgcaccaagc catcagaatgtgatgtccctctga >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_7|255_aa MEYYEAIKKNDHVLCRDMDEAGSHHSQQTNTGAENQTLHVLTHKWEPNMRFFYSSYDIQG ALTDVPNTVTVEFILQNWATLSDYLLPQRVTGLTERTVLQNSKELKTENEYKKVCREASA VTWVTDTGGLDCNSSCRCGEKTVDFRGQVHRVRLWVGLVEKERESKILESGSVDPEKRPV GWFAIEKKDRDRCRYIHDIMLTRVLQTKESTFAVHNGCDKNLVQKVPVKPNVLQAPSSSK PNVLQAPLSSILECS >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_7|768_bp atggaatactatgaagccataaaaaagaatgatcatgtcctttgcagggacatggatgaa gctggaagccatcattctcagcaaactaacacaggagcggaaaaccaaacactgcatgtt ctcactcataagtgggagccaaatatgagattcttttatagcagctatgacattcaagga gctcttacagatgtccccaatactgtcacagtggagtttatcttacagaattgggccact ctaagtgactacctgttgcctcaaagggtaacaggtctaacagaaagaactgtactccag aattcaaaggagctgaagacagagaacgaatacaagaaagtctgtagggaggcaagtgca gtgacctgggtgacagatactggtggcctggactgcaacagcagctgtagatgtggagag aagacggtagatttcagaggacaggtccacagggttcgactgtgggttggtttagtggaa aaggaaagggaatctaaaatactagaatctggcagtgttgacccagagaagaggcccgtg gggtggtttgctatagagaagaaagacagagacaggtgtcgctatatccatgacatcatg ctcaccagggtcttacaaactaaggagagcacatttgcagtgcacaatgggtgtgacaaa aacttggttcaaaaggttccagtgaagcctaatgtcctacaggctccatcatcctcaaag cctaatgtcctacaggctccattgtcctcaattcttgagtgttcctga >gi568815591r:41589653_41800374|GENSCAN_predicted_peptide_8|49_aa XSLTTLDCIAWVPVGSSSIRWGTDKECEAFLLHSRQLGGCLAVGSCALA >gi568815591r:41589653_41800374|GENSCAN_predicted_CDS_8|150_bp nnaagcttgacgactttggattgcattgcctgggttccagtgggaagcagcagtataagg tgggggacagataaagagtgtgaggcatttcttctccattcccgccagcttggaggctgc cttgctgttggcagctgtgcccttgcatga