GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:15:04 Sequence gi568815588r:119475668_119688248 : 212581 bp : 46.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 18410 18769 360 1 0 101 37 141 0.721 6.37 1.02 PlyA + 18926 18931 6 1.05 2.06 PlyA - 20558 20553 6 1.05 2.05 Term - 24592 24446 147 1 0 125 52 153 0.993 13.30 2.04 Intr - 39985 39842 144 1 0 64 91 207 0.922 19.08 2.03 Intr - 50451 50365 87 0 0 114 68 0 0.045 0.77 2.02 Intr - 51776 51639 138 2 0 32 9 177 0.039 4.86 2.01 Init - 56110 56102 9 1 0 110 81 28 0.134 3.31 2.00 Prom - 56310 56271 40 -2.96 3.17 PlyA - 56448 56443 6 1.05 3.16 Term - 66753 66312 442 2 1 60 52 200 0.003 8.33 3.15 Intr - 67020 66923 98 0 2 87 115 50 0.003 6.41 3.14 Intr - 80166 80052 115 2 1 106 82 -6 0.137 1.05 3.13 Intr - 86687 86445 243 1 0 99 75 37 0.016 0.11 3.12 Intr - 101083 100944 140 1 2 103 89 51 0.095 5.96 3.11 Intr - 101536 101413 124 0 1 63 91 77 0.136 6.09 3.10 Intr - 101868 101784 85 1 1 102 92 -12 0.994 -0.52 3.09 Intr - 102069 101974 96 1 0 77 87 18 0.595 0.58 3.08 Intr - 103167 103059 109 0 1 104 69 95 0.973 9.06 3.07 Intr - 104343 104268 76 2 1 105 92 44 0.971 6.02 3.06 Intr - 106342 106255 88 2 1 55 97 44 0.978 1.03 3.05 Intr - 106556 106502 55 2 1 85 100 -7 0.976 -1.25 3.04 Intr - 106890 106792 99 0 0 98 116 55 0.817 9.61 3.03 Intr - 121268 121148 121 1 1 94 -2 129 0.001 5.00 3.02 Intr - 121422 121344 79 1 1 108 51 42 0.015 1.11 3.01 Init - 124077 124011 67 0 1 81 43 76 0.019 3.63 3.00 Prom - 127431 127392 40 -2.96 4.00 Prom + 128433 128472 40 -0.86 4.01 Init + 143013 143088 76 2 1 68 72 125 0.955 10.16 4.02 Term + 150487 150668 182 2 2 46 39 92 0.132 -2.13 4.03 PlyA + 152535 152540 6 1.05 5.00 Prom + 154039 154078 40 -3.66 5.01 Sngl + 162759 163415 657 2 0 75 35 728 0.912 62.38 5.02 PlyA + 163445 163450 6 1.05 6.00 Prom + 174813 174852 40 -3.56 6.01 Init + 176009 176188 180 1 0 84 105 339 0.522 34.38 6.02 Intr + 184580 184696 117 1 0 109 90 54 0.832 8.36 6.03 Intr + 190872 191048 177 2 0 14 -3 181 0.466 1.72 6.04 Intr + 194184 194324 141 2 0 96 55 90 0.755 7.05 6.05 Intr + 194352 194510 159 2 0 61 94 50 0.654 3.08 6.06 Intr + 196588 196989 402 0 0 104 96 224 0.882 19.42 6.07 Intr + 200797 200970 174 0 0 84 75 126 0.630 11.04 6.08 Term + 201052 201615 564 0 0 103 38 604 0.863 51.29 6.09 PlyA + 201754 201759 6 1.05 7.00 Prom + 203450 203489 40 -5.86 7.01 Init + 208140 208238 99 2 0 47 105 41 0.115 2.11 7.02 Intr + 211514 211604 91 1 1 68 76 70 0.120 3.37 7.03 Intr + 211666 211760 95 2 2 59 78 79 0.161 3.68 7.04 Intr + 212206 212269 64 0 1 75 96 65 0.130 4.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 38154 38040 115 0 1 89 97 40 0.805 5.02 S.002 Init - 41639 41592 48 2 0 93 81 42 0.901 3.85 S.003 Term - 101083 100945 139 0 1 103 40 165 0.862 10.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_1|119_aa MEMAGMDTVKLYFGEESRGLSDDQMSKCRTEETLRPLPAAPTQWPCGNQEKGSSKQACST GHPCKTLTLRPSLASAKLLKPLILTQRPGQFWTCKACSVCGNPRGALGSAQGLGTISTQ >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_1|360_bp atggagatggcaggcatggacacggtgaagctatattttggagaagaatctagaggactt tctgatgatcagatgtcgaagtgcaggacagaggagactctcaggccacttccagctgcc cctacccagtggccttgtgggaatcaggaaaagggctcctctaaacaagcatgcagcaca gggcacccgtgcaagaccctcaccctgagaccctcccttgcaagtgcgaagttgctcaag cccctgattctcactcagagacctggtcaattttggacttgcaaagcttgtagtgtctgt ggcaatccaaggggagccctgggctctgcccaaggtttagggaccatttcaacacagtga >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_2|174_aa MALSELCFADIHDSDGSSSSSHQSLKSTAKWAASLENLLEDPEGVKRFREFLKKEFSEEN VLFWLACEDFKKMQDKTQMQEKAKEIYMTFLSSKASSQVNVEGQSRLNEKILEEPHPLMF QKLQDQIFNLMKYDSYSRFLKSDLFLKHKRTEEEEEDLPDAQTAAKRASRIYNT >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_2|525_bp atggccctgtctgaactttgctttgcagacatccacgacagcgatggcagttccagcagc agccaccagagcctcaagagcacagccaaatgggcggcatccctggagaatctgctggaa gacccagaaggcgtgaaaagatttagggaatttttaaaaaaggaattcagtgaagaaaat gttttgttttggctagcatgtgaagattttaagaaaatgcaagataagacgcagatgcag gaaaaggcaaaggagatctacatgacctttctgtccagcaaggcctcatcacaggtcaac gtggaggggcagtctcggctcaacgagaagatcctggaagaaccgcaccctctgatgttc cagaaactccaggaccagatctttaatctcatgaagtacgacagctacagccgcttctta aagtctgacttgtttttaaaacacaagcgaaccgaggaagaggaagaagatttgcctgat gctcaaactgcagctaaaagagcttccagaatttataacacatga >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_3|678_aa MVAARENEDDAKAETPDKTIRSPRRLPGRRTSSRMTTLPRSLLFTSQRGTTIPRDFGFTT FLFLPDLSRSWLGFGRPEAHSDDVIRRERHTSNDPYCFVEFYEHRDAAAALAAMNGRKIL GKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTEDIKSAFAPFGKISDARVVKDMAT GKSKGYGFVSFYNKLDAENAIVHMGGQWLGGRQIRTNWATRKPPAPKSTQENNTKQLRFE DVVNQSSPKNCTVYCGGIASGLTDQLMRQTFSPFGQIMEIRVFPEKGYSFVRFSTHESAA HAIVSVNGTTIEGHVVKCYWGKESPDMTKNFQQVDYSQWGQWSQVYGNPQQYGQYMANGW QVPPYGVYGQPWNQQGFGVERFTPTAGQILQEIRDGLPYRRRGVRTLIVICAERKDVNTD TSYSTIFLSLKELTDTGLKGVQNESRNEPFKCDCSYCLYHSISQQLSPTLFCQFSPMVFG SFQKPIRSTKYCSRNVRENPPRRRAAPSSSSGARGDVQPRREPAEQEAAAVSRDGSAELL REPRGGSLAERGTAAVFFLVMVWEKKERPKSPNFGGDMGLQIPKAERRAGFSLRDALRLR CGRGWKDAERPGEHGWTRASDWAPGPGASGAFASAARSQKVGSRGWRNPRMEKGTSAVRA SEVLTLRRIRLGRHFGPA >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_3|2037_bp atggtggcagcaagagaaaatgaggatgatgcaaaagcggaaacccctgataaaaccatc agatctccccgccgacttcctggtcgtcgcacgtcctcacgtatgactacactacccaga agtctcctcttcacgtcccagcgcgggaccacaattcccagagacttcggcttcacgacg tttctctttttgcccgatctctcccggagctggctgggcttcggccggccagaggcccac agcgacgacgtgatccgtcgtgagcggcatacaagcaatgacccatattgctttgtggaa ttttatgaacacagagatgcagctgctgcattagctgctatgaatgggagaaaaattttg ggaaaggaggtcaaagtaaactgggcaaccacaccaagtagccagaaaaaagatacttcc aatcacttccatgtgtttgttggggatttgagtccagaaattacaacagaagatatcaaa tcagcatttgccccctttggtaaaatatcggatgcccgggtagttaaagacatggcaact ggaaaatccaaaggctatggttttgtatctttttataacaaactggatgcagaaaatgcg attgtgcatatgggcggtcagtggttgggtggtcgtcaaatccgaaccaattgggccact cgtaaaccacctgcacctaaaagtacacaagaaaacaacactaagcagttgagatttgaa gatgtagtaaaccagtcaagtccaaaaaattgtactgtgtactgtggaggaattgcgtct gggttaacagatcagcttatgagacagacattctcaccatttggacaaattatggaaata agagttttcccagaaaagggctattcatttgtcagattttcaacccatgaaagtgcagcc catgccattgtttcggtgaacggtactacgattgaaggacatgtggttaaatgctattgg ggtaaagaatctcctgatatgactaaaaacttccaacaggttgactatagtcaatggggc caatggagccaagtgtatggaaacccacaacagtatggacagtatatggcaaatgggtgg caagtaccgccttatggagtatacgggcaaccatggaatcaacaaggatttggagtagaa cggttcactccaacagcaggacagatcttacaggagatcagagatggcttgccctacagg aggagaggagtaaggacattgattgtgatctgtgcagaaagaaaggatgttaatacagat acctcttactcaacaatttttctcagcttgaaagaactaacagacactggacttaaggga gttcagaatgagtcaagaaatgagccattcaaatgtgactgcagttattgcctttatcac agcatttcacaacagttatcacccaccttattctgccaattttcaccaatggtttttggc agtttccaaaaaccaattaggtccacaaaatattgttctagaaatgtccgagagaatccg ccgcgccgccgggctgctccttcttcctcctcgggcgcccgcggcgatgttcaaccgcgc cgtgagccggctgagcaggaagcggccgccgtcagccgcgacgggagcgcagagctcctc cgggagccccggggaggaagtttggctgaaagaggaacagccgcagttttctttttagtg atggtctgggagaaaaaggagcgccccaaatctccaaactttggaggtgacatggggctc cagattccgaaagcggaacggcgcgcgggcttctccctccgtgacgctctccgcctccgc tgcgggcgtgggtggaaggatgccgagcgcccgggggagcacggctggacccgggcatcc gactgggcccctgggcctggggcgtcgggggccttcgcttctgccgcgaggagccaaaag gtgggatcgcgagggtggcggaaccccaggatggagaagggcacttctgcggtccgagcc agcgaagttctgacgttacgaaggattcgccttggccgtcactttgggccagcctga >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_4|85_aa MGPTFSSLWFSESHRMSAALQVLSTEAVIQETTAESAVSFMICLRSHNHLVWNILLVYLI QCGKIQYQKAMVMEPSWRLATTKGN >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_4|258_bp atggggcccacgttcagctccctgtggttctcagagtctcaccggatgtctgccgcgctg caggtgctcagcacagaggcagtgattcaagagaccacagcagaatctgcagtgtctttt atgatttgtcttcgaagtcacaaccatctcgtctggaatatcttattggtctaccttatt cagtgtgggaagatacaataccagaaggcgatggtcatggagccatcttggaggctggct accacaaaagggaactga >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_5|218_aa MGPLSSQRRVMGISQDNWHKRRKTGSKRKPYDKKRKYELGHLAANTKIGPHHIHTVRVRG GNNKYGALRRDMGNFSWGSECCTRKTRITDVVYDAPNSKLVRTKTLVENCFVLTDSTPYH QWYESHYALPLGCKKGAKLTPEEEKTLNKKRSKKIQKKYDEREKNAKISRLLGEQFQQGK LLACVASRLGQCGQAHVYVPGGKEMEFYLRKIKARKGK >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_5|657_bp atggggcctctttccagccagcgccgagtgatgggcatctctcaggacaactggcacaag cgccgcaagactggcagcaagagaaagccctacgacaagaagcggaagtatgagttgggg cacctggctgccaacaccaagattggcccccaccacatccacacagtccgtgtgcgggga ggtaacaataaatacggtgccctgaggcgggacatggggaatttctcctggggttcagag tgttgtactcgcaaaacaaggatcactgatgttgtctacgatgcgcccaatagcaagctg gtccgtaccaagaccctggtggagaactgcttcgtgctcactgacagcacaccgtaccac cagtggtatgagtcccactatgcgctgcccctgggctgcaagaagggagccaaactgact cctgaggaagaaaagactttaaacaaaaaacgatctaaaaaaattcagaagaaatacgat gaaagggaaaagaatgccaaaatcagccgtctcctgggggagcagttccagcagggcaag cttcttgcatgcgtcgcttcaaggctgggacagtgtggccaagcccatgtctatgtgcca gggggcaaggagatggagttctatcttaggaaaatcaaggcccggaaaggcaaataa >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_6|637_aa MSAATHSPMMQVASGNGDRDPLPPGWEIKIDPQTGWPFFVDHNSRTTTWNDPRVPSEGPK ASGRCVKEPQSVRLGALLSQEFVCRAGLFPRAAAQPVGEAQNVLSEMHEVLSEMHEVLSE MHEVLSSWLAQGLSGSRGFPQEELHLLKACLRTPSPGRETPSSANGPSREGSRLPPAREG HPVYPQLRPGYIPIPVLHEGAENRQPGMQRFRTEAAAAAPQRSQSPLRGMPETTQPDKQC GQVAAAAAAQPPASHGPERSQSPAASDCSSSSSSASLPSSGRSSLGSHQLPRGYISIPVI HEQNVTRPAAQPSFHQAQKTHYPAQQGEYQTHQPVYHKIQGDDWEPRPLRAASPFRSSVQ GASSREGSPARSSTPLHSPSPIRVHTVVDRPQQPMTHRETAPVSQPENKPESKPGPVGPE LPPGHIPIQVIRKEVDSKPVSQKPPPPSEKSVATEERAAPSTAPAEATPPKPGEAEAPPK HPGVLKVEAILEKVQGLEQAVDNFEGKKTDKKYLMIEEYLTKELLALDSVDPEGRADVRQ ARRDGVRKVQTILEKLEQKAIDVPGQVQVYELQPSNLEADQPLQAIMEMGAVAADKGKKN AGNAEDPHTETQQPEATAAATSNPSSMTDTPGNPAAP >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_6|1914_bp atgagcgccgccacccactcgcccatgatgcaggtggcgtccggcaacggtgaccgcgac cctttgccccccggatgggagatcaagatcgacccgcagaccggctggcccttcttcgtg gaccacaacagccgcaccactacgtggaacgacccgcgcgtgccctctgagggccccaag gcctcagggagatgtgtcaaggaaccacagtcagtgcggctgggcgccctgctcagccag gagtttgtctgcagagcaggcctcttccccagagcagcggcccagcctgttggagaggca caaaatgtgctcagcgagatgcacgaggtgctcagcgagatgcacgaggtgctcagcgag atgcatgaggtgctcagctcgtggctggctcaagggctttctggatccaggggcttcccc caggaggagctgcatctcctcaaggcctgtctgaggactccaagccctggaagggagact ccatcctctgccaatggcccttcccgggagggctctaggctgccgcctgctagggaaggc caccctgtgtacccccagctccgaccaggctacattcccattcctgtgctccatgaaggc gctgagaaccggcagcctgggatgcagcgattccgaactgaggcggcagcagcggctcct cagaggtcccagtcacctctgcggggcatgccagaaaccactcagccagataaacagtgt ggacaggtggcagcggcggcggcagcccagcccccagcctcccacggacctgagcggtcc cagtctccagctgcctctgactgctcatcctcatcctcctcggccagcctgccttcctcc ggcaggagcagcctgggcagtcaccagctcccgcgggggtacatctccattccggtgata cacgagcagaacgttacccggccagcagcccagccctccttccaccaagcccagaagacg cactacccagcgcagcagggggagtaccagacccaccagcctgtgtaccacaagatccag ggggatgactgggagccccggcccctgcgggcggcatccccgttcaggtcatctgtccag ggtgcatcgagccgggagggctcaccagccaggagcagcacgccactccactccccctcg cccatccgtgtgcacaccgtggtcgacaggcctcagcagcccatgacccatcgagaaact gcacctgtttcccagcctgaaaacaaaccagaaagtaagccaggcccagttggaccagaa ctccctcctggacacatcccaattcaagtgatccgcaaagaggtggattctaaacctgtt tcccagaagcccccacctccctctgagaagagtgtggctacagaagagagggcagccccc agcactgcccctgcagaagctacacctccaaaaccaggagaagccgaggctcccccaaaa catccaggagtgctgaaagtggaagccatcctggagaaggtacaggggctggagcaggct gtagacaactttgaaggcaagaagactgacaaaaagtacctgatgatcgaagagtatttg accaaagagctgctggccctggattcagtggaccccgagggacgagccgatgtgcgtcag gccaggagagacggtgtcaggaaggttcagaccatcttggaaaaacttgaacagaaagcc attgatgtcccaggtcaagtccaggtctatgaactccagcccagcaaccttgaagcagat cagccactgcaggcaatcatggagatgggtgccgtggcagcagacaagggcaagaaaaat gctggaaatgcagaagatccccacacagaaacccagcagccagaagccacagcagcagcg acttcaaaccccagcagcatgacagacacccctggtaacccagcagcaccgtag >gi568815588r:119475668_119688248|GENSCAN_predicted_peptide_7|117_aa METDPKVPDRAWSSGTLFLLNAKLIYAYRAHRKDQDRNNVNRKNMLPRMRLQYFTLYEKG KMKALPGGSNDQNREYPTSFMLGGLELPPPCRMGKERGLDPDHKRRFLDLEQEGIQX >gi568815588r:119475668_119688248|GENSCAN_predicted_CDS_7|351_bp atggaaacggatccaaaagtgccggatcgagcttggagctctggtaccctctttctcctg aatgccaaacttatctatgcctacagggcacacagaaaggaccaggatcgaaataatgtt aacaggaaaaacatgcttcctcgtatgagactgcagtatttcaccttgtatgagaaagga aaaatgaaggctcttcctggtggcagtaatgatcagaaccgggagtacccaacttctttc atgcttgggggactagagctgccccctccatgcagaatgggcaaagaaaggggtcttgat ccagaccacaagagaaggttcttggatctggagcaagaaggaattcaggnn