GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:28:55 Sequence gi568815584r:77618091_77855122 : 237032 bp : 43.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 26360 26502 143 1 2 49 44 92 0.223 0.51 1.02 Term + 34842 35046 205 0 1 57 38 142 0.218 2.94 1.03 PlyA + 38530 38535 6 1.05 2.07 PlyA - 40270 40265 6 1.05 2.06 Term - 56151 55722 430 2 1 60 53 171 0.700 5.57 2.05 Intr - 57704 57566 139 0 1 74 109 143 0.982 14.52 2.04 Intr - 61880 61795 86 1 2 102 70 50 0.959 4.06 2.03 Intr - 76777 76648 130 2 1 84 107 73 0.837 8.55 2.02 Intr - 86387 86279 109 2 1 110 105 92 0.953 12.96 2.01 Init - 89914 89732 183 1 0 89 87 336 0.667 32.72 2.00 Prom - 91314 91275 40 -4.86 3.17 PlyA - 91517 91512 6 1.05 3.16 Term - 100196 99998 199 1 1 69 48 196 0.999 10.57 3.15 Intr - 100440 100277 164 0 2 98 50 130 0.984 8.97 3.14 Intr - 102738 102621 118 2 1 55 82 154 0.895 11.97 3.13 Intr - 105214 105091 124 2 1 13 92 69 0.532 -0.56 3.12 Intr - 113039 112898 142 2 1 42 86 61 0.775 1.23 3.11 Intr - 114511 114395 117 1 0 101 81 133 0.987 14.56 3.10 Intr - 116922 116857 66 0 0 122 86 15 0.933 3.80 3.09 Intr - 117916 117847 70 0 1 91 61 10 0.994 -2.22 3.08 Intr - 118985 118881 105 1 0 91 108 61 0.989 7.73 3.07 Intr - 120794 120688 107 2 2 91 76 69 0.998 5.01 3.06 Intr - 120971 120876 96 2 0 65 78 100 0.932 6.91 3.05 Intr - 133390 133229 162 1 0 53 84 203 0.977 16.57 3.04 Intr - 137030 136877 154 1 1 93 73 21 0.669 1.17 3.03 Intr - 143193 143024 170 0 2 21 94 77 0.002 0.34 3.02 Intr - 170672 170482 191 0 2 92 72 82 0.338 6.30 3.01 Init - 182243 182147 97 2 1 72 50 88 0.135 1.88 3.00 Prom - 187505 187466 40 -3.86 4.00 Prom + 198158 198197 40 -7.26 4.01 Init + 200889 201023 135 2 0 113 90 171 0.769 17.97 4.02 Intr + 204345 204428 84 2 0 83 115 109 0.999 13.12 4.03 Intr + 207786 207938 153 2 0 110 103 38 0.926 7.77 4.04 Intr + 213023 213137 115 1 1 31 72 45 0.271 -2.78 4.05 Intr + 216583 216707 125 2 2 40 110 93 0.475 7.00 4.06 Term + 226948 226956 9 0 0 114 54 0 0.020 -2.61 4.07 PlyA + 229990 229995 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 69727 69616 112 0 1 103 44 80 0.801 3.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:77618091_77855122|GENSCAN_predicted_peptide_1|115_aa MLYNAHTINYENSLGITVDTVSFSQAPDLPPVILANTEHLLCAKHYANHSKLIESCDPRR YNEFSNKMKAQVLGQKATHRVSSKHRKPWQLLSLIGMPALTTTSFLADGKMDGCV >gi568815584r:77618091_77855122|GENSCAN_predicted_CDS_1|348_bp atgctgtacaatgcacacaccataaattatgagaacagtctgggaattacggttgacact gtcagcttctcgcaagcccctgatcttcctcctgttatattagctaacactgaacactta ctctgtgccaagcactatgctaaccacagcaaacttatagagagctgcgacccccgcaga tacaatgagttctcaaacaagatgaaagcacaggtgcttggccagaaggccacgcatcgg gtttcgtcaaagcaccggaagccatggcagttactgtccttgataggtatgccagcctta acaacaacgtccttcttggctgatgggaaaatggacggctgtgtttga >gi568815584r:77618091_77855122|GENSCAN_predicted_peptide_2|358_aa MGKMAAAVGSVATLATEPGEDAFRKLFRFYRQSRPGTADLEGVIDFSAAHAARGKGPGAQ KVIKSQLNVSSVSEQNAYRAGLQPVSKWQAYGLKGYPGYQWHWVKQCLKLYSQKPNVCNL DKHMSKEETQDLWEQSKEFLRYKEATKRRPRSLLEKLRWVTVGYHYNWDKQVAAACGFED FRAEAGILNYYRLDSTLGIHVDRSELDHSKPLLSFSFGQSAIFLLGGLQRDEAPTAMFMH SGDIMIMSGFSRLLNHAVPRVLPNPEGEGLPHCLEAPLPAVLPRDSMVEPCSMEDWQVCA SYLKTARVNMTVRQVLATDQNFPLEPIEDEKRDISTEGFCHLDDQNSEVKRARINPDS >gi568815584r:77618091_77855122|GENSCAN_predicted_CDS_2|1077_bp atggggaagatggcagcggccgtgggctctgtggcgactctggcgactgagcccggggag gacgcctttcggaaacttttccgcttctaccgtcagagccggcccgggaccgcagacctg gaaggggtcatcgacttctcggcggcccacgcagcccgtggcaagggtcctggtgcccaa aaggtgatcaaatctcagctaaatgtgtcttctgtcagtgagcagaatgcatatagagca ggtcttcagcccgtcagcaagtggcaagcctatggactcaaaggctatcctggttaccag tggcactgggtgaaacagtgccttaagttatattcccagaaacctaatgtatgtaacctg gacaaacacatgtctaaagaagagacccaagatctgtgggaacagagcaaagagttcctg aggtataaagaagcgactaaacggagaccccgaagtttactggagaaactgcgttgggtg accgtaggctaccattataactgggacaagcaagtagccgctgcctgtggatttgaggat ttccgagctgaagcagggatcctgaattactaccgcctggactccacactgggaatccac gtagacagatctgagctagatcactccaaacccttgctgtcattcagctttggacagtcc gccatctttctcctgggtggtcttcaaagggatgaggcccccacggccatgtttatgcac agtggtgacatcatgataatgtcgggtttcagccgcctcttgaaccacgcagtccctcgt gtccttccaaatccagaaggggaaggcctgcctcactgcctagaggcacctctccctgct gtcctcccgagagattcaatggtagagccttgttctatggaggactggcaggtgtgtgcc agctacttgaagaccgctcgtgttaacatgactgtccgacaggtcctggccacagaccag aatttccctctagaacccatcgaggatgaaaaaagagacatcagtacagaaggtttctgc catctggatgaccagaatagcgaagtaaaacgggccaggataaaccctgacagctga >gi568815584r:77618091_77855122|GENSCAN_predicted_peptide_3|693_aa MGLSLPAPRAAAQPEQRPTPGVSMPVPSRSRRGDLWQCLETFVIVMTWEMLLASSGQRSS VLLDILQCTELAFTTKNCPVPNVNSITVEKPWSIYQNTNGTPVQRTTNPRRPFRPERLRI AHSFLSAPHHLERPAQRCCRSRWKKRKKMALTSFLPAPTQLSQDQLEAEEKARSQRSRQT SLVSSRREPPPYGYRKGWIPRLLEDFGDGGAFPEIHVAQYPLDMGRKKKMSNALAIQVDS EGKIKYDAIARQGQSKDKVIYSKYTDLVPKEVMNADDPDLQRPDEEAIKEITEKTRVALE KSVSQKVAAAMPVRAADKLAPAQYIRYTPSQQGVAFNSGAKQRVIRMVEMQKDPMEPPRF KINKKIPRGPPSPPAPVMHSPSRKMTVKEQQEWKIPPCISNWKNAKGYTIPLDKRLAADG RGLQTVHINENFAKLAEALYIADRKAREAVEMRAQVERKMAQKEKEKHEEKLREMAQKAR ERRAGIKTHVEKVKYFSTVHTEDGEARERDEIRHDRRKERQHDRNLSRAAPDKRSKLQRN ENRDISEVIALGVPNPRTSNEVQYDQRLFNQSKGMDSGFAGGEDEIYNVYDQAWRGGKDM AQSIYRPSKNLDKDMYGDDLEARIKTNRFVPDKEFSGSDRRQRGREGPVQFEEDPFGLDK FLEEAKQHGGSKRPSDSSRPKEHEHEGKKRRKE >gi568815584r:77618091_77855122|GENSCAN_predicted_CDS_3|2082_bp atggggctgtccctcccggcgcccagggcggctgcacaaccggagcagaggccgactccc ggcgtgtccatgccggttccctcgcgctccaggcgcggggatctttggcaatgtctggag acatttgtgattgtcatgacttgggagatgctactggcatctagtgggcagaggtcaagt gtgttgttggacatcttacaatgcacagaactggccttcacaacaaagaactgtccagtt ccaaatgtcaacagtatcactgtggagaaaccctggtcaatctatcagaatacaaacggt actcctgttcaaagaactacaaatcccagaaggccttttcggccagagcgcctgcgcatc gcgcactccttcctttccgctcctcatcatctggaaagacccgcccagcggtgctgtcgc tcgcgctggaagaagcggaagaagatggcgctcaccagctttttacctgcacctactcag ctatctcaggaccagcttgaggctgaagaaaaggcaagatcccagagatcacggcagacc tcactggtctcctcccgaagagaacctcccccgtacggataccggaaaggctggatacct cggttattagaggattttggagatggaggtgcttttccagagatccatgtggcccagtat ccactggatatgggacgaaagaaaaaaatgtcgaatgcgctggccattcaggtggattct gaaggaaaaattaaatatgatgcaattgctcgacaaggacagtcaaaagacaaggtcatt tatagcaaatacactgacctggttccaaaggaggttatgaatgcagatgatccagacctg caaaggcccgatgaagaagctattaaagagataacagaaaagacaagagtagccttagaa aaatctgtatcacagaaggtcgccgcagccatgccagttcgagcagctgacaaattggct cctgctcagtatatccgatacacaccatctcagcaaggagtggcattcaactctggagct aaacagagggttattcggatggtagaaatgcagaaagatccaatggagcctccaaggttc aagattaataagaaaattccccggggaccaccttctcctcctgcgcctgtcatgcattct cctagccgaaagatgactgtaaaggaacaacaagagtggaagattcctccttgtatttct aactggaaaaatgcaaagggttatacaattccattagacaaacgtctggctgctgatgga agaggactacagacagtacacataaatgaaaatttcgccaaattggcagaagccctctac attgctgatcggaaggctcgtgaagctgtggaaatgcgtgcccaagtagagagaaaaatg gctcagaaagaaaaggaaaaacatgaagagaaacttagagaaatggcccagaaagccagg gagagaagagctgggatcaaaactcatgtggaaaaagtgaagtacttttcaactgttcac acagaggatggggaggcacgtgagagggatgaaatccggcatgacaggcgaaaagagaga cagcatgaccggaatctttccagggcagctcctgataagaggtcgaaacttcagagaaat gaaaatcgggatatcagtgaagttattgctctcggtgttcctaatcctcggacttccaat gaagttcagtatgaccaaaggctcttcaaccaatccaagggtatggacagtggatttgca ggtggagaagatgaaatttataatgtttatgatcaagcctggagaggtggtaaagatatg gcccagagtatttataggcccagtaaaaatctggacaaggacatgtatggtgatgaccta gaagccagaataaagaccaacagatttgttcccgacaaggagttttctggttcagaccgt agacagagaggccgagaaggaccagtgcagtttgaggaagatccttttggtttggacaag tttttggaagaagccaaacagcatggtggctctaaaagaccctcagatagcagccgcccc aaggaacacgagcatgaaggcaagaagaggaggaaggaatag >gi568815584r:77618091_77855122|GENSCAN_predicted_peptide_4|206_aa MARKALKLASWTSMALAASGIYFYSNKYLDPNDFGAVRVGRAVATTAVISYDYLTSLKSV PYGSEEYLQLRSKLTGAMGLGIHWCNILRRACRLGHRGKASALRNLQRRQAMGVKKTVAC HVSHADMAGQASQLGDCLLEEEGAENQCFADKQIPNREERGRAHSSSRVHVCSSLTYVGC LDTWVGFLLLQSCPPVAPTTWFYRMS >gi568815584r:77618091_77855122|GENSCAN_predicted_CDS_4|621_bp atggccagaaaggctctcaagcttgcttcgtggaccagcatggctcttgctgcctctggc atctacttctacagtaacaagtacttggaccctaatgactttggcgctgtcagggtgggc agagcagttgctacgacggctgtcatcagttacgactacctcacttccctgaagagtgtc ccttatggctcagaggagtacttgcagctgagatctaagctaaccggtgctatgggcctt ggcatccactggtgtaacattctacggagagcttgcaggctcggacacaggggcaaggcc tctgctttacggaatctgcagaggaggcaggcgatgggtgtcaaaaagacagttgcctgt cacgtgtcacatgcagacatggctggacaagcctctcagttgggggactgcctcttagag gaggaaggagctgaaaatcaatgctttgcagataagcagatcccaaacagagaggagaga ggaagggcacacagttcctctcgtgtgcatgtctgcagctccctcacctatgtgggctgc ctggacacatgggtgggcttcctgctacttcagagctgccctccagttgcccctaccacc tggttctacaggatgagctga