GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:46:40 Sequence gi568815581f:58592644_58834219 : 241576 bp : 42.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 Intr - 1018 912 107 1 2 50 80 98 0.045 3.29 1.17 Intr - 7023 6110 914 1 2 84 91 751 0.093 65.05 1.16 Intr - 9313 9163 151 1 1 105 47 190 0.917 15.41 1.15 Intr - 9947 9742 206 0 2 115 -73 145 0.494 -0.90 1.14 Intr - 12486 12335 152 2 2 80 76 100 0.843 6.89 1.13 Intr - 18696 18518 179 0 2 76 86 96 0.425 6.00 1.12 Intr - 20901 20778 124 2 1 106 3 100 0.298 2.97 1.11 Intr - 22702 22589 114 0 0 100 94 69 0.371 7.34 1.10 Intr - 23662 23532 131 1 2 57 108 141 0.916 11.57 1.09 Intr - 24976 24895 82 0 1 77 116 31 0.805 3.52 1.08 Intr - 29143 29007 137 1 2 57 84 142 0.378 9.15 1.07 Intr - 30369 30204 166 2 1 92 110 235 0.959 25.14 1.06 Intr - 37911 37797 115 1 1 96 89 78 0.969 7.29 1.05 Intr - 39551 39440 112 2 1 34 12 191 0.842 5.13 1.04 Intr - 55469 55349 121 1 1 84 55 94 0.816 5.28 1.03 Intr - 56032 55866 167 1 2 47 80 55 0.638 -1.56 1.02 Intr - 59359 59143 217 0 1 77 86 173 0.470 13.58 1.01 Init - 62231 62158 74 2 2 68 80 26 0.319 0.39 1.00 Prom - 66896 66857 40 -4.85 2.03 PlyA - 67800 67795 6 1.05 2.02 Term - 69051 67934 1118 1 2 69 33 1261 0.457 109.90 2.01 Init - 75645 75555 91 0 1 81 78 69 0.200 5.90 2.00 Prom - 88804 88765 40 -3.65 3.00 Prom + 94072 94111 40 -5.05 3.01 Init + 100001 100145 145 1 1 62 92 117 0.894 9.98 3.02 Intr + 100229 100316 88 0 1 73 25 107 0.338 1.01 3.03 Intr + 104050 104216 167 1 2 52 110 175 0.654 14.78 3.04 Intr + 110553 110686 134 1 2 84 86 78 0.995 6.64 3.05 Intr + 113846 113965 120 1 0 85 95 17 0.781 1.87 3.06 Intr + 117216 117347 132 2 0 85 87 84 0.836 7.92 3.07 Intr + 128103 128169 67 2 1 112 80 31 0.232 2.26 3.08 Term + 148953 149056 104 1 2 65 42 124 0.507 2.96 3.09 PlyA + 149167 149172 6 1.05 4.00 Prom + 162052 162091 40 -3.55 4.01 Sngl + 163355 163819 465 1 0 104 33 836 0.999 75.59 4.02 PlyA + 165365 165370 6 1.05 5.03 PlyA - 165385 165380 6 1.05 5.02 Term - 199003 198830 174 1 0 50 53 132 0.308 2.68 5.01 Intr - 230762 230711 52 1 1 58 108 42 0.137 1.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 7023 6101 923 1 2 84 48 758 0.902 62.76 S.002 Init + 85887 85947 61 2 1 83 50 43 0.900 1.46 S.003 Term + 86811 87043 233 1 2 36 42 159 0.934 1.75 S.004 Sngl - 174571 174404 168 1 0 61 54 146 0.844 3.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:58592644_58834219|GENSCAN_predicted_peptide_1|1090_aa MRSSFLFSQSNNEIQTGKEGSLLARMSRAVRLPVPCPVQLGTLRNDSLEAQLHEYVKQGN YVKVKKILKKGKHHVEMADAPWYTFLNSVQRKRNVIKAREMRPESQPAVQLKYVSLPQAA TRVTHQEAAVFPGVEEARRLLSVSYLEQSLMGGSGVATNSVEIRGRVEKQGGELTEHCCI LQCGGGKKLESAWGLLASDHQSKPGSFPDGFWRSGALYSRQRYAWRKMGRGIYVDAVNSL GQTALFVAALLGLRKFVDVLVDYGSDPNHRCFDGSTPVHAAAFSGNQWILSKLLDAGGDL RLHDERGQNPKTWALTAGKERSTQIVEFMQRCASHMQAIIQGFSYDLLKKIDSPQRLVYS PSWCGGLVQGNPNGSPNRLLKAGVISAQNIYSFGFGKFYLTGATQMAYLGSLPVIGEKEV IQADDEPTFSFFSGPYMVMTNLVWNGSRVTVKELNLPTHPHCSRLRLADLLIAEQEHSSK LRHPYLLQLMAVCLSQDLEKTRLVYERITIGTLFSVLHERRSQFPVLHMEVIVHLLLQIS DALRYLHFQGFIHRSLSSYAVHIISPGEARLTNLEYMLESEDRGVQRDLTRVPLPTQLYN WAAPEVILQKAATVKSDIYSFSMIMQEILTDDIPWKGLDGSVVKKAVVSGNYLEADVRLP KPYYDIVKSGIHVKQKDRTMNLQDIRYILKNDLKASPEVDFTGAQRTQPTESPRVQRYGL HPDVNVYLGLTSEHPRETPDMEIIELKEMGSQPHSPRVHSLFTEGTLDPQAPDPCLMARE TQNQDAPCPAPFMAEEASSPSTGQPSLCSFEINEIYSGCLILEDDIEEPPGAASSLEADG PNQVDELKSMEEELDKMEREACCFGSEDESSSKAETEYSFDDWDWQNGSLSSLSLPESTR EAKSNLNNMSTTEEYLISKCVLDLKIMQTIMHENDDRLRNIEQILDEVEMKQKEQEERMS LWATSREFTNAYKLPLAVGPPSLNYIPPVLQLSGGQKPDTSGNYPTLPRFPRMVRDFQGG HFGKRSRKRNGCGWKPFTQVSKVNTGDQSEISHQLPTLCDPGKQNTDEQFQCTQGAKDSL ETSRIQNTSS >gi568815581f:58592644_58834219|GENSCAN_predicted_CDS_1|3270_bp atgaggtccagcttcttgttctcacagtccaataatgagatccagactgggaaagaaggg agtttattggccaggatgtctcgggctgttcgtcttccagtcccctgtcctgttcaactt ggtaccttaagaaatgactccctggaagctcagcttcatgagtatgtcaaacaagggaac tatgtgaaagtgaagaaaattcttaagaaaggtaagcaccatgttgagatggcagatgca ccttggtacacatttctgaatagcgttcagaggaaaaggaatgtcataaaggcaagagag atgagacctgagtcccagcctgctgtacagttgaaatatgtcagcctaccccaagctgcc accagagtgacacaccaagaagctgcagtgttccctggagttgaggaggcaagaagactg ctatctgtctcctacttggagcagtccttgatgggtggctctggagttgccacaaacagc gttgagatcagaggaagagtggagaagcaaggaggagaattaacagagcattgctgcatc ctgcagtgcggaggagggaagaaattggaatcggcctggggccttcttgcttccgatcat cagtccaaaccgggcagtttccctgacggtttctggcgctccggcgcgctttatagtcgt cagcgttacgcgtggaggaaaatggggcggggaatttatgttgatgcagttaactccttg ggccaaacagcactttttgttgcggcgttattgggccttaggaaattcgttgatgttctg gtggattatggatcagatccaaatcaccgctgctttgatgggagcacccctgtccatgca gcagcattttcgggcaatcagtggatccttagcaaactgctggatgcaggaggtgacctg cgactccacgatgagaggggtcaaaacccgaagacttgggctttgacagcaggaaaggag cgtagcacccagatagtggagttcatgcagcgctgtgcctcacacatgcaggccatcatc cagggcttctcttacgacctcctgaagaagatagactccccgcagcggcttgtctacagc ccgtcctggtgtgggggcctcgtgcagggaaaccctaatggctctcctaaccgactgctt aaagctggagtcatttctgctcaaaatatctacagctttggttttgggaagttttatctt actggggcgacacagatggcctatctaggatctcttccggtcattggagaaaaggaagtg attcaagctgatgatgagcccaccttctctttcttcagcggcccctacatggtcatgacc aacctagtgtggaatgggagcagggtcacagtgaaagagctgaatctccccacccaccca cactgcagcaggctgcggctggccgacttgttaattgccgagcaggaacacagcagcaag ctgcggcacccctacttgctacagttgatggctgtgtgtctctcccaggacctagagaaa acccgccttgtgtacgagcgcatcactatcggcacattgttcagtgtccttcatgaacga cggtcccagttcccagtgctgcacatggaggtgattgtgcacctgctgctccagatatct gatgccctgagatacctgcatttccaggggtttatccaccgctccctcagctcctatgct gtccatatcatctccccaggtgaagcgaggctgaccaacctggagtacatgttggaaagc gaggacagaggtgtacagagggacctgactcgagtgccccttcctacgcagctatacaac tgggccgcaccagaagtgatcttacagaaggcagccacagtgaaatcagacatctacagc ttttctatgatcatgcaggagattttaacagatgacataccctggaagggcttagatggc tcagttgttaaaaaagccgtagtctcggggaattatttagaagctgatgtcaggcttccg aaaccttactatgatattgttaagtcaggcatccacgtcaagcagaaagaccgaactatg aaccttcaagatatccggtatattctgaagaatgacttaaaggcaagccctgaagttgat tttactggagcccagagaactcaaccaaccgagagccccagagtgcagagatacggactc catcccgatgtcaatgtctatctaggactgacttcagaacaccccagagagacacctgac atggaaatcatagaactaaaggaaatgggcagtcaacctcattcaccaagggttcactct ttattcactgaggggacactagatcctcaggccccagatccatgtctgatggccagggag actcagaatcaagatgctccttgccctgctccatttatggcagaagaggccagcagcccc agcacaggtcagccaagcctctgcagtttcgaaatcaacgagatctactcaggctgcttg attttggaagatgacatagaagagcctccaggagctgcttcatctttggaggcagacgga cctaaccaggtagatgaactgaaatccatggaagaagagctggataagatggagagagag gcgtgttgttttggcagtgaggatgagagctcttcaaaagctgagacagagtactctttt gatgactgggactggcaaaacggttcactcagttcactcagccttcctgagtcaaccaga gaagccaagagcaatttgaacaacatgtccacgactgaggagtatctcatcagtaagtgt gtgctggatctaaagattatgcagacaataatgcacgagaatgatgataggctgaggaat atcgagcagatattagatgaagtcgagatgaaacagaaggaacaggaagagcgcatgtct ttatgggccacttcaagagagtttacaaatgcctacaagttacctctggccgtgggccct ccatctttaaactatattcctcctgtcctacagctttcagggggtcagaagccagacacc agtggcaactacccaaccctaccaagatttccaagaatggtaagagactttcaaggggga cattttggcaaaagatccaggaaaagaaatggttgtggctggaaacctttcacccaagtc tccaaagtaaacactggggatcagtcagagataagccatcagctgccgactctttgtgac cctggaaaacagaacacagatgaacaatttcagtgcactcaaggagccaaggacagtttg gaaacaagcaggatccaaaataccagtagn >gi568815581f:58592644_58834219|GENSCAN_predicted_peptide_2|402_aa MGSCSGGERLGTSPKYSMGKWEFVAKEQCGAPDSDWLLEIYARIFTCVQKGKESTFRISL PKMAAAEDEFLPPPRLPELFDSSKQLLDEVEGATEPTGSRIVQEKVFKGLDLLDKVAKML SQLDLFSRNEDLEEITSTDLKYLMVPAFQGALTMKQVNPRKRLDHLQQAREHFIKYLTQC HYYRVAEFELPQTKTNSAENHGAITSTAYPSLVAMASQRQAKIERYKQKKVLEHKLSTMK SAVESGQADNERVREYYLLHLQRWIDISLEEIESIDQEIKILGEKDSSREASTSNSCHQK RPPMKPFILTRNMAQAKVFGAGYPSLASMTVSDWYDQHQKHGVLPDQGIAKATPEEFRKA TQQQEDQEKEEEDDEQTLQRAREWDDWKDTHPRGYGNRQNMG >gi568815581f:58592644_58834219|GENSCAN_predicted_CDS_2|1209_bp atggggtcttgtagtggaggagaaagactgggcacaagtccaaagtacagcatgggcaaa tgggaatttgtagccaaggagcagtgtggagcgcccgattcagactggttgttagaaatc tacgccagaatctttacgtgtgtccagaaaggaaaggaatcaacatttcgcatcagcctt cccaagatggcagctgctgaagacgagttcctgccgccgccgcggctccccgagctgttc gattccagcaaacagcttctggacgaagtcgaaggagcgactgaacccaccggttcccga atagtccaggaaaaggtgttcaagggcctcgacctccttgacaaggttgccaaaatgtta tcgcagcttgacttgttcagccgaaatgaagatttggaggagattacttccaccgacctg aagtacctgatggtgccagcgtttcaaggagccctcaccatgaaacaagtcaacccccgt aagcgtctagatcatttgcagcaggctcgcgaacactttataaaatacttaactcagtgc cattactatcgtgtggccgagtttgagctgccccaaaccaagaccaactcagctgaaaat cacggtgctattacctccacggcttatcctagcctcgttgctatggcatctcaaagacag gctaaaatagagcgatacaagcagaagaaggtgttggagcataagttgtctacaatgaaa tctgctgtggaaagtggtcaagcagataatgagcgtgttcgtgaatattatcttcttcac cttcagaggtggattgatatcagcttagaagagattgagagcattgatcaggaaataaag atcctgggagagaaagactcttcaagagaggcatccacttctaactcatgtcaccagaag aggcctccaatgaaacccttcattctcactcggaacatggcgcaagccaaagtatttggc gctggctatccaagtctggcttccatgacagtgagtgactggtatgatcaacatcagaaa catggagtgttaccagatcagggaatagccaaggcaacaccagaagaattcagaaaagcc actcagcaacaggaagatcaagaaaaggaggaagaggatgatgaacaaacactccaaaga gctcgagagtgggatgactggaaggacacccaccctaggggctacggcaaccgacagaac atgggctaa >gi568815581f:58592644_58834219|GENSCAN_predicted_peptide_3|318_aa MRGKTFRFEMQRDLVSFPLSPAVRVKLVSAGFQTAEELLEVKPSELSKAQSPLDSASSHV HVYSVKELLDSTYKLSECMQLAVDVQIPECFGGVAGEAVFIDTEGSFMVDRVVDLATACI QHLQLIAEKHKGEEHRKALEDFTLDNILSHIYYFRCRDYTELLAQVYLLPDFLSEHSKAH CSSRITLLLCGVALPLPLSARAPPSETKDDIALPPGLMVRLVIVDGIAFPFRHDLDDLSL RTRLLNGLAQQMISLANNHRLAVILTNQMTTKIDRNQALLVPALAVALDSHSPNRIVNCA YKESRLHAPYDNLMPDDL >gi568815581f:58592644_58834219|GENSCAN_predicted_CDS_3|957_bp atgcgcgggaagacgttccgctttgaaatgcagcgggatttggtgagtttcccgctgtct ccagcggtgcgggtgaagctggtgtctgcggggttccagactgctgaggaactcctagag gtgaaaccctccgagcttagcaaagcccagtctccgttagattctgcttcctcccacgtc catgtttacagcgtgaaagagctcctcgactccacttacaagttgtctgaatgtatgcag ttggcagtagatgtgcagataccagaatgttttggaggagtggcaggtgaagcagttttt attgatacagagggaagttttatggttgatagagtggtagaccttgctactgcctgcatt cagcaccttcagcttatagcagaaaaacacaagggagaggaacaccgaaaagctttggag gatttcactcttgataatattctttctcatatttattattttcgctgtcgtgactacaca gagttactggcacaagtttatcttcttccagatttcctttcagaacactcaaaggcccac tgcagtagcagaatcaccctactgctctgtggggtggccctgccccttcctctctctgca agggccccaccctctgaaaccaaggacgacatagcattgccccctgggcttatggttcga ctagtgatagtggatggtattgcttttccatttcgtcatgacctagatgacctgtctctt cgtactcggttattaaatggcctagcccagcaaatgatcagccttgcaaataatcacaga ttagctgtaattttaaccaatcagatgacaacaaagattgatagaaatcaggccttgctt gttcctgcattagcagtggcattagattctcatagcccgaaccgtatcgtgaactgtgca tacaaggaatctaggttgcatgctccttatgataatctaatgcctgatgatctgtga >gi568815581f:58592644_58834219|GENSCAN_predicted_peptide_4|154_aa MAGCIPEEKTYRRFLELFLGEFRGPCGGGEPEPEPEPEPEPEPESEPEPEPELVEAEAAE ASVEEPGEEAATVAATEEGDQEQDPEPEEEAAVEGEEEEEGAATAAAAPGHSAVPPPPPQ LPPLPPLPRPLSERITREEVEGESLDLCLQQLYK >gi568815581f:58592644_58834219|GENSCAN_predicted_CDS_4|465_bp atggccggctgcatccctgaggagaaaacttaccggcgcttcctggagctattcctgggc gagtttcgcggaccgtgcggcggcggcgagccggagccggaacccgaacccgaacccgaa cccgaacccgagtccgagcccgagcccgaacctgaactggtagaagctgaggcggccgag gcttcggtagaggaacccggggaggaggcggccacggtagccgcgacggaggagggggac caggagcaagacccggagcccgaggaggaggcggcggttgagggtgaggaggaggaggag ggcgcggcgacggcggcggcagccccggggcactcggccgtgccgccgccgccgccccag ctgccgcctttgcccccgctcccgcgaccgctgtcagagcgcatcacccgcgaggaggtg gagggcgaaagcctggacctgtgcctgcagcagctctacaaatag >gi568815581f:58592644_58834219|GENSCAN_predicted_peptide_5|75_aa XEQRYHNPEKPFLCLEFQVFSPENLEFCAWVGCGIPIQVDKEESFAVNKVILNLFGVTDF SETLIKGLRYYFRRY >gi568815581f:58592644_58834219|GENSCAN_predicted_CDS_5|228_bp nnagaacagcgttatcacaatcctgagaaaccattcctgtgcctagaatttcaagtattc agtccagaaaacctggaattttgtgcttgggtaggatgtggtattcctatacaagtggat aaagaggaaagttttgctgttaataaagtaattcttaacctttttggagttacggacttc tctgagacactaataaaaggtcttcgttattacttcagaagatattga