GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:08:27 Sequence gi568815597f:170432149_170644846 : 212698 bp : 37.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1268 1343 76 1 1 95 78 57 0.429 6.70 1.02 Term + 2436 2587 152 1 2 26 49 130 0.549 -0.01 1.03 PlyA + 5185 5190 6 1.05 2.04 PlyA - 8238 8233 6 1.05 2.03 Term - 21624 21023 602 1 2 73 38 136 0.686 0.80 2.02 Intr - 22874 22726 149 1 2 21 84 56 0.612 -2.54 2.01 Init - 25109 24877 233 2 2 50 113 186 0.650 15.08 2.00 Prom - 28538 28499 40 -6.95 3.00 Prom + 29747 29786 40 -5.35 3.01 Sngl + 32754 33143 390 2 0 99 54 399 0.964 33.57 3.02 PlyA + 33168 33173 6 1.05 4.00 Prom + 34169 34208 40 -6.15 4.01 Sngl + 34262 34939 678 1 0 45 48 295 0.735 17.13 4.02 PlyA + 34976 34981 6 1.05 5.00 Prom + 37402 37441 40 -3.65 5.01 Init + 57947 58022 76 1 1 95 89 22 0.268 4.30 5.02 Term + 71593 71711 119 2 2 99 42 107 0.548 4.92 5.03 PlyA + 72944 72949 6 1.05 6.03 PlyA - 73312 73307 6 1.05 6.02 Term - 74566 74319 248 2 2 52 48 127 0.053 0.07 6.01 Init - 81363 81270 94 0 1 103 77 131 0.856 14.09 6.00 Prom - 82275 82236 40 -8.85 7.00 Prom + 96418 96457 40 -3.55 7.01 Init + 100076 100259 184 1 1 76 59 87 0.228 3.93 7.02 Intr + 107062 107423 362 2 2 60 83 303 0.612 21.01 7.03 Intr + 107953 107993 41 0 2 97 9 52 0.739 -5.70 7.04 Intr + 110343 110444 102 0 0 131 100 131 0.971 16.97 7.05 Intr + 112557 112697 141 0 0 49 109 183 0.950 15.05 7.06 Intr + 119867 120282 416 2 2 66 90 577 0.013 48.52 7.07 Intr + 120537 120562 26 1 2 76 106 -12 0.000 -3.77 7.08 Term + 138129 138365 237 2 0 77 54 178 0.839 8.48 7.09 PlyA + 138447 138452 6 1.05 8.00 Prom + 139486 139525 40 -6.15 8.01 Sngl + 141797 142498 702 1 0 74 35 227 0.516 11.96 8.02 PlyA + 142840 142845 6 1.05 9.06 PlyA - 143249 143244 6 1.05 9.05 Term - 167489 167406 84 2 0 80 42 103 0.859 1.57 9.04 Intr - 169711 169357 355 0 1 5 39 297 0.581 10.87 9.03 Intr - 169858 169759 100 2 1 12 76 49 0.551 -5.65 9.02 Intr - 170505 169918 588 1 0 10 47 361 0.381 15.87 9.01 Init - 171542 171503 40 2 1 73 105 71 0.388 7.70 9.00 Prom - 176573 176534 40 -4.15 10.03 PlyA - 177331 177326 6 1.05 10.02 Term - 179014 178453 562 0 1 18 41 236 0.761 4.76 10.01 Init - 181506 181361 146 0 2 68 98 152 0.758 13.84 10.00 Prom - 195590 195551 40 -3.65 11.04 PlyA - 196858 196853 6 1.05 11.03 Term - 199012 198866 147 1 0 95 52 106 0.311 4.62 11.02 Intr - 202900 202729 172 0 1 76 75 25 0.050 -0.98 11.01 Intr - 204903 204836 68 0 2 71 94 70 0.079 2.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 74741 74319 423 2 0 50 48 192 0.855 7.44 S.002 Term + 119867 120314 448 2 1 66 45 613 0.987 48.50 S.003 Term - 122699 122634 66 2 0 132 53 20 0.924 -0.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_1|75_aa MDRAGEHSSNQTYTGTENQIPRIHICDSAAAGGPETTRENHWTREILYIIDTLVSLDTEW ISEVKCNFADLDTMV >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_1|228_bp atggatagagctggagagcattcgtctaaccaaacttacacaggaacagaaaaccaaata ccacgtattcacatttgtgattctgctgctgctggtggtccagagaccacacgtgagaac cactggactagggagatcttgtacatcatagacacattggtttctctggacacagaatgg atctcagaagtcaagtgtaattttgcagatttggacactatggtttaa >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_2|327_aa MNINKKDVQRKTPSEGHQHQRPKLDKSTKMKKNQCKKVENSKNQNASSPPKDHNSSPARE QSWMENEFDELTEVGFRRTTWKLNNLLLNDYWVNNKIKTEKNKFFETNENKDTTYQNLWD PAKAVLRGLEVLAWTIRKEKEIKGIHIGREKVKLSLFADDMIVYLENPIISAKNLLKLIS NFDKVSRYKVNMQKPQAFLYTNNSQITSELPFTTDTKRIKYLGIQLTKDVKDLFKENYKP LLKEIREDTNKWKSIPCSWIRRISFVKMAILQPNVIDRFNAIPMKLPLTFFTELEKNYLK FHMEPKKSSYSQDNPKQKEQSWRHHTT >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_2|984_bp atgaacatcaacaaaaaggacgtccaaagaaaaaccccatccgaaggtcaccaacatcaa agaccaaagctagataaatccacaaagatgaagaaaaaccagtgtaaaaaggttgaaaat tccaaaaaccagaatgcctcttctcctccaaaggatcacaactcctcgccagcaagagag caaagctggatggagaatgagtttgacgaactgacagaagtaggcttcagaagaacaaca tggaaactgaacaacttgctcctgaatgactactgggtaaataacaaaattaagacagaa aaaaataagttctttgaaaccaatgaaaacaaagacacaacgtaccagaatctctgggac ccagctaaagcagtgttgagaggattggaagttttggcctggacaatcaggaaagagaaa gaaataaagggtattcatataggaagagagaaagtcaaattgtctctgtttgcagatgac atgattgtgtatttagaaaaccccatcatctcagccaaaaatctccttaagttgataagc aactttgacaaagtctcaagatacaaagtcaatatgcaaaaaccacaggcattcctatac accaataatagccaaatcacgagtgaactcccattcacaactgatacaaagagaataaaa tatctaggaatacaacttacaaaggatgtgaaggacctcttcaaggagaactacaaacca ctgctcaaggaaataagagaagacacaaacaaatggaaaagcattccatgctcatggata agaagaatcagtttcgtgaaaatggccatactgcagcccaatgtaattgatagattcaat gctattcccatgaagctaccattgactttcttcacagaattagaaaaaaactaccttaaa tttcatatggaaccaaaaaagagctcatatagccaagacaatcctaaacaaaaagaacaa agctggaggcatcacaccacctga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_3|129_aa MGKKQSRKTENSKNQSASPPPKECSSSPAVEQSWTENDFDKLQQEGFRRSNFSELKEEVR THGKEVKNVEKRLDEWLTRITNAEKSLKDLKELKTTTRDLRDECTSLSSQFNQLEETVSV MEDQMNEMK >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_3|390_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcagtggaacaaagctggacggagaatgactttgac aagttgcaacaagaaggcttcagacgatcaaacttctctgagctaaaggaggaagttcga actcatggcaaagaagttaaaaacgttgaaaaaagattagacgaatggctaactagaata accaatgcagagaagtccttaaaggacctgaaggagctgaaaaccacaacacgagatcta cgtgatgaatgcacaagcctcagtagccaattcaatcaactggaagaaacggtatcagtg atggaagatcaaatgaatgaaatgaagtga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_4|225_aa MGDFNTPLPTLDRSTRHKVNKDIQELNLALHQADLIDIYRTLHPISTEYTFFSAPHCTYS KIDHIVGNKALLSKCKRTEMITNCLSDHSAIKLEWRIKKLTQNSSSTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSRLK ELEKQEQTHSKASRRQEITKIGAELKEIETQKTLQKNQRIQELVF >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_4|678_bp atgggagactttaacaccccactgccaacattagacaggtcaacaagacataaagttaac aaggatatccaggaattgaacttagctctgcaccaagcagacctaatagacatctacaga actctccaccccatatcaacagaatatacattcttctcagcaccacattgcacttattcc aaaattgaccacatagttggaaataaagcactcctcagcaaatgtaaaagaacagaaatg ataaccaactgtctctcagaccacagtgcaatcaaactagaatggaggattaagaaactc actcaaaacagctcaagtacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtttgtagagggaaatttatagcacta aatgcccacaaaagaaagcaggaaagatctaaaattgacaccctaacatcacgattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcggagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaaatcaacgaatc caggagctggtattttga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_5|64_aa MDGARGHYHKQINAGTENQILHVITYVLSREQLMTRQQLHEVYLTVEMLNSLAPGPYIKL RRPL >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_5|195_bp atggatggagctagaggccattatcataagcaaattaatgcaggaacagaaaaccaaata ctgcatgttatcacttatgttttatcccgtgaacagctcatgactagacaacaactgcat gaggtgtacttaactgttgagatgctgaattcattggcaccaggtccctacatcaaactc agaagacctctttaa >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_6|113_aa MTTKATGTRKGSGNQNPRTWWLLLAEVVVVEVLEVLARAIRKEKEIKGIQIGKEEVKLSL FADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINMQNSQAFLHTNNRLTAKS >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_6|342_bp atgaccaccaaggccacaggaacaaggaaaggaagtggtaaccagaatccaagaacttgg tggctgttgcttgctgaagttgtagttgtagaagtattggaagttctggccagggcaatc aggaaagagaaagaaataaagggtattcaaataggaaaagaggaagtcaaattgtctctg tttgcagatgacatgattgtatatttagaaaacccaatcgtctcagcccaaaatctcctt aagctgataagcaacttcagcaaagtctcaggatacaagattaatatgcaaaattcacaa gcattcctacacaccaataacagactaacagccaaatcatga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_7|502_aa MAQGWAGFSEEELRRLKQTKGYKMGLQWFNSWVLRGRGGDAALAGKGGVEAATFVLAEGA VNPFEPQRRLPAKKSRQQLQREKALVEQSQKLGLQDGSTSLLPEQLLSAPKQRVNVQKPP FSSPTLPSHFTLTSPVGDGQPQGIESQPKELGLENSHDGHNNVEILPPKPDCKLEKKKVE LLTISIGPEIGTCGETQEKSRWEVLQQEQRLMEEKNKRKKALLAKAIAERSKRTQAETMK LKRIQKELQALDDMVSADIGILRNRIDQASLDYSYARKRFDRAEAEYIAAKLDIQRKTEI KEQLTEHLCTIIQQNELRKAKKLEELMQQLDVEADEETLELEVEVERLLHEQEVESRRPV VRLERPFQPAEESVTLEFAKENRKCQEQAVSPKVDDQCGNSSSIPFLSPNCPNQEVFRLP FLLKDQNSSPAREQNWTENEFDKWTEVVFRRWVITNSTELKEHVLTQCKEAKNLDKRLEE LLTRITSLEKNINDLMDLKNTA >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_7|1509_bp atggcgcaaggttgggcaggattctctgaggaggaactgaggagactaaagcagactaaa ggttacaagatgggtttacagtggtttaattcttgggtcttaaggggaagaggcggggat gcagctttggcggggaaagggggcgtggaagcggctacgtttgtgttagcggaaggcgct gtaaatccatttgaaccacagcgacgtctccccgcgaagaaaagtcgacaacaacttcag cgagaaaaagcccttgtagagcaaagccaaaaacttgggcttcaagatggatcaacctca ttacttccagagcagctgctttcagcaccaaaacagagagttaacgttcaaaaaccacct ttttcttcccctactcttccgagtcatttcactctcacctcccccgttggtgatggacaa ccacagggcattgaaagtcagccaaaggaactgggacttgagaattcccatgatggtcac aacaatgttgagattctacctccaaagccagattgcaaattggagaaaaagaaagtggaa ttgttaactatttccattggcccagaaattggaacttgtggagagacgcaagaaaaatct cgttgggaagtcctccaacaagaacaacggctaatggaagagaaaaataaacgtaaaaaa gctcttttggctaaagctattgcagaaagatccaaaagaactcaggcagagaccatgaaa ctaaagcggatccagaaggagttgcaggctttagatgacatggtgtcagctgacattgga attctcaggaaccggattgatcaggccagcttagactattcatacgctcggaagcggttt gacagggctgaagcagagtacattgcagcaaagctagatatacagcgcaagactgagata aaagagcaactcactgaacacctttgtacgatcatacagcaaaatgagctccgaaaggcc aagaagttggaggagttgatgcaacaactagatgtagaagccgatgaagagactttggag cttgaggtggaggtcgagagattgctacacgaacaagaagtagaatcaaggagaccagtg gttcgtttagagaggccatttcagcctgcggaggagagtgtgacattagaatttgctaaa gagaacagaaagtgtcaagaacaagctgtttccccaaaggtagatgaccagtgtggaaat tccagtagcatcccctttcttagtccaaactgcccaaatcaagaagtgttccgtttaccc tttttactgaaagatcaaaactcctcaccagcaagggaacaaaactggacggaaaatgaa tttgacaaatggacagaagtagtcttcagaaggtgggtaataacaaattccaccgagtta aaggagcatgttctaacccaatgcaaggaagctaagaaccttgataaaaggctagaggaa ttgctaactagaataaccagtttagagaagaacataaatgacctgatggacctgaaaaac acagcatga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_8|233_aa MLPDFKLCYKATVTKTACYWHQNRYTDQWNRTETSEITPHIYSHLIFDKPDKNKQWGKDS LFNKWCWENWLAICRKLKLDPLLTPYTKINSRWIKDLNIRPKSIKTLEENLGNTIQDIGM GKDFMTETPKAVATKAKNDKWDPIKLKSFCTAKETIIRVNRQPTEWEKIFAIYPSDKGLI SRICKELQQIYKKKTNNPIKKWVKDLDRRFSKEDIYAANKHLKKCSSLFIREM >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_8|702_bp atgctacctgacttcaaactatgctacaaggctacagtaaccaaaacagcatgttactgg caccaaaacagatatacagaccaatggaacagaacagaaacctcagaaataacaccacac atctacagccatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattcg ctatttaacaaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggac cccttgctcacaccttatacaaaaattaactcaagatggattaaagatttaaatataaga cctaaaagcataaaaaccctagaagaaaatctaggcaataccattcaggacataggcatg ggcaaagacttcatgactgaaacaccaaaagcagtggcaacaaaagccaaaaatgacaaa tgggatccaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtgaac aggcaacctacagaatgggagaaaatttttgcaatctatccatctgacaaagggctaata tccagaatctgcaaagaacttcaacaaatttacaagaaaaaaacaaacaaccccatcaaa aagtgggtgaaggatttggacagacgattttcaaaagaagacatttatgcagccaacaaa catttgaaaaaatgctcatcacttttcattagagaaatgtaa >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_9|388_aa MKENDAREMGDKQGRGWNSLDGSEEDRKMWESLELPRDLLNGFDQNADNDMDNEIQSEMV SDGDEELVGNWSKGDSCYVLAKRLVAFCPFPRDLWDFGLERDDLGYLVEEISKQQCIQEV TRVLLKAFSFIRETDHKSSENLQPDNAIENKIAFSKKKFKPVAEICISNKEPNVNPQDNG EMSPGHVRDLHGSPSQHRPRGLGGKNGFTAAPAMAEGSNIELWLWLQRVQAPNHGSFHVV LSLVQKMYGNTWMTRQKLAAGAGSSWRTSARAAQKGNVGLEPPSTVPTGVPPSGAVRRRP PSSRPQNGRSTDSLHHAPGKATSTQCQPLKAAMREAVPCKATEAELPKTMGTHLLHQHDL DLIEHLDVEPADMEGHCDAKIMTKTVPN >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_9|1167_bp atgaaggaaaatgatgctcgagagatgggagacaaacaaggcagaggttggaacagtttg gatggctcagaagaagacaggaaaatgtgggaaagtttggaactccctagagacttgtta aatggctttgatcaaaatgctgataatgatatggacaatgaaatccagtctgagatggtc tcagatggagatgaggaacttgttgggaactggagcaaaggtgactcttgttatgtttta gcaaagagactggtggcattttgtcccttccctagagatttgtgggactttggacttgag cgggatgatttggggtacctggtggaagaaatttctaagcagcaatgcattcaagaggtt actcgtgtgctgttaaaggcattcagttttataagggaaacagaccataaaagttcagaa aatttgcagcctgacaatgccatagaaaataaaatagcattttctaagaagaaattcaag ccagttgcagaaatttgcataagtaataaggagccgaatgttaatccccaagacaatggt gaaatgtctccaggccatgtcagagaccttcatggcagcccctcccaacacaggcccaga ggcctaggaggaaaaaatggtttcacagctgctccagctatggctgaagggtccaacata gagctctggctgtggcttcagagggtgcaagccccaaaccatggcagcttccatgtggtg ttgagcctagttcagaagatgtatggaaacacctggatgaccaggcagaagcttgctgca ggagcaggttcctcatggagaacctctgctagggcagcacagaagggaaatgtggggttg gaacccccaagcacagtccctactggagtaccacccagtggagctgtgagaagaaggcca ccatcctccagaccccagaatggtagatccactgacagcttgcaccatgcacctggaaaa gccacaagcactcaatgccagcccctgaaagcagctatgagggaggcggtaccctgcaaa gccacagaggcagagctgcccaagaccatgggaacccacctcttgcatcagcatgacctg gatttgattgaacacttagatgtagaaccagcagatatggagggacactgtgatgccaaa ataatgaccaaaacagttccaaattga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_10|235_aa MRKIQHKKAQNSKNQNASSPSKNHNFLPAKEQNWTENEFDEMTEVGFRRSWFFDKTNKID RLLARPIRKKRQNNQIDTVKNDKGDITTDPTEIQTTIREYYKYFYTNKLENAEEMDNFLD TSTLQRLNQEEDESLNRPITSSEIEAVIDSLSTKIRLGTDGFTTEFYQRYNEELVPFLLK LFQTIEKEGLLSNSFYEASIILIPKPGKDTTNKENFRPISLMNIDAKILNKILAN >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_10|708_bp atgaggaaaatccagcacaaaaaggctcaaaattccaaaaaccagaatgcctcttctcct tcaaagaatcacaacttcttgccagcaaaggaacaaaactggaccgagaatgagtttgat gaaatgacagaagtgggcttcagaaggagctggttttttgacaagactaacaaaatagac agactgctagccagaccaataaggaagaaaagacagaataatcaaatagacacagtaaaa aatgataaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaatac tataaatacttctacacaaataaactagaaaatgcagaagaaatggataatttcctggac acatctaccctccaaagactaaatcaggaagaagatgaatccctgaatagaccaataaca agttctgaaattgaggcagtaattgatagcctatcaacgaaaatacgcttaggtacagat ggattcacaactgaattctaccagaggtacaatgaggagctggtaccattccttttgaaa ctattccaaacaatagaaaaagagggactcctctctaactcattttatgaggccagcatc atcctgataccaaaacctggcaaagacacaacaaacaaagaaaatttcaggccaatatcc ctgatgaacatcgatgcaaaaatcctcaataaaatactggcaaactga >gi568815597f:170432149_170644846|GENSCAN_predicted_peptide_11|128_aa GSGILPLASVLEAKEQLTEGSVSILHNQSFPLKLGLLETMDRMTLICVCFLPCPVSSFLS NDRRTSNLQCGEGKVENRKRPNTPAGLRSECIFRPNSDPSFLSGRGFPAGSPITPARASG TEFGSPWA >gi568815597f:170432149_170644846|GENSCAN_predicted_CDS_11|387_bp ggtagcggcatcctgcctttggcttcagtgcttgaagccaaagaacaattaacagaaggt agtgtcagcatactccacaaccaatcatttcctctgaagcttgggcttttagaaacaatg gacagaatgactttgatctgtgtttgcttccttccttgccctgtatcatcattcctgagt aatgacaggaggactagcaaccttcaatgtggagaagggaaggtggaaaatagaaaaagg cccaacacaccagctggcctgcggtcagagtgcatcttcaggcctaattctgacccatcc ttcctcagtgggcggggctttcctgcaggatctccaataactccagccagagcctcaggg acagaatttggatctccctgggcctga