GENSCAN 1.0 Date run: 21-Sep-117 Time: 11:32:15 Sequence gi568815593f:118873796_119074122 : 200327 bp : 39.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1900 2089 190 1 1 49 48 176 0.613 5.64 1.02 PlyA + 2725 2730 6 1.05 2.02 PlyA - 6892 6887 6 1.05 2.01 Sngl - 9084 8551 534 0 0 71 48 303 0.864 20.42 2.00 Prom - 29108 29069 40 -3.55 3.00 Prom + 32229 32268 40 -5.35 3.01 Init + 35468 35520 53 1 2 93 36 57 0.223 1.69 3.02 Intr + 49953 50010 58 0 1 54 53 68 0.335 -2.13 3.03 Intr + 50464 50605 142 0 1 53 45 130 0.366 4.21 3.04 Intr + 52455 52610 156 1 0 42 78 83 0.687 1.76 3.05 Intr + 53152 53354 203 2 2 57 74 99 0.765 3.48 3.06 Term + 72500 72643 144 1 0 82 43 166 0.637 8.43 3.07 PlyA + 72741 72746 6 1.05 4.00 Prom + 72938 72977 40 -4.75 4.01 Init + 73549 73620 72 0 0 74 53 113 0.336 7.52 4.02 Intr + 78206 78340 135 1 0 98 77 57 0.119 5.44 4.03 Term + 84984 85247 264 2 0 52 48 163 0.144 3.22 4.04 PlyA + 85321 85326 6 1.05 5.00 Prom + 86181 86220 40 -3.95 5.01 Init + 88802 88885 84 1 0 25 103 43 0.235 0.37 5.02 Intr + 95556 95730 175 2 1 30 64 137 0.215 4.09 5.03 Term + 96806 97029 224 0 2 22 50 171 0.462 2.80 5.04 PlyA + 97476 97481 6 1.05 6.00 Prom + 97644 97683 40 -6.15 6.01 Sngl + 100001 100330 330 1 0 81 42 720 0.999 62.47 6.02 PlyA + 100978 100983 6 1.05 7.04 PlyA - 100996 100991 6 1.05 7.03 Term - 101542 101031 512 2 2 -68 42 305 0.022 3.85 7.02 Intr - 109553 109435 119 1 2 -1 43 137 0.007 -0.51 7.01 Init - 114980 114499 482 2 2 55 60 440 0.148 32.48 7.00 Prom - 118618 118579 40 -3.65 8.03 PlyA - 118815 118810 6 -1.95 8.02 Term - 120168 118826 1343 1 2 67 39 418 0.946 25.47 8.01 Init - 120398 120320 79 2 1 42 40 80 0.320 -0.13 8.00 Prom - 122300 122261 40 -4.95 9.00 Prom + 129932 129971 40 -5.75 9.01 Init + 136033 136107 75 0 0 105 84 116 0.911 12.85 9.02 Term + 136256 136405 150 1 0 7 48 133 0.611 -1.87 9.03 PlyA + 136661 136666 6 1.05 10.07 PlyA - 137068 137063 6 1.05 10.06 Term - 138064 137903 162 1 0 9 44 162 0.026 0.85 10.05 Intr - 148303 148255 49 0 1 106 89 -3 0.089 -0.64 10.04 Intr - 150842 150690 153 1 0 73 60 83 0.642 2.27 10.03 Intr - 151508 151372 137 2 2 16 84 149 0.047 5.75 10.02 Intr - 169080 169024 57 0 0 93 71 67 0.287 3.66 10.01 Init - 197912 197238 675 2 0 80 48 446 0.008 33.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 151436 151372 65 2 2 82 84 83 0.900 7.97 S.002 Sngl - 197912 197193 720 2 0 80 47 457 0.956 34.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_1|63_aa XSIQIYKASSQRPSRRLRLPHNNSGDFNTPRTILHRSRQKTNKDIKDLNSALDRTDPTGI HSL >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_1|192_bp nngagcatccagatttataaagcaagttctcagagaccttcaaggagacttagacttcca cacaataatagtggagactttaacaccccacggacaatcttacacagatcaagacaaaaa actaacaaagatattaaggacctgaactcagcactggatcgaacggacccaacgggtatc cactctctgtga >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_2|177_aa MGKVSPGHVRYLRGRLSYEWFCGLAPGPCCSVQPQGLVPCAPKLHLQPLLKGRLRYSSEH CFRGCKPQALAISMCLGPAGVQKSRVELWEPRFQRMYGNDWMYRQKSAAGAEPSWRTSAK AVQKGNVGLEPPHRVPSGALPSGALKRGPLSSRPQNVDPLTACTMHLEKPQILNASP >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_2|534_bp atggggaaagtgtctccagggcatgtcagatatcttcgtggcagactttcctatgaatgg ttttgtgggctggccccagggccctgctgctctgtgcagcctcagggcttggtgccctgt gccccgaagctccatctccagccattgctaaagggtaggctaaggtacagctcagaacat tgtttcagagggtgcaagccccaagctttggcaatttccatgtgccttgggcctgcaggt gtgcagaagtcaagagttgagttatgggagcctagatttcaaaggatgtatggaaacgac tggatgtacaggcagaagtctgctgcaggtgcagagccctcatggagaacctctgctaag gcagtgcagaagggaaatgtgggtttggagcccccacacagagtccccagtggagcactg cctagtggagctctgaaaagagggccactgtcctcaagaccccagaatgtagatccactg acagcttgcaccatgcacctggaaaagccacagatactcaatgccagcccatga >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_3|251_aa MEKLNSTALGVTADAPQFLSLRRQGLKLLQTRTDVLLIISSTGKKSIHKRKSDQNKEKHT PKTELPENQIQQSLKPLLLTLNLTATWMELEAIILSEVTQEWKTKFRYVLTYKWELSYED AKMYIINFGDSGWKEGVLEAGKSESEHLHGHGPGESSLPGLQMEVFLLCPHMQRETEKER ERDKERRRETESKKEVSLMSLLLKEYEEIFEMESLSVVTGVNTKEGQTPTDETGCNDKQI EDCYKKGFLTQ >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_3|756_bp atggagaaactcaactcaacagcccttggggtcactgcagatgccccacagtttctttct ctgaggaggcaaggactgaagttgctgcagacccgcacagatgtgctgctgataatttcc agtacaggaaagaagtccattcataagagaaaatcagaccaaaacaaagagaagcatacc ccaaaaacagaactgcctgagaaccagatccagcaaagcctgaagcctctcctactcact ctgaacctcaccgcaacttggatggaactggaggccattattctaagtgaagtaacccag gaatggaaaaccaaattccgttatgttctcacttataagtgggagctaagctatgaggat gcaaagatgtacataataaattttggggactctgggtggaaggagggagttctagaagct gggaagtctgaatcagagcacctgcatggtcacggtcctggtgagagctctcttcctggc ttgcaaatggaagtcttcttgctgtgtcctcacatgcagagagagacagagaaagagaga gagagagacaaagagagacggagagagacagagagcaaaaaggaagtatctctcatgtct cttcttctaaaggaatatgaggagatttttgagatggaaagtctttctgtggtcacagga gtaaacacaaaagagggacagacaccaactgatgaaacagggtgcaatgacaaacagatc gaggactgctataagaaaggattcctcactcagtag >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_4|156_aa MEDFIADESGSKREGELEEGWSRKKKKGTEIKGRERLKGGAEIERRKGLRDSERTEKTVK RRRLPNLKLDTRMVQYTQINKCDSSHKENNKNHMIISIDAEKAFNKIQHPFMLKILNKLG TKGTYLKIIRAIYAKPTANIILNGQKPKAFPLRTGT >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_4|471_bp atggaggattttattgccgatgaaagtggctctaagcgggaaggagagctggaagaggga tggagcaggaagaaaaagaaaggaactgaaattaagggaagggagagattgaaaggtggc gcagaaattgaaaggagaaaggggttgagggatagtgagaggacagagaagacagtaaaa agacgccgcttacccaatttaaaattggatacaagaatggttcaatatacgcaaatcaat aaatgtgattcatcacacaaagagaacaacaaaaaccacatgatcatatcaatagatgca gaaaaggctttcaataaaatccagcatcccttcatgttaaaaatcctcaacaaactaggc actaaaggaacttacctcaaaataataagagccatctatgccaaacccacagccaacatc atactgaacgggcaaaaaccaaaagcattccccttgagaactggaacatga >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_5|160_aa MEKKKEVGLGSKNKRGNEKEKGEEKKGQVPDPVPPDWVRTPSRGSRHLLQEHSGQHLFSV PLGQSSEKKEEAVTFAVLQPSPVISPARQANIQIQEIQGTPERYSTRRSTPRHVINRFSK VKMKEKMLRAAREKRQVTYKEKSIRLKAKPSVETLLASRD >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_5|483_bp atggagaagaaaaaggaagtgggattagggagtaaaaataaaagagggaatgaaaaggag aagggagaggaaaagaagggacaggtccctgatcctgttccccctgactgggtgagaact cccagtaggggctccagacaccttctacaggagcattcaggccaacatctgttcagtgtg cccctgggacaaagctccgagaagaaagaggaggctgtcacctttgctgttttgcagcct tcaccggtaatatctccagcaagacaagccaacattcaaattcaggaaattcagggaacc ccagaaagatactccacgagaagatctaccccaagacacgtaatcaacagattctccaag gtcaaaatgaaagaaaaaatgttaagggcagccagagagaaacgccaggtaacctacaaa gagaagtccatcagactaaaagcaaaaccctcagtggaaaccctattagccagcagagat tag >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_6|109_aa MSDAAVDTSSEITTKDLQEKKEVVEEAENGRDAPANGNANEENGEQEADNEVDEEEEEGG EEEEEEEGDGEEEDGDEDEEAETATGKRAAEDDEDDDVDTKKQKTDEDD >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_6|330_bp atgtcagacgcagccgtagacaccagctccgaaatcaccaccaaggacttacaggagaag aaggaagttgtggaagaggcagaaaatggaagagacgcccctgctaacgggaatgctaat gaggaaaatggggagcaggaggctgacaatgaggtagatgaagaagaggaagaaggtggg gaggaagaggaggaggaagaaggtgatggtgaggaagaggatggagatgaagatgaggaa gctgagacagctacgggcaagcgggcagctgaagatgatgaggatgacgatgtcgatacc aagaagcagaagaccgacgaggatgactag >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_7|370_aa MTLLNLTSTPTLPTLAYPVLGERSHWARTNGLVPSQRQGGTSSLGFGDPPPVRGVRALRY AARAGFYSAWRPPAAAGLQCHGLTGVSAMESQKEARTLQEPVARPSGASSSQTPNDKERR EGGAVPAAAALGAEADDDSADGLWELPVEPAERRPECTRCRFLAFVTVGRSDSFRDHLPE QQLVKLSIIGCARILGMISPAYTSISSQIDQAEERISEIEDKLNEIKREDKITEKRMKRN EQSLQEIWDYVKRPNLCLIGVRESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQR TPQRYSLRRATPRHIIIRFTKVEMKEKMLRAAREKGWVTHKGKPIRLTADLSAETLQARR EWGPIFNILK >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_7|1113_bp atgacgctacttaacctcaccagcactccaacactgcccacacttgcctacccagttcta ggtgaacggtcccactgggcacggaccaatgggctcgttccttcccagcggcagggtggg acctccagcctgggcttcggagacccgccccccgtacgtggcgtgcgggcattgcgctac gcggcgcgggccggtttctacagcgcgtggcgccccccggcggcagccgggcttcaatgc cacggcctgaccggagtgtccgccatggagtcgcagaaagaggcacgaacactccaggag cccgttgcgcggccttctggggcctcaagctctcagacgccgaacgacaaggagcggcgg gagggcggcgcagtgccggcggcggctgccctgggcgcagaggcggacgacgacagtgcg gacgggctgtgggagctgccggtggagccggccgagcggaggcctgagtgcacccgctgc agatttctggcttttgtgacagtgggaagatctgacagcttcagagaccaccttcctgag cagcagttggttaagctcagtattattggctgtgcccgtatattgggcatgatctctcca gcatacacaagtatcagtagccaaatcgatcaagcggaagaaaggatatcagaaattgaa gataaactcaatgaaataaagcgtgaagacaagattacagaaaaaagaatgaaaaggaac gaacaaagcctccaagaaatatgggactatgtgaaaagaccaaacctatgtttgattggt gtacgtgaaagtgatggggagaatggaaccaagttggaaaacactcttcaggatattatc caggagaacttccccaacctagcaagacaggccaacattcaaattcaggaaatacagaga acaccacaaagatactccttgagaagagcaaccccaagacacataatcatcagattcacc aaggttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggttacccac aaagggaagcctatcagactaacagcagatctctcagcagaaactctacaagccagaaga gagtgggggccaatattcaacattcttaaataa >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_8|473_aa MIISIDAEKAFDKIQQLFMLKTLNKLVLEVLARAIRQEKEIKGIELGKEEVKLSLFADDM IVYLENPIISAQNLLKLISNFSKVSGYKINVQKSQAFLYINNRQTESQIMSELPFTIASK RIKYLEIQLTRDVKDLFKENYKPLLNEIKEDKNKWKNIPCSWIGRINIVKMAILPKVIYR FNAIPIKLQMTFFTELEKTTLKFIWNQKRAHIAKTILSKKNKAGGIMLPEFKLYYKATVT KTAWYWYQNRDIDQWNRTESSEIIPHIYNHLTFDKPDKNKQWGKDSLFNKWCWENWLAIC RKLKLDLFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMAT KAKIDKWDLIKLKSFCTAKGTTSRVNRQPTEWEKISAIYPSDKGLVSRIYKELKQIYKKK IKQPHQKVGKGYEQTLLKRRHLCSQKTHEKMLIITGHQRNGNQNHNEIPSHIS >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_8|1422_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagctcttcatgcta aaaactctcaataaactagtgttggaagttctggccagggcaattaggcaggagaaagaa ataaagggtattgaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg attgtatatttagaaaaccccatcatctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcgttcctatacatc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctagaaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaatgaaataaaagaggacaaaaacaaatggaagaacattccatgt tcatggataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacaga ttcaatgccatccccatcaagctacaaatgactttcttcacagaattggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccacattgccaagacaatcctaagcaaaaag aacaaagctggaggcatcatgctacctgagttcaaactatattacaaagctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaacagagtcc tcagaaataataccacacatctacaaccatctgacctttgacaaacctgacaaaaacaag caatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgt agaaagctgaaactggatctcttccttacaccttatacaaaaattaattcaagatggatt aaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcaatacc attcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaaca aaagccaaaattgataaatgggatctaattaaactaaagagcttctgcacagcaaaagga actaccagcagagtgaacaggcaacctacagaatgggagaaaatttctgcaatctaccca tctgataaagggctagtatccagaatctacaaagagcttaaacaaatttacaagaaaaaa atcaaacaaccccatcaaaaagtgggcaaaggatatgaacagacacttctcaaaagaaga catttatgcagccaaaaaacacatgaaaaaatgctcatcatcactggccatcagagaaat ggaaatcaaaaccacaatgagataccatctcacatcagttag >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_9|74_aa MGAMGMSHMLASFFDLDSMLIDMARDFNWEEAVQETKDGAANRKLTGECHFLWKSTRLQL MTLAENIKVMLTEL >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_9|225_bp atgggggctatggggatgagccacatgctggcctctttctttgacctggacagcatgctc atcgacatggccagggacttcaattgggaagaagcagtccaggaaacaaaagatggtgcc gccaatagaaaattgactggagaatgtcacttcctttggaaatctacacgtttacagctt atgacactagcagaaaatatcaaagtcatgctcaccgaactttga >gi568815593f:118873796_119074122|GENSCAN_predicted_peptide_10|410_aa MTGKGRATGDARASLTHREALVANAAHGEAVVARVHSPGQHLVQVHVGALVLQRTPHPAH ATGQRLLPSLLPSGAAAALRWRRQKQRQAPGAGAAQLGPATHPEPRRARPGGGGSARGRS AALLRAPVTPRASPGEAQGPLTGHDPGPTHRPGRPRCPHTSSGPRAAAPDPARVAQAVRA RTPRLTCPFPAPVPATPTVLRHSLRASGSLRPKYRETSLGAAATVWVNDVAMAMPVLIAL ESGQIKDLKPKTIKTLEENLGSTIQDISMGKDFTTKTPKAIATKAKIDNWDMDEAGNHHP QQTNTGTENQTPHVLTHKWELNNENTWTQGGEYHTPGPVRGFLQRTAIIWRDVVNKSQFY FNHQYDHIQKDEEDDEVCEERTLKDPSGPQRKRWPLSAAHMKPAQQPLGM >gi568815593f:118873796_119074122|GENSCAN_predicted_CDS_10|1233_bp atgacgggcaaaggccgggccacgggcgacgcgagggcctccctcactcaccgtgaagcg ctggtcgccaatgctgcccacggagaagcagtggtcgccagggttcacagccccggtcag cacctggtgcaggttcatgtcggcgccctagtcctgcaacggacaccgcatccagctcat gccacgggtcagcggctgcttccctccctccttccctcaggggcggcggccgctcttcgg tggcgacggcagaagcagcgacaggcgcctggagccggagccgctcagctggggcccgcg actcatcccgagccccggagggcgaggcccggagggggcgggtcggcgagagggcggagc gccgcgctccttcgcgcacctgtcactccccgcgctagcccaggtgaagctcagggtccc ctcaccggccacgacccggggccgactcaccggccaggaaggccgcgctgtccacacact tcctctggacctcgggcagccgctcctgacccagcccgagtggcgcaagcagtacgcgct cgcactccccgcctgacctgtcccttcccggctccggtgcccgccacccccacagtcctc cgccactcactacgcgcctctggatccctgcggccgaaatatcgcgagacttcgctcggg gcagctgctactgtgtgggtcaacgatgttgctatggcaatgccagtactcatagcacta gaaagtggccagattaaagacttaaaacctaaaaccataaaaaccctagaagaaaaccta ggcagtaccattcaggacatcagcatgggcaaagactttacgactaaaacaccaaaagca attgcaacaaaagccaaaattgacaactgggacatggatgaagctggaaaccatcatcct cagcagactaacacaggaacagaaaaccaaacaccacatgttctcactcataagtgggag ttgaacaatgagaacacatggacacagggaggggaatatcacacaccggggcctgtcagg ggatttcttcagagaacagctataatttggagggatgtagtaaacaagtcacaattctat ttcaaccatcaatacgatcatattcaaaaagatgaggaagatgacgaagtttgtgaagag aggacactcaaggacccatccggcccacaaagaaagaggtggcctctgagtgcagcgcac atgaagcctgctcaacagcctctgggcatgtga