GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:25:56 Sequence gi568815586f:120203562_120413033 : 209472 bp : 49.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 27 22 6 1.05 1.13 Term - 9019 8753 267 1 0 103 43 564 0.960 48.89 1.12 Intr - 10429 10281 149 2 2 69 90 300 0.999 28.25 1.11 Intr - 10656 10575 82 0 1 105 102 75 0.998 9.81 1.10 Intr - 11437 11264 174 1 0 112 76 334 0.999 34.74 1.09 Intr - 11712 11542 171 0 0 111 64 188 0.996 18.84 1.08 Intr - 13555 13280 276 1 0 124 38 198 0.471 16.11 1.07 Intr - 16530 15595 936 0 0 134 -11 200 0.464 6.08 1.06 Intr - 18197 18062 136 1 1 69 44 208 0.993 15.07 1.05 Intr - 19129 18988 142 2 1 73 67 170 0.510 12.81 1.04 Intr - 19438 19302 137 0 2 61 91 154 0.998 13.21 1.03 Intr - 20272 20157 116 1 2 78 84 154 0.871 13.25 1.02 Intr - 20816 20590 227 0 2 119 94 223 0.999 23.70 1.01 Init - 27424 27274 151 1 1 66 47 91 0.261 3.01 1.00 Prom - 27702 27663 40 -4.56 2.00 Prom + 30125 30164 40 -3.16 2.01 Init + 31190 31240 51 1 0 55 85 43 0.431 1.86 2.02 Term + 45012 45182 171 2 0 105 39 122 0.207 6.83 2.03 PlyA + 45578 45583 6 1.05 3.00 Prom + 51834 51873 40 -4.16 3.01 Init + 62067 62379 313 2 1 65 80 117 0.393 4.09 3.02 Term + 78686 79155 470 0 2 36 48 609 0.945 46.84 3.03 PlyA + 79274 79279 6 1.05 4.00 Prom + 90297 90336 40 -1.06 4.01 Init + 100007 100497 491 1 2 64 61 283 0.244 17.88 4.02 Intr + 108859 109189 331 1 1 72 70 234 0.224 15.63 4.03 Term + 109323 109475 153 2 0 112 47 54 0.971 1.62 4.04 PlyA + 109660 109665 6 1.05 5.06 PlyA - 109964 109959 6 1.05 5.05 Term - 118756 118632 125 2 2 114 42 185 0.996 15.05 5.04 Intr - 121500 121373 128 2 2 101 42 161 0.996 13.12 5.03 Intr - 122459 122300 160 0 1 81 108 349 0.884 35.25 5.02 Intr - 132706 132521 186 2 0 67 74 60 0.485 2.16 5.01 Init - 133600 133546 55 1 1 92 87 0 0.368 1.89 5.00 Prom - 134592 134553 40 -6.06 6.15 PlyA - 134987 134982 6 1.05 6.14 Term - 139544 139509 36 2 0 85 52 64 0.495 -0.16 6.13 Intr - 142728 142574 155 1 2 106 84 252 0.953 26.39 6.12 Intr - 143953 143885 69 2 0 94 99 92 0.994 10.05 6.11 Intr - 144127 144101 27 2 0 99 63 34 0.510 0.09 6.10 Intr - 149818 149738 81 2 0 97 131 91 0.991 13.91 6.09 Intr - 153458 153341 118 2 1 35 59 121 0.687 3.94 6.08 Intr - 154337 154255 83 0 2 92 101 107 0.981 11.76 6.07 Intr - 155492 155444 49 2 1 66 105 123 0.916 10.05 6.06 Intr - 159574 159482 93 1 0 112 98 150 0.994 18.56 6.05 Intr - 161194 161153 42 1 0 97 100 14 0.737 1.94 6.04 Intr - 164531 164447 85 1 1 53 79 84 0.608 3.82 6.03 Intr - 164712 164631 82 1 1 35 99 134 0.986 7.90 6.02 Intr - 164926 164856 71 0 2 66 40 64 0.626 -1.77 6.01 Init - 165420 165272 149 0 2 121 84 97 0.680 10.17 6.00 Prom - 179698 179659 40 -4.76 7.04 PlyA - 180809 180804 6 1.05 7.03 Term - 185463 185385 79 2 1 77 38 106 0.816 1.74 7.02 Intr - 189797 189736 62 2 2 77 107 32 0.705 1.53 7.01 Init - 192237 192184 54 0 0 92 90 59 0.799 5.78 7.00 Prom - 208604 208565 40 -1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 124192 124159 34 1 1 64 96 58 0.861 2.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_1|987_aa MASNKTDKAPALWSLILGEETVDKNPNWQGRPLKETEFLVSVQALGPWAGDALLADLEST TSHISKRPVFLSEETPYSYPTGNHTYQEIAVPPPVPPPPSSEALNGTILDPLDQWQPSSS RFIHQQPQSSSPVYGSSAKTSSVSNPQDSVGSPCSRVGEEEHVYSFPNKQKSAEPSPTVM STSLGSNLSELDRLLLELNAVQHNPPGFPAETNSPLGGKAGPLTKEKPKRNGGRGLEDVR PSVESLLDELESSVPSPVPAITVNQGEMSSPQRVTSTQQQTRISASSATRELDELMASLS DFKTSSSTVALSAPGLSSSAPSSYCSLPPSPPPMPSVFLPPTTIPSPRGQGHTPEFPCTE QSGRGLLPPVAPSWLDLAGLGVMPDTFNSRSPSVEGSLWAVGTESQGRDWRHLPTITSEL SGAPRCHTVPCAGSTALQEPGEPQGPPASPSCPEEALAATWEQPWASEVFGPERMPPSGA ARSFQEVTEPAVVAVDRQAIFPDTWTLTEEHGLQQERPRPEPGRLGSSSPASVTTEQLGA KMTERGSVARPTQGPETPRSPEGTTEAATQDGKEQPELPCAMAMGTPSTTERISTSGQAG TSSHGCWAWPLAGLSIRSVIRRSWESGHAHPMSREPSPRRRLDPATLSRTPSQEQLIAEL QGRLGIQPEAEEPAEAAGPSAQDWLTEGVIITVQPRGKRAGGQLVEKFMAQGKTGSSSPP GGPPKPGSQLDSMLGSLQSDLNKLGVATVAKGVCGACKKPIAGQVVTAMGKTWHPEHFVC THCQEEIGSRNFFERDGQPYCEKDYHNLFSPRCYYCNGPILDKVVTALDRTWHPEHFFCA QCGAFFGPEGFHEKDGKAYCRKDYFDMFAPKCGGCARAILENYISALNTLWHPECFVCRE CFTPFVNGSFFEHDGQPYCEVHYHERRGSLCSGCQKPITGRCITAMAKKFHPEHFVCAFC LKQLNKGTFKEQNDKPYCQNCFLKLFC >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_1|2964_bp atggcatcgaacaagaccgacaaggcccctgctctctggagcttaattcttggagaggag acagttgacaagaacccaaattggcagggaaggcccctcaaggagactgagttcctagtc agtgtccaggccctaggtccctgggctggggacgccctgctggcggacttggagtctacc acctcccacatctccaaacggcctgtgttcttgtcggaggagaccccctactcataccca actggaaaccacacataccaggagattgccgtgccaccccccgtccccccacccccgtcc agcgaggccctcaatggcacaatccttgaccccttagaccagtggcagcccagcagctcc cgattcatccaccagcagcctcagtcctcatcacctgtgtacggctccagtgccaaaact tccagtgtctccaaccctcaggacagtgttggctctccgtgctcccgagtgggtgaggag gagcacgtctacagcttccccaacaagcagaaatcagctgagccttcacccaccgtaatg agcacgtccctgggcagcaacctttctgaactcgaccgcctgctgctggaactgaacgct gtacagcataacccgccaggcttccctgcagagactaacagccccttgggaggcaaagct gggcccctgacgaaagagaagcctaagcggaatgggggccggggcctggaggacgtgcgg cccagtgtggagagtctcttggatgaactggagagctccgtgcccagccccgtccctgcc atcactgtgaaccagggcgagatgagcagcccgcagcgcgtcacctccacccaacagcag acacgcatctcggcctcctctgccaccagggagctggacgagctgatggcttcgctgtcg gatttcaagaccagctcctccactgtggctctgagtgccccggggctgtccagctctgct ccgtcctcatactgctcccttcctccttctcctcctcccatgccatctgtatttctgcca cccaccactataccctcccctcgaggccagggccacactccggagttcccttgtactgag cagagtggacgaggccttctacctcctgtagcccccagctggcttgatttggctggtctt ggggtgatgcctgacaccttcaactcaaggtctccctctgtggagggttctctgtgggca gtgggcacagagagtcagggtcgagattggaggcacctgccgaccatcacaagtgagctc tctggggctccccgctgccacactgtaccctgtgctgggagcacagctctccaagagcct ggggagccccaggggccaccagccagcccttcgtgcccagaggaggccttggctgccaca tgggagcagccatgggcttcggaggtattcgggcctgagagaatgcccccctctggagct gctcgaagcttccaagaagtaacagagccagctgtagtggcagtggaccggcaggccatc ttcccagatacctggactctcacggaggaacatggcctacagcaggagaggccaaggcca gagccagggaggctgggaagcagctcccctgcctcagttaccacggagcagctaggtgca aagatgaccgagaggggaagtgtggccaggccaacccagggacctgaaaccccaaggagc ccagagggcaccacggaagctgccacccaggacgggaaggaacagccagagcttccatgt gccatggccatgggcacacccagcaccacggagaggatttccacctctggccaggcaggc accagctcgcatggctgctgggcatggccattggcgggtctgagtatccgatctgtgatc aggaggagctgggagtctggccacgcacaccccatgtcccgggagccctcccctcgccgc cggctggaccctgccaccttgagcaggaccccatcccaggaacagctcatcgcggagctg caggggcggctgggcatccagcctgaggcagaggagccggcggaggcggcggggccctct gcccaggactggctgaccgagggcgtcatcatcactgtgcagccacgtgggaagcgggcc ggggggcagctcgtagagaagttcatggcccaggggaagacagggagcagctcaccccct ggggggcccccgaagcccgggagccagctggacagcatgctggggagcctgcagtctgac ctgaacaagctgggggtcgccacagtcgccaaaggagtctgcggggcctgcaagaagccc atcgccgggcaggttgtgaccgccatggggaagacgtggcaccccgagcacttcgtctgc acccactgccaggaggagatcggatcccggaacttcttcgagcgggatggacagccctac tgtgaaaaggactaccacaacctcttctccccgcgctgctactactgcaacggccccatc ctggataaagtggtgacagcccttgaccggacgtggcaccctgaacacttcttctgtgca cagtgtggagccttctttggtcccgaagggttccacgagaaggacggcaaggcctactgt cgcaaggactacttcgacatgttcgcacccaagtgtggcggctgcgcccgggccatcctg gagaactatatctcagccctcaacacgctgtggcatcctgagtgctttgtgtgccgggaa tgcttcacgccattcgtgaacggcagcttcttcgagcacgacgggcagccctactgtgag gtgcactaccacgagcggcgcggctcgctgtgttctggctgccagaagcccatcaccggc cgctgcatcaccgccatggccaagaagttccaccccgagcacttcgtctgtgccttctgc ctcaagcagctcaacaagggcaccttcaaggagcagaacgacaagccttactgtcagaac tgcttcctcaagctcttctgctag >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_2|73_aa MDEKECESHPGGWDKINWHRVASPSLASRNQNRVYNVSKGLYATVTESNPSSTTNQQLTS GKLLTSLCFSVVS >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_2|222_bp atggatgaaaaagaatgtgagagccatcctggtggctgggataaaataaactggcaccgc gtggcatcaccctcattggcgagcaggaatcagaacagagtctacaatgtcagcaaaggg ctgtatgcgacagtcactgaatccaatcccagctccaccactaatcagcaactaacctcg ggcaagttgctcacctctctgtgcttcagtgtcgtcagctag >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_3|260_aa MAGPRAPAQGRASCPSRGRSSMPRNFSAASLGPERNAPPPRAPAPAPAARPAPGRPLCTR APPPDTPASLLAPPPAHAGPAPAKSPGSGRRWAGRWRPLAAKRAASRTTANLERTFITIR PDSMQCGLVGKIIKRFEQKGFRLVAMKFLPASEEHLKQHYIDLKDRPFFPGLVKYMNSGP VVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVKSAEKEISL RFKPEELVDYKSCAHDWVYE >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_3|783_bp atggccggaccacgggcgccggctcagggtcgcgctagctgcccgtcccggggccgctcg tctatgccccgcaacttttccgccgcgagcctcggcccggaacggaacgcgccgccgccg cgcgcgcccgcgcccgcgcccgccgcgcgccccgcccccggccgccccctgtgcacgcgc gccccgcccccggacacccccgcgagcttgctggccccgccccctgcgcacgctggtccc gcccccgccaagagcccgggcagtgggcgtcgctgggcggggcggtggcgccccctcgcg gctaagcgggcagcttcccggaccacggccaacctcgagcgcaccttcatcaccatcagg ccggacagcatgcagtgcggcctggtgggcaagatcatcaagcgcttcgagcagaagggg ttccgcctcgtggccatgaagttcctcccggcctctgaagaacacctgaagcagcactac attgacctgaaggaccgcccattcttccctgggctggtgaagtacatgaactcagggccg gtcgtggccatggtctgggaggggctgaacgtcgtgaagacaggccgagtgatgcttggg gagaccaatccagcagattctaagccaggcaccattcgtggggacttttgcattcaggtt ggcaggaacatcattcatggcagtgattcagtaaaaagtgctgaaaaagaaatcagccta cggtttaagcctgaagaactggttgactacaagtcttgtgctcatgactgggtctatgaa taa >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_4|324_aa MSFALTFRSAKGRWIANPSQPCSKASIGLFVPASPPLDPEKVKELQRFITLSKRLLVMTG AGISTESGIPDYRSEKVGLYARTDRRPIQHGDFVRSAPIRQRYWARNFVGWPQFSSHQPN PAHWALSTWEKLGKLYWLVTQNVDALHTKAGSRRLTELHGCMDRAYCSVSVFLGSRVLCL DCGEQTPRGVLQERFQVLNPTWSAEAHGLAPDGDVFLSEEQVRSFQVPTCVQCGGHLKPD VVFFGDTVNPDKVDFVHKRVKEADSLLVVGSSLQVYSGYRFILTAWEKKLPIAILNIGPT RSDDLACLKLNSRCGELLPLIDPC >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_4|975_bp atgagctttgcgttgactttcaggtcagcaaaaggccgttggatcgcaaaccccagccag ccgtgctcgaaagcctccattgggttatttgtgccagcaagtcctcctctggaccctgag aaggtcaaagagttacagcgcttcatcaccctttccaagagactccttgtgatgactggg gcaggaatctccaccgaatcggggataccagactacaggtcagaaaaagtggggctttat gcccgcactgaccgcaggcccatccagcatggtgattttgtccggagtgccccaatccgc cagcggtactgggcgagaaacttcgtaggctggcctcaattctcctcccaccagcctaac cctgcacactgggctttgagcacctgggagaaactcggaaagctgtactggttggtgacc caaaatgtggatgctttgcacaccaaggcggggagtcggcgcctgacagagctccacgga tgcatggacagggcatactgttcagtcagcgtcttccttggttccagggtcctgtgcttg gattgtggggaacagactccccggggggtgctgcaagagcgtttccaagtcctgaacccc acctggagtgctgaggcccatggcctggctcctgatggtgacgtctttctctcagaggag caagtccggagctttcaggtcccaacctgcgttcaatgtggaggccatctgaaaccagat gtcgttttcttcggggacacagtgaaccctgacaaggttgattttgtgcacaagcgtgta aaagaagccgactccctcttggtggtgggatcatccttgcaggtatactctggttacagg tttatcctcactgcctgggagaagaagctcccgattgcaatactgaacattgggcccaca cggtcggatgacttggcgtgtctgaaactgaattctcgttgtggagagttgctgcctttg atagacccatgctga >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_5|217_aa MVPCIPATPAPAVAKRSQETRECRNKDTRQRDKRKDSWARGTTTTKSRRPVVALNVWLHC YLLDTKQKGQGKECESSPMILAAADSGISPRAVWQFRKMIKCVIPGSDPFLEYNNYGCYC GLGGSGTPVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNK ECEAFICNCDRNAAICFSKAPYNKAHKNLDTKKYCQS >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_5|654_bp atggtgccctgcatcccagccactccagctccagctgtggctaaaaggagccaagagacg agagagtgtagaaataaagacacaagacaaagagataaaagaaaagacagctgggcccgg ggaaccactaccaccaagtcacggagaccggtagtggccctgaatgtctggctgcactgt tatttattggatacaaagcaaaaggggcagggtaaagagtgcgagtcatctccaatgata ctggccgccgccgacagcggcatcagccctcgggccgtgtggcagttccgcaaaatgatc aagtgcgtgatcccggggagtgaccccttcttggaatacaacaactacggctgctactgt ggcttggggggctcaggcacccccgtggatgaactggacaagtgctgccagacacatgac aactgctatgaccaggccaagaagctggacagctgtaaatttctgctggacaacccgtac acccacacctattcatactcgtgctctggctcggcaatcacctgtagcagcaaaaacaaa gagtgtgaggccttcatttgcaactgcgaccgcaacgctgccatctgcttttcaaaagct ccatataacaaggcacacaagaacctggacaccaagaagtattgtcagagttga >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_6|379_aa MADPGPAGPPRSPGPRPLRPGARRSRGPFVSLLLPQQDVHRGTQLADYAGPARPSASRGP GGRQEAQRERGEGEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQ SRHELDSKTIDPKVAFPRRAQPKMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGKVDDAML MFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKMVECKKAQPKEVMSPTGSARGRS RVMPYGMDAFMLGIGMLGYPGFQATTYASRSYTGLAPGYTYQFPGQDTDGVAQAIPLTAY GPMAAAAAAAAVVRGTGSTPSRTGGFLGTTSPGPMAELYGAANQDSGVSSYISAASPAPS TGFGHSLGRPSLQLTEDHE >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_6|1140_bp atggccgatccgggtccggccgggcctccccggagcccgggcccgcgccccctgcgccct ggcgcccggcgctcacgcgggccctttgtgtctctcctcctcccgcagcaagatgttcat cgggggactcagttggcagactacgcaggcccagctcggccctcggcttcccggggcccc ggtgggcgccaggaggctcagcgggaacggggcgagggcgaagggctgcgcgaatacttc ggccagttcggggaggtgaaggagtgtctggtgatgcgggaccccctgaccaagagatcc aggggtttcggcttcgtcactttcatggaccaggcgggggtggataaagtgctggcgcaa tcgcggcacgagctcgactccaaaacaattgaccctaaggtggccttccctcggcgagca cagcccaagatggtgactcgaacgaagaagatctttgtgggggggctgtcggtgaacacc acggtggaggacgtgaagcaatattttgagcagtttgggaaggtggacgacgccatgctg atgtttgacaaaaccaccaaccggcaccgagggttcgggtttgtcacgtttgagagtgag gacatcgtggagaaagtgtgtgaaattcattttcatgaaatcaacaacaaaatggtggaa tgtaagaaagctcagccaaaggaggtgatgtcgccaacgggctcagcccgggggaggtct cgagtcatgccctacggaatggacgccttcatgctgggcatcggcatgctgggttaccca ggtttccaagccacaacctacgccagccggagttatacaggcctcgcccctggctacacc taccagttccccggtcaggacacagatggtgtggcccaagccattcctctcactgcctac ggaccaatggcggcggcagcggcggcagcggctgtggttcgagggacaggttcgactccc agccgcacagggggcttcctggggaccaccagccccggccccatggccgagctctacggg gcggccaaccaggactcgggggtcagcagttacatcagcgccgccagccctgcccccagc accggcttcggccacagtcttgggcgccccagcctgcagctgactgaggaccacgagtga >gi568815586f:120203562_120413033|GENSCAN_predicted_peptide_7|64_aa MGRARWLTPVIPALLEAEVPIMHPLSACSLCHPVNALVRSRDADWLRAGPWAQRCVSIIV RDCE >gi568815586f:120203562_120413033|GENSCAN_predicted_CDS_7|195_bp atgggccgggcacggtggctcacacctgtaatcccagcacttttggaggctgaggtcccc atcatgcacccgctcagtgcttgttctctctgccatcctgtcaatgcccttgtgagatca cgtgatgccgactggctccgagctgggccctgggctcagcgctgtgtgagcatcattgta cgggactgtgaatag