GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:58:33 Sequence gi568815577f:6460783_6663961 : 203179 bp : 48.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5236 5626 391 1 1 67 116 186 0.255 14.03 1.02 Intr + 7642 7714 73 0 1 57 71 81 0.648 2.28 1.03 Intr + 10546 10774 229 2 1 59 64 202 0.804 11.83 1.04 Intr + 12434 12522 89 2 2 77 72 70 0.656 3.91 1.05 Intr + 14602 14651 50 2 2 75 80 24 0.235 -1.30 1.06 Term + 18478 18612 135 0 0 117 44 69 0.394 3.42 1.07 PlyA + 19307 19312 6 1.05 2.10 PlyA - 20716 20711 6 1.05 2.09 Term - 24134 23987 148 1 1 93 52 103 0.681 4.57 2.08 Intr - 25448 25356 93 1 0 60 81 123 0.983 7.88 2.07 Intr - 25673 25540 134 2 2 87 84 234 0.999 22.34 2.06 Intr - 26423 26325 99 2 0 82 97 174 0.987 18.01 2.05 Intr - 26630 26581 50 0 2 118 78 51 0.973 5.50 2.04 Intr - 32328 32262 67 0 1 113 93 11 0.695 2.58 2.03 Intr - 38041 37921 121 0 1 -23 41 144 0.405 -0.90 2.02 Intr - 38442 38347 96 2 0 57 89 207 0.686 16.82 2.01 Init - 51880 50754 1127 1 2 60 53 298 0.510 17.17 2.00 Prom - 53138 53099 40 -4.66 3.00 Prom + 67928 67967 40 -5.26 3.01 Init + 71225 71361 137 1 2 69 84 113 0.925 8.61 3.02 Intr + 73089 73218 130 0 1 67 41 75 0.835 1.40 3.03 Term + 74736 74816 81 2 0 96 33 96 0.770 2.59 3.04 PlyA + 76567 76572 6 1.05 4.00 Prom + 83544 83583 40 -5.66 4.01 Init + 88361 88428 68 1 2 69 86 68 0.409 5.25 4.02 Intr + 91667 91872 206 2 2 109 75 67 0.698 6.34 4.03 Intr + 96807 96913 107 1 2 90 65 53 0.359 3.13 4.04 Intr + 99911 100189 279 1 0 52 94 464 0.603 41.07 4.05 Intr + 101419 101541 123 0 0 61 99 333 0.764 32.48 4.06 Term + 102973 103182 210 0 0 125 47 337 0.999 30.59 4.07 PlyA + 103683 103688 6 1.05 5.03 PlyA - 105584 105579 6 -3.74 5.02 Term - 106555 106377 179 2 2 27 42 194 0.666 6.55 5.01 Init - 112632 112536 97 0 1 104 106 29 0.243 6.87 5.00 Prom - 115964 115925 40 -5.46 6.00 Prom + 117193 117232 40 -6.76 6.01 Init + 117517 117554 38 0 2 59 94 36 0.255 0.78 6.02 Intr + 118414 118518 105 1 0 101 64 73 0.390 5.53 6.03 Intr + 174805 174871 67 1 1 91 99 43 0.325 4.61 6.04 Intr + 182856 182962 107 2 2 68 70 28 0.036 -1.99 6.05 Intr + 185274 185424 151 0 1 45 -6 169 0.243 3.36 6.06 Term + 201370 201567 198 0 0 36 37 150 0.114 2.10 6.07 PlyA + 201585 201590 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 85131 84655 477 0 0 71 43 191 0.843 7.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_1|322_aa XLGAKAASNDPGGWCFQGLGTTDLSENPKALDLLFPRKMHPLSEVRVPRFSFYPNLVRSR RRERSSTFLRGSRESPTGRDSPSGTGPVWPGARMLRRRIFQKAVDPRPRQTQAAQRLAFG PSPRPGALSRKRKTRTVPISSGQRNVLRLRSVPGSEYNTYAESAHVMSGQRINRYPTGGR AEGRGLGLHHPLVAGRLGPQNWRRLEEEEEEEEGKEEEEEGKEEEEEGKEERIFLTLKEG HDLTIPLQKRKTSVQEQGAELGTGPASLHAVWAQADPQEICPHCSTRQDVPSSSSFRTYL LAAAAHLYLHHYLLLYSLLFPD >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_1|969_bp nnactgggggccaaagcggcttcaaacgaccctggagggtggtgctttcaaggtcttggg accacggacttgtctgaaaatccgaaggcgttggatcttctcttccccagaaaaatgcac ccgctttcagaggttcgtgtcccacgcttttctttctatcccaaccttgtaagaagccgc cgccgtgagcggagcagcaccttcctccgcgggtcgcgggagtcacctacgggaagggac tctccgtctggcacaggccctgtctggcctggggcgcgcatgctccgccgccgaatcttc cagaaagccgtggacccgaggccccggcagacgcaggcggcccagcgccttgcttttggc ccctcgcctcgccctggagccctctctcgcaagcgaaagacacggacggtgcccatcagc tctgggcagaggaacgtgctgcgcctccgcagcgtccccggctcagagtataacacgtat gcagaaagtgcacatgtcatgagtggacagaggataaaccgctaccccactggggggcga gcagaggggagaggcctgggcctgcaccatcccctggtggccggaaggctgggccctcag aactggaggagattggaggaggaggaggaggaagaggaagggaaggaggaggaagaggaa gggaaggaggaggaagaggaagggaaggaggagaggatcttcctaacactgaaggagggg cacgacctgaccatccctttacagaagagaaagacgagtgtgcaggaacagggagcagag ctgggcacagggccagcgtccctgcatgctgtgtgggcccaagctgacccgcaggaaatc tgccctcactgctccacgcgccaggacgtgccctcctcctcctctttccgcacctatctc ctggctgcggctgctcacttgtacctgcaccattacttactgctgtattctttgcttttt ccagactga >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_2|644_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSCIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRTHIAKTILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDKWNRTEPSETIPHIYNHLIFDKPDKNKKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRI YKELKQTYKKKTNNPIKKWAKDMNRHFSKEDIYAANRHMKKCLSSLAIREMQIKTIMRYH LTPVRMAIIKKSGNNRVGGVGSSVDGSGGGGWEMAEYLASIFGTEKDNRRLPEAASSRAE SCLEHPGLLDVDTFVPTFGCECPVEQALTILIQNIYRNPQNSAQTADGSHCAVSDVEMQE HYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFN GQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRELRRELYGRRRKKHRSR SRSRERRSRSRDRGRGGGGGGGGGGGGRERDRRRSRDRERSGRF >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_2|1935_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagagatgtgaaggacctcttcaaggagaactacaaaccactgctcaaagaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgcataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagtttatatggaaccaaaaaaga acccacattgccaagacaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagacaaatggaatagaacagagccctcagaaacaataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcgagatggattaaagacttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagagactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaaacttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgcttatcatcactggccatcagagaaatgcaaatcaaaaccataatgagataccat ctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagggtcggcggcgtc ggcagcagtgtcgacggcagcggcggcggcgggtgggaaatggcggagtatctggcctcc atcttcggcaccgagaaagacaatcgccggctcccggaagccgcctcgagccgcgctgaa agttgcctggagcatccgggccttttggacgtggacacgtttgttccaactttcggctgt gaatgtcctgtcgagcaagctctgaccatcttgattcaaaacatctatcgtaatccccaa aacagtgcacagacggctgacggctcacactgtgccgtgagcgatgtggagatgcaggaa cactatgatgagttttttgaggaggtttttacagaaatggaggagaagtatggggaagta gaggagatgaacgtctgtgacaacctgggagaccacctggtggggaacgtgtacgtcaag tttcgccgtgaggaagatgcggaaaaggctgtgattgacttgaataaccgttggtttaat ggacagccgatccacgccgagctgtcacccgtgacggacttcagagaagcctgctgccgt cagtatgagatgggagaatgcacacgaggcggcttctgcaacttcatgcatttgaagccc atttccagagagctgcggcgggagctgtatggccgccgtcgcaagaagcatagatcaaga tcccgatcccgggagcgtcgttctcggtctagagaccgtggtcgtggcggtggcggtggc ggtggtggaggtggcggcggacgggagcgtgacaggaggcggtcgagagatcgtgaaaga tctgggcgattctga >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_3|115_aa MIYPIFGMSVSDGGGSGGGEITESMSAKHTLLGAPPPTAQLKDKRPQGDSRKGDGVTGQD PSYEWELDQELSVLSHLLPVLGPLSETLQAMELTILPLAAVVLFRGTVIAFISIL >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_3|348_bp atgatctacccaatatttgggatgtccgtgagtgatggcggtggtagtggtggtggtgaa atcacggaatcaatgtctgcaaagcacacgctcttgggagcacctcctcccaccgcacag ctgaaagacaagcgaccacagggagacagcagaaagggagatggagtcaccggccaggac cccagctatgagtgggagctggaccaggagttgagtgtcctgagccacctcctccctgtg ctgggtccactctcagaaacactccaggccatggagctgaccatcctgcctctggcagct gtggtgctgtttaggggaacggtcattgccttcatctccattttatag >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_4|330_aa MPTTLVRPEITEESEHPGDAACPSPMQPTLGPAQPLGRRGWSPCAVPSAAHPAERPQAAG LHTCDRPEAPAAGPTRLQFPSRQPMALRPLPDSGAGSLRSPLQARPILWQSLDVPRRPLS RQALQRVGGLGLGSTLRCPEAPLTPASLQVPVVPKLNMDVTIQHPWFKRTLGPFYPSRLF DQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPE DLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCG PKIQTGLDATHAERAIPVSREEKPTSAPSS >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_4|993_bp atgcccaccacgctcgtgaggccggagatcaccgaggaatccgagcacccgggggacgct gcatgcccgagccccatgcagcccacactgggtcctgcccagcccctgggccggagggga tggtccccctgcgccgtccccagcgctgcccaccctgcagaacgtcctcaggcggccggg ctccacacctgcgacaggcccgaggccccggctgcaggccccactcggctgcagttcccg tcgaggcagccaatggccttgaggcctctcccagactcaggcgctggctcactcagaagc cccctgcaggcccggccaatcctgtggcagagcctcgacgtcccacggcggcctctgagc cgccaggccctacagcgtgtgggagggctcggccttggctccacactgcgctgcccagag gccccgctgactcctgccagcctccaggtccccgtggtaccaaagctgaacatggatgtg accatccagcacccctggttcaagcgcaccctggggcccttctaccccagccggctgttc gaccagtttttcggcgagggcctttttgagtatgacctgctgcccttcctgtcgtccacc atcagcccctactaccgccagtccctcttccgcaccgtgctggactccggcatctctgag gttcgatccgaccgggacaagttcgtcatcttcctcgatgtgaagcacttctccccggag gacctcaccgtgaaggtgcaggacgactttgtggagatccacggaaagcacaacgagcgc caggacgaccacggctacatttcccgtgagttccaccgccgctaccgcctgccgtccaac gtggaccagtcggccctctcttgctccctgtctgccgatggcatgctgaccttctgtggc cccaagatccagactggcctggatgccacccacgccgagcgagccatccccgtgtcgcgg gaggagaagcccacctcggctccctcgtcctaa >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_5|91_aa MGSPKIQSLLSSSNHGLPWLTLDICHQGPTNVVLAEPVLLEDPTGGAEEIRQPTSQGKFC LGWPWCSVRLSKEPDRGSGEPPTGLADPAKA >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_5|276_bp atgggcagccccaaaatacaaagtctcctatccagttccaaccacggcctgccctggctg acactggacatttgccaccaggggcccacaaacgtggtccttgctgagcctgtgctgctg gaagacccaactgggggagcagaggagatcagacagcccaccagccaggggaagttctgc ctgggctggccctggtgctccgtgcgactctccaaggagcctgatcgaggatctggtgaa ccacccacgggccttgcagaccctgccaaggcctag >gi568815577f:6460783_6663961|GENSCAN_predicted_peptide_6|221_aa MQPAKEDSSAQVRVLRPHSQVPCEEEVEREERGLGPTLTDLECGPWARHQSLLTPTIEGA FVTVRRYRALLLQQSSVGTRNRAGCNHLQAITNSMAQCRSTMEQISIILKPKLGKDTPEK ENFRPVSLMNIDAKILNKILANRIQQHIKKLIHHDLASIIQIPKPNIDTTTTTTTTTTHH ANVFGKHCAKILNKILANQIQQHIKKFICNNGVGFVPRMQG >gi568815577f:6460783_6663961|GENSCAN_predicted_CDS_6|666_bp atgcaacctgcaaaagaggacagcagtgcccaagtcagggtcctgcggccccactcacag gtgccctgtgaggaggaggtggagcgagaggagaggggactgggcccgacactcactgat cttgaatgcggcccatgggccagacatcaaagtctgctcacaccaaccatagaaggagcc tttgtcactgtcagaagatacagagctttgctgctccagcaaagctcagtgggcaccaga aacagagctggctgtaaccacctccaggccatcactaactctatggcccaatgcaggagc actatggaacaaatcagcatcatcctgaaaccaaaacttggcaaagacacaccagaaaaa gaaaatttcaggcccgtatccctgatgaatatcgatgcgaaaatcctcaataaaatactg gcaaaccgaatccagcagcacatcaaaaagcttatccaccacgatctagccagcatcatc cagataccaaaacctaacatagatactacaacaacaacaacaacaacaacaacacatcat gccaatgtctttggtaaacactgtgcaaaaatcctcaataaaatactggcaaaccaaatc cagcagcatattaaaaagttcatctgcaacaatggagttggctttgtccccaggatgcaa ggttga