GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:06:09 Sequence gi568815575f:119474712_119683252 : 208541 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11583 11708 126 2 0 63 74 66 0.314 2.86 1.02 Intr + 13156 13319 164 0 2 63 89 70 0.754 3.37 1.03 Intr + 14638 14723 86 1 2 73 53 73 0.595 1.76 1.04 Intr + 29063 29208 146 0 2 121 45 64 0.036 5.40 1.05 Term + 45476 45598 123 1 0 64 42 66 0.156 -2.02 1.06 PlyA + 45941 45946 6 1.05 2.08 PlyA - 48466 48461 6 1.05 2.07 Term - 65078 65016 63 2 0 106 41 66 0.767 1.59 2.06 Intr - 66709 66617 93 1 0 95 105 148 0.998 17.36 2.05 Intr - 67883 67794 90 2 0 109 87 180 0.996 20.19 2.04 Intr - 69780 69642 139 2 1 68 86 99 0.998 8.17 2.03 Intr - 70793 70752 42 1 0 90 106 46 0.939 3.96 2.02 Intr - 85674 85557 118 1 1 100 94 129 0.874 14.22 2.01 Init - 90644 90521 124 2 1 93 92 109 0.997 12.13 2.00 Prom - 94570 94531 40 -5.96 3.16 PlyA - 95316 95311 6 1.05 3.15 Term - 97592 97522 71 0 2 114 50 32 0.076 0.10 3.14 Intr - 102060 102025 36 1 0 125 98 11 0.126 4.03 3.13 Intr - 116523 114775 1749 1 0 56 89 1194 0.498 103.62 3.12 Intr - 117802 117672 131 0 2 43 101 73 0.130 4.44 3.11 Intr - 131324 131020 305 2 2 14 94 248 0.204 13.39 3.10 Intr - 131819 131690 130 1 1 75 77 110 0.365 9.30 3.09 Intr - 154797 154607 191 0 2 69 106 313 0.942 29.58 3.08 Intr - 158781 158649 133 2 1 -19 84 145 0.466 4.05 3.07 Intr - 162484 162316 169 2 1 52 86 385 0.679 33.70 3.06 Intr - 166077 165981 97 0 1 104 96 104 0.621 12.38 3.05 Intr - 173937 173881 57 0 0 68 86 39 0.623 0.78 3.04 Intr - 175387 175226 162 1 0 143 92 273 0.999 33.37 3.03 Intr - 178329 178143 187 2 1 117 75 245 0.999 25.79 3.02 Intr - 188966 188771 196 0 1 93 103 179 0.971 18.37 3.01 Intr - 200957 200843 115 2 1 66 92 104 0.905 8.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:119474712_119683252|GENSCAN_predicted_peptide_1|214_aa MGSFAHKVGWVLDDEHRCGQWVRQSTQMWCEVWVEVEEEETQSSALGWSMGLGAVEQGVV LVGEAWAAQEPMEWVGGSGMESCRSRALPYGKTAKARCFSYLNKHKEDINAIHFPCPISR SCNVRDLSACQVCIPESMQVSLALFHPFPGKKRPPTHGKATVVNLSPELSLKVAKNAHLF TSTQNIFNFKELGDSLKLIPEAHGAQVKSFYCVG >gi568815575f:119474712_119683252|GENSCAN_predicted_CDS_1|645_bp atggggagcttcgcacacaaagtagggtgggtgttggacgatgaacacagatgtggacaa tgggttagacaatcaacacagatgtggtgtgaggtgtgggttgaagtggaggaggaggag acacagtcctcagcccttgggtggtcgatgggactgggcgccgtggagcagggtgtggtg ctcgttggggaggcttgggccgcacaggagcccatggagtgggtgggaggctcaggcatg gagagctgcaggtcccgagccctgccctacgggaagacagctaaggctcggtgcttcagc tatctgaacaagcacaaagaggatatcaatgccatccatttcccctgccctattagcagg agctgcaacgtcagagacttgagtgcttgccaggtgtgcattccagagtcaatgcaagtg tccctggctctcttccaccccttcccaggcaaaaagaggccacccacccatggaaaagcc actgttgtcaatctgtcgccagaactctcgctcaaggtggcaaaaaatgcccatctgttc acaagcacacagaacatatttaatttcaaggagcttggggactccctgaagctcatccct gaagcccatggagcccaggttaagagcttctattgtgtaggctag >gi568815575f:119474712_119683252|GENSCAN_predicted_peptide_2|222_aa MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVID AAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVV KFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQ NAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK >gi568815575f:119474712_119683252|GENSCAN_predicted_CDS_2|669_bp atgccgaaagtagtgtctcggtcagtagtctgctctgacactcgggaccgggaggaatat gacgacggcgagaagcccctccatgtttactactgtttgtgcggccagatggtcctagtg ctggactgccagttagagaaattgcccatgaggccccgggaccggtcccgtgtgattgat gctgccaaacatgcccataagttttgtaacacagaagatgaggagactatgtatctgcgg agacctgaaggcattgaacgacagtacaggaagaaatgtgcaaagtgtggactgccgctc ttctaccaatcccagccaaagaatgctcctgttaccttcattgtggatggagcagtagtc aagtttggccagggctttgggaaaacgaacatatatactcagaaacaagagcctcctaag aaggtgatgatgaccaaacggaccaaagacatgggcaagttcagttctgtcaccgtgtct accattgatgaagaggaagaggagattgaggctagggaagttgctgactcatatgcacag aatgccaaagtgattgaaaaacagctggagcgcaaaggcatgagcaagaggcgactgcaa gagctggctgaattggaagccaagaaagcgaaaatgaaggggaccttgattgacaaccag ttcaaataa >gi568815575f:119474712_119683252|GENSCAN_predicted_peptide_3|1242_aa GEGCRTVPLAGHVGFDSLPDQLVNKSVSQGFCFNILCVGETGLGKSTLMDTLFNTKFEGE PATHTQPGVQLQSNTYDLQESNVRLKLTIVSTVGFGDQINKEDSYKPIVEFIDAQFEAYL QEELKIRRVLHTYHDSRIHVCLYFIAPTGHSLKSLDLVTMKKLDSKVNIIPIIAKADAIS KSELTKFKIKITSELVSNGVQIYQFPTDDESVAEINGTMNYTRDVFVITITILDKSGSRA HLPFAVIGSTEELKIGNKMMRARQYPWGTVQVENEAHCDFVKLREMLIRVNMEDLREQTH TRHYELYRRCKLEEMGFKDTDPDSKPFSLQETYEAKRNEFLGELQKKEEEMRQMFVQRVK EKEAELKEAEKELHEKFDRLKKLHQDEKKKLEDKKKSLDDEVNAFKQRKTAAELLQSQGS QAGGSQTLKRDKEKKNASKARSRHWREKSTSREILRRDLEVREQPRRKRTGLTTPDGTRS ASSLGSLLGGGEDGWRTSAVGGRLPVAPPLPPLPPPPLPPLPPPPPEPVLEQWRYSHESD WQWALRRSFICRHLHSYPGAALDQLLALSAAWTNHVFLGCRYSPRLMEKILQMAEGIDIG EMPSYDLVLSKPSKGQKRHLSTCDASSSKDERQEDPYGPQTKEVNEQTHFASMPRDIYQD YTQDSFSIQDGNSQYCDSSGFILTKDQPVTANMYFDSGNPAPSTTSQQANSQSTPEPSPS QTFPESVVAEKQYFIEKLTATIWKNLSNPEMTSGSDKINYTYMLTRCIQACKTNPEYIYA PLKEIPPADIPKNKKLLTDGYACEVRCQNIYLTTGYAGSKNGSRDRATELAVKLLQKRIE VRVVRRKFKHTFGEDLVVCQIGMSSYEFPPALKPPEDLVVLGKDASGQPIFNASAKHWTN FVITENANDAIGILNNSASFNKMSIEYKYEMMPNRTWRCRVFLQDHCLAEGYGTKKTSKH AAADEALKILQKTQPTYPSVKSSQCHTGSSPRGSGKKKDIKDLVVYENSSNPVCTLNDTA QFNRMTVEYVYERMTGLRWKCKVILESEVIAEAVGVKKTVKYEAAGEAVKTLKKTQPTVI NNLKKGAVEDVISRNEIQGRSAEEAYKQQIKEDNIGNQLLRKMGWTGGGLGKSGEGIREP ISVKEQHKREGLGLDVERVNKIAKRDIEQIIRNYARSESHTDLTFSRELTNDERKQIHQI AQKYGLKTLKSNDIYPGGKDFGIICIIKVSGFGAFRILDFQI >gi568815575f:119474712_119683252|GENSCAN_predicted_CDS_3|3729_bp ggtgaaggttgccgaactgtccccctggctggacatgtggggtttgacagcttgcctgac cagctggtgaataagtccgtcagccagggcttctgcttcaacatcctgtgcgtgggagag acaggtttgggcaagtccaccctcatggacaccctgttcaacaccaaattcgaaggggag ccagccacccacacacagccgggtgtccagctccagtctaatacctatgacctccaagag agcaacgtgaggctaaagctcacgatcgttagcacagttggctttggggaccagatcaac aaagaggacagctacaagcctatcgtggaattcatcgatgcacaattcgaggcctacctg caggaagagctaaagatccgaagagtgctacacacctaccatgactcccgaatccatgtc tgcttgtatttcattgcccccacgggtcattccctgaagtctctggacctagtgactatg aagaagctggacagtaaggtgaacatcatccccatcattgccaaagcagatgccatttcg aagagtgagctaacaaagttcaaaatcaaaatcaccagcgagcttgtcagcaacggagtc cagatctatcagtttcctacagatgatgagtcggtggcagagatcaatggaaccatgaac tacacccgagatgtgtttgttatcaccatcaccatcttagacaaatctggctcaagggcc cacctgccgtttgctgtcattggcagcacagaagaactgaagataggcaacaagatgatg agggcgcggcagtatccttggggcactgtgcaggttgaaaacgaggcccactgcgacttt gtgaagctgcgggagatgctgattcgggtcaacatggaggatctgcgggagcagacccac acccggcactatgagctgtatcgccgctgtaagctggaggagatgggcttcaaggacacc gaccctgacagcaaacccttcagtttacaggagacatatgaggccaaaaggaacgagttc ctaggggaactccagaaaaaagaagaggagatgagacagatgttcgtccagcgagtcaaa gagaaagaagcggagctcaaagaggcagagaaagagctgcacgagaagtttgaccgtctg aagaaactgcaccaggacgagaagaagaaactggaggataagaagaaatccctggatgat gaagtgaatgctttcaagcaaagaaagacggcggctgagctgctccagtcccagggctcc caggctggaggctcacagactctgaagagagacaaagagaagaaaaatgcgtcaaaggcc cgtagtcgtcactggagggaaaaaagtacgtcgcgcgagattctgcgacgggatttggaa gttagggaacagccgcggcgcaagcgcactggcctcacaaccccggacggcacgcggtcg gcttcgtccctggggtccctgcttgggggcggagaagatggctggaggacgtctgctgtt ggggggcgacttcctgtcgcgccgccgctgccccccctcccgccgccgccgctgccgccc ctcccgccgcccccgcccgagccagtgctggagcagtggcgctatagccacgaaagtgac tggcagtgggctctgcggcgcagcttcatctgtcggcacctgcacagctatcccggggct gccctcgaccagctcctcgcgctctccgccgcctggaccaaccacgtcttcctgggctgc aggtacagcccacgcttgatggaaaaaattctccaaatggctgaaggtattgatattggg gagatgccttcatatgatctggtgctgtccaaaccttccaaaggtcaaaaacgccacctc tcaacatgtgatgctagtagttcaaaagatgaaagacaggaagatccttatggccctcaa acaaaagaggtaaatgaacaaacacattttgccagcatgccaagagacatctaccaagat tatactcaagactctttcagtatacaagatgggaattctcagtattgtgattcatcagga ttcattctcacaaaagaccagcctgtaacagccaacatgtattttgacagtgggaaccct gccccaagcaccacatcacagcaggcaaactctcagtcaactcctgagccttcaccatca cagacatttcccgagtctgtggtagccgagaagcagtattttattgaaaaattaacggcg acaatctggaagaacctttctaatccagaaatgacttctggatctgataaaattaattat acatatatgttaactcgttgtattcaggcgtgtaagacaaatcctgagtatatatatgct cctttaaaggaaattcctcctgccgacatccccaaaaataaaaaacttctaactgatggc tatgcttgtgaagttagatgccaaaatatctacttaactacaggttatgctggcagcaag aatgggtccagggatcgagctacagagctagctgtaaaactcttgcagaaacgtattgaa gttagagttgtccggcggaaattcaagcatacatttggagaggacctcgtggtgtgtcag attggcatgtcctcctatgaatttcctccagctctgaagccaccagaagacctggtggtg ctgggtaaagatgcttccgggcagccaatttttaatgcttctgccaaacactggaccaat tttgtcattacagaaaatgcaaatgatgcaattggtatccttaacaattctgcctcattc aacaagatgtcaattgaatacaaatatgagatgatgccaaatcgcacatggcgttgtcga gtgtttttacaagatcactgcttagctgaaggttatggaaccaagaaaacaagtaaacat gcagctgccgacgaggctttgaaaattcttcaaaaaacacagcccacttatccatctgtc aaaagttcacaatgccatacaggctcttcacccagaggatctggaaagaagaaagatata aaggatcttgtagtttatgagaattcttcaaatcccgtgtgcacgctgaacgacacagct cagtttaaccgaatgacagttgagtatgtctatgaaaggatgacaggcctccgctggaaa tgcaaagtgattctagagagtgaagtaattgcagaagcagttggggtgaagaaaactgtc aaatatgaagctgctggggaagctgtgaaaaccctcaaaaagacccagccaactgtcatt aacaacttgaagaaaggagctgttgaagatgtgatttcaagaaatgaaattcagggccgc tcagcagaggaggcttacaaacagcaaatcaaagaagataatattggaaatcagctgctg agaaagatgggttggactggtggtggtttaggtaaatctggtgagggcatacgggagcct atctcagtgaaagagcagcataagcgggaagggcttggtctggatgtagagagggtgaat aaaattgccaagagagatattgaacagatcatcagaaactacgcccgctccgagagccac acagatttgactttctctagagagctgactaatgatgaacggaagcaaatacatcagatt gcccagaagtatggtcttaagaccctgaagagtaatgatatttacccagggggtaaagat tttggaattatttgcattataaaagtttcaggttttggagcatttcggattttagatttt cagatttga