GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:31:41 Sequence gi568815593r:83541499_83773523 : 232025 bp : 37.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 80 770 691 1 1 34 110 669 0.744 58.96 1.02 Intr + 4039 4152 114 2 0 56 91 88 0.891 5.40 1.03 Intr + 6473 6586 114 0 0 72 116 40 0.960 4.70 1.04 Intr + 11866 12024 159 2 0 114 44 168 0.976 14.04 1.05 Intr + 13458 13540 83 1 2 72 99 114 0.905 9.34 1.06 Intr + 30918 31062 145 2 1 64 95 113 0.186 8.53 1.07 Intr + 38482 38664 183 2 0 48 103 115 0.924 7.84 1.08 Intr + 50468 50626 159 0 0 68 64 100 0.098 4.64 1.09 Term + 80941 81005 65 2 2 122 46 59 0.418 2.17 1.10 PlyA + 82297 82302 6 1.05 2.11 PlyA - 83620 83615 6 1.05 2.10 Term - 100287 99998 290 1 2 139 42 297 0.996 24.75 2.09 Intr - 103167 102865 303 1 0 115 95 168 0.999 15.84 2.08 Intr - 111326 110955 372 0 0 61 95 311 0.909 23.21 2.07 Intr - 132051 131926 126 1 0 53 116 20 0.015 1.03 2.06 Intr - 149739 149675 65 2 2 92 82 51 0.035 2.34 2.05 Intr - 150874 150771 104 1 2 124 84 -9 0.042 0.35 2.04 Intr - 179821 179751 71 2 2 92 48 98 0.523 4.18 2.03 Intr - 180868 180498 371 0 2 103 66 150 0.889 8.12 2.02 Intr - 181093 180897 197 1 2 87 28 107 0.949 1.89 2.01 Init - 182032 181976 57 1 0 79 107 16 0.911 3.96 2.00 Prom - 185809 185770 40 -6.95 3.00 Prom + 186392 186431 40 -6.95 3.01 Init + 186571 186799 229 0 1 83 88 98 0.824 7.88 3.02 Intr + 193460 193776 317 0 2 -22 30 276 0.190 5.66 3.03 Intr + 193836 193893 58 2 1 61 87 21 0.019 -3.16 3.04 Intr + 205160 205512 353 0 2 114 -48 440 0.006 27.12 3.05 Intr + 212797 212984 188 0 2 20 100 166 0.363 8.57 3.06 Term + 213134 213383 250 2 1 -41 39 243 0.628 0.69 3.07 PlyA + 213412 213417 6 1.05 4.00 Prom + 214433 214472 40 -6.15 4.01 Sngl + 216701 217420 720 1 0 74 50 214 0.807 12.17 4.02 PlyA + 217791 217796 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:83541499_83773523|GENSCAN_predicted_peptide_1|570_aa MSPQDSFKEIHVNIEATFKPSSEEYLHITEPPSLSPDTKLEPSEDDGKPELLEEMEASPT ELIAVEGTEILQDFQNKTDGQVSGEAIKMFPTIKTPEAGTVITTADEIELEGATQWPHST SASATYGVEAGVVPWLSPQTSERPTLSSSPEINPETQAALIRGQDSTIAASEQQVAARIL DSNDQATVNPVEFNTEVATPPFSLLETSNETDFLIGINEESVEGTAIYLPGPDRCKMNPC LNGGTCYPTETSYVCTCVPGYSGDQCELDFDECHSNPCRNGATCVDGFNTFRCLCLPSYV GALCEQDTETCDYGWHKFQGQCYKYFAHRRTWDAAERECRLQGAHLTSILSHEEQMFVNR VGHDYQWIGLNDKMFEHDFRWTDGSTLQYENWRPNQPDSFFSAGEDCVVIIWHENGQWND VPCNYHLTYTCKKGTVACGQPPVVENAKTFGKMKPRYEINSLIRYHCKDGFIQRHLPTIR CLGNGRWAIPKITCMNLPCLGVGNWLLTLEIDRKAPLNILNATERCRVDLWRVIGARQDC QTTANKGRAGVLPYTWRIGMLTYRGKEESD >gi568815593r:83541499_83773523|GENSCAN_predicted_CDS_1|1713_bp atgtccccacaggattcttttaaggaaattcatgtaaatattgaagcgactttcaaacca tcaagtgaggaataccttcacataactgagcctccctctttatctcctgacacaaaatta gaaccttcagaagatgatggtaaacctgagttattagaagaaatggaagcttctcccaca gaacttattgctgtggaaggaactgagattctccaagatttccaaaacaaaaccgatggt caagtttctggagaagcaatcaagatgtttcccaccattaaaacacctgaggctggaact gttattacaactgccgatgaaattgaattagaaggtgctacacagtggccacactctact tctgcttctgccacctatggggtcgaggcaggtgtggtgccttggctaagtccacagact tctgagaggcccacgctttcttcttctccagaaataaaccctgaaactcaagcagcttta atcagagggcaggattccacgatagcagcatcagaacagcaagtggcagcgagaattctt gattccaatgatcaggcaacagtaaaccctgtggaatttaatactgaggttgcaacacca ccattttcccttctggagacttctaatgaaacagatttcctgattggcattaatgaagag tcagtggaaggcacggcaatctatttaccaggacctgatcgctgcaaaatgaacccgtgc cttaacggaggcacctgttatcctactgaaacttcctacgtatgcacctgtgtgccagga tacagcggagaccagtgtgaacttgattttgatgaatgtcactctaatccctgtcgtaat ggagccacttgtgttgatggttttaacacattcaggtgcctctgccttccaagttatgtt ggtgcactttgtgagcaagataccgagacatgtgactatggctggcacaaattccaaggg cagtgctacaaatactttgcccatcgacgcacatgggatgcagctgaacgggaatgccgt ctgcagggtgcccatctcacaagcatcctgtctcacgaagaacaaatgtttgttaatcgt gtgggccatgattatcagtggataggcctcaatgacaagatgtttgagcatgacttccgt tggactgatggcagcacactgcaatacgagaattggagacccaaccagccagacagcttc ttttctgctggagaagactgtgttgtaatcatttggcatgagaatggccagtggaatgat gttccctgcaattaccatctcacctatacgtgcaagaaaggaacagtcgcttgcggccag ccccctgttgtagaaaatgccaagacctttggaaagatgaaacctcgttatgaaatcaac tccctgattagataccactgcaaagatggtttcattcaacgtcaccttccaactatccgg tgcttaggaaatggaagatgggctatacctaaaattacctgcatgaaccttccttgttta ggagttgggaactggttacttacactcgaaatagacaggaaagcccccttgaatatcctg aatgccactgaaagatgccgagttgacctgtggagggtgattggggcaaggcaggattgt cagaccacagccaacaaaggcagagcaggtgtcctgccttatacctggaggataggaatg ttaacatacagaggcaaagaagagtcggactag >gi568815593r:83541499_83773523|GENSCAN_predicted_peptide_2|651_aa MGKGRLEIGEDQEGQDDSESPGGGSPRGLGFIFKTIAPLAATRATRIGHPGGRTPRAGSS AHRPPALSARAPVPAASPAAWLPLRTPWTRPSSCPTSSSTYDSLSPYGPRNPLPNPRHSP SGGGGLKKPAVPPAPPRRGRRLEPGARPRIPARPQRSATPGPARCNSGTACVSPPPWPRA PRACLLVLSPRASPAPSSRLLQQPFASCDHSTDPLSHPFWTDPGLMRVRVREIFSLPLVL SSFIIEGFGLIFCMFLVLEYLLKFLGLNRQFSELLALNASAQVFEQALVKKILWAIKMKS LLLLVLISICWADHLSDNYTLDHDRAIHIQAENGPHLLVEAEQAKVFSHRGGNVTLPCKF YRDPTAFGSGIHKIRIKWTKLTSDYLKEVDVFVSMGYHKKTYGGYQGRVFLKGGSDSDAS LVITDLTLEDYGRYKCEVIEGLEDDTVVVALDLQGVVFPYFPRLGRYNLNFHEAQQACLD QDAVIASFDQLYDAWRGGLDWCNAGWLSDGSVQYPITKPREPCGGQNTVPGVRNYGFWDK DKSRYDVFCFTSNFNGRFYYLIHPTKLTYDEAVQACLNDGAQIAKVGQIFAAWKILGYDR CDAGWLADGSVRYPISRPRRRCSPTEAAVRFVGFPDKKHKLYGVYCFRAYN >gi568815593r:83541499_83773523|GENSCAN_predicted_CDS_2|1956_bp atggggaagggaaggctggaaataggggaggatcaggagggacaggatgactcagagagt ccaggaggaggcagtccccgtggcttaggctttatctttaagacaatagcgccgctcgcc gccacccgcgcgactcggatcgggcatcccggcggccgcaccccgcgcgctggctcatct gcacaccggccacctgcattgtcggccagagcccccgtcccggcggcttccccagcagct tggctgcccctcaggacgccctggacccgcccatcctcctgccccactagctcatcgact tacgactccctcagtccctacggcccacggaaccctctccccaacccgcgccacagcccg agcggcggcggcggccttaagaagcccgcagtcccgcccgcgccgccccgccggggccgg aggttggagccgggagcacggccgcgaatccccgcacgtccccagcgctccgcgacaccg ggccctgctcgctgcaactcggggacggcctgcgtctcgcctccaccctggccgcgggcc cctcgagcttgtttgcttgtcctcagtcccagggccagccccgctccctcctcccggctg ctccaacagccgtttgcttcctgcgatcattccacagaccccttgtcccatcccttctgg acagacccaggactgatgagggttcgggttcgggagattttctctttaccactggtttta agcagtttcattattgagggttttggtttaattttctgcatgtttcttgtgctagaatat ttgctgaaattcttgggtctgaaccgtcaattttcagaactcttggctctcaacgcctca gctcaagtttttgaacaggctctagtgaagaagattctttgggctataaagatgaagagt ctacttcttctggtgctgatttcaatctgctgggctgatcatctttcagacaactatact ctggatcatgacagagctattcacatccaagcagaaaatggcccccatctacttgtggaa gcagagcaagccaaggtgttttcacacagaggtggcaatgttacactgccatgtaaattt tatcgagaccctacagcatttggctcaggaatccataaaatccgaattaagtggaccaag ctaacttcggattacctcaaggaagtggatgtttttgtttccatgggataccacaaaaaa acctatggaggctaccagggtagagtgtttctgaagggaggcagtgatagtgatgcttct ctggtcatcacagacctcactctggaagattatgggagatataagtgtgaggtgattgaa ggattagaagatgatactgttgtggtagcactggacttacaaggtgtggtattcccttac tttccacgactggggcgctacaatctcaattttcacgaggcgcagcaggcgtgtctggac caggatgctgtgatcgcctccttcgaccagctgtacgacgcctggcggggcgggctggac tggtgcaatgccggctggctcagtgatggctctgtgcaatatcccatcacaaagcccaga gagccctgtggggggcagaacacagtgcccggagtcaggaactacggattttgggataaa gataaaagcagatatgatgttttctgttttacatccaatttcaatggccgtttttactat ctgatccaccccaccaaactgacctatgatgaagcggtgcaagcttgtctcaatgatggt gctcagattgcaaaagtgggccagatatttgctgcctggaaaattctcggatatgaccgc tgtgatgcgggctggttggcggatggcagcgtccgctaccccatctctaggccaagaagg cgctgcagtcctactgaggctgcagtgcgcttcgtgggtttcccagataaaaagcataag ctgtatggtgtctactgcttcagagcatacaactga >gi568815593r:83541499_83773523|GENSCAN_predicted_peptide_3|464_aa MDEAGNHHSQQTYTGTENQTLHVLTHKWQLNNENTWTQGGKHHTPGPVRGWGSRGGITLG EMPNVDDGLMGAANHHETGSILLQSVELRDLWNFELERDNSGYLAENISKRQSIEEEAEH KSLGNLQPGDAIEKKNPFSGEKFKPAEEICIDNKQPNVNLQDNGENVSRACQRPSRQPLV SQPRDLVSCFLAAPTVAKRGQGRWDAVKHLEAVQSSLASLGLVGQHALHSTPEDAAGDLE VVGASRRVGVHPLVEESQVLQLVSAETARNVDAFAAYDHRLPAQQYLFSHDGRQAAQEMA STIKHQDLHLRHLRQPLGKPPLVIPKQTESGVDLQQTPADLQKRGLTVRRKTNKQKATAS TSTKRMTMQKPYLKVTNIKDQRRRIVTNSSKLKEHVLTQCKEAKNLNKRLQELLTRITSL EKNINDLMELKATARELCEADTSMNNRIIKWKKEYQRLKMNLMK >gi568815593r:83541499_83773523|GENSCAN_predicted_CDS_3|1395_bp atggatgaagctggaaaccatcattctcagcaaacttacacaggaacagaaaaccaaaca ctgcatgttctcactcataagtggcagctgaacaatgagaacacatggacacagggaggg aagcatcacacaccggggcctgtcagggggtgggggtctagaggagggataacattagga gaaatgcctaatgtagatgacggattgatgggtgcagcaaaccaccatgagactggcagc attttgctccagtctgtggaactgagagatctgtggaactttgaacttgagagagataat tcagggtatctagcggaaaatatttctaagaggcaaagcattgaagaggaagcagaacat aaaagtttgggaaatttgcagcctggtgatgcaatagaaaagaaaaacccattttctggg gagaaattcaagcctgctgaagagatttgcatagataacaagcagcctaatgttaatctc caagacaatggggaaaatgtctccagggcatgtcagagaccttcaaggcagccccttgta tcacagcctagggacttggtttcctgctttctagcagctccaactgtggctaaaaggggc caagggcggtgggatgccgtcaaacaccttgaggcggtccagagcagcctggcctcactt ggtcttgtggggcagcatgccttgcacagtacgccagaagatgcggctggggacctggaa gtggtaggggcgtcgagaagggttggtgttcatccgcttgtggaggaaagccaggtactt caacttgtttctgcagaaactgccagaaatgttgatgccttcgcagcttacgaccaccgc cttccggcccagcagtacctgtttagccacgatggccgccaggcggcccaggagatggcc tcaaccatcaagcaccaggacctgcacctccgccatcttcggcagccgcttgggaaacct ccactggtgatacccaagcaaacagagtctggagtggacctccagcaaactcctgcagac ctgcagaagaggggcctgactgttagaaggaaaactaacaaacagaaagcaacggcatca acatcaacaaaaaggatgaccatgcaaaaaccttatctgaaggtcaccaacatcaaagac caaagaaggaggatagtaacaaactcttccaagctaaaggagcatgttctaacccagtgc aaggaagctaagaacctgaataaaaggttacaggaactgctaactagaataaccagttta gagaagaacataaatgacctgatggagctgaaagccacagcacgagaactttgtgaagca gatacaagtatgaataacagaatcatcaagtggaagaaagaatatcagagattgaagatg aacttaatgaaataa >gi568815593r:83541499_83773523|GENSCAN_predicted_peptide_4|239_aa MLPDFKLYYKATVTKKAWYWYQNRYIDQYNRTEASEITLHIYNHLICDKPDKSKQWRKDF LFNKWCWKNLLAICRKLKLDPFLTPYTKSNSRWIKDLNLRPKTIKTLEENLGNTIQDIGM GKDFMTKTPKAIATKARIDKWDLIKLKSFCTAKETIIRVNRQPTQWEKMFAIYPSDKGLI STIHKQLKQIYKEKTNNPIKKWAKDMKRHVSQEDIYVANKHMKKAHYHWSLEKRKSKPL >gi568815593r:83541499_83773523|GENSCAN_predicted_CDS_4|720_bp atgctacctgacttcaaactatactacaaggctacagtaaccaaaaaagcatggtactgg taccaaaacagatatatagaccaatacaacagaacagaagcctcagaaataacactgcac atctacaaccatctgatctgtgacaaacctgacaaaagcaagcaatggagaaaagatttc ctatttaataaatggtgttggaaaaacttgctagccatatgcagaaaactgaaattggac cccttccttacaccttatacaaaaagtaactcaagatggattaaagatttaaacctacga cctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatg ggcaaagacttcatgactaaaacaccaaaagcaattgcaacaaaagccagaattgacaaa tgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtaaac aggcaacctacacaatgggagaaaatgtttgcaatctatccatctgacaaagggctaata tccacaatccacaagcaacttaaacaaatttacaaggagaaaacaaacaaccccattaaa aagtgggcaaaggatatgaagagacacgtctcacaagaagacatttatgtggccaacaaa cacatgaaaaaagctcattatcactggtcattagagaaacgcaaatcaaaaccactatga