GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:40:17 Sequence gi568815583f:84503928_84706572 : 202645 bp : 43.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 178 338 161 2 2 101 44 86 0.605 3.60 1.02 PlyA + 2359 2364 6 1.05 2.10 PlyA - 2865 2860 6 1.05 2.09 Term - 6493 6389 105 1 0 25 37 92 0.280 -3.79 2.08 Intr - 7540 7430 111 1 0 79 64 174 0.627 14.68 2.07 Intr - 7932 7807 126 0 0 -83 78 218 0.447 4.58 2.06 Intr - 8906 7996 911 0 2 90 34 1079 0.721 93.70 2.05 Intr - 9375 9288 88 0 1 104 94 41 0.999 5.84 2.04 Intr - 10120 10040 81 1 0 77 99 76 0.993 7.43 2.03 Intr - 10365 10306 60 0 0 82 94 64 0.975 5.33 2.02 Intr - 11107 10988 120 1 0 113 76 84 0.999 10.39 2.01 Init - 12822 12775 48 0 0 68 93 46 0.903 4.05 2.00 Prom - 38215 38176 40 -3.56 3.00 Prom + 44803 44842 40 -1.56 3.01 Init + 49723 49781 59 0 2 79 93 40 0.460 4.38 3.02 Intr + 58464 58622 159 0 0 17 67 164 0.114 6.30 3.03 Intr + 66501 66807 307 0 1 40 105 117 0.010 5.05 3.04 Intr + 74021 74236 216 1 0 84 101 125 0.103 12.10 3.05 Intr + 86291 86414 124 1 1 55 37 64 0.003 -1.84 3.06 Intr + 96887 97071 185 0 2 86 52 116 0.078 7.31 3.07 Term + 100055 100618 564 1 0 33 43 393 0.003 23.69 3.08 PlyA + 102933 102938 6 1.05 4.00 Prom + 106871 106910 40 -3.76 4.01 Sngl + 116719 118113 1395 0 0 47 45 1510 0.597 138.66 4.02 PlyA + 119768 119773 6 1.05 5.00 Prom + 124562 124601 40 -7.16 5.01 Init + 127973 128371 399 1 0 76 83 296 0.976 24.57 5.02 Intr + 128670 128796 127 2 1 -52 91 83 0.936 -5.15 5.03 Intr + 129897 130412 516 1 0 107 -4 675 0.545 53.13 5.04 Intr + 133420 133493 74 2 2 135 109 14 0.813 7.33 5.05 Intr + 134469 134550 82 2 1 63 86 54 0.867 1.91 5.06 Term + 137228 137346 119 0 2 78 42 94 0.506 2.50 5.07 PlyA + 137931 137936 6 1.05 6.10 PlyA - 139174 139169 6 1.05 6.09 Term - 141909 141455 455 1 2 99 44 149 0.768 6.92 6.08 Intr - 142421 142257 165 0 0 85 86 139 0.946 13.33 6.07 Intr - 144698 144494 205 2 1 101 84 13 0.677 0.97 6.06 Intr - 148875 148787 89 1 2 108 88 67 0.989 8.39 6.05 Intr - 149772 149702 71 2 2 100 19 95 0.449 2.63 6.04 Intr - 151958 151789 170 2 2 20 25 77 0.250 -6.66 6.03 Intr - 153421 153249 173 2 2 90 99 52 0.767 6.16 6.02 Intr - 153615 153502 114 1 0 101 5 71 0.376 0.52 6.01 Init - 154225 154069 157 1 1 103 131 289 0.999 32.77 6.00 Prom - 156473 156434 40 -2.36 7.06 PlyA - 158306 158301 6 1.05 7.05 Term - 162480 162475 6 0 0 102 44 0 0.197 -5.13 7.04 Intr - 166855 166798 58 0 1 79 116 60 0.871 6.79 7.03 Intr - 176905 176786 120 0 0 67 98 61 0.923 4.61 7.02 Intr - 183847 183698 150 0 0 113 53 74 0.739 5.68 7.01 Intr - 187717 187608 110 1 2 111 92 15 0.423 3.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 65712 65578 135 1 0 33 110 129 0.917 10.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_1|53_aa XTMIAASVRLKAGSLAPAEALLLASRLTHLGLLLLGVACAIQPIVLLAQLSAA >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_1|162_bp ngtaccatgatagctgctagcgtgcgactgaaggcagggtccctggcccctgctgaagca ctactgctggccagcaggctcacgcaccttggcctgttgcttctaggggttgcctgtgct attcagccaatagtgctgctggcccagctgagcgccgcctag >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_2|549_aa MSEKTQQGKLAAAKKKLKAYWQRKSPGIPAGANRKKKINGSSPDTATSGGYHSPGDSATG IYGEGRASSTTLEDLESQYQELAVALDSSSAIISQLTENINSLVRTSKEEKKHEIHLVQK LGRSLFKLKNQTAEPLAPEPPAGPSKVEQLQDETNHLRKELESVGRQLQAEVENNQMLSL LNRRQEERLREQEERLREQEERQREQEDRLHEQEERLREQEERLCEQEERLCEQEERLRE HEERLCEQEERLCEQEERLREQEERLHEQEERLREQEERLCEQEERLREHEERLCEQEER LREQEERLCEQEERLREQEERLCEQEERLREQEERLCEQEKLPGQERLLEEVEKLLEQER RQEEQERLLERERLLEEVEKLLEQERQQEEQERLLERERLLEEVEKLLEQERRQEEQERL LERERLLDEVEELLDETLQELERLRELERLRELERMLELGWEALYEQRAEPRSGFEELNN ENKSTLQLEQQVKELKKSGELKETVTSDPSKKMWEESGSQPPHVLMPRVFRQVELKSQEA PSLQQQPDQ >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_2|1650_bp atgtcagaaaaaacacaacaggggaaattggccgcagccaagaaaaagttaaaagcatat tggcagaggaagagccctggcattccagcaggagctaacaggaaaaagaaaatcaatggc agtagccctgacacagccacttctggtggttaccactcacctggggattcagcaacaggt atctacggggagggccgtgcatcctctactaccctggaggatctggagagccagtaccaa gaactagcagtggccctggattcaagctccgcaataatcagtcaactcactgaaaacatc aattcactggttcgcacatctaaggaggagaagaagcatgagatacatctggtacagaag cttgggaggagcttgttcaaactcaaaaaccagacggctgaacccctggccccagagccc ccagcagggccatctaaggtagagcagctacaagatgagaccaaccacctaaggaaggag ctagagagtgtgggaagacagctccaggctgaggtggaaaacaatcagatgttgagtctc ctgaacaggagacaggaggagaggctacgtgaacaggaggagaggctacgtgaacaggag gagaggcaacgtgaacaggaggataggctacatgaacaggaggagaggctacgtgaacag gaggagaggctgtgtgaacaggaggagaggctgtgtgaacaggaggagaggctacgtgaa catgaggagaggctgtgtgaacaggaggagaggctatgtgaacaggaggagaggctacgt gaacaggaggagaggctacatgaacaggaggagaggctacgtgaacaggaggagaggctg tgtgaacaggaggagaggctacgtgaacatgaggagaggctgtgtgaacaggaggagagg ctacgtgaacaggaggagaggctgtgtgaacaggaggagaggctacgtgaacaggaggag aggctgtgtgaacaggaggagaggctacgtgaacaggaggagaggctgtgtgaacaggag aagctgccagggcaggagaggctgctggaagaggtggagaagctgttagaacaggagagg cggcaggaggagcaggagaggctgctggagagggagaggctgctggaagaggtggagaag ctgttagaacaggagaggcagcaggaggagcaggagaggctgctggagagggagaggctg ctggaagaggtggagaagctgttagaacaggagaggcggcaggaggagcaggagaggctg ctggagagggagaggctgctggacgaggtggaggagctcctggacgagactctgcaggag ctggagaggctgcgggagctggagaggctgcgggagctggagaggatgctggagctgggg tgggaagccctgtacgagcagcgggccgagccacgcagcggcttcgaggagctgaacaac gagaacaagagcacactgcagttggagcagcaagtaaaggagctgaagaagtcgggtgag ctgaaagagactgtaacctccgacccatccaagaagatgtgggaggaatctggaagccag ccaccacatgtcctcatgcccagggtcttccggcaggtggagctgaagagccaagaggct ccgagtctgcagcagcagccagaccagtag >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_3|537_aa MRLHHHDWLEDNSTDFEILREYSKSNNITSEMRLQGDYDACFGHPSPALSIAPSDGSQLP CDEVPYGEAHMASLPQSRRSGPGQSREKGARSPGPGNRGLSWAGSPLSRDSGLFLSALAL PCEAAAGRLTVMLQWSLSCRGGSWFLWKIEDNNFSLALNPDTDILLSPLAGGARKLQHIM SDEICVQVTDLYLAENNNGATGGQPNTQNSRSLLESTYPQKAKQLMSDEKCFKVRVAHVV KDLAGDNTQPSSSLEDSAFNVPSSKQRAALTRLQTCWYLDLGLPSHQNCTAQAVSPKPYS GDRGQVRLRADYISHNALGPASLLPPGWVPSPPTERQNFQGRSRQPMSAAVPQEEDRQEE EVTTMILEDDSWVQEAVLQEDGPESEPFPQSAGKGGPQEEVTRGPQGALGRLRELCRRWL RPEVHTKEQMLTMLPKEIQAWLQEHRPESSEEAAALVEDLTQTLQDSGETQNLIGRGREH PSKVEECGVSEEEKVVSKAEWGASAIPLLCLQAVSVFISLLVSSLCAKSAPEVLGGP >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_3|1614_bp atgaggcttcaccaccatgactggttagaagacaacagcacagactttgaaattctaaga gaatacagcaaaagtaataacatcacttctgagatgaggctacaaggagactacgatgcc tgctttggtcacccttctcctgctctttccattgctccctctgatggaagccagttgcca tgtgatgaggtgccctatggagaggcccacatggcaagcctcccgcagtcccggcgatcg gggccaggccagtcgcgggagaaaggtgcgcgctcacccggcccggggaaccggggcctc tcctgggcaggttcccctttgtcccgggactccgggctcttcctctccgccctcgccctg ccgtgtgaagccgccgctgggcgcctcaccgtgatgttgcagtggagcctgagctgccgc ggcggctcctggttcttgtggaaaatagaggacaacaacttcagcttggccttgaaccct gacacggacattttactctcacctctggcgggaggggcgcggaagctgcagcatatcatg agtgatgagatctgtgtacaggtgactgacctttacctggcagaaaataataatggggcc accggaggccagccgaacacacagaactcaagaagcctcctggagtcaacgtatccacag aaagccaagcagctaatgtcagatgagaaatgctttaaggtgagagttgctcatgtagtc aaagaccttgctggtgacaatacacagccttcctcatccctggaggactcagcattcaac gtgccttcttcgaagcagagagcagccctcactagactacaaacctgctggtaccttgat cttggacttcccagccaccagaactgcactgcccaggctgtcagccccaaaccctactcc ggggaccgcggtcaggttcgtctccgggcggactacatctcccacaatgccttgggccca gcctccctcctgccgcccggctgggtgccgtctccaccaacagaaaggcagaatttccag ggccgttctcggcagccaatgagcgcggcggtgcctcaagaggaagatagacaggaggag gaggtcaccaccatgatcctggaggatgactcctgggtgcaagaagctgtgctgcaggag gatggccctgagtctgagccctttccccagagtgctggcaagggcggcccccaggaggag gtgaccaggggaccacagggtgcactcggccgcctccgagagctctgccggcgctggctg agaccagaggtacacaccaaggagcagatgttaaccatgctgccaaaggaaattcaggct tggctgcaagagcatcggcctgaaagcagtgaggaggcagcggccctggtggaagacttg acccagacccttcaggacagtggtgagacgcagaacctcatagggagagggcgggagcac ccttccaaggtagaggagtgtggtgtttcggaggaggagaaggtggtgtccaaggcagag tggggggctagcgccatccctctgctctgtctgcaggcagtcagcgtgttcatcagcctt ttagtgtcctcactgtgtgcaaagtcagctccagaagtgctaggagggccttag >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_4|464_aa MFENESRKIFSEMPEGESAQHSDGESDFERDAGIQRLQGHSPGEDHGEVVSQDREVGQLI GLQGTYLGEKPYECPQCGKTFSRKSHLITHERTHTGEKYYKCDECGKSFSDGSNFSRHQT THTGEKPYKCRDCGKSFSRSANLITHQRIHTGEKPFQCAECGKSFSRSPNLIAHQRTHTG EKPYSCPECGKSFGNRSSLNTHQGIHTGEKPYECKECGESFSYNSNLIRHQRIHTGEKPY KCTDCGQRFSQSSALITHRRTHTGEKPYQCSECGKSFSRSSNLATHRRTHMVEKPYKCGV CGKSFSQSSSLIAHQGMHTGEKPYECLTCGESFSWSSNLLKHQRIHTGEKPYKCSECGKC FSQRSQLVVHQRTHTGEKPYKCLMCGKSFSRGSILVMHQRAHLGDKPYRCPECGKGFSWN SVLIIHQRIHTGEKPYKCPECGKGFSNSSNFITHQRTHMKEKLY >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_4|1395_bp atgtttgagaatgaatcacgtaagatattctcggaaatgcctgaaggtgaaagtgctcag cactccgatggggaaagtgactttgagagagatgctggcatccagaggctccagggacac agcccaggtgaggaccacggggaggtggtttctcaggacagggaagttggccagctcata ggcctgcagggcacctacctaggggagaagccctacgaatgtccccagtgtgggaagacc ttcagccggaaatcccacctcatcacacacgagaggacccacacaggagagaaatactac aaatgtgatgaatgtggaaaaagctttagtgatggttcaaattttagtagacaccaaacc actcacaccggggagaagccctacaaatgcagagactgtgggaagagctttagccggagt gccaacctcataacccaccagaggatccacacgggggaaaagcccttccagtgtgccgag tgtggcaagagcttcagcaggagtcccaacctcattgcacatcagcgcacccacacagga gagaaaccctactcgtgccccgagtgtggaaagagctttggcaaccgatccagccttaac acgcatcaggggatccacactggagaaaagccctacgaatgtaaagaatgcggcgaaagc tttagttacaactccaatctaatcagacaccagagaatccacacaggagagaaaccctac aaatgtaccgactgtgggcagaggttcagccagagttcagccctcatcacccaccggaga acccacacaggagagaaaccctaccagtgcagcgagtgtgggaaaagcttcagccgcagc tctaacctggccacacaccggagaacccacatggtggagaagccctataagtgtggggtg tgtgggaagagcttcagccagagctccagtctgattgcacaccagggcatgcacacaggg gagaaaccctacgagtgcctgacatgtggggagagcttcagctggagctccaacctcctc aagcaccagaggatccacacgggagagaaaccctacaaatgcagcgagtgtgggaaatgc ttcagccagcgctcccagctcgtagtgcaccagcggacccacacgggcgagaagccctac aaatgcctcatgtgcggcaagagcttcagccggggctccattctggtcatgcaccagaga gcccatttgggagacaagccctacaggtgccctgagtgtgggaaaggctttagctggaac tcagtcctcattatacatcagcgaatccacactggggagaagccctacaaatgccccgag tgtggcaaaggcttcagcaacagctctaactttatcacacatcagagaactcacatgaaa gagaaactttattga >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_5|438_aa MAVAVDQQIQTPSVQDLQIVKLEEDSHWEQEISLQGNYPGPETSCQSFWHFRYQEASRPR EALLQLQKLCCQWLRPEKCTKEQILELLVLEQFPTVLLQEIQIWVRQQHPESGEEAVALV EDLQKEPGRQRLENSSHSVAASQRQAASVVFVDIIEPIAKHLNYPMSFCLGFGKAASGEA VRPHPSCSAAVAMANDSCGPGEPSSSERDRQYCELCGKMENLLRCSRSSFCCKERQRQDW KKHKLVCQGSEGALGHGGGPHQDSGPAPPAAAPPSRDRALEARKAARRRDSASGDAAKAK AKSAADPAAAASPPRASPGRTKAMAACYPVNGTGYVRHVDNPNGDGRPLPDVALGIPAEK SRGGQEIVSSSQEAWRDPEPGLKNHLEITQQNSENEVTLGLPVPQPDGVTMLQKSEELWN EDLPDFKEIQKTSSAGGK >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_5|1317_bp atggctgtagctgtggaccaacaaatccagactccttcagtacaagatctccaaatagtt aaactggaagaagattcccactgggagcaggaaatttcccttcaagggaattaccctgga ccagagacatcctgccagagcttttggcatttccgttaccaagaagcatcacgaccccga gaggccctcctccagctccagaagctctgttgtcagtggctaaggccagagaagtgtaca aaagagcagatcctggagttgctggtcctagaacagttcccgactgtccttctccaggag atccagatctgggtcagacagcagcatccggagagtggagaggaggcagtggccctggtg gaagacttgcagaaagaacctggaagacagaggctggagaacagctctcactctgtagca gccagccagaggcaagcggcttcggtggtctttgtcgacattattgagcctattgcaaaa cacctgaattatcctatgtctttttgcctagggtttggcaaggcagcctcgggcgaggcc gtccggccgcacccctcctgctcagctgcggtcgccatggccaatgacagctgcgggccc ggcgagccgagctcgagcgagcgagaccggcagtactgcgagctgtgcgggaagatggag aacctgctgcgctgcagccgcagctccttctgctgcaaggagcgccagcgccaggactgg aagaagcacaagctcgtgtgccagggcagcgagggcgccctcggccacggagggggccct caccaggactccggccccgcgccgcccgctgcagcgccgccgtccagggaccgggccctg gaggccaggaaggcagcgaggcgccgggacagcgcctccggggacgcagccaaggcaaag gccaagtccgcggccgaccccgcggcggccgcgtccccgcctcgcgcgtccccgggccgg acaaaagccatggctgcttgttatccggtcaatggaacgggttatgtacgtcatgttgat aatccaaatggagacggaagacccctgcctgatgtggctctgggaattcctgcagagaag agcaggggtggccaggagatagtgagcagctcccaggaggcatggagggatccagaacct ggactgaagaaccacctagaaataactcagcagaattctgaaaatgaggtcacgctggga cttccagttccccagccagatggggtcacaatgctgcagaaaagtgaagagctttggaat gaagatctcccggacttcaaggagattcagaaaacgtccagtgcaggtgggaaatga >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_6|532_aa MARRAGGARMFGSLLLFALLAAGVAPLSWDLPEPRSRASKIRVHSRGNLWATGKAEGAGF ATTQTPLWLLGEVEAPRGGRLSVFPLQERKRHFMGKKSLEPSSPSPLGTAPHTSLRDQRL QLSHDLLGILLLKKALGVSLSRPAPQIQNQQPLRTSDVRPRSLGWLSLKGNNVTSNALGF EALTHADQLENLRSSSAYVWGFRVRYQDFYAFDLSGATRVLEWIDDKGGVFVAGYESLKK NEILHLKLPLRLSVKENKGLFPERDFKVRHGGFSDRSIFDLKHVPHTRYGQFCDPAIHTG WDGMAANAWGYSASSSPFPPPQHTGTYVIKAVSTIAVHEKEESLWPRVAVFSTLAPGVLH GARLRSLQVVDLESRKTTYTSDVSDSEELSSLQVLDADTFAFCCASGRLGLVDTRQKWAP LENRSPGPGSGGERWCAEVGSWGQGPGPSIASLGSDGRLCLLDPRDLCHPVSSVQCPVSV PSPDPELLRVTWAPGLKNCLAISGTAEQDFVLLSDLFLPGDCWVLRKEPPAP >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_6|1599_bp atggcccggcgggcggggggcgctcggatgttcggcagcctcctgctcttcgccctgctc gctgccggcgtcgccccgctcagctgggatctcccggagccccgcagccgagccagcaag atccgagtgcactcgcgaggcaacctctgggccaccggaaaagctgagggagcaggcttt gccaccacccagacacctttgtggctccttggtgaggtggaagcaccaagaggaggaagg ttaagtgtcttcccgctacaagaacggaaacgtcacttcatgggcaagaagagtctggag ccttccagcccatccccattggggacagctccccacacctccctgagggaccagcgactg cagctgagtcatgatctgctcggaatcctcctgctaaagaaggctctgggcgtgagcctc agccgccccgcaccccaaatccagaaccagcagcccctgaggacctcagatgtaaggcct aggagcttgggctggctgagtctgaagggaaacaatgtcacctctaatgcccttggtttt gaagctctgacacatgcagaccaactagagaatctcagaagcagcagtgcctacgtctgg ggcttcagagtgaggtaccaggatttctatgcattcgacctgtcaggagccactcgagtc cttgaatggattgatgacaaaggtggagtctttgttgctggctatgaaagcctgaaaaag aatgaaattcttcatctgaaattacctctcagactttctgtaaaggaaaacaagggctta ttcccagaaagagatttcaaagtgcgccatggaggattttcagacaggtctatctttgat ctaaagcatgtgccacataccaggtatggtcaattttgtgatccagccatccacacagga tgggatgggatggctgcaaatgcctgggggtattctgcctcctcatctccattcccaccc cctcagcacacaggcacatatgtcattaaagctgtcagcaccattgctgtgcatgagaaa gaggagagtctctggcctagggtggccgtcttctccacattggcacccggagtcctccat ggggcgaggctccgaagtctgcaggtcgttgatctggagtcccggaagaccacgtacacc tcagatgtcagtgacagtgaggagctgagtagcctgcaggtcctagatgcagacaccttt gccttctgctgtgcttcaggccggctggggcttgttgacacccggcagaagtgggcaccg ttggagaatcgcagccctggccctgggtctggtggagagagatggtgtgctgaagttggg agctggggccagggccctgggcccagcattgccagccttggctcagatgggcgtctttgt cttcttgacccccgggatctctgccatcctgtgagctcagtccagtgcccagtatccgta cctagccctgacccagagctgctgcgagtgacttgggccccaggcctgaagaattgcttg gccatctcaggtactgccgagcaggatttcgtcttgttatctgatctctttcttccagga gactgttgggtgctcaggaaggagcctccagcaccatga >gi568815583f:84503928_84706572|GENSCAN_predicted_peptide_7|147_aa LYYQVLNFGMIVSSALMIWKGLMVITGSESPIVVVLSGSMEPAFHRGDLLFLTNRVEDPI RVGEIVVFRIEGREIPIVHRVLKIHEKQNGHIKFLTKGDNNAVDDRGLYKQGQHWLEKKD VVGRARGFVPYIGIVTILMNDYPKFKA >gi568815583f:84503928_84706572|GENSCAN_predicted_CDS_7|444_bp ctctattatcaagtcctaaattttggaatgattgtctcatcggcactaatgatctggaag gggttaatggtaataactggaagtgaaagtccgattgtagtggtgctcagtggcagcatg gaacctgcatttcatagaggagatcttctctttctaacaaatcgagttgaagatcccata cgagtgggagaaattgttgtttttaggatagaaggaagagagattcctatagttcaccga gtcttgaagattcatgaaaagcaaaatgggcatatcaagtttttgaccaaaggagataat aatgcggttgatgaccgaggcctctataaacaaggacaacattggctagagaaaaaagat gttgtggggagagccaggggatttgttccttatattggaattgtgacgatcctcatgaat gactatcctaaatttaaggcttag