GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:19:55 Sequence gi568815597f:169006830_169231552 : 224723 bp : 39.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1648 1643 6 1.05 1.08 Term - 3255 3124 132 0 0 102 42 71 0.694 1.01 1.07 Intr - 4122 3918 205 2 1 25 39 94 0.391 -3.52 1.06 Intr - 6204 5932 273 2 0 77 116 174 0.416 14.83 1.05 Intr - 7156 7065 92 1 2 59 69 18 0.182 -5.13 1.04 Intr - 10013 9937 77 0 2 41 81 88 0.000 1.72 1.03 Intr - 12280 12076 205 1 1 81 48 86 0.001 1.75 1.02 Intr - 13055 12876 180 2 0 90 28 111 0.002 4.44 1.01 Init - 25347 25066 282 0 0 79 82 192 0.999 14.79 1.00 Prom - 28381 28342 40 -2.15 2.10 PlyA - 28454 28449 6 1.05 2.09 Term - 35176 35123 54 1 0 124 54 19 0.092 -1.32 2.08 Intr - 38620 38472 149 2 2 102 66 61 0.122 4.33 2.07 Intr - 41680 41480 201 2 0 85 11 99 0.119 0.24 2.06 Intr - 45903 45762 142 0 1 12 46 107 0.447 -2.09 2.05 Intr - 46190 46077 114 2 0 106 46 90 0.576 6.32 2.04 Intr - 50377 50273 105 1 0 109 76 16 0.381 1.99 2.03 Intr - 53528 53391 138 2 0 107 10 79 0.270 1.74 2.02 Intr - 53690 53583 108 2 0 103 109 58 0.994 8.96 2.01 Init - 55391 55185 207 2 0 83 65 83 0.721 4.37 2.00 Prom - 66777 66738 40 -5.85 3.00 Prom + 75612 75651 40 -2.95 3.01 Init + 81689 81849 161 1 2 22 56 128 0.577 2.34 3.02 Intr + 82446 82498 53 0 2 111 111 14 0.798 3.63 3.03 Intr + 84569 84850 282 0 0 34 86 173 0.937 8.17 3.04 Intr + 93508 93625 118 2 1 71 121 52 0.951 5.40 3.05 Term + 94177 94312 136 1 1 77 48 125 0.928 3.91 3.06 PlyA + 95004 95009 6 -0.45 4.07 PlyA - 95460 95455 6 -0.45 4.06 Term - 98231 98034 198 2 0 23 35 154 0.481 0.02 4.05 Intr - 100305 99733 573 0 0 81 48 304 0.027 17.71 4.04 Intr - 101172 101021 152 1 2 104 -90 283 0.040 11.06 4.03 Intr - 104726 104623 104 1 2 42 89 86 0.524 3.00 4.02 Intr - 105690 105611 80 0 2 7 88 79 0.336 -2.77 4.01 Init - 107065 106916 150 1 0 63 82 67 0.287 3.69 4.00 Prom - 108835 108796 40 -5.55 5.00 Prom + 111956 111995 40 -6.75 5.01 Init + 112270 112314 45 0 0 91 99 18 0.018 3.93 5.02 Intr + 118066 118210 145 0 1 -31 115 138 0.000 3.53 5.03 Intr + 120395 120453 59 0 2 75 52 106 0.626 3.38 5.04 Intr + 123181 123261 81 0 0 45 98 93 0.641 4.92 5.05 Term + 124463 124726 264 1 0 56 43 386 0.740 25.42 5.06 PlyA + 125172 125177 6 1.05 6.04 PlyA - 125300 125295 6 1.05 6.03 Term - 129470 129255 216 2 0 71 40 111 0.121 0.76 6.02 Intr - 162725 162618 108 2 0 62 94 89 0.271 6.46 6.01 Init - 223987 223889 99 1 0 71 95 116 0.455 10.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 101172 100982 191 1 2 104 33 249 0.935 17.63 S.002 Intr + 118055 118210 156 0 0 98 115 151 0.999 17.86 S.003 Sngl - 180321 179851 471 0 0 30 38 193 0.899 4.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_1|481_aa MVPLHWVIQPDLVGRKKEREREREREKEEEGEGEGEEEEERRRRRRRKKKEGRKERKRKK ERKKKKETRKKEKRKEIMKTMESKERRLRLLGPEPIPKWCWDMQEEPKAMSRAQVLRELS SDQRMQFRTHLGGSETPKLDCKDMESSAQRMDGVVGYRFGLLDVSQGGLCGRTPGRMIAG NFSLPWRAQMPPCWVRTGFLGGPGSAVASKPSVWRFPAPVAGGQLAVPAWSSAGALLPLA LGQHADFKAPSPNTVTFPVTGHQSVHVQRWRAHSSAHGRSQVPAIMSDKWLVHKQCLING CRISLFLAELPLKVAGVSLSGINSSQNLLHLPGDNVLEKASVSCQNPTGMGRTPWKRRIP VCVLQQETVRSQCPGEVSRQGIRCDKWVIESVSIGQQWNSCTSLLTWVSQKREPEAKGFC TATISVVILRKKECWKKSKTTAKPGHKEHTDSGVQMCLEREDYRTNMLYKTGTNKFLSQG F >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_1|1446_bp atggtgccactgcactgggtgatacagccagatcttgttggaagaaagaaagagagagag agagagagagagagagagaaagaagaagaaggagaaggagaaggagaagaagaagaagaa agaaggagaagaagaagaaggaagaagaaggaaggaaggaaagaaagaaaaagaaagaaa gagagaaagaaaaagaaagaaacaagaaagaaagaaaaaagaaaagaaataatgaaaact atggaaagcaaagaaaggaggctgcgccttcttggtccagagcccattccaaagtggtgc tgggatatgcaggaggagccaaaggcaatgagtcgagcccaggtgctcagagagctgagt tctgatcagagaatgcagttcagaacacacctggggggctcagagaccccgaagctagac tgcaaggatatggagagctcagctcagaggatggatggggtggtaggttatagatttggc cttttggatgtctcacaaggaggtctttgtggcagaacacctggaaggatgattgcaggc aatttctccctcccctggagagcacagatgccaccctgttgggtgcggactgggttctta ggaggcccaggaagtgctgttgcttctaaaccctctgtttggcggtttcctgcacctgtt gcaggagggcagttggctgtccctgcatggtcgagtgctggggctttacttcctctggct ctgggccagcatgcagacttcaaggccccatctccaaatacagtcacattcccagttact gggcatcagagcgtccacgtacagagatggagggctcacagttcagcccatggcagatca caggttcctgccatcatgtctgacaaatggttagtgcacaagcagtgtttgatcaatgga tgccggatttccctgttcttggccgaattgcctttgaaagtagcaggagtcagtctctca ggcattaatagttcccagaaccttctccatctccctggagataatgtcttggagaaggct tctgtcagctgccagaaccccacaggaatggggaggactccatggaagagaagaattcca gtctgtgtgcttcagcaagaaactgtaaggtcacagtgcccaggagaggtcagcagacaa ggaatcagatgtgataagtgggttattgaatctgtgtccataggacagcagtggaattct tgtacttctcttttaacctgggtgtcccagaaaagagaacctgaggcaaagggcttctgt actgctactatatcagttgtcatcctgaggaaaaaagagtgttggaaaaagagtaaaacc actgcaaaacctggccataaggagcatacagactctggtgttcagatgtgcttggagagg gaagactatagaacaaatatgttatataagacaggaacaaataagttcctgtctcagggc ttttga >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_2|405_aa MPYRVLVCSSSSDLVVVRTFTWQPRAPKASVLGNKEEAVSPFMTKPWTSATLLVNAVSSL LTFKWVSWEPALADGEEPHYLKTTHSGDVLYVIGLPCENPWLFFQALLGKICVCGLSPEH TLVFSGGKHSQGRKRFCNVDSVRCSSLRNLSTQPPLHPGEINSLVALHMDACDTSQGWRV CNTESRDYSRHLNRTASQDGLFLLSYASIIGSKLVQTLGSLDTMNSESEPFILSCSRTTR RQNSSDTELKKEGVYLAGASARLLSQELSFPIQLFGASYHVHFQFKVRLRQPLARLVPHM IPTSQVPASRSNLWTELAKVHEITVANRGEVKARGSILGTVLGFKVGIRAGLLVKGGGQQ AQRAQVALGKPGEGCEVPSLRQAREGHMLPIYYLHEIQFLAYTCD >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_2|1218_bp atgccctacagagtgcttgtatgcagctcatccagtgatcttgtggtagttagaaccttt acatggcagcccagggctccaaaagcaagtgttctaggaaacaaagaggaagctgtaagc ccttttatgacaaagccttggacatcagctactctgttggtgaatgcagtctcaagccta ctcacattcaagtgggtgtcatgggagccagctcttgctgatggtgaggagcctcattat cttaagacaacacacagtggagatgtgctttacgtaattggcctgccctgtgaaaacccc tggctgttttttcaggctcttctagggaaaatttgcgtctgtggccttagtccagaacac accctagtcttctctggagggaaacacagtcagggaaggaagagattttgtaatgtggac agtgttcggtgttcatcgctgaggaatctgagtactcagcccccactgcacccaggtgaa ataaacagccttgttgctcttcacatggatgcgtgtgacactagccagggatggagagtg tgtaatactgagagtagggattattctaggcacctgaatagaacagccagccaggatggc ttgttcctgctctcctatgcttctatcattggcagtaaacttgtgcaaacactgggatcc ctggataccatgaattcagagtctgagcctttcatcttgagctgtagcaggaccacccgc agacaaaactcctcagacaccgagttaaagaaggaaggggtttatttggccggggcatcg gcaagactcctgtctcaagagctgagcttcccgattcaattatttggcgcttcctaccat gtacattttcaatttaaagtgaggctacgccaacccctggcaaggcttgtgccccacatg ataccaaccagccaagtacctgcttccagaagcaatctgtggacagagcttgcaaaggtc catgagatcactgttgcaaaccggggagaggtcaaggcaagaggaagtatcttgggaaca gtgctgggtttcaaagtaggcatcagggcagggctgctagttaaagggggtggtcagcag gcccagagggcccaggttgccctggggaagcctggagaaggctgtgaagttccaagctta aggcaggcaagggaaggccatatgctaccaatctactatcttcacgagatccagttttta gcttacacatgtgactga >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_3|249_aa MSLPLESRIGGCERERENAQTQTGSRDLLHNLQSPVQNENAGHLPENLLRISKRSNNLGF LDEDIRPTFKKPFRYEEQVPIQANSRKEGFNVGIQKSHRNPRTGLLGTEGPVRRGTEKMF RTLAEAASGLPVFILLFFLVIEPCVLYAGTWLPVAHYSPQPALQQGRNLHQGSPGLGFPA TCDLAGKCELCQSDSLCLESRLEARTRVPALKGHEPMRNHADEGSKYNERVGKFLTQVLK KYEIFYSKK >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_3|750_bp atgtctctgcctttggaaagccggattggaggatgtgaaagggagagagaaaatgctcaa acacagacagggagtcgggacctgctacataatttgcagagtccagtgcaaaatgaaaat gcagggcacttgccagaaaatttattaagaatttcaaaaaggtctaacaatttaggattc cttgatgaagacataaggcctacctttaagaaaccatttcgttacgaagaacaggttccc attcaggctaattctagaaaagagggttttaatgtcgggatacagaaatctcacaggaat ccaaggacagggctgcttggtacagagggccccgtgagaagaggaactgagaagatgttc agaactctggcagaagctgctagtggtcttccagtgttcattcttcttttcttcttagta atagaaccctgtgtgctttatgcgggcacatggctgccggttgcacactacagtccccag cctgccttgcagcaagggaggaacctacaccaaggcagtccaggcttaggtttcccagca acctgtgacctggctggaaaatgtgaactgtgccaatccgattccctttgcctggaatcc agactagaagcaagaaccagagtacctgctttaaagggacatgagccaatgagaaatcat gcagatgaaggcagcaagtataatgagagggttggaaagtttctcacacaagtactgaag aaatacgagattttttactcgaaaaagtga >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_4|418_aa MRKRRDRNYQAKEQGRWRTEDFSCDSDHRTGMHFSKMSHQHSFGEVRVCQLLLNLCQGKA EKGQVIAIPQQQLQGHSKSDTHAISEGNEAIHVDFTWRGHSVLICGLKFTDDPPKQEHQS ASVQDSLNDCSGGASGVGGDGHDDDDDDDDVGGGGGSDAAGGPRPCAPLWGHRIPHRTAS PGRRDPAASTTLAPATPASAVGAEPAADPAQGMRRPGKIRRGGRELRTPYLHQLPPVLPR NSFFSEFQMNFFQLPSSLAFPRAMAMAGQQLPRSGGWVAAARALASSAPLGFSGLALCRQ DAPLPLRLSECETRGGGAKAGAGGGGGGGGGGGGAGAEGGTAQAARLRAPHLLLACRTCA NLNIPEIRGRSDHLSTKPQWGRLQGSVKGIREREQLIEKGSHLSFSTCNVYVGKTPAW >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_4|1257_bp atgaggaaaagaagagacagaaattaccaggccaaagaacaaggaagatggaggactgag gacttttcctgtgactcagaccacaggacaggaatgcatttttccaagatgagccaccag cattcctttggagaagtcagggtttgtcagctgttactaaacctttgccaagggaaagca gagaagggacaagtcattgccatccctcagcagcagcttcaaggtcattcaaaaagtgac acccatgctatctctgaaggaaatgaagctattcatgtggattttacctggcggggccac tcggtcctgatatgtgggcttaaattcactgatgatcctcccaagcaggaacaccaatcg gcctctgttcaggactccctgaatgactgcagtggtggtgccagtggtgttggtggtgat ggtcatgatgatgatgatgatgatgatgatgttggaggaggagggggcagtgatgcagca gggggaccccgaccctgcgccccactctggggacaccgcattccgcacaggactgcaagc ccaggccggcgcgacccagcagcgtccaccaccctcgcccccgccaccccggccagcgcg gtgggtgcggaaccggccgcggaccctgcgcaggggatgcgccgcccgggaaagatcaga cgcggcggccgggagctgcggaccccgtacttacaccaactgccaccggtcctgcccaga aactccttcttctctgagttccagatgaatttcttccagctgccctcctccttggctttc ccgcgggccatggcgatggcgggtcagcagctgccgcggtccggagggtgggtggctgcg gcgcgcgccctggcgtcctctgcgccgctcggcttctccggcctggctctctgcaggcag gacgcgccgctgccgctgcggctctcggagtgcgagacgcgcggcggaggagccaaggca ggagcaggaggagggggaggcggcggcggcggcggcggcggggcgggggcggagggaggg acggcgcaggctgcgcggctccgagccccgcacctattgctcgcctgcaggacctgtgca aacctaaacattcctgaaattcgggggagaagcgaccacctatccaccaagccccagtgg gggaggctgcagggctctgtcaagggcatccgagagagagaacagcttattgagaaagga agccacctctcgtttagcacctgtaatgtttatgttggaaaaaccccagcttggtag >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_5|197_aa MEDLPYFSRDGDQRQIPQIQKTEISFRPNDPKSYEAYVLNIVRFLEKYKDSAQRDDMIFE DCGDVPSEPKERGDFNHERGERKPPKNESLETYPVMKYNPNVLPVQCTGKRDEDKDKVGN VEYFGLGNSPGFPLQYYPYYGKLLQPKYLQPLLAVQFTNLTMDTEIRIECKAYGENIGYS EKDRFQGRFDVKIEVKS >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_5|594_bp atggaggatcttccttatttctcaagagatggggaccaaaggcagattcctcagatccag aagactgaaatttcctttcgtcctaatgatcccaagagctatgaggcatatgtactgaac atagttaggttcctggaaaagtacaaagattcagcccagagggatgacatgatttttgaa gattgtggcgatgtgcccagtgaaccgaaagaacgaggagactttaatcatgaacgagga gagcgaaagcctcccaagaatgagtccttggagacttacccagtgatgaagtataaccca aatgtccttcccgttcagtgcactggcaagcgagatgaagataaggataaagttggaaat gtggagtattttggactgggcaactcccctggttttcctctgcagtattatccgtactat ggcaaactcctgcagcccaaatacctgcagcccctgctggccgtacagttcaccaatctt accatggacactgaaattcgcatagagtgtaaggcgtacggtgagaacattgggtacagt gagaaagaccgttttcagggacgttttgatgtaaaaattgaagttaagagctga >gi568815597f:169006830_169231552|GENSCAN_predicted_peptide_6|140_aa MVTEMYSGPCVAMEIQQNNATKTFREFCGPADPEIARHLRPGTLRAIFGKTKIQNAVHCT DLPEDGLLERLQSRLIRTSPRPNLYLLVISEPLLLIKRKGSYVLFLQDCSILLEALQKTD QVMTISGTGCARYMWLAVLN >gi568815597f:169006830_169231552|GENSCAN_predicted_CDS_6|423_bp atggtgacagaaatgtattctggcccttgtgtagcaatggagattcaacagaataatgct acaaagacatttcgagaattttgtggacctgctgatcctgaaattgcccggcatttacgc cctggaactctcagagcaatctttggtaaaactaagatccagaatgctgttcactgtact gatctgccagaggatggcctattagagaggctccaaagcagactcatcagaacctcccca aggcctaatctttacctgctggttatttcagaacccctgctccttatcaaaaggaaagga tcctatgtcctattcctgcaggattgttccatactccttgaagcactccaaaagacagac caagtaatgactatatcgggcactggctgtgcccgatatatgtggctggctgttttaaat taa