GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:35:31 Sequence gi568815581f:30871074_31099188 : 228115 bp : 44.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5301 5477 177 1 0 57 116 25 0.292 2.09 1.02 Intr + 8350 8414 65 2 2 66 119 41 0.330 3.44 1.03 Term + 22221 23081 861 2 0 67 41 161 0.063 2.03 1.04 PlyA + 23113 23118 6 1.05 2.05 PlyA - 23663 23658 6 1.05 2.04 Term - 28533 28096 438 0 0 49 39 224 0.262 8.88 2.03 Intr - 33364 32993 372 1 0 -12 71 178 0.124 1.36 2.02 Intr - 35547 35402 146 1 2 89 62 92 0.775 6.70 2.01 Init - 39099 39051 49 0 1 86 58 20 0.750 -2.09 2.00 Prom - 39735 39696 40 -3.76 3.00 Prom + 50501 50540 40 -4.66 3.01 Init + 50942 51035 94 1 1 95 110 226 0.999 26.04 3.02 Intr + 51867 51997 131 1 2 119 60 138 0.980 14.51 3.03 Intr + 55754 55845 92 1 2 76 76 117 0.503 8.09 3.04 Intr + 60816 60895 80 0 2 93 103 39 0.631 5.09 3.05 Intr + 63112 63224 113 2 2 64 110 79 0.948 7.80 3.06 Intr + 73834 73980 147 0 0 136 100 134 0.995 19.83 3.07 Intr + 78214 78297 84 0 0 99 83 43 0.876 4.92 3.08 Intr + 82215 82277 63 2 0 95 77 37 0.717 2.21 3.09 Intr + 83130 83193 64 2 1 91 40 37 0.469 -2.51 3.10 Intr + 83361 83482 122 1 2 58 99 103 0.677 8.61 3.11 Intr + 85168 85396 229 0 1 139 46 288 0.757 27.14 3.12 Term + 86712 87013 302 1 2 120 47 85 0.495 2.98 3.13 PlyA + 88096 88101 6 1.05 4.00 Prom + 90115 90154 40 -4.26 4.01 Init + 100001 100372 372 1 0 91 53 405 0.919 32.26 4.02 Intr + 113505 113687 183 2 0 22 102 218 0.504 16.58 4.03 Intr + 116871 117033 163 2 1 30 84 122 0.538 5.55 4.04 Intr + 127589 127699 111 0 0 94 31 61 0.324 1.35 4.05 Intr + 137350 137730 381 2 0 12 94 278 0.054 15.68 4.06 Intr + 145822 145930 109 2 1 83 45 74 0.100 1.94 4.07 Intr + 150837 150969 133 0 1 100 62 89 0.434 8.15 4.08 Term + 163488 163670 183 2 0 87 32 113 0.448 3.14 4.09 PlyA + 164556 164561 6 1.05 5.07 PlyA - 168290 168285 6 1.05 5.06 Term - 176591 176088 504 2 0 82 48 242 0.941 14.04 5.05 Intr - 177323 176746 578 0 2 86 54 181 0.231 6.94 5.04 Intr - 178915 177572 1344 2 0 53 39 598 0.302 39.63 5.03 Intr - 179687 179477 211 2 1 109 106 172 0.958 19.59 5.02 Intr - 181739 181646 94 1 1 60 97 18 0.097 -0.13 5.01 Init - 206325 206228 98 0 2 103 89 59 0.249 7.28 5.00 Prom - 209792 209753 40 -3.86 6.00 Prom + 220447 220486 40 -4.96 6.01 Init + 224237 224296 60 1 0 95 91 127 0.349 14.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_1|367_aa XKISSPKKVVTSPRKVPPPSPKSSGPKRALPPKTLANYFKVSPKPKNNEEIGMLLENNKD PTFSLMFDGCFEEIKFSTPSLHKITMKEEWHKFIQLLTEFQMRNVDFLYSNLEFILPLPV DTIPETKNFCGPSVTVDASAATKSMNCLARKHSEREQPLKKSQKKKQKKTLVILDDSDLF DTDLDFPDQSISLSSVSSSSNAEESKTGDEESKARDKGNNPETKKSIPCPPKTTAGKKCS ALVSHCLNSLSEFMDNMSFLDALLTDVREQNKYGRNDFSWTNGKVTSGLCDEFSLESNDG WTSQSSGELKAAAEALSFTKCSSAISKALETLNSCKKLGRDPTNDLTFYVSQKRNNVYFS QSAANLE >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_1|1104_bp naaaaaataagctcccctaagaaagttgttacatcaccaagaaaagttcctccaccatca ccaaaaagtagtggaccaaagcgagcacttcctcccaaaaccttggcaaattattttaaa gtatctcccaaacctaaaaataatgaagaaataggaatgcttctggaaaataataaagac ccaacatttagtttaatgtttgatggctgctttgaagaaatcaagttcagtactccttcc ctgcacaaaatcacaatgaaggaagaatggcataaattcatccagcttcttacagaattc caaatgcggaatgtagattttttatatagtaatcttgagtttattctaccattaccagtt gataccattccagaaactaaaaacttttgtggcccatcagtaactgtggatgccagtgca gcaacaaaaagtatgaattgtcttgctaggaaacactctgaaagagaacagccattgaaa aagtcccagaaaaagaaacaaaagaaaacattggtaatattagatgatagtgatctattt gacactgacttggactttcctgatcaatctattagcctgtcctctgtatcatcttcctca aatgcagaagaaagcaaaaccggagacgaagaaagcaaagccagagacaaaggaaacaat ccagagacaaagaaatctattccttgtcctcctaaaacaactgcaggaaaaaaatgttct gcccttgtttctcattgtttaaattctctctctgagttcatggataacatgtccttctta gatgcacttttaactgatgtaagggaacaaaacaaatacggtagaaatgactttagttgg acaaatggaaaggttacaagtggactttgtgatgagtttagtcttgagagtaatgatgga tggacttctcaaagctctggagaattaaaggcagctgcagaagctctcagctttactaaa tgttcttctgctatttcaaaagcattggaaaccttgaattcttgcaagaaattaggaaga gatccaaccaacgatcttactttttatgtttcacaaaagcgcaataatgtatactttagt cagtcagcagctaatttagagtaa >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_2|334_aa MGFHHVGQAGLELLTSGWAPTGAPQSKCLGETQGVSTKEALYPLQTKTGLGIAFLEIRPS GSQESKITPNVTFCDENAKEPENALDKLFSSEQQASILHVLNTASTKELEAFRLLRGRRS INIVEHRENFGPFQNLESLMNVPLFKYKSTVQVCNSILCPKTGREKRKSPENRFLRKLLK PDIERERLKISSIISKMPKADFYVLEKTGLSIQNSSLFPILLHFHIMEAMLYALLNKTFA QDGQHQVLSMNRNAVGKHFELMIGDSRTSGKELVKQFLFDSILKADPRVFFPSDKIVHYR QMFLSTELQRVEELYDSLLQAIAFYELAVFDSQP >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_2|1005_bp atggggtttcaccatgttggccaagctggtctcgaacttctgacctcaggctgggcccca acaggagctcctcaatctaagtgtctaggagaaacccagggagtctccacgaaggaggca ctttacccactacagactaaaactggcctcggaatcgccttcctggagattcggcccagc ggctctcaggagtcgaaaattactcccaatgttactttttgtgatgaaaatgcaaaggag cccgaaaatgcacttgacaagctcttctcttcagaacagcaggcttccatcttgcatgtg ttgaatacagcatctactaaagaacttgaagctttccgattgcttcgtggaagaaggtcc atcaatatcgtagagcacagagaaaactttgggccatttcagaatttagagagtttaatg aatgtgcccttgtttaagtataaaagtacagttcaagtttgtaactccatactttgtcca aagactggacgggaaaaaagaaagtcaccggaaaaccggttcctgagaaagctcctcaaa ccagacatagaaagagaaagacttaagatttcctcgatcatttcaaagatgcctaaagca gatttctatgttctggaaaaaacaggactttccattcagaactcatctctgtttccaata ctgttacattttcatatcatggaagccatgctgtatgccttattaaataaaacttttgcc caggatgggcagcatcaggtgctgagcatgaatcgaaatgcagtggggaagcattttgaa ctgatgattggtgactcccggactagtggaaaagagctagtgaagcagtttctcttcgat tctatactgaaggcggatcctcgggtgttcttcccatcagataaaatagttcactacaga cagatgtttttatctactgaactacaaagagtagaagagctttatgattcattattacaa gctattgccttctatgaattagcagtgtttgactctcagccttag >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_3|506_aa MGDRERNKKRLLELLRAPDTGNAHCADCGAADPDWASYKLGIFICLNCCGVHRNFPDISR VKSVRLDFWDDSIVEFMIHNGNLRVKAKFEARVPAFYYIPQANDCLVLKEQWIRAKYERR EFMADGETISLPGNREGFLWKRGRDNSQFLRRKFVLLAREGLLKYFTKEQGKSPKAVISI KDLNATFQTEKIGHPHGLQITYRRDGHTRNLFVYHESGKEIVDWFNALRAARLQYLKMAF PELPESELVPFLTRNYLKQGFMEKTGPKPEKEEGASSKLRQQGEAGKGYETDLVICGKKF PLLQQKEPFKKRWFALDCHERRLLYYKNPLDAFEQGQVFLGNKEQGYEAYEDLPKGIRGN RWKAGLTIVTPERRFVLTCPSEKEQQEWLESLRGVLSSPLTPLNRLRQHWHGHLPQPSSF PLQLHQQRVAAAAGDPLTEELAATEHLELLVGRSLHLGPGCPPSVPRSQQPFLAVNSART EAVALSISSLGLPRNPPRGSEDLVHR >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_3|1521_bp atgggcgatcgcgagcgcaacaagaagcggctgctggagctgctgcgggcgccggacaca ggcaacgcgcactgcgccgactgcggggcggcagatcccgactgggcctcttacaagctg gggatcttcatctgtctcaactgctgcggcgtccaccgtaacttccctgacatcagcaga gttaaatctgtgcgacttgacttctgggacgacagtattgtggagtttatgatccacaat ggaaacctccgtgtgaaggccaagttcgaagccagagtcccagctttctactacatcccc caggccaacgactgcctggtcttaaaggaacaatggattcgagctaagtatgagagacgg gaatttatggctgatggggaaaccatctcgctcccaggtaaccgagaaggattcctgtgg aagcgaggaagggacaactcacagtttctgagaaggaagtttgtacttctggcaagagaa ggcctcctgaagtacttcacaaaggaacagggtaaaagccccaaagctgtcatcagcatt aaggacttgaatgccaccttccagacagagaagatagggcacccccatgggctgcagatc acctacaggagagatggccacaccaggaacctgtttgtgtatcatgaaagtgggaaggag atagtggactggttcaatgccctccgtgcagcccgtctgcagtacctaaaaatggccttt cctgaactcccagagtctgagctcgtgccattcctcaccaggaactacctcaaacaaggc ttcatggaaaagactgggccaaagcctgaaaaagaggaaggtgcctcttccaagttgagg cagcagggtgaagctggcaaagggtatgaaactgacctggtcatctgtggtaagaagttc cctcttttgcagcagaaagaacctttcaagaaaaggtggttcgccctggattgccatgag cggaggctgctctattacaagaacccactggatgccttcgagcagggccaggtttttctt gggaacaaggagcagggatatgaagcctacgaagacctgcccaagggcatccgaggaaat cgctggaaagccggactcaccattgtcaccccagagcggagatttgtcctcacttgcccc agtgagaaggaacagcaggaatggctggaaagtttgcggggtgtcctgtccagccccttg acgcccctcaaccggcttagacagcattggcatggccacctccctcagccctcttcattt cccttgcagctgcatcaacagagagtggccgcagcagcaggtgacccattaactgaggaa ctggctgccactgaacacctggaactccttgtgggaagaagtttgcacctcggccctggc tgcccaccatcagtgccccgcagtcagcagccattcctggcagtgaactctgccaggact gaagctgtggctttatccatcagctccctgggccttccccgcaacccacctcggggatct gaggatctggtgcatagatga >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_4|544_aa MAGLGLGSAVPVWLAEDDLGCIICQGLLDWPATLPCGHSFCRHCLEALWGARDARRWACP TCRQGAAQQPHLRKNTLLQDLADKYRRAAREIQAGSDPAHCPCPGSSSLSSAAARPRRRP ELQRFYRTQDLNFAILKVAVEKSITEVAQELTELVEHLVDIVRSLQNQRPLSESGPDNEL SILGKAFSSGVDLSMASPKLVTSDTAAGKIRDILHDLEEIQEKLQESVTWKEAPEAQMQG AIHPTFNLKSLSCSLEVSKDSRTVTVSHRPQPYRWSSDQRRRLLGLAQTRGGKGRRGRED AAEARAAGCRLVVQPRPRKEKWEERGARRPASSAAQDDYARHVRDPPAVAVAAPKARASA PDAVVTPPRLAVAPVPVLPAASAAGDMSNPGSRRNGPVKLHLTGLPDPFAKVVVDGSGQC HSIDTVKNMLDPKWNQHYDLYIEKSDSVTISVWNHKKIHKKQGAGFLRCVHLFPVPSTTS KTLVCNKLAVYLDLYLIFLLLRFSLRIRFFLHLALMVQVSKKMALRPRVNNKIFKTWLQA IQMP >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_4|1635_bp atggcgggcctgggcctgggctccgccgttcccgtgtggctggccgaggacgacctcggc tgcatcatctgccaggggctgctggactggcccgccacgctgccctgcggccacagcttc tgccgccactgcctggaggccctgtggggcgcccgcgacgcccgccgctgggcctgcccc acttgccgccagggcgccgcgcagcagccgcacctgcggaagaacacgctactgcaggac ctggccgacaagtaccgccgcgccgcacgcgagatacaggcgggctccgaccctgcccac tgcccctgcccgggctccagttccctctccagcgcggccgcgaggccccggcgccgcccg gaactgcagcggttttatagaacccaggacctgaactttgctattttgaaggtggcagta gagaagagcatcacagaagttgctcaggagctgacagagctggtggaacatcttgtagac attgtcagaagcctgcagaatcagaggcccctatcagaatctggaccagacaacgaactg agcatcctgggcaaggctttttcttctggggtggatctttccatggcttctccaaagctg gtgacttccgacacagctgcagggaaaatcagagatattctccatgacctagaagaaatt caggaaaaattacaagaaagcgtcacctggaaagaggctcctgaagcacaaatgcagggg gccatccatccaacctttaacttgaagagcctttcctgcagcctggaggtgtccaaggat tcccgtacagtgactgtgtctcaccgcccacaaccctatcgctggagctccgaccagcgc cgtcgactactgggcctagcccagacgagaggagggaaaggccgcagaggccgcgaggat gctgcggaggcgcgggcggccggctgccgcctagttgtgcagcccagaccaaggaaagag aagtgggaggagcggggagctcggcgtcccgcgtcctccgcggctcaggacgactacgct cggcacgtccgggaccctccggccgtggcggttgcagcgccaaaagctcgggcctcagcc cctgacgctgtggtgactccgccgcgcctcgccgtcgcccctgtccccgtcctgcccgcc gcctccgccgctggggacatgtccaaccccggaagccggaggaacgggcccgtcaagctg cacctgacaggacttcctgatccatttgctaaggtggtggttgatggatctgggcaatgc cattctatagatactgtgaagaatatgctcgatccaaagtggaatcagcattatgacctg tatattgaaaagtctgattcagttacaatcagtgtatggaatcacaagaagatccataag aaacaaggtgctggatttctccgttgtgttcatctttttccagtgccatcaaccacctca aagacactggtttgcaacaagctagcggtttacttggacctctacctcatctttcttctt ttgcgcttcagcctgcgcattcgcttcttcctccacttggctctcatggtgcaggtttcc aagaaaatggcgctaaggccgagagttaataataaaatatttaaaacatggctccaggcc atccagatgccttaa >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_5|942_aa MPAFYLLLLKAMGFGVTGLDLNPDLPDDVGHLNSYKVSQPTPLTPCQSLSFIEGVFKSKP GSYPVSALPQESNENLVPFLDTDSGGELPLGPEQFLAAHQDLNDKRTPQERLPEVVPLLD RDQNQTLVQLPRLKIKPADVEVTMTSEPKNETESSQAQQEAPIQPPEEVAPSATQQEAPT EPPGPPMEPELSPSEQEQPAQPSESSGEVESSPAQHETPAQPPEEMEPSAFQEEAPTEPP RPPIEPELSPSEQEQLAQPSESSGEVESSPTQQETPAQPPEHHEVPVSPSGYHETQHSDF PNVSVKPPDVQLTIATEPSAEVETSPVHQEATAQLSGPGNVEPPAIQHGCPPLPPESSEE AGHLQIQQETSVQSPEPINKENTTPTQQEAAAEHPQTAEEGESSPIQQEAPAQPLMSPEQ FQHLKDQQDIIIQQLNRPENYEPPPVHKEPTTQPPTQLSSDFVSSMDDEAIGSPPDVSYL DLDRELTKPTAVTMWVEPSPVQQDNPSIPTEQADFSLTQPDLPSPPLHSPEKIESPVHQE ATAQTPDPPKEAEPSPVQQEFPAEPPEPPKEVEPSATQQEASGFPQEVEPSVTQQEVPAQ IPEPPVEAEPSLTQQEATVQAAESPREVESSRQQMVPVQLPEPPKEVAAQPPAHYEVTVP TLGQDPAQNSTLPSVTVQPLDLGLTIIPESTTEVELSPTMQETPTQPPKKALQQLVLYQE VKIPTPGQDQAQHPMSPSITVQPLDLGLTITPEPTMEVEHSTPLKRTIVPPKHLKNTLRC SPSDKGQAQLSHLTEATVQPLDLELSITTEPTTEVKPSPTTEETSAQPLDPGLAITPEPT TEIGHSTALEKTRAPHPDQVQTLHRSLTEVTGPPTKLESSQDSLVQSETAPEEQKASTGT NICELCTCGDETLSCVGLSPKQRLRQVPVPEPYTYNGIFTTL >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_5|2829_bp atgccagccttttaccttttattgcttaaggccatgggctttggtgttacaggtttggat ttgaatccagacctaccagatgatgttgggcacctcaactcctacaaagtatcacagccc acccccctcaccccctgccaaagcctttccttcatagaaggtgtcttcaaatccaagcca ggctcttatccggtgtcagccctgccccaggaatcaaatgaaaatttggttccattcctg gatacggattcaggtggagagctgcccctggggccagagcagttcttggctgcacatcag gatttaaatgacaagcggactccacaagaaaggctcccagaggtggttccactgctggac agggatcagaaccagaccctagttcagcttcctcgcctcaaaatcaaacctgcagatgtg gaagttaccatgacttcagagcccaaaaatgagacagaatcttcccaagcccagcaggag gccccaattcagcctccagaggaggtggcaccttctgcaacccaacaggaggccccaact gagcctccaggtcctcctatggagcctgaactttcccccagtgaacaggagcagccagct cagccttctgagtcttctggagaggttgaatcttctccagcccagcacgagaccccagct cagcctccagaagagatggaaccatctgcattccaagaggaggccccaactgagcctcca cgtcctcctatagagcctgaactttcccccagtgagcaggagcagctagctcagccttct gagtcttctggggaggttgaatcttctccaacccagcaggagaccccagctcagccccca gaacatcatgaagtcccagtttcaccttcaggttaccatgaaactcagcattcagatttt cccaatgtctctgttaagcctccagacgtgcagctcactatagcaacagagcctagtgca gaggtggaaacttctccagtccaccaggaggctacagctcagctctcaggtccaggtaat gtagaacctcccgccatccagcacgggtgcccacctctgcctccagagtcatcagaagag gctggacatttacaaattcaacaggaaacttcagttcaatctccagaacctattaataaa gagaacaccactccaacccagcaggaggctgcagctgagcatccacagaccgctgaggag ggtgagtcttctccaatccagcaggaggccccagctcagcccttaatgtcccctgagcag tttcaacatttgaaagaccagcaagacattataattcagcagctaaatagacctgaaaat tatgaacctcctccagtccataaagagcctacaactcagcctccaactcagctctcctca gactttgtaagttcaatggatgatgaagcaataggttcacctccagatgtgtcatatcta gatctagatagggagcttaccaaacctacagcagtcactatgtgggtagaaccttctcca gtccagcaggacaacccttctattcccactgagcaggctgacttttctttaacccagcct gatctcccttccccacctctgcattctcctgagaagattgaatctccagtccaccaagag gccacagctcagactccagatccccctaaggaggcagaaccttctccagtccagcaagag ttcccagctgagccaccagagccccctaaggaggttgaaccatctgcaacccagcaggaa gcctcagggttccctcaggaggtagaaccatctgtaactcagcaggaggttccagctcag attccagagccccctgtggaggcagaaccttctctgacccagcaggaggccacagttcag gctgcagagtcccctagggaggtagaatcttcaaggcagcaaatggtcccagttcagctt ccagagccacctaaggaagttgcagctcaacctccagctcattatgaggtgacagtccca acactaggccaggatccagctcagaattcaacattgcccagtgtcactgttcaacctttg gacctgggacttaccatcattccagaatcaacgacagaggttgaactttctccaaccatg caggagaccccaactcagcctcctaagaaagctctacaacaacttgtactatatcaagag gtaaaaattccaacaccaggtcaggatcaagctcagcatccaatgtcacccagcattaca gttcaacctttggacctgggacttaccatcactccagaacccactatggaggttgaacat tctacacccctgaagagaactatagttcctccaaagcaccttaagaacaccctgaggtgt tcaccttcagacaagggtcaggctcagctttcacacctgaccgaagccacagttcaacct ctggacctggagcttagcataactacagagcctactacagaggttaaaccatctccaacc acggaggaaacctcagctcagcctctagacccggggcttgccataactccagaacctact acagagattggacattccacagccctggagaagactagagctcctcatccagaccaggtt cagactctgcatcgaagcctgactgaagtcacaggtccacctacaaagttagaatcttcg caggattcattggtgcagtctgaaactgcaccagaggaacagaaggcctccacaggcacc aacatatgtgagctctgcacctgcggagatgagactctgtcatgtgttggtctcagccca aagcagaggctccgccaagtgcctgtgccagagccctacacctacaatggcatcttcacc accttgtaa >gi568815581f:30871074_31099188|GENSCAN_predicted_peptide_6|20_aa MAAHRPVEWVQAVVSRFDEQ >gi568815581f:30871074_31099188|GENSCAN_predicted_CDS_6|60_bp atggccgcgcacaggccggtggaatgggtccaggccgtggtcagccgcttcgacgagcag