GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:01:14 Sequence gi568815589f:90743899_90995597 : 251699 bp : 44.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1095 1090 6 -0.45 1.10 Term - 1537 1482 56 2 2 117 55 20 0.064 -0.68 1.09 Intr - 7607 7325 283 2 1 74 86 163 0.198 11.69 1.08 Intr - 8170 7974 197 2 2 58 -42 182 0.149 1.33 1.07 Intr - 10532 10437 96 0 0 114 75 40 0.570 5.28 1.06 Intr - 12526 12263 264 2 0 62 100 107 0.124 6.78 1.05 Intr - 15352 15056 297 2 0 72 -54 251 0.024 6.15 1.04 Intr - 15861 15760 102 1 0 -11 94 102 0.617 1.05 1.03 Intr - 18434 18356 79 2 1 84 -9 81 0.042 -2.88 1.02 Intr - 24982 24760 223 0 1 93 108 52 0.019 5.83 1.01 Init - 31862 31834 29 2 2 65 101 32 0.011 1.29 1.00 Prom - 46179 46140 40 -2.96 2.00 Prom + 54574 54613 40 -2.16 2.01 Init + 57622 57995 374 0 2 58 96 167 0.639 8.95 2.02 Intr + 63556 63637 82 1 1 51 84 51 0.135 0.64 2.03 Term + 76579 76704 126 0 0 97 44 72 0.182 1.98 2.04 PlyA + 76977 76982 6 1.05 3.00 Prom + 82559 82598 40 -1.96 3.01 Init + 100001 100417 417 1 0 82 80 705 0.820 65.53 3.02 Term + 101536 101700 165 0 0 79 48 180 0.972 11.02 3.03 PlyA + 102759 102764 6 -0.45 4.00 Prom + 103221 103260 40 -1.26 4.01 Init + 104994 105143 150 2 0 81 70 77 0.950 5.24 4.02 Term + 107428 107619 192 0 0 113 47 68 0.848 2.62 4.03 PlyA + 108028 108033 6 1.05 5.00 Prom + 109501 109540 40 -2.46 5.01 Init + 117818 117909 92 1 2 63 113 24 0.813 2.37 5.02 Intr + 118308 118446 139 0 1 108 80 290 0.999 30.67 5.03 Intr + 120691 120769 79 0 1 119 96 20 0.880 5.02 5.04 Intr + 129007 129180 174 2 0 46 63 75 0.393 0.71 5.05 Intr + 130322 130393 72 0 0 43 100 62 0.430 2.28 5.06 Intr + 130774 130951 178 2 1 81 106 386 0.987 38.68 5.07 Intr + 133673 133882 210 2 0 75 72 333 0.990 28.43 5.08 Intr + 134866 135055 190 1 1 31 115 203 0.994 16.89 5.09 Intr + 143851 143991 141 0 0 90 46 129 0.719 9.45 5.10 Intr + 148884 149032 149 2 2 26 77 46 0.136 -3.67 5.11 Intr + 157479 157545 67 0 1 151 70 12 0.128 4.61 5.12 Intr + 159408 159614 207 2 0 84 89 26 0.319 1.57 5.13 Intr + 159850 160189 340 0 1 94 92 237 0.872 19.75 5.14 Intr + 160436 160968 533 0 2 3 46 384 0.129 18.35 5.15 Intr + 165404 165425 22 1 1 94 109 22 0.130 2.02 5.16 Intr + 169845 170010 166 1 1 77 55 98 0.042 4.42 5.17 Intr + 172050 172068 19 0 1 80 105 4 0.144 -2.09 5.18 Intr + 172498 172536 39 0 0 68 94 58 0.174 2.82 5.19 Term + 174290 174361 72 1 0 71 45 102 0.173 2.11 5.20 PlyA + 177807 177812 6 1.05 6.00 Prom + 181473 181512 40 -3.36 6.01 Init + 193294 193429 136 0 1 28 105 177 0.528 13.71 6.02 Intr + 193556 193717 162 0 0 36 52 159 0.694 7.05 6.03 Term + 194436 194548 113 1 2 54 43 81 0.588 -1.08 6.04 PlyA + 200289 200294 6 1.05 7.03 PlyA - 202792 202787 6 1.05 7.02 Term - 213526 213257 270 1 0 69 54 209 0.573 11.08 7.01 Init - 235406 235290 117 2 0 94 77 59 0.687 5.61 7.00 Prom - 245124 245085 40 -3.06 8.02 PlyA - 246015 246010 6 1.05 8.01 Sngl - 250376 250041 336 2 0 71 44 216 0.833 11.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 15352 15048 305 2 2 72 44 258 0.907 15.13 S.002 Init - 97121 97033 89 2 2 62 64 87 0.851 3.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_1|541_aa MVTASYIAEMVTHPKKVSQDQCLECPDESPKLTGPWVRINVPTMNKRAVDNETVLTVPTD REKKLSINSRANLLPLAPMSCFAQHTRNNITEEVYTPCDIGSNIIRFFRGYYTRINITGW VYTYCDIERNIMLSPSLDIRSNITARLRPKTGIQMPLREQRYTGIDEDGHVVERRVFGYQ PFTCVDLLNWKTNTPPYTKKPQALIDLLQTVIQTHNHTWADWHLLLMFLFNSEERRRVPK QQLKSPVQPDCVEVLDSVDSSTPDLRDQPCTSVDWELYVDGSSFFNPQGERDAGYAVVTL DTVVEARLLPQATSAQKAELIAFTRALELSEDIRKNVTGDVNSLAILGVISSSLPLHIGN NITVLRNLLVILAVSSDSHLHTPMYFFLSNPCWADIGFTSATVPKMTVDMQSHIRVISYA SCLTRMSFLDFIVRWEYKAFSTCGSHLAVVCLFYGTGIGMYLTSAVAPPPRNGVVASVMY AVVTPMLNPFIYSLRNRDIQSALWRLRSRTVESHDLFHPFSCVALDFLKGVINSGFGFHS Y >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_1|1626_bp atggttacagcatcctacattgcagaaatggttacccatcccaagaaagtgtcacaagat cagtgtctggaatgcccagatgagagccccaaactgactggcccctgggtcagaatcaat gtgcctactatgaacaaaagggctgttgacaatgagactgtcctaactgtccctactgac agagagaaaaaactctccatcaattctagagctaaccttcttccactagccccaatgagc tgctttgctcaacatactaggaacaatatcacagaagaggtgtacactccctgcgatatt gggagtaatatcatacgcttcttccgtggatattatactcggatcaatatcactggctgg gtgtacacctactgcgatattgaacgtaatatcatgctctctccctccctggacattagg agcaatatcacagctcgtttacgacccaaaacggggatacaaatgcccctgagagagcag cggtatactgggatagatgaggatggtcacgtggtggagaggcgtgtttttgggtaccag cccttcacctgcgtcgaccttctcaactggaaaaccaatacaccgccctataccaaaaag ccacaagccctaattgatttgctccaaactgttatccagacccacaaccacacctgggct gattggcacctgttgctcatgttcctctttaacagcgaagaaaggcggagagtccccaag cagcaactaaagagccctgtccagcctgattgtgtagaagtgttggactcagttgactct agcacacctgacctccgggaccagccttgcacatcagtagactgggaactatatgtggat gggagcagcttcttcaacccccaaggagagagagatgcagggtatgcagtggtaaccctg gacactgttgttgaagccagattgttgccccaggccacttcagcccagaaagcggaactc attgctttcactcgggccttagaactcagtgaagatattaggaaaaatgtcactggggat gtgaacagccttgcaatattgggagtaatatcatcctctctccccttgcatattgggaac aacatcacagtgctgaggaacctgctcgtcatcctggctgtcagctctgactcccacctc cacacccccatgtacttcttcctctccaacccgtgctgggctgacatcggtttcacttcg gccacggttcccaagatgactgtggacatgcagtcacatatcagagtcatctcttatgcg agctgcctgacacggatgtctttcttggatttcatcgtcagatgggagtataaagccttc tccacctgtggctctcacctggcagttgtttgcttattttatggaacaggcattggcatg tacctgacttcagctgtggcaccaccccccaggaatggtgtggtggcgtcagtgatgtac gctgtggtcacccccatgctgaaccctttcatctacagcctgagaaacagggacattcaa agcgccctgtggaggctgcgcagcagaacagtcgaatctcatgatctgttccatcctttt tcttgtgtggctctcgacttcctcaaaggagtcataaattcggggtttggcttccattcc tattga >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_2|193_aa MRSMSRAAPPSLPAKGPCPAGGERRAPPGLLYTCRRLGRFRGPRPALQPIPAQLRAHGRG QQGGPGRRGATLGGSAGRLPGRVKEVAQNEEEPRARRLRPPRRRLESEEERVAPRCARPR LTWRSSSFVNGAVTGGFTLIPNGDRACSARYLAQHHVEAAKAWGFHPLKPQPELYGGPFQ PQLEQLGHRAPSP >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_2|582_bp atgcggtccatgtcccgggcagccccaccttctctgcctgcgaagggcccttgtccggcg ggaggagagaggcgcgccccacccgggctcctctacacctgccgccgcctgggccgattc cgcgggcctcgcccggcgcttcagccgattcccgcccagctccgggctcatgggcgcggt cagcagggcgggccagggcggcggggcgcgacactgggaggaagtgcgggccgcctgccc gggcgcgttaaggaagttgcccaaaatgaggaagagccgcgggcccggcggctgaggcca ccccggcggcggctggagagcgaggaggagcgggtggccccgcgctgcgcccgccctcgc ctcacctggcgcagctcttcttttgtaaatggagcagtcacagggggcttcactctgata cctaatggagaccgtgcctgcagtgctcggtacctggctcaacaccacgtggaagctgcc aaagcttggggctttcaccctctgaagccacagcctgagctctacggtggcccctttcag ccacagctggagcagctgggacacagggcaccaagtccctag >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_3|193_aa MASSGMADSANHLPFFFGNITREEAEDYLVQGGMSDGLYLLRQSRNYLGGFALSVAHGRK AHHYTIERELNGTYAIAGGRTHASPADLCHYHSQESDGLVCLLKKPFNRPQGVQPKTGPF EDLKENLIREYVKQTWNLQGQALEQAIISQKPQLEKLIATTAHEKMPWFHGKISREESEQ IVLIGSKTNGKFL >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_3|582_bp atggccagcagcggcatggctgacagcgccaaccacctgcccttctttttcggcaacatc acccgggaggaggcagaagattacctggtccaggggggcatgagtgatgggctttatttg ctgcgccagagccgcaactacctgggtggcttcgccctgtccgtggcccacgggaggaag gcacaccactacaccatcgagcgggagctgaatggcacctacgccatcgccggtggcagg acccatgccagccccgccgacctctgccactaccactcccaggagtctgatggcctggtc tgcctcctcaagaagcccttcaaccggccccaaggggtgcagcccaagactgggcccttt gaggatttgaaggaaaacctcatcagggaatatgtgaagcagacatggaacctgcagggt caggctctggagcaggccatcatcagtcagaagcctcagctggagaagctgatcgctacc acagcccatgaaaaaatgccttggttccatggaaaaatctctcgggaagaatctgagcaa attgtcctgataggatcaaagacaaatggaaagttcctgtga >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_4|113_aa MDLAVREAIEKRPEGSEEPSPPREWQIRMPCGRSVPGEQGTHRGQCHWQRSLPTPFRSDF EKVVTGPLHMCLKNNEYKQRLFVEGLGSVGQWLSLTAACGISGGDVHHLAFAI >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_4|342_bp atggaccttgctgtgagggaggccattgagaagaggcctgaaggcagcgaggaaccatcc ccacccagggaatggcaaattcgaatgccctgtggcaggagcgtgcctggggagcagggc acccacagaggccagtgtcactggcagagaagcttacccacaccattcaggagtgacttt gagaaggtagttacagggcctttgcacatgtgcttgaagaataacgaatataaacagagg ctctttgtggaggggctgggcagtgtgggtcagtggttgtccctcacagccgcctgtggc atttctggaggtgatgttcatcacctggcctttgctatataa >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_5|962_aa MPTKHPHPHSKLGGRAVHLGSNPNAGPGEERIRARDNNGSYALCLLHEGKVLHYRIDKDK TGKLSIPEGKKFDTLWQLVEHYSYKADGLLRVLTVPCQKIGTQVTSVNQFSRSAEILCDK TDPIDIIFTNKTRCPLALSTGLGHGEQCLPLKCSADRVKEEGNRQESTVSFNPYEPELAP WAADKGPQREALPMDTEVYESPYADPEEIRPKEVYLDRKLLTLEDKELGSGNFGTVKKGY YQMKKVVKTVAVKILKNEANDPALKDELLAEANVMQQLDNPYIVRMIGICEAESWMLVME MAELGPLNKYLQQNRHVKDKNIIELVHQVSMGMKYLEESNFVHRDLAARNVLLVTQHYAK ISDFGLSKALRADENYYKAQTHGKWPVKWYAPECINYYKFSSKSDVWSFGVLMWEAFSYG QKPYRSQPSSGGLCRMEKFPGLSRFSESQGGSRCGSNSCLKINKPKIKHEFPGPWAYRFN LSSKEKEETMTALWYRKPCCRALNFPSIRPGNEALKPVPLQPDYPSLTFVSPSANQAPLK PQGPLKPVPQLPLQPGPQAPEGPAQQHKISHSPLTLLSRRLTPRLLKIYSLRIRSLSPRI RLLRQIISPHNCLLLWHSQFQVYIQAAVQPDPIHPGQVWLRPATWESFSFKFLKDFKASV KQYGTNSPFVHSTLKALAEDAWDNIQNDGKICPSFTAVTQGQQEPYPDFIAHLQDATEKT IPDSHGQRLVVELKAYEQTNADCQVAIRPIKGKIPPGGDILFYIKACEGVGELYIQQWSW HKPWPLLECLDDSLANAFNAANQGIVEKIVLGFQAAILFNANNKTLYSNKSCPPVCAHDA VREITGLLSAAQNLTLMPLESEYTDSMGFSGTGPMKLPTWRHGAVLGPGEARRAMLGSET ALLSAEKPGGPCENNTRRPTREAKPKLTDKAAYGQHQTEFCHDVDITGDAGAEVRATVVK GE >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_5|2889_bp atgcccaccaaacacccccacccacactccaagctaggtggtagagctgtccatcttgga tctaacccaaatgcaggccctggtgaagaaaggatccgagccagagacaacaacggctcc tacgccctgtgcctgctgcacgaagggaaggtgctgcactatcgcatcgacaaagacaag acagggaagctctccatccccgagggaaagaagttcgacacgctctggcagctagtcgag cattattcttataaagcagatggtttgttaagagttcttactgtcccatgtcaaaaaatc ggcacacaggttacttctgttaatcagttcagtagaagtgctgagatactctgtgacaaa acagatcccatcgacatcatttttacaaataagaccagatgtcctttagccttgtcaaca ggcttgggccatggtgaacaatgtctgcctctgaagtgctcagcagaccgtgtgaaagaa gaagggaaccggcaagagagtactgtgtcattcaatccgtatgagccagaacttgcaccc tgggctgcagacaaaggcccccagagagaagccctacccatggacacagaggtgtacgag agcccctacgcggaccccgaggagatcaggcccaaggaggtttacctggaccgaaagctg ctgacgctggaagacaaagaactgggctctggtaattttggaactgtgaaaaagggctac taccaaatgaaaaaagttgtgaaaaccgtggctgtgaaaatactgaaaaacgaggccaat gaccccgctcttaaagatgagttattagcagaagcaaatgtcatgcagcagctggacaac ccgtacatcgtgcggatgatcgggatatgcgaggccgagtcctggatgctggttatggag atggcagaacttggtcccctcaataagtatttgcagcagaacagacatgtcaaggataag aacatcatagaactggttcatcaggtttccatgggcatgaagtacttggaggagagcaat tttgtgcacagagatctggctgcaagaaatgtgttgctagttacccaacattacgccaag atcagtgatttcggactttccaaagcactgcgtgctgatgaaaactactacaaggcccag acccatggaaagtggcctgtcaagtggtacgctccggaatgcatcaactactacaagttc tccagcaaaagcgatgtctggagctttggagtgttgatgtgggaagcattctcctatggg cagaagccatatcgatcccagccctccagtgggggcctctgccgaatggagaaatttcct gggctctccaggttctctgagagtcagggaggctctagatgcggctcaaattcctgttta aagatcaacaagcccaagataaaacatgaattcccaggaccttgggcatatagattcaat ctttcctccaaagaaaaagaagaaacgatgacagcactttggtatagaaagccctgctgc agggccctcaactttccatctattcggcctggcaatgaggccctcaaacctgtccctctg cagcctgattatccatctctcacttttgtttctccttcagcaaatcaggctcccttgaag cctcagggtcccttgaagcctgtcccgcagttaccactgcagcctggacctcaggcccct gaaggacctgcccagcagcacaaaatcagccacagcccgctgaccctgctcagcaggagg ctgaccccgcggctcctcaagatctacagcctgagaatcaggtccctcagcccgagaatc aggctcctcaggcaaataatcagcccccacaactgcctgctgctgtggcacagccagttc caggtatacatccaggcagcagtccaaccagatcctattcatccaggccaggtttggcta cgccctgctacttgggaaagtttttcttttaaattcctcaaagatttcaaagcatcagtg aagcagtacggcaccaactccccgtttgttcattccacattaaaggccctagcagaagat gcttgggataacattcagaatgatggtaaaatatgtccatcttttacggctgttacacag ggacagcaggaaccctatccagactttattgcccatcttcaagatgcgacagagaaaacc atccctgatagccacggccaacgacttgttgtagaacttaaggcttatgaacaaacaaat gcagactgccaagtggctattcgccccattaaaggcaaaattccaccaggaggtgatata ctcttctacattaaagcctgtgaaggcgtgggggaactctacatacagcaatggtcctgg cacaagccatggccactattagaatgcctggacgattctctggcaaatgctttcaatgcg gccaatcagggcattgtagaaaaaattgtcctcggctttcaggctgccattcttttcaat gccaacaacaaaaccctatacagcaacaaatcttgccctccagtctgtgcccacgatgcc gtaagggaaatcactgggctgctcagtgccgcgcaaaatttgacattgatgcctttggag agtgagtacacagactccatgggcttttcagggacaggacccatgaagctgcccacatgg aggcatggagctgtgctgggacccggggaggccaggagggcaatgctgggctctgagact gccctgctgtctgctgaaaaacctggtggtccctgtgaaaacaacacccgacgcccaaca cgtgaagcaaagccaaagctaactgacaaggcggcctatggacagcaccaaacagagttt tgtcatgatgtagacatcactggagacgctggagctgaagttcgggccacggttgtcaaa ggagagtag >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_6|136_aa MWKVSLESPVGKVDLVSPVGEVDLVSPVGEVDLVSPVGKVDLDGEVGKVDLDGEGEPGVT GGEGGPGVTGQEDEPGFSGGEDEPGASGAECEPGVTSGGDPLARGYFCWGVAWFFGMNYP SIAQGSGPTSQHLRTR >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_6|411_bp atgtggaaggtgagcctggagtcaccagtggggaaggtggacctggtgtctccagtgggg gaggtagacctggtgtctccagtgggggaggtggacctggtgtctccagtggggaaggtg gacctggatggggaagtggggaaggtggacctggatggggaaggtgagcctggagtcact ggtggggaaggtggacctggtgtcaccggtcaggaagatgagcctggtttctctggtggg gaagatgaacctggtgcctctggtgcggaatgtgaacctggagtcaccagtgggggagac cctcttgcccgtggctacttctgctggggtgttgcctggttctttggcatgaattacccc agtattgctcagggtagtgggcccacatcacagcatctgagaactcgctag >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_7|128_aa MPSTGFDTVPSTLPTTAILVGDGGENVVEVWSRENRKWMTMEMMLDKKPIRVIFFFEFKM DHKAAETTCNVNNAFGSGTTNECTVQWWFKKFCKGDENLEAEEHSGRPLAVDKGQLRATI KADPPTAT >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_7|387_bp atgcccagcactggatttgacactgtgccatcaacattacctaccacagcaatcttagtg ggtgatggaggtgagaacgtggtggaagtctggtcgagagaaaacaggaaatggatgact atggaaatgatgttagacaaaaagccaattcgagtgattttcttttttgagttcaaaatg gatcataaagcagcagagacaacttgcaacgtcaacaatgcatttggctcaggaactact aatgaatgtacagtgcagtggtggttcaaaaagttttgcaaaggagacgagaaccttgaa gctgaggagcatagtggccggccattggcagttgacaagggccagttgagagcaaccatc aaagctgatcctcctacagctacatga >gi568815589f:90743899_90995597|GENSCAN_predicted_peptide_8|111_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTVKETIIRVLRQPTEWEKIFAIYSSDKGL ISRIYKELKQIDKKKTNNPIKKWAKDIHRHFSKEDIYAANKHDKKLIIAGH >gi568815589f:90743899_90995597|GENSCAN_predicted_CDS_8|336_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagtaaaagaaactatcatcagagtg ctcaggcaacctacagaatgggagaaaatttttgcaatctactcatctgacaaagggcta atatccagaatctacaaagaacttaaacaaattgacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatatacacagacacttctccaaagaagacatttatgcggccaac aaacatgacaaaaagctcatcatcgctggtcattag