GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:10:57 Sequence gi568815586f:108095833_108348340 : 252508 bp : 46.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5103 5126 24 1 0 124 77 37 0.742 4.20 1.02 Intr + 5844 5905 62 1 2 110 105 20 0.698 4.35 1.03 Term + 27078 27185 108 2 0 119 51 22 0.497 0.11 1.04 PlyA + 27297 27302 6 1.05 2.00 Prom + 30233 30272 40 -4.46 2.01 Init + 33333 33414 82 2 1 58 66 97 0.262 3.64 2.02 Term + 37772 37851 80 0 2 109 44 64 0.266 2.03 2.03 PlyA + 38112 38117 6 1.05 3.04 PlyA - 38822 38817 6 1.05 3.03 Term - 39485 39387 99 2 0 79 42 48 0.068 -2.57 3.02 Intr - 41053 40883 171 1 0 112 55 58 0.273 5.04 3.01 Init - 45753 45679 75 0 0 60 116 44 0.741 5.49 3.00 Prom - 46482 46443 40 -6.06 4.00 Prom + 46831 46870 40 -8.16 4.01 Init + 50119 50172 54 0 0 73 63 72 0.431 4.48 4.02 Intr + 56739 56816 78 2 0 85 94 76 0.542 7.65 4.03 Term + 74476 74568 93 0 0 -6 36 166 0.425 -0.17 4.04 PlyA + 74815 74820 6 1.05 5.03 PlyA - 75550 75545 6 1.05 5.02 Term - 80830 80604 227 2 2 70 48 142 0.955 5.34 5.01 Init - 82468 82360 109 1 1 103 41 93 0.904 6.48 5.00 Prom - 90370 90331 40 -4.46 6.00 Prom + 95024 95063 40 -6.26 6.01 Init + 100001 100382 382 1 1 88 94 313 0.886 28.93 6.02 Intr + 110457 110571 115 1 1 90 66 205 0.966 18.01 6.03 Intr + 114289 114477 189 1 0 17 83 315 0.630 22.60 6.04 Intr + 114634 114824 191 1 2 123 40 81 0.878 6.03 6.05 Intr + 128907 129028 122 1 2 96 105 188 0.887 21.51 6.06 Intr + 131158 131325 168 0 0 93 -24 142 0.618 3.64 6.07 Term + 131828 132112 285 1 0 69 51 514 0.711 41.20 6.08 PlyA + 132295 132300 6 1.05 7.00 Prom + 134375 134414 40 -5.26 7.01 Init + 135331 135337 7 0 1 89 78 0 0.927 0.31 7.02 Intr + 136899 137063 165 1 0 114 100 209 0.896 24.63 7.03 Intr + 140521 140552 32 2 2 73 81 4 0.418 -3.95 7.04 Intr + 141052 141087 36 0 0 113 80 2 0.289 0.36 7.05 Intr + 144475 144712 238 0 1 53 87 479 0.670 41.49 7.06 Term + 152159 152511 353 0 2 153 44 549 0.998 51.65 7.07 PlyA + 158997 159002 6 1.05 8.02 PlyA - 160059 160054 6 1.05 8.01 Sngl - 186532 185942 591 1 0 61 45 326 0.777 21.79 8.00 Prom - 192239 192200 40 -5.46 9.06 PlyA - 195081 195076 6 1.05 9.05 Term - 197127 196009 1119 0 0 114 42 1281 0.555 118.13 9.04 Intr - 214565 214526 40 1 1 96 66 31 0.006 -0.07 9.03 Intr - 228402 228268 135 2 0 64 99 73 0.155 5.68 9.02 Intr - 230008 229960 49 2 1 105 46 12 0.015 -3.56 9.01 Init - 237574 237454 121 1 1 63 60 117 0.465 5.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_1|64_aa XAYTKAFSQKGDCIDLHLQFLWNLSRYGPPPPPMKVSSSKSFVLLIPRWYLLCRGLSGLS VLQK >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_1|195_bp ngtgcctacaccaaagccttttcccagaaaggagactgtattgacttgcaccttcagttt ctttggaacctgtccaggtatggcccccctccacctccaatgaaagtttcctccagtaaa tcttttgtgctcctaattccacgttggtacctgctttgcagaggactgagtggattgagt gtcctgcagaagtga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_2|53_aa MLSRPGTGYARLPGRLREGQVACKELSGLNCACAVHLPSVDSAVGASSEQSVG >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_2|162_bp atgctcagccggcctggcaccggctatgcccggctccccgggaggctgcgggagggccag gtggcatgcaaggagttaagcggactgaactgtgcttgtgccgtgcaccttccgagtgtt gattcagcagttggtgccagctctgagcagagtgttggatga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_3|114_aa MTEADIRVAWGYEPRNMGSPPMLEKACGDGPGLVHAISRLTNERERKGKASSEEFLKPCI TPTWERQSGEAAGGVAENTGFEIVSSLKTELGSLISVSSVPSTEYDAEKILSKH >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_3|345_bp atgacggaagcagacatcagagtggcttggggctatgaaccaaggaatatgggcagccct ccaatgctggaaaaggcctgtggggatggacctggcctggtgcatgccatttccaggtta acaaatgagagggagagaaaagggaaagcttcctccgaggagttcctgaaaccgtgcatc acccccacctgggaaagacagagcggggaggcagcgggaggagtagctgagaacacgggc tttgagattgtaagctcattgaagacagagctggggtctctcatttctgtatcttcagtg cccagcacagagtatgatgctgagaagattctgagcaaacactga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_4|74_aa MLWWKHKNKEEITTIVLKSLLWDKDESQELTGAIPEARGAFGGQRNGKISDNDDINDGDD KDNDKTTMMMMVMM >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_4|225_bp atgctttggtggaaacataaaaacaaggaagaaattaccaccatcgtcctgaagagcctg ctctgggataaagatgagtcccaggaattaaccggggccatcccagaagcccgaggggcc tttggaggccagagaaacgggaaaataagtgataatgatgacatcaatgatggtgatgac aaagacaatgacaagacaacaatgatgatgatggtgatgatgtaa >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_5|111_aa MPFHKCPYKRRKSGQIHPEGRRYEDTEKRAIYAKERGICPREMKTHAQAKICMQIFIATL FVIAKNESKQMVNEQTVVPPYNGVLLSHEKESATDDTTALINLKGIMHYAE >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_5|336_bp atgccctttcataagtgtccttacaaaaggaggaaatcaggacagatacatccagaggga agacgctatgaagacacagagaagagggccatctatgccaaggagaggggtatttgccca agagaaatgaaaacccatgctcaggcaaaaatctgcatgcaaatatttatagcaacttta tttgtaattgccaaaaatgaaagcaaacaaatggtgaatgaacaaactgtggtgcctcca tacaacggagtactactcagccatgaaaaggaatcagctactgatgacacaacagcactg ataaacctcaaaggcatcatgcattatgccgagtga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_6|483_aa MAKLWFKFQRYFRRKPVRFFTFLALYLTAGSLVFLHSGFVGQPAVSGNQANPAAAGGPAE GAELSFLGDMHLGRGFRDTGEASSIARRYGPWFKGKDGNERAKLGDYGGAWSRALKGRVV REKEEERAKYIGCYLDDTQSRALRGVSFFDYKKMTIFRCQDNCAERGYLYGGLEFGAECY CGHKIQATNVSEAECDMECKGERGSVCGGANRLSVYRLQLAQESARRCTALCPSSFEWSC MRFSLPPTELRLVPAVSLILTSLPPSETELEPEPRLPILIFSEQTVSNDDDGDGSAVFRG CFRRPDNLSLALPVTAAMLNMSVDKCVDFCTEKEYPLAALAGTACHCGFPTTRFPLHDRE DEQLCAQKCSAEEFESCGTPSYFIVYQTQRNPHLLLSPLLSPGATRHDDNDGDDDDDDDD GDDYDDGDDHNDGDDYDDGDDDNGDDSDKSDADSADDSDDESDDDDDGDEKEEEEEAATL GVS >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_6|1452_bp atggccaagctctggttcaaattccagcggtacttccgccggaaacctgtgcgcttcttt accttcctggcactctacctgactgctgggagccttgtcttccttcactctggctttgtg ggccagcccgctgtctcggggaaccaggcgaaccccgctgctgcaggaggcccagctgag ggtgctgagctgtccttcttgggtgacatgcatctgggcagaggtttccgggacacaggt gaagcctcaagcattgctcgcaggtacggaccctggttcaagggcaaggatgggaatgag agagccaagcttggcgactacggtggagcctggagccgagccctcaaggggagggttgtc cgggagaaggaggaagagcgagccaagtacatcggctgctacctggatgacacccagagt cgggcccttcgaggagtgtccttttttgactacaaaaagatgaccatcttccgttgccag gacaactgtgctgaacggggttacctgtatggcgggctggagttcggcgccgagtgctac tgcggccacaagatccaggcgacgaacgtgagcgaggcagagtgcgacatggagtgcaag ggcgagcgaggcagcgtgtgcggcggcgccaaccgcctctctgtctaccggctgcagctg gcccaggagtcggcccgcaggtgtacagccctttgccccagctcctttgaatggtcctgc atgcgcttttctctccctcccactgagctgcgtctggttccagctgtatccctcattctc accagtcttcctccttctgaaactgagttggagcctgagcctcgccttcccatcctgatt ttctcagagcagactgtctctaatgatgatgatggtgatggaagtgcagtgttccggggc tgcttccgcaggcccgacaacctttccctggccttacccgtgacagctgccatgctgaac atgtctgtggacaaatgcgtggacttctgcactgagaaggagtacccgctggcagctctt gcaggcaccgcctgccactgtgggtttcccaccacccgattcccgctccatgacagagag gatgagcagctctgtgcccagaagtgcagcgcggaggagtttgagagctgcgggactcct agttacttcattgtgtaccagacacaacggaatccacacttactgttatctcctcttctg tctcctggggctaccagacatgatgataatgatggtgatgatgatgatgatgatgatgat ggtgatgattatgatgatggtgatgatcataatgatggtgatgattatgacgatggtgat gatgataatggtgatgatagtgataagagtgacgctgatagtgctgatgatagtgatgat gaaagtgatgatgacgatgatggtgatgagaaggaggaagaggaggaggcagcaacactt ggagtttcttga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_7|276_aa MGDNRCMDRRFLPGKSKQLIALASFPGAGNTWARHLIELATGFYTGSYYFDGSLYNKVWK RVHSTSELRPHCPNADSSSALTSPVPHLWLREGFKGERDHWRSGRTICIKTHESGQKEIE AFDAAILLIRNPYKALMAEFNRKYGGHIGFAAHAHWKGKEWPEFVRNYAPWWATHTLDWL KFGKKVLVVHFEDLKQDLFVQLGRMVSLLGVAVREDRLLCVESQKDGNFKRSGLRKLEYD PYTADMQKTISAYIKMVDAALKGRNLTGVPDDYYPR >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_7|831_bp atgggagacaaccgttgcatggacagaaggttcctgccaggcaagtccaagcagctcatt gctttggccagcttcccaggtgctggcaacacgtgggctcgccacctcattgaattggcc acaggcttctacactggcagctactacttcgatggctccctctacaacaaagtctggaaa agggtccactccaccagtgagctgagacctcattgtcctaatgcagactcaagctctgcg ctcaccagtcctgttcctcacctctggctgcgggaagggtttaaaggtgagcgggaccac tggcgcagcggacggaccatctgcatcaagacgcacgaaagcggccagaaagagatcgag gccttcgacgccgccatcctgctcatccgcaacccctacaaagccctcatggctgagttc aaccgcaagtacggcggccacataggctttgctgcgcatgcccactggaagggcaaagag tggccagagttcgtgaggaactatgccccgtggtgggccactcacacactggactggctc aagtttggcaagaaggtgctggtggtgcactttgaggacctgaagcaggacctctttgtc cagctgggccggatggtcagcctgctgggcgtggctgtcagggaggaccggctgctctgt gtggagagccagaaggatggcaacttcaagcgctcagggctccggaagctcgagtatgac ccctatactgcggacatgcagaagaccatctctgcctacatcaagatggtggatgcagcc ctcaaagggcggaacctaacgggtgtccccgatgactactacccaagatga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_8|196_aa MLGQLKFEPSLSEKLSRNILEVGTGDKFQTKIMVRASLPRRLHMKPQIRNLMEWMYPGTF YYNFENRPILSGWNTTWLCYKMKTKKDPSKPPLDARIFGGQVYSKPEHHPEMRFVDWFCN SRLHRDQDYLVIWYISWSPCSEYAGNVAEFLAKDGKVTLTIFVAHLYYFWEADYQEELHR WCQKKQSTCQHEDHEL >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_8|591_bp atgctgggtcagctgaaattcgagccctcactgtcagagaagctaagcagaaacatctta gaggtagggacaggggacaaatttcagacaaagatcatggtcagggcaagcctgccaaga aggctgcacatgaagcctcagatcagaaacctgatggagtggatgtatccaggcacattc tactacaactttgaaaacagacccatcctctcaggttggaacaccacctggctgtgctac aaaatgaaaacaaagaaggacccctcaaagccccctttggacgcaaggatctttggaggc caggtctattccaagcctgaacaccacccagagatgagattcgtagattggttctgcaac tcgaggctgcatcgtgaccaggactacctggtcatctggtacatctcctggagtccctgc tcagagtatgcagggaacgtggcagagttcctggccaaggatggcaaggtcaccctgacc atcttcgttgcccacctctactacttctgggaagcagattaccaagaggagcttcacaga tggtgtcagaaaaaacagtccacatgccagcatgaagatcatgaactatga >gi568815586f:108095833_108348340|GENSCAN_predicted_peptide_9|487_aa MSGPKERSWALLVESALGLTVFTIDSLWQCWKYQRLTQQGGSGSENWERMWCWWLRCQAT FPGQLRCACSTTSVITCSSQKKETGKNPVVTRDPKYLSTGDSLSSSNAVPTSATQRMEDE DYNTSISYGDEYPDYLDSIVVLEDLSPLEARVTRIFLVVVYSIVCFLGILGNGLVIIIAT FKMKKTVNMVWFLNLAVADFLFNVFLPIHITYAAMDYHWVFGTAMCKISNFLLIHNMFTS VFLLTIISSDRCISVLLPVWSQNHRSVRLAYMACMVIWVLAFFLSSPSLVFRDTANLHGK ISCFNNFSLSTPGSSSWPTHSQMDPVGYSRHMVVTVTRFLCGFLVPVLIITACYLTIVCK LQRNRLAKTKKPFKIIVTIIITFFLCWCPYHTLNLLELHHTAMPGSVFSLGLPLATALAI ANSCMNPILYVFMGQDFKKFKVALFSRLVNALSEDTGHSSYPSHRSFTKMSSMNERTSMN ERETGML >gi568815586f:108095833_108348340|GENSCAN_predicted_CDS_9|1464_bp atgtctggccccaaggagaggtcttgggctctgctggtggagagtgccctgggcctcact gtcttcaccatagacagtctctggcaatgttggaagtaccagcggctgactcagcaggga gggagtggaagtgagaactgggagaggatgtggtgttggtggctgcgctgccaggccaca tttcctggacagctgagatgtgcttgttctacaacaagcgtgatcacatgcagctcccag aaaaaagagaccggcaagaacccagtcgtcaccagggaccccaagtacctttccactggg gacagcctttcctcttccaacgcagtgcccacatcagccacccagagaatggaggatgaa gattacaacacttccatcagttacggtgatgaataccctgattatttagactccattgtg gttttggaggacttatcccccttggaagccagggtgaccaggatcttcctggtggtggtc tacagcatcgtctgcttcctcgggattctgggcaatggtctggtgatcatcattgccacc ttcaagatgaagaagacagtgaacatggtctggttcctcaacctggcagtggcagatttc ctgttcaacgtcttcctcccaatccatatcacctatgccgccatggactaccactgggtt ttcgggacagccatgtgcaagatcagcaacttccttctcatccacaacatgttcaccagc gtcttcctgctgaccatcatcagctctgaccgctgcatctctgtgctcctccctgtctgg tcccagaaccaccgcagcgttcgcctggcttacatggcctgcatggtcatctgggtcctg gctttcttcttgagttccccatctctcgtcttccgggacacagccaacctgcatgggaaa atatcctgcttcaacaacttcagcctgtccacacctgggtcttcctcgtggcccactcac tcccaaatggaccctgtggggtatagccggcacatggtggtgactgtcacccgcttcctc tgtggcttcctggtcccagtcctcatcatcacagcttgctacctcaccatcgtgtgcaaa ctgcagcgcaaccgcctggccaagaccaagaagcccttcaagattattgtgaccatcatc attaccttcttcctctgctggtgcccctaccacacactcaacctcctagagctccaccac actgccatgcctggctctgtcttcagcctgggtttgcccctggccactgcccttgccatt gccaacagctgcatgaaccccattctgtatgttttcatgggtcaggacttcaagaagttc aaggtggccctcttctctcgcctggtcaatgctctaagtgaagatacaggccactcttcc taccccagccatagaagctttaccaagatgtcatcaatgaatgagaggacttctatgaat gagagggagaccggcatgctttga