GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:07:35 Sequence gi568815586r:108191844_108392960 : 201117 bp : 46.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3990 4371 382 2 1 88 94 313 0.963 28.93 1.02 Intr + 14446 14560 115 2 1 90 66 205 0.966 18.01 1.03 Intr + 18278 18466 189 2 0 17 83 315 0.630 22.60 1.04 Intr + 18623 18813 191 2 2 123 40 81 0.878 6.03 1.05 Intr + 32896 33017 122 2 2 96 105 188 0.887 21.51 1.06 Intr + 35147 35314 168 1 0 93 -24 142 0.618 3.64 1.07 Term + 35817 36101 285 2 0 69 51 514 0.711 41.20 1.08 PlyA + 36284 36289 6 1.05 2.00 Prom + 38364 38403 40 -5.26 2.01 Init + 39320 39326 7 1 1 89 78 0 0.927 0.31 2.02 Intr + 40888 41052 165 2 0 114 100 209 0.896 24.63 2.03 Intr + 44510 44541 32 0 2 73 81 4 0.418 -3.95 2.04 Intr + 45041 45076 36 1 0 113 80 2 0.289 0.36 2.05 Intr + 48464 48701 238 1 1 53 87 479 0.670 41.49 2.06 Term + 56148 56500 353 1 2 153 44 549 0.998 51.65 2.07 PlyA + 62986 62991 6 1.05 3.02 PlyA - 64048 64043 6 1.05 3.01 Sngl - 90521 89931 591 2 0 61 45 326 0.777 21.79 3.00 Prom - 96228 96189 40 -5.46 4.06 PlyA - 99070 99065 6 1.05 4.05 Term - 101116 99998 1119 1 0 114 42 1281 0.555 118.13 4.04 Intr - 118554 118515 40 2 1 96 66 31 0.006 -0.07 4.03 Intr - 132391 132257 135 0 0 64 99 73 0.154 5.68 4.02 Intr - 133997 133949 49 0 1 105 46 12 0.015 -3.56 4.01 Init - 141563 141443 121 2 1 63 60 117 0.440 5.75 4.00 Prom - 145122 145083 40 -3.86 5.00 Prom + 149334 149373 40 -4.26 5.01 Init + 157678 157715 38 0 2 65 68 94 0.644 2.72 5.02 Intr + 159573 159796 224 0 2 87 94 66 0.710 4.87 5.03 Intr + 169601 169881 281 0 2 60 59 99 0.312 1.40 5.04 Intr + 174500 174569 70 1 1 65 93 62 0.010 3.15 5.05 Intr + 176444 176583 140 0 2 86 75 75 0.321 6.18 5.06 Term + 177801 177926 126 2 0 94 46 30 0.307 -2.32 5.07 PlyA + 178545 178550 6 1.05 6.00 Prom + 196876 196915 40 -6.06 6.01 Sngl + 197609 198145 537 1 0 50 36 1015 0.998 88.58 6.02 PlyA + 199827 199832 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 175893 176022 130 2 1 103 82 75 0.967 8.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_1|483_aa MAKLWFKFQRYFRRKPVRFFTFLALYLTAGSLVFLHSGFVGQPAVSGNQANPAAAGGPAE GAELSFLGDMHLGRGFRDTGEASSIARRYGPWFKGKDGNERAKLGDYGGAWSRALKGRVV REKEEERAKYIGCYLDDTQSRALRGVSFFDYKKMTIFRCQDNCAERGYLYGGLEFGAECY CGHKIQATNVSEAECDMECKGERGSVCGGANRLSVYRLQLAQESARRCTALCPSSFEWSC MRFSLPPTELRLVPAVSLILTSLPPSETELEPEPRLPILIFSEQTVSNDDDGDGSAVFRG CFRRPDNLSLALPVTAAMLNMSVDKCVDFCTEKEYPLAALAGTACHCGFPTTRFPLHDRE DEQLCAQKCSAEEFESCGTPSYFIVYQTQRNPHLLLSPLLSPGATRHDDNDGDDDDDDDD GDDYDDGDDHNDGDDYDDGDDDNGDDSDKSDADSADDSDDESDDDDDGDEKEEEEEAATL GVS >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_1|1452_bp atggccaagctctggttcaaattccagcggtacttccgccggaaacctgtgcgcttcttt accttcctggcactctacctgactgctgggagccttgtcttccttcactctggctttgtg ggccagcccgctgtctcggggaaccaggcgaaccccgctgctgcaggaggcccagctgag ggtgctgagctgtccttcttgggtgacatgcatctgggcagaggtttccgggacacaggt gaagcctcaagcattgctcgcaggtacggaccctggttcaagggcaaggatgggaatgag agagccaagcttggcgactacggtggagcctggagccgagccctcaaggggagggttgtc cgggagaaggaggaagagcgagccaagtacatcggctgctacctggatgacacccagagt cgggcccttcgaggagtgtccttttttgactacaaaaagatgaccatcttccgttgccag gacaactgtgctgaacggggttacctgtatggcgggctggagttcggcgccgagtgctac tgcggccacaagatccaggcgacgaacgtgagcgaggcagagtgcgacatggagtgcaag ggcgagcgaggcagcgtgtgcggcggcgccaaccgcctctctgtctaccggctgcagctg gcccaggagtcggcccgcaggtgtacagccctttgccccagctcctttgaatggtcctgc atgcgcttttctctccctcccactgagctgcgtctggttccagctgtatccctcattctc accagtcttcctccttctgaaactgagttggagcctgagcctcgccttcccatcctgatt ttctcagagcagactgtctctaatgatgatgatggtgatggaagtgcagtgttccggggc tgcttccgcaggcccgacaacctttccctggccttacccgtgacagctgccatgctgaac atgtctgtggacaaatgcgtggacttctgcactgagaaggagtacccgctggcagctctt gcaggcaccgcctgccactgtgggtttcccaccacccgattcccgctccatgacagagag gatgagcagctctgtgcccagaagtgcagcgcggaggagtttgagagctgcgggactcct agttacttcattgtgtaccagacacaacggaatccacacttactgttatctcctcttctg tctcctggggctaccagacatgatgataatgatggtgatgatgatgatgatgatgatgat ggtgatgattatgatgatggtgatgatcataatgatggtgatgattatgacgatggtgat gatgataatggtgatgatagtgataagagtgacgctgatagtgctgatgatagtgatgat gaaagtgatgatgacgatgatggtgatgagaaggaggaagaggaggaggcagcaacactt ggagtttcttga >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_2|276_aa MGDNRCMDRRFLPGKSKQLIALASFPGAGNTWARHLIELATGFYTGSYYFDGSLYNKVWK RVHSTSELRPHCPNADSSSALTSPVPHLWLREGFKGERDHWRSGRTICIKTHESGQKEIE AFDAAILLIRNPYKALMAEFNRKYGGHIGFAAHAHWKGKEWPEFVRNYAPWWATHTLDWL KFGKKVLVVHFEDLKQDLFVQLGRMVSLLGVAVREDRLLCVESQKDGNFKRSGLRKLEYD PYTADMQKTISAYIKMVDAALKGRNLTGVPDDYYPR >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_2|831_bp atgggagacaaccgttgcatggacagaaggttcctgccaggcaagtccaagcagctcatt gctttggccagcttcccaggtgctggcaacacgtgggctcgccacctcattgaattggcc acaggcttctacactggcagctactacttcgatggctccctctacaacaaagtctggaaa agggtccactccaccagtgagctgagacctcattgtcctaatgcagactcaagctctgcg ctcaccagtcctgttcctcacctctggctgcgggaagggtttaaaggtgagcgggaccac tggcgcagcggacggaccatctgcatcaagacgcacgaaagcggccagaaagagatcgag gccttcgacgccgccatcctgctcatccgcaacccctacaaagccctcatggctgagttc aaccgcaagtacggcggccacataggctttgctgcgcatgcccactggaagggcaaagag tggccagagttcgtgaggaactatgccccgtggtgggccactcacacactggactggctc aagtttggcaagaaggtgctggtggtgcactttgaggacctgaagcaggacctctttgtc cagctgggccggatggtcagcctgctgggcgtggctgtcagggaggaccggctgctctgt gtggagagccagaaggatggcaacttcaagcgctcagggctccggaagctcgagtatgac ccctatactgcggacatgcagaagaccatctctgcctacatcaagatggtggatgcagcc ctcaaagggcggaacctaacgggtgtccccgatgactactacccaagatga >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_3|196_aa MLGQLKFEPSLSEKLSRNILEVGTGDKFQTKIMVRASLPRRLHMKPQIRNLMEWMYPGTF YYNFENRPILSGWNTTWLCYKMKTKKDPSKPPLDARIFGGQVYSKPEHHPEMRFVDWFCN SRLHRDQDYLVIWYISWSPCSEYAGNVAEFLAKDGKVTLTIFVAHLYYFWEADYQEELHR WCQKKQSTCQHEDHEL >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_3|591_bp atgctgggtcagctgaaattcgagccctcactgtcagagaagctaagcagaaacatctta gaggtagggacaggggacaaatttcagacaaagatcatggtcagggcaagcctgccaaga aggctgcacatgaagcctcagatcagaaacctgatggagtggatgtatccaggcacattc tactacaactttgaaaacagacccatcctctcaggttggaacaccacctggctgtgctac aaaatgaaaacaaagaaggacccctcaaagccccctttggacgcaaggatctttggaggc caggtctattccaagcctgaacaccacccagagatgagattcgtagattggttctgcaac tcgaggctgcatcgtgaccaggactacctggtcatctggtacatctcctggagtccctgc tcagagtatgcagggaacgtggcagagttcctggccaaggatggcaaggtcaccctgacc atcttcgttgcccacctctactacttctgggaagcagattaccaagaggagcttcacaga tggtgtcagaaaaaacagtccacatgccagcatgaagatcatgaactatga >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_4|487_aa MSGPKERSWALLVESALGLTVFTIDSLWQCWKYQRLTQQGGSGSENWERMWCWWLRCQAT FPGQLRCACSTTSVITCSSQKKETGKNPVVTRDPKYLSTGDSLSSSNAVPTSATQRMEDE DYNTSISYGDEYPDYLDSIVVLEDLSPLEARVTRIFLVVVYSIVCFLGILGNGLVIIIAT FKMKKTVNMVWFLNLAVADFLFNVFLPIHITYAAMDYHWVFGTAMCKISNFLLIHNMFTS VFLLTIISSDRCISVLLPVWSQNHRSVRLAYMACMVIWVLAFFLSSPSLVFRDTANLHGK ISCFNNFSLSTPGSSSWPTHSQMDPVGYSRHMVVTVTRFLCGFLVPVLIITACYLTIVCK LQRNRLAKTKKPFKIIVTIIITFFLCWCPYHTLNLLELHHTAMPGSVFSLGLPLATALAI ANSCMNPILYVFMGQDFKKFKVALFSRLVNALSEDTGHSSYPSHRSFTKMSSMNERTSMN ERETGML >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_4|1464_bp atgtctggccccaaggagaggtcttgggctctgctggtggagagtgccctgggcctcact gtcttcaccatagacagtctctggcaatgttggaagtaccagcggctgactcagcaggga gggagtggaagtgagaactgggagaggatgtggtgttggtggctgcgctgccaggccaca tttcctggacagctgagatgtgcttgttctacaacaagcgtgatcacatgcagctcccag aaaaaagagaccggcaagaacccagtcgtcaccagggaccccaagtacctttccactggg gacagcctttcctcttccaacgcagtgcccacatcagccacccagagaatggaggatgaa gattacaacacttccatcagttacggtgatgaataccctgattatttagactccattgtg gttttggaggacttatcccccttggaagccagggtgaccaggatcttcctggtggtggtc tacagcatcgtctgcttcctcgggattctgggcaatggtctggtgatcatcattgccacc ttcaagatgaagaagacagtgaacatggtctggttcctcaacctggcagtggcagatttc ctgttcaacgtcttcctcccaatccatatcacctatgccgccatggactaccactgggtt ttcgggacagccatgtgcaagatcagcaacttccttctcatccacaacatgttcaccagc gtcttcctgctgaccatcatcagctctgaccgctgcatctctgtgctcctccctgtctgg tcccagaaccaccgcagcgttcgcctggcttacatggcctgcatggtcatctgggtcctg gctttcttcttgagttccccatctctcgtcttccgggacacagccaacctgcatgggaaa atatcctgcttcaacaacttcagcctgtccacacctgggtcttcctcgtggcccactcac tcccaaatggaccctgtggggtatagccggcacatggtggtgactgtcacccgcttcctc tgtggcttcctggtcccagtcctcatcatcacagcttgctacctcaccatcgtgtgcaaa ctgcagcgcaaccgcctggccaagaccaagaagcccttcaagattattgtgaccatcatc attaccttcttcctctgctggtgcccctaccacacactcaacctcctagagctccaccac actgccatgcctggctctgtcttcagcctgggtttgcccctggccactgcccttgccatt gccaacagctgcatgaaccccattctgtatgttttcatgggtcaggacttcaagaagttc aaggtggccctcttctctcgcctggtcaatgctctaagtgaagatacaggccactcttcc taccccagccatagaagctttaccaagatgtcatcaatgaatgagaggacttctatgaat gagagggagaccggcatgctttga >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_5|292_aa MGEVGALALGLGGACSAAAGCLLVHTFPHSFCVMLRREKGNGSPKKLPSIQQKKPVNHTR QKDVTRPRNHSPQKDSYICSDAFLQDAERGSQQARLQSVTAYILYRVPHSPQAQHQWGGV DLPPSEGIAKTHHKGINTCMGTERMLHRQREKSGWDIASVEASVNPTENAGAEMILQSQE EDAHIITFTKSSRKGIASHRLEREAAPGTFHRTLGPQGTQFANHYSSQMEKQAQKGKCLA RRRCKGDAPSQLPLGSGHQLQCQCWWEAPSITHQQTRARVGCHMPHSNHTAK >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_5|879_bp atgggagaagtgggggcgctggccctgggcctgggtggtgcctgctctgcagcagctggc tgcctcctggtccatacatttccacattccttttgtgtgatgttgaggagagagaaagga aatgggagccccaagaagttacctagcattcagcaaaagaaacctgtgaatcacacacgc caaaaagatgtcactcgccccagaaatcactcaccccagaaagattcctatatctgctca gatgcattccttcaggacgcagaacgggggagccaacaggcaaggctacagtctgttacc gcatatattctctatagagttcctcatagcccccaggcccaacaccagtggggtggagta gacttaccccctagtgaaggcattgcaaagacacatcacaaaggaatcaacacctgcatg gggacagaaaggatgctgcatcggcagagggagaaatctggctgggacatagcctcagtg gaggcttctgttaaccccacagagaatgctggagctgagatgatcctacagagccaggaa gaggatgctcacatcatcaccttcacgaaaagcagcagaaagggaattgcttctcaccga cttgagcgtgaagcagccccagggaccttccacagaactctggggccccagggaacccag tttgcaaaccactactctagtcagatggagaaacaggcccagaagggaaaatgtctggca aggaggcgatgcaaaggtgatgccccttcccagcttcccttgggttctggacaccagctc cagtgccagtgctggtgggaagctccgagcatcacccaccaacagaccagagccagagtg ggctgccacatgccacacagcaaccacacagcaaaataa >gi568815586r:108191844_108392960|GENSCAN_predicted_peptide_6|178_aa MITTTTIIITIITTTNTIMIITITIIIIPIIIIISAITTIISTIPTITINISITIITTTI ITTIIIITTTIITITTTITITTTIITITIIIIPIIIIITTITTIITTIPTITVIIIITII TITIITTIIIIITTIITITTTIITIIITTMTTIITIILIITTTIIIMAVLLIFMSLTL >gi568815586r:108191844_108392960|GENSCAN_predicted_CDS_6|537_bp atgatcactaccaccaccatcatcatcaccattatcaccactaccaacactatcatgatc atcactatcaccattattatcatccccatcatcattatcatttcagccatcaccaccatc atcagcactatccccaccatcaccatcaacatcagcatcactattatcaccaccaccatt atcacaaccatcatcattatcaccaccaccattatcaccattaccaccacaatcactatc actaccaccatcatcaccatcaccattattatcatccccatcattattatcattacgact atcaccaccatcatcaccactatccccactattacagtcatcatcatcatcactatcatc accatcaccattatcacaaccatcatcattatcatcaccaccattatcaccattactacc actatcatcaccatcataatcaccacaatgaccaccatcatcaccatcatcctcatcatc accaccaccatcatcatcatggcagtattactcatttttatgagcttgacactgtaa