GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:00:53 Sequence gi568815581r:48860884_49067232 : 206349 bp : 47.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1393 1421 29 2 2 121 131 -8 0.931 4.63 1.02 Term + 1955 2122 168 1 0 114 48 115 0.980 7.98 1.03 PlyA + 2934 2939 6 1.05 2.00 Prom + 4655 4694 40 -5.16 2.01 Init + 10668 10680 13 2 1 110 95 8 0.824 3.40 2.02 Intr + 20237 20437 201 0 0 114 72 167 0.981 16.96 2.03 Intr + 20816 20882 67 0 1 92 68 44 0.393 0.76 2.04 Intr + 22463 22616 154 2 1 43 117 41 0.303 2.57 2.05 Intr + 31747 31914 168 0 0 -13 31 187 0.116 3.04 2.06 Intr + 32526 32573 48 2 0 103 95 -8 0.417 0.28 2.07 Intr + 34273 34451 179 0 2 100 111 230 0.983 25.22 2.08 Term + 34772 34886 115 2 1 116 54 138 0.812 11.24 2.09 PlyA + 39204 39209 6 1.05 3.00 Prom + 40928 40967 40 -5.46 3.01 Init + 47621 47937 317 1 2 108 89 288 0.999 25.71 3.02 Intr + 49925 49997 73 2 1 90 95 69 0.998 7.21 3.03 Intr + 51951 52138 188 2 2 71 90 180 0.999 14.99 3.04 Intr + 55193 55304 112 2 1 89 110 89 0.999 11.58 3.05 Intr + 60277 60389 113 0 2 38 92 53 0.969 -0.12 3.06 Intr + 61964 62054 91 2 1 97 90 138 0.952 14.90 3.07 Term + 66081 66251 171 2 0 116 41 119 0.958 7.83 3.08 PlyA + 68152 68157 6 1.05 4.15 PlyA - 69235 69230 6 1.05 4.14 Term - 69729 69592 138 0 0 152 42 69 0.999 6.66 4.13 Intr - 70834 70760 75 1 0 106 72 110 0.983 10.91 4.12 Intr - 72463 72322 142 0 1 47 69 222 0.692 16.56 4.11 Intr - 75359 75287 73 0 1 114 39 19 0.853 -1.94 4.10 Intr - 76241 76137 105 0 0 87 78 104 0.974 9.49 4.09 Intr - 80179 80041 139 1 1 105 58 239 0.892 22.64 4.08 Intr - 81674 81585 90 2 0 72 42 119 0.976 5.89 4.07 Intr - 83092 83042 51 1 0 123 111 33 0.996 8.20 4.06 Intr - 83955 83798 158 1 2 46 83 82 0.056 3.23 4.05 Intr - 93528 93506 23 2 2 94 90 27 0.074 0.69 4.04 Intr - 100104 100003 102 2 0 51 91 115 0.864 7.39 4.03 Intr - 100936 100844 93 0 0 92 105 79 0.998 9.08 4.02 Intr - 103597 103427 171 0 0 102 61 167 0.998 14.46 4.01 Init - 106349 106264 86 2 2 73 91 196 0.999 16.69 4.00 Prom - 109339 109300 40 -5.46 5.00 Prom + 119337 119376 40 -3.46 5.01 Init + 136863 137037 175 2 1 110 111 465 0.970 50.51 5.02 Intr + 138226 138286 61 2 1 94 97 65 0.789 5.79 5.03 Intr + 164735 164783 49 2 1 109 100 44 0.967 6.38 5.04 Intr + 165583 165634 52 0 1 94 116 54 0.992 7.28 5.05 Intr + 171027 171090 64 1 1 116 73 96 0.896 8.68 5.06 Intr + 177285 177566 282 0 0 83 89 388 0.896 34.93 5.07 Intr + 179074 179208 135 1 0 97 111 112 0.999 14.08 5.08 Intr + 180495 180617 123 0 0 81 71 102 0.994 7.50 5.09 Intr + 181359 181494 136 0 1 64 75 171 0.999 13.97 5.10 Intr + 182545 182667 123 0 0 134 78 16 0.954 5.98 5.11 Intr + 183084 183203 120 2 0 74 53 200 0.637 15.79 5.12 Intr + 185007 185138 132 2 0 94 101 179 0.864 20.64 5.13 Intr + 185377 185490 114 0 0 106 88 212 0.998 23.64 5.14 Intr + 187680 187811 132 2 0 46 83 74 0.861 3.54 5.15 Term + 188469 188561 93 2 0 106 47 95 0.888 5.03 5.16 PlyA + 188881 188886 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 83851 83798 54 1 0 62 83 79 0.929 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:48860884_49067232|GENSCAN_predicted_peptide_1|65_aa XIQESSSPSPLSIKKCPICKADDICDHTLEQQQMQPLCFNCPICDKIFPATEKQIFEDHV FCHSL >gi568815581r:48860884_49067232|GENSCAN_predicted_CDS_1|198_bp ngtatccaagaaagttcttcccccagcccgctctccatcaagaaatgccctatctgcaaa gcagatgatatttgtgatcacaccttggagcaacagcagatgcagcccctttgtttcaat tgtccaatttgtgacaagatcttcccagctacagagaagcagatctttgaagaccacgtg ttctgccactctctctga >gi568815581r:48860884_49067232|GENSCAN_predicted_peptide_2|314_aa MPGLDRDAGTQHRTWAHFIQIMKVEETAASKCNWPAVTQLHDSKVEFLLPHWVLCGQQQA TCHHQEARHLTGILSSPSTGCETLRKSLNDALAGLCSSEPSLRYMTLTNQSVARKRNGQH SLQALPLCELLTHRTSQRRNTGPKPESPASIRMEAGIATKLLWRERKQLRKANKSGESMT STNGDAGILRPMRMEKVQDTWTEKMQTAGALFISPALPSYSNFPLQVARREFQTSVVSRD IDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIGYARNPSLKQQLFSYAILGFALSEAMG LFCLMVAFLILFAM >gi568815581r:48860884_49067232|GENSCAN_predicted_CDS_2|945_bp atgcctggcctagaccgagacgcgggcacccagcaccgcacctgggcccacttcatccag atcatgaaggtggaggagactgcagccagcaagtgcaactggccagcagtcacacagctc catgactccaaggttgagttcctgctgcctcactgggtcctctgtggccagcaacaagcc acgtgtcaccaccaagaggcccgacatcttacaggaatcctgtcttcaccatcaactggc tgcgagaccttgagaaagtcacttaatgatgcactggctgggctatgctctagtgaaccc agcctaagatacatgacgcttactaaccagtcagtagcaaggaaacggaatggccagcac tcgcttcaggctctgcctctgtgtgagttgctgacacacagaacatcgcagaggaggaac accgggcccaaaccggaaagccccgcctctatccgcatggaggcgggaattgccacgaag ctcctgtggagggagaggaagcagctgcggaaagccaataagagtggggaatcgatgacg tcaaccaatggggacgcggggatattacggccaatgagaatggagaaggtccaggacacg tggactgaaaaaatgcagaccgccggggcattattcatttctccagctctgccttcctac agcaacttcccactccaggtggccagacgggagttccagaccagtgttgtctcccgggac attgacacagcagccaagtttattggtgctggggcagccacagttggtgtggctggttca ggggctggcattggaaccgtgtttggcagcttgatcattggctatgccaggaacccgtct ctcaagcagcagctcttctcctatgccattcttggctttgccctgtctgaggccatgggg cttttctgtttgatggtcgccttcctcatcctcttcgccatgtga >gi568815581r:48860884_49067232|GENSCAN_predicted_peptide_3|354_aa MAESPTEEAATAGAGAAGPGASSVAGVVGVSGSGGGFGPPFLPDVWAAAAAAGGAGGPGS GLAPLPGLPPSAAAHGAALLSHWDPTLSSDWDGERTAPQCLLRIKRDIMSIYKEPPPGMF VVPDTVDMTKIHALITGPFDTPYEGGFFLFVFRCPPDYPIHPPRVKLMTTGNNTVRFNPN FYRNGKVCLSILGTWTGPAWSPAQSISSVLISIQSLMTENPYHNEPGFEQERHPGDSKNY NECIRHETIRVAVCDMMEGKCPCPEPLRGVMEKSFLEYYDFYEVACKDRLHLQGQTMQDP FGEKRGHFDYQSLLMRLGLIRQKVLERLHNENAEMDSDSSSSGTETDLHGSLRV >gi568815581r:48860884_49067232|GENSCAN_predicted_CDS_3|1065_bp atggcggagagtccgactgaggaggcggcaacggcgggcgccggggcggcgggccccggg gcgagcagcgttgctggtgttgttggcgttagcggcagcggcggcgggttcgggccgcct ttcctgccggatgtgtgggcggcggcggcggcagcgggcggggccgggggcccggggagc ggcctggctccgctgcccgggctcccgccctcagccgctgcccacggggccgcgctgctt agccactgggaccccacgctcagctccgactgggacggcgagcgcaccgcgccgcagtgt ctactccggatcaagcgggatatcatgtccatttataaggagcctcctccaggaatgttc gttgtacctgatactgttgacatgactaagattcatgcattgatcacaggcccatttgac actccttatgaagggggtttcttcctgttcgtgtttcggtgtccgcccgactatcccatc cacccacctcgggtcaaactgatgacaacgggcaataacacagtgaggtttaaccccaac ttctaccgcaatgggaaagtctgcttgagtattctaggtacatggactggacctgcctgg agcccagcccagagcatctcctcagtgctcatctctatccagtccctgatgactgagaac ccctatcacaatgagcccggctttgaacaggagagacatccaggagacagcaaaaactat aatgaatgtatccggcacgagaccatcagagttgcagtctgtgacatgatggaaggaaag tgtccctgtcctgaacccctacgaggggtgatggagaagtcctttctggagtattacgac ttctacgaggtggcctgcaaagatcgcctgcaccttcaaggccaaactatgcaggaccct tttggagagaagcggggccactttgactaccagtccctcttgatgcgcctgggactgata cgtcagaaagtgctggagaggctccataatgagaatgcagaaatggactctgatagcagt tcatctgggacagagacagaccttcatgggagcctgagggtttag >gi568815581r:48860884_49067232|GENSCAN_predicted_peptide_4|481_aa MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSSPQPRGPRYAEGTFISD YSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQREARALELASQANRKEEEAVEPQSSPA KNPSDEDLLRDLLIQELLACLLDQTNLCRLRLRNRRERDDARLGLPPWGAGGGVRDVETR GPGSRAARGPRVGMHRRGVGAGAIAKKKLAEAKYKERGTVLAEDQLAQPFPSLYVLRVYP VIVIGEEVEEAVKQENGAMSKQLDMFKTNLEEFASKHKQEIRKNPEFRVQFQDMCATIGV DPLASGKGFWSEMLGVGDFYYELGVQIIEVCLALKHRNGGLITLEELHQQVLKGRGKFAQ DVSQDDLIRAIKKLKALGTGFGIIPVGGTYLIQSVPAELNMDHTVVLQLAEKNGYVTVSE IKASLKWETERARQVLEHLLKEGLAWLDLQAPGEAHYWLPALFTDLYSQEITAEEAREAL P >gi568815581r:48860884_49067232|GENSCAN_predicted_CDS_4|1446_bp atggtggccacgaagacctttgctctgctgctgctgtccctgttcctggcagtgggacta ggagagaagaaagagggtcacttcagcgctctcccctccctgcctgttggatctcatgct aaggtgagcagccctcaacctcgaggccccaggtacgcggaagggactttcatcagtgac tacagtattgccatggacaagattcaccaacaagactttgtgaactggctgctggcccaa aaggggaagaagaatgactggaaacacaacatcacccagagggaggctcgggcgctggag ctggccagtcaagctaataggaaggaggaggaggcagtggagccacagagctccccagcc aagaaccccagcgatgaagatttgctgcgggacttgctgattcaagagctgttggcctgc ttgctggatcagacaaacctctgcaggctcaggctgaggaaccgtcgtgaaagagatgac gcgcggctcgggcttccgccttggggagccggcggcggagtccgggacgtggagacccgg ggtcccggcagccgggcggcccgcgggcccagggtggggatgcaccgccgcggggtggga gctggcgccatcgccaagaagaaacttgcagaggccaagtataaggagcgagggacggtc ttggctgaggaccagctagcccagcccttcccttctctgtacgtgctccgagtttaccca gtgattgtgattggggaagaagtggaggaagccgttaagcaggaaaatggggctatgtca aagcagttggacatgttcaagaccaacctggaggaatttgccagcaaacacaagcaggag atccggaagaatcctgagttccgtgtgcagttccaggacatgtgtgcaaccattggcgtg gatccgctggcctctggaaaaggattttggtctgagatgctgggcgtgggggacttctat tacgaactaggtgtccaaattatcgaagtgtgcctggcgctgaagcatcggaatggaggt ctgataactttggaggaactacatcaacaggtgttgaagggaaggggcaagttcgcccag gatgtcagtcaagatgacctgatcagagccatcaagaaactaaaggcacttggcactggc ttcggcatcatccctgtgggcggcacttacctcattcagtctgttccagctgagctcaat atggatcacaccgtggtgctgcagctggcagagaagaatggctacgtgactgtcagtgag atcaaagccagtcttaaatgggagaccgagcgagcgcggcaagtgctggaacacctgctg aaggaagggttggcgtggctggacttacaggccccaggggaggcccactactggctgcca gctctcttcactgacctctactcccaggagattacagctgaggaggccagagaagccctc ccctga >gi568815581r:48860884_49067232|GENSCAN_predicted_peptide_5|596_aa MNKLYIGNLNESVTPADLEKVFAEHKISYSGQFLVKSGYAFVDCPDEHWAMKAIETFSGK VELQGKRLEIEHSVPKKQRSRKIQIRNIPPQLRWEVLDSLLAQYGTVENCEQVNTESETA VVNVTYSNREQTRQAIMKLNGHQLENHALKVSYIPDEQIAQGPENGRRGGFGSRGQPRQG SPVAAGAPAKQQQVDIPLRLLVPTQYVGAIIGKEGATIRNITKQTQSKIDVHRKENAGAA EKAISVHSTPEGCSSACKMILEIMHKEAKDTKTADEVPLKILAHNNFVGRLIGKEGRNLK KVEQDTETKITISSLQDLTLYNPERTITVKGAIENCCRAEQEIMKKVREAYENDVAAMSL QSHLIPGLNLAAVGLFPASSSAVPPPPSSVTGAAPYSSFMQAPEQEMVQVFIPAQAVGAI IGKKGQHIKQLSRFASASIKAQGRIYGKLKEENFFGPKEEVKLETHIRVPASAAGRVIGK GGKTVNELQNLTAAEVVVPRDQTPDENDQVIVKIIGHFYASQTYYSFSWGIKMKRGFVMM SSKLRPSTTGVPKPRATDWYRATQQEMAQRKIRDILAQVKQQHQKGQSNQAQARRK >gi568815581r:48860884_49067232|GENSCAN_predicted_CDS_5|1791_bp atgaacaagctttacatcggcaacctcaacgagagcgtgacccccgcggacttggagaaa gtgtttgcggagcacaagatctcctacagcggccagttcttggtcaaatccggctacgcc ttcgtggactgcccggacgagcactgggcgatgaaggccatcgaaactttctccgggaaa gtagaattacaaggaaaacgcttagagattgaacattcggtgcccaaaaaacaaaggagc cggaaaattcaaatccgaaatattccaccccagctccgatgggaagtactggacagcctg ctggctcagtatggtacagtagagaactgtgagcaagtgaacaccgagagtgagacggca gtggtgaatgtcacctattccaaccgggagcagaccaggcaagccatcatgaagctgaat ggccaccagttggagaaccatgccctgaaggtctcctacatccccgatgagcagatagca cagggacctgagaatgggcgccgagggggctttggctctcggggtcagccccgccagggc tcacctgtggcagcgggggccccagccaagcagcagcaagtggacatcccccttcggctc ctggtgcccacccagtatgtgggtgccattattggcaaggagggggccaccatccgcaac atcacaaaacagacccagtccaagatagacgtgcataggaaggagaacgcaggtgcagct gaaaaagccatcagtgtgcactccacccctgagggctgctcctccgcttgtaagatgatc ttggagattatgcataaagaggctaaggacaccaaaacggctgacgaggttcccctgaag atcctggcccataataactttgtagggcgtctcattggcaaggaaggacggaacctgaag aaggtagagcaagataccgagacaaaaatcaccatctcctcgttgcaagaccttaccctt tacaaccctgagaggaccatcactgtgaagggggccatcgagaattgttgcagggccgag caggaaataatgaagaaagttcgggaggcctatgagaatgatgtggctgccatgagcctg cagtctcacctgatccctggcctgaacctggctgctgtaggtcttttcccagcttcatcc agcgcagtcccgccgcctcccagcagcgttactggggctgctccctatagctcctttatg caggctcccgagcaggagatggtgcaggtgtttatccccgcccaggcagtgggcgccatc atcggcaagaaggggcagcacatcaaacagctctcccggtttgccagcgcctccatcaag gctcagggaagaatctatggcaaactcaaggaggagaacttctttggtcccaaggaggaa gtgaagctggagacccacatacgtgtgccagcatcagcagctggccgggtcattggcaaa ggtggaaaaacggtgaacgagttgcagaatttgacggcagctgaggtggtagtaccaaga gaccagacccctgatgagaacgaccaggtcatcgtgaaaatcatcggacatttctatgcc agtcagacatattattccttttcctggggtattaaaatgaagcgtggatttgtgatgatg tcttctaagttacgaccctctacaacaggagtccccaaaccccgggccacggactggtac cgggccacacagcaggagatggctcaacggaagatccgagacatcctggcccaggttaag cagcagcatcagaagggacagagtaaccaggcccaggcacggaggaagtga