GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:46:52 Sequence gi568815581f:48897746_49149441 : 251696 bp : 46.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10759 11075 317 0 2 108 89 288 0.999 25.71 1.02 Intr + 13063 13135 73 1 1 90 95 69 0.998 7.21 1.03 Intr + 15089 15276 188 1 2 71 90 180 0.999 14.99 1.04 Intr + 18331 18442 112 1 1 89 110 89 0.999 11.58 1.05 Intr + 23415 23527 113 2 2 38 92 53 0.969 -0.12 1.06 Intr + 25102 25192 91 1 1 97 90 138 0.952 14.90 1.07 Term + 29219 29389 171 1 0 116 41 119 0.958 7.83 1.08 PlyA + 31290 31295 6 1.05 2.15 PlyA - 32373 32368 6 1.05 2.14 Term - 32867 32730 138 2 0 152 42 69 0.999 6.66 2.13 Intr - 33972 33898 75 0 0 106 72 110 0.983 10.91 2.12 Intr - 35601 35460 142 2 1 47 69 222 0.692 16.56 2.11 Intr - 38497 38425 73 2 1 114 39 19 0.853 -1.94 2.10 Intr - 39379 39275 105 2 0 87 78 104 0.974 9.49 2.09 Intr - 43317 43179 139 0 1 105 58 239 0.892 22.64 2.08 Intr - 44812 44723 90 1 0 72 42 119 0.976 5.89 2.07 Intr - 46230 46180 51 0 0 123 111 33 0.996 8.20 2.06 Intr - 47093 46936 158 0 2 46 83 82 0.056 3.23 2.05 Intr - 56666 56644 23 1 2 94 90 27 0.074 0.69 2.04 Intr - 63242 63141 102 1 0 51 91 115 0.864 7.39 2.03 Intr - 64074 63982 93 2 0 92 105 79 0.998 9.08 2.02 Intr - 66735 66565 171 2 0 102 61 167 0.998 14.46 2.01 Init - 69487 69402 86 1 2 73 91 196 0.999 16.69 2.00 Prom - 72477 72438 40 -5.46 3.00 Prom + 82475 82514 40 -3.46 3.01 Init + 100001 100175 175 1 1 110 111 465 0.970 50.51 3.02 Intr + 101364 101424 61 1 1 94 97 65 0.789 5.79 3.03 Intr + 127873 127921 49 1 1 109 100 44 0.967 6.38 3.04 Intr + 128721 128772 52 2 1 94 116 54 0.992 7.28 3.05 Intr + 134165 134228 64 0 1 116 73 96 0.896 8.68 3.06 Intr + 140423 140704 282 2 0 83 89 388 0.896 34.93 3.07 Intr + 142212 142346 135 0 0 97 111 112 0.999 14.08 3.08 Intr + 143633 143755 123 2 0 81 71 102 0.994 7.50 3.09 Intr + 144497 144632 136 2 1 64 75 171 0.999 13.97 3.10 Intr + 145683 145805 123 2 0 134 78 16 0.954 5.98 3.11 Intr + 146222 146341 120 1 0 74 53 200 0.637 15.79 3.12 Intr + 148145 148276 132 1 0 94 101 179 0.864 20.64 3.13 Intr + 148515 148628 114 2 0 106 88 212 0.998 23.64 3.14 Intr + 150818 150949 132 1 0 46 83 74 0.861 3.54 3.15 Term + 151607 151699 93 1 0 106 47 95 0.886 5.03 3.16 PlyA + 152019 152024 6 1.05 4.00 Prom + 152370 152409 40 -4.16 4.01 Init + 166897 166943 47 0 2 62 94 39 0.193 2.09 4.02 Intr + 180273 180345 73 0 1 51 80 73 0.163 2.21 4.03 Intr + 184799 184918 120 1 0 65 72 41 0.025 0.89 4.04 Intr + 234978 235061 84 2 0 32 100 162 0.405 11.82 4.05 Intr + 243527 243702 176 1 2 44 85 96 0.236 3.64 4.06 Term + 244290 244431 142 0 1 86 38 104 0.555 2.60 4.07 PlyA + 244981 244986 6 -0.45 5.03 PlyA - 245094 245089 6 1.05 5.02 Term - 247861 247749 113 2 2 22 55 83 0.334 -2.88 5.01 Intr - 251011 250732 280 1 1 32 84 310 0.372 22.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 46989 46936 54 0 0 62 83 79 0.929 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:48897746_49149441|GENSCAN_predicted_peptide_1|354_aa MAESPTEEAATAGAGAAGPGASSVAGVVGVSGSGGGFGPPFLPDVWAAAAAAGGAGGPGS GLAPLPGLPPSAAAHGAALLSHWDPTLSSDWDGERTAPQCLLRIKRDIMSIYKEPPPGMF VVPDTVDMTKIHALITGPFDTPYEGGFFLFVFRCPPDYPIHPPRVKLMTTGNNTVRFNPN FYRNGKVCLSILGTWTGPAWSPAQSISSVLISIQSLMTENPYHNEPGFEQERHPGDSKNY NECIRHETIRVAVCDMMEGKCPCPEPLRGVMEKSFLEYYDFYEVACKDRLHLQGQTMQDP FGEKRGHFDYQSLLMRLGLIRQKVLERLHNENAEMDSDSSSSGTETDLHGSLRV >gi568815581f:48897746_49149441|GENSCAN_predicted_CDS_1|1065_bp atggcggagagtccgactgaggaggcggcaacggcgggcgccggggcggcgggccccggg gcgagcagcgttgctggtgttgttggcgttagcggcagcggcggcgggttcgggccgcct ttcctgccggatgtgtgggcggcggcggcggcagcgggcggggccgggggcccggggagc ggcctggctccgctgcccgggctcccgccctcagccgctgcccacggggccgcgctgctt agccactgggaccccacgctcagctccgactgggacggcgagcgcaccgcgccgcagtgt ctactccggatcaagcgggatatcatgtccatttataaggagcctcctccaggaatgttc gttgtacctgatactgttgacatgactaagattcatgcattgatcacaggcccatttgac actccttatgaagggggtttcttcctgttcgtgtttcggtgtccgcccgactatcccatc cacccacctcgggtcaaactgatgacaacgggcaataacacagtgaggtttaaccccaac ttctaccgcaatgggaaagtctgcttgagtattctaggtacatggactggacctgcctgg agcccagcccagagcatctcctcagtgctcatctctatccagtccctgatgactgagaac ccctatcacaatgagcccggctttgaacaggagagacatccaggagacagcaaaaactat aatgaatgtatccggcacgagaccatcagagttgcagtctgtgacatgatggaaggaaag tgtccctgtcctgaacccctacgaggggtgatggagaagtcctttctggagtattacgac ttctacgaggtggcctgcaaagatcgcctgcaccttcaaggccaaactatgcaggaccct tttggagagaagcggggccactttgactaccagtccctcttgatgcgcctgggactgata cgtcagaaagtgctggagaggctccataatgagaatgcagaaatggactctgatagcagt tcatctgggacagagacagaccttcatgggagcctgagggtttag >gi568815581f:48897746_49149441|GENSCAN_predicted_peptide_2|481_aa MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSSPQPRGPRYAEGTFISD YSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQREARALELASQANRKEEEAVEPQSSPA KNPSDEDLLRDLLIQELLACLLDQTNLCRLRLRNRRERDDARLGLPPWGAGGGVRDVETR GPGSRAARGPRVGMHRRGVGAGAIAKKKLAEAKYKERGTVLAEDQLAQPFPSLYVLRVYP VIVIGEEVEEAVKQENGAMSKQLDMFKTNLEEFASKHKQEIRKNPEFRVQFQDMCATIGV DPLASGKGFWSEMLGVGDFYYELGVQIIEVCLALKHRNGGLITLEELHQQVLKGRGKFAQ DVSQDDLIRAIKKLKALGTGFGIIPVGGTYLIQSVPAELNMDHTVVLQLAEKNGYVTVSE IKASLKWETERARQVLEHLLKEGLAWLDLQAPGEAHYWLPALFTDLYSQEITAEEAREAL P >gi568815581f:48897746_49149441|GENSCAN_predicted_CDS_2|1446_bp atggtggccacgaagacctttgctctgctgctgctgtccctgttcctggcagtgggacta ggagagaagaaagagggtcacttcagcgctctcccctccctgcctgttggatctcatgct aaggtgagcagccctcaacctcgaggccccaggtacgcggaagggactttcatcagtgac tacagtattgccatggacaagattcaccaacaagactttgtgaactggctgctggcccaa aaggggaagaagaatgactggaaacacaacatcacccagagggaggctcgggcgctggag ctggccagtcaagctaataggaaggaggaggaggcagtggagccacagagctccccagcc aagaaccccagcgatgaagatttgctgcgggacttgctgattcaagagctgttggcctgc ttgctggatcagacaaacctctgcaggctcaggctgaggaaccgtcgtgaaagagatgac gcgcggctcgggcttccgccttggggagccggcggcggagtccgggacgtggagacccgg ggtcccggcagccgggcggcccgcgggcccagggtggggatgcaccgccgcggggtggga gctggcgccatcgccaagaagaaacttgcagaggccaagtataaggagcgagggacggtc ttggctgaggaccagctagcccagcccttcccttctctgtacgtgctccgagtttaccca gtgattgtgattggggaagaagtggaggaagccgttaagcaggaaaatggggctatgtca aagcagttggacatgttcaagaccaacctggaggaatttgccagcaaacacaagcaggag atccggaagaatcctgagttccgtgtgcagttccaggacatgtgtgcaaccattggcgtg gatccgctggcctctggaaaaggattttggtctgagatgctgggcgtgggggacttctat tacgaactaggtgtccaaattatcgaagtgtgcctggcgctgaagcatcggaatggaggt ctgataactttggaggaactacatcaacaggtgttgaagggaaggggcaagttcgcccag gatgtcagtcaagatgacctgatcagagccatcaagaaactaaaggcacttggcactggc ttcggcatcatccctgtgggcggcacttacctcattcagtctgttccagctgagctcaat atggatcacaccgtggtgctgcagctggcagagaagaatggctacgtgactgtcagtgag atcaaagccagtcttaaatgggagaccgagcgagcgcggcaagtgctggaacacctgctg aaggaagggttggcgtggctggacttacaggccccaggggaggcccactactggctgcca gctctcttcactgacctctactcccaggagattacagctgaggaggccagagaagccctc ccctga >gi568815581f:48897746_49149441|GENSCAN_predicted_peptide_3|596_aa MNKLYIGNLNESVTPADLEKVFAEHKISYSGQFLVKSGYAFVDCPDEHWAMKAIETFSGK VELQGKRLEIEHSVPKKQRSRKIQIRNIPPQLRWEVLDSLLAQYGTVENCEQVNTESETA VVNVTYSNREQTRQAIMKLNGHQLENHALKVSYIPDEQIAQGPENGRRGGFGSRGQPRQG SPVAAGAPAKQQQVDIPLRLLVPTQYVGAIIGKEGATIRNITKQTQSKIDVHRKENAGAA EKAISVHSTPEGCSSACKMILEIMHKEAKDTKTADEVPLKILAHNNFVGRLIGKEGRNLK KVEQDTETKITISSLQDLTLYNPERTITVKGAIENCCRAEQEIMKKVREAYENDVAAMSL QSHLIPGLNLAAVGLFPASSSAVPPPPSSVTGAAPYSSFMQAPEQEMVQVFIPAQAVGAI IGKKGQHIKQLSRFASASIKAQGRIYGKLKEENFFGPKEEVKLETHIRVPASAAGRVIGK GGKTVNELQNLTAAEVVVPRDQTPDENDQVIVKIIGHFYASQTYYSFSWGIKMKRGFVMM SSKLRPSTTGVPKPRATDWYRATQQEMAQRKIRDILAQVKQQHQKGQSNQAQARRK >gi568815581f:48897746_49149441|GENSCAN_predicted_CDS_3|1791_bp atgaacaagctttacatcggcaacctcaacgagagcgtgacccccgcggacttggagaaa gtgtttgcggagcacaagatctcctacagcggccagttcttggtcaaatccggctacgcc ttcgtggactgcccggacgagcactgggcgatgaaggccatcgaaactttctccgggaaa gtagaattacaaggaaaacgcttagagattgaacattcggtgcccaaaaaacaaaggagc cggaaaattcaaatccgaaatattccaccccagctccgatgggaagtactggacagcctg ctggctcagtatggtacagtagagaactgtgagcaagtgaacaccgagagtgagacggca gtggtgaatgtcacctattccaaccgggagcagaccaggcaagccatcatgaagctgaat ggccaccagttggagaaccatgccctgaaggtctcctacatccccgatgagcagatagca cagggacctgagaatgggcgccgagggggctttggctctcggggtcagccccgccagggc tcacctgtggcagcgggggccccagccaagcagcagcaagtggacatcccccttcggctc ctggtgcccacccagtatgtgggtgccattattggcaaggagggggccaccatccgcaac atcacaaaacagacccagtccaagatagacgtgcataggaaggagaacgcaggtgcagct gaaaaagccatcagtgtgcactccacccctgagggctgctcctccgcttgtaagatgatc ttggagattatgcataaagaggctaaggacaccaaaacggctgacgaggttcccctgaag atcctggcccataataactttgtagggcgtctcattggcaaggaaggacggaacctgaag aaggtagagcaagataccgagacaaaaatcaccatctcctcgttgcaagaccttaccctt tacaaccctgagaggaccatcactgtgaagggggccatcgagaattgttgcagggccgag caggaaataatgaagaaagttcgggaggcctatgagaatgatgtggctgccatgagcctg cagtctcacctgatccctggcctgaacctggctgctgtaggtcttttcccagcttcatcc agcgcagtcccgccgcctcccagcagcgttactggggctgctccctatagctcctttatg caggctcccgagcaggagatggtgcaggtgtttatccccgcccaggcagtgggcgccatc atcggcaagaaggggcagcacatcaaacagctctcccggtttgccagcgcctccatcaag gctcagggaagaatctatggcaaactcaaggaggagaacttctttggtcccaaggaggaa gtgaagctggagacccacatacgtgtgccagcatcagcagctggccgggtcattggcaaa ggtggaaaaacggtgaacgagttgcagaatttgacggcagctgaggtggtagtaccaaga gaccagacccctgatgagaacgaccaggtcatcgtgaaaatcatcggacatttctatgcc agtcagacatattattccttttcctggggtattaaaatgaagcgtggatttgtgatgatg tcttctaagttacgaccctctacaacaggagtccccaaaccccgggccacggactggtac cgggccacacagcaggagatggctcaacggaagatccgagacatcctggcccaggttaag cagcagcatcagaagggacagagtaaccaggcccaggcacggaggaagtga >gi568815581f:48897746_49149441|GENSCAN_predicted_peptide_4|213_aa MNISEINEVIRSALPSISSFFLDAGQELKNHRMRVQTIAQNPSATCEGGFWKMTDQMKQR SAVPAEASLDYSVCQQSDTWGGAVGGLAARLRDIRQSEGGGIRDDFGRILVIILVLGIVG FMFGSMFLQAVFSSPKPELPSPAPGVQKLKLLPEERLRNLFSYDGIWLFPKNQCKCEANK EQGGYNFQDAYGQSDLPAVKARRQAEFEHFQRR >gi568815581f:48897746_49149441|GENSCAN_predicted_CDS_4|642_bp atgaacatcagtgaaatcaatgaagtaatcagatctgcgctgcccagcatatcctcattc ttcctggatgcaggacaagagctcaagaaccaccgaatgcgggtacaaactatagcacag aacccttctgccacttgtgaaggaggcttctggaagatgacagaccagatgaagcagaga tcagcagttccagctgaggcttctttagactactcagtctgccaacaatcagacacatgg ggcggggcagtcggcggcctggctgctaggctccgtgacatccggcagtctgagggcggc gggattcgggatgacttcgggcggatattggtcataatcctggtacttggcattgttgga tttatgttcggaagcatgttccttcaagcagtgttcagcagccccaagccagaactccca agtcctgccccgggtgtccagaagctgaagcttctgcctgaggaacgtctcaggaacctc ttttcctacgatggaatctggctgttcccgaaaaatcagtgcaaatgtgaagccaacaaa gagcagggaggttacaactttcaggatgcctatggccagagcgacctcccagcggtgaaa gcgaggagacaggctgaatttgaacactttcagaggaggtaa >gi568815581f:48897746_49149441|GENSCAN_predicted_peptide_5|130_aa EGVTVAKKDVHMPKHLELADKKVPDLHVMKVVQSLKSGGFVQERFAWRHFYAYLTNKGIQ YLHDDLHLPPEIVLATLCHSCPETGRSLPEGLEGKKTLPAEVLAEGKGNTEWVVKEDNQY QLRPCDQLQK >gi568815581f:48897746_49149441|GENSCAN_predicted_CDS_5|393_bp gagggagtcacggtggccaagaaggatgtccacatgcctaagcacctggagctggcagac aagaaggtgcccgaccttcatgtcatgaaggttgtgcagtctctcaagtccggaggcttt gtgcaggaacggtttgcctggagacatttctacgcgtaccttaccaacaagggtatccag tatctccatgatgacctccatctgcccccagagattgtgcttgccaccttatgccacagc tgtccagagactggcaggtctctgcctgaaggtctggagggaaaaaaaacattacctgct gaggtgcttgctgaaggcaaggggaatacagaatgggtagtaaaagaagataatcaatac cagctacgaccatgtgaccagctgcagaaatga