GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:13:59 Sequence gi568815582f:83708077_83909420 : 201344 bp : 44.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9387 9460 74 1 2 96 72 110 0.465 9.33 1.02 Intr + 12586 12725 140 0 2 82 121 6 0.731 2.56 1.03 Intr + 16659 16724 66 0 0 95 80 58 0.222 3.72 1.04 Term + 23741 23825 85 2 1 67 48 66 0.010 -2.37 1.05 PlyA + 25623 25628 6 1.05 2.00 Prom + 29263 29302 40 -5.16 2.01 Init + 33680 34093 414 1 0 98 47 161 0.435 9.48 2.02 Intr + 36467 36537 71 1 2 83 67 31 0.362 -1.52 2.03 Intr + 40032 40174 143 0 2 63 89 178 0.553 15.40 2.04 Intr + 69077 69322 246 0 0 76 55 101 0.682 3.03 2.05 Intr + 71892 72125 234 1 0 115 98 212 0.978 22.56 2.06 Intr + 75178 75396 219 2 0 117 89 137 0.952 14.97 2.07 Intr + 82028 82086 59 0 2 76 109 11 0.029 0.60 2.08 Term + 86947 86964 18 0 0 109 45 22 0.054 -1.68 2.09 PlyA + 88044 88049 6 1.05 3.10 PlyA - 88223 88218 6 1.05 3.09 Term - 88741 88587 155 2 2 51 47 101 0.322 0.38 3.08 Intr - 94804 94710 95 0 2 81 97 17 0.281 1.51 3.07 Intr - 96486 96432 55 1 1 116 56 53 0.203 2.94 3.06 Intr - 101164 101090 75 2 0 120 86 74 0.376 9.89 3.05 Intr - 115953 115849 105 1 0 63 76 59 0.188 2.39 3.04 Intr - 120490 120344 147 2 0 97 77 86 0.117 8.61 3.03 Intr - 139401 139320 82 0 1 83 57 27 0.018 -1.69 3.02 Intr - 143466 143414 53 1 2 118 53 83 0.292 6.43 3.01 Init - 165216 165138 79 0 1 79 35 51 0.011 0.12 3.00 Prom - 175122 175083 40 -2.26 4.00 Prom + 176193 176232 40 -3.86 4.01 Init + 191069 191596 528 1 0 90 90 1203 0.529 113.55 4.02 Intr + 195420 195443 24 2 0 113 88 -1 0.098 0.62 4.03 Term + 195898 195945 48 0 0 45 37 107 0.153 -1.40 4.04 PlyA + 196068 196073 6 -1.75 5.02 PlyA - 196113 196108 6 -0.45 5.01 Term - 197542 197386 157 0 1 116 38 163 0.987 11.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:83708077_83909420|GENSCAN_predicted_peptide_1|121_aa XNRLRAAEKQAQAILLKDDTIRKKQEAAKTGCQTPLPRSTGFHTLRKEGIRLMRARNEID AIPNPGPVHPGSLSSVSVRSKMIDEVSAGTQPAGLQTGKCKMSKILVYKLAQRFVEDLNA A >gi568815582f:83708077_83909420|GENSCAN_predicted_CDS_1|366_bp ngaaacaggctcagggcagctgagaaacaggcccaggccatcctgctgaaggacgacacc atcaggaagaaacaggaagcagcaaagacagggtgccagacacctttgccacgctccact ggcttccatactctgagaaaagagggaatccgactcatgagagccagaaatgaaattgat gcaatcccaaacccagggccagtccatcctggaagcctgagcagtgtcagcgtcaggagc aagatgattgatgaagtgtcagcagggacacaaccagctggacttcagacaggaaaatgt aaaatgagcaaaattctcgtgtataaactagctcaaaggtttgtggaagatttaaatgct gcatga >gi568815582f:83708077_83909420|GENSCAN_predicted_peptide_2|467_aa MLRKHGSPTSGVPNPWATDGPNPWATTSDCYWSTNPIVNCTHKGSRRHTPYENLMPELRR NSFISKPSPTCSWKNCLPRNQSMVPKRLGTAALPHRKRVTPRSLQKGLVKPLISFTFPPI LEPLRAEAGSPWLWKSFQLQHQNIVPLGSAESDVSETFQGTVYSVYKDPAGWLNINPING TVDTTAVLDRESPFVDNSVYTALFLAIDSGSHISYQSNRVLSEDKSDQIVPGSLPFRKAS GGLVHSVELVQVPCDATQGPPGAHAILSALSSGGAAPPRHPMTLHVEPHGEGNPPATGTG TLLITLEDVNDNAPFIYPTVAEVCDDAKNLSVVILGASDKDLHPNTDPFKFEIHKQAVPD KVWKISKINNTHALVSLLQNLNKANYNLPIMVTDSGKPPMTNITDLRVQVCSCRNSKVDC NAAGALRFSLPSVLLLSLFSLALVWQLSHFLDEKTDPQKGLQVCENS >gi568815582f:83708077_83909420|GENSCAN_predicted_CDS_2|1404_bp atgctgcgcaaacatgggtcccctacatcaggggtccccaacccctgggccacggacggc cccaacccctgggccacaacatcagattgttattggagcacaaatcctattgtgaactgc acacacaagggatctaggcggcacactccttatgagaatctaatgcctgaactgaggagg aacagtttcatctccaagccatcccccacctgttcatggaaaaattgtcttccacgaaac cagtccatggtgccaaagaggttggggactgctgccctacctcacagaaagcgagtgacc ccaagaagcctgcagaagggacttgtcaaacctttgatctctttcaccttcccaccaata cttgagcccctgagggcagaggctggcagcccttggctctggaagtccttccagttgcag catcaaaacattgttcccctaggctctgctgagtcggatgtttctgaaacattccagggg accgtgtattctgtttacaaggacccagcaggttggctgaatattaaccccatcaatggg actgttgacaccacagctgtgctggaccgtgagtccccatttgtcgacaacagcgtgtac actgctctcttcctggcaattgacagtgggtctcacatttcctatcaatccaatcgtgtg ctttctgaagataagtcagatcagattgttcctggatcactgcctttccggaaagcctct ggtggcctcgtgcactctgtagaattggtacaggtcccttgtgatgctactcaaggccct ccaggagcccatgccatcctcagtgccctcagctctggaggagcagctccaccacggcac cccatgactttgcatgtagagccacatggggagggcaaccctcccgctacgggcactggg actttgctgataaccctggaggacgtgaatgacaatgccccgttcatttaccccacagta gctgaagtctgtgatgatgccaaaaacctcagtgtagtcattttgggagcatcagataag gatcttcacccgaatacagatcctttcaaatttgaaatccacaaacaagctgttcctgat aaagtctggaagatctccaagatcaacaatacacacgccctggtaagccttcttcaaaat ctgaacaaagcaaactacaacctgcccatcatggtgacagattcagggaaaccacccatg acgaatatcacagatctcagggtacaagtgtgctcctgcaggaattccaaagtggactgc aacgcggcaggggccctgcgcttcagcctgccctcagtcctgctcctcagcctcttcagc ttagctttggtctggcagttgtcccattttctagatgagaaaactgatcctcagaaaggt ctccaggtctgtgagaactcctga >gi568815582f:83708077_83909420|GENSCAN_predicted_peptide_3|281_aa MKAIYEKSTANIIINGEKLKAFPLRSGKYTFDIVTGAEGAVAEEVLHEAVYLLRLNREGV TLGTEWVLDLSKAGTAIGFSALPSSKEDLAWTDHFKGCFHEVESGGESEDEKGCRFLGQD ACQDMHLSNDQMFAEHLLRAGHCAGTELVKMTMFPDYSLKHNDFRETGLPIFFRHCYTLV GCLMLLLSNLHLMGGATAGTILQVLVSLRRRMLSRWHEEGLGHDSHHPSKADYDTLRSYH EVDKMSPWPMLRTGTSVFIDSLTNQRCTTDPKWSVKNVNLH >gi568815582f:83708077_83909420|GENSCAN_predicted_CDS_3|846_bp atgaaggccatatacgaaaagtccacagctaacatcataatcaacggagaaaaattgaaa gcttttcctctaagatctggtaaatacacctttgacattgtcacaggtgctgaaggagct gtggctgaggaggtcctccatgaagcagtctacctcctcaggctaaaccgggaaggtgtc acccttgggacagagtgggtacttgatctctctaaggcgggcactgccattggcttctct gctctcccatcttccaaggaggatctggcttggactgaccattttaaggggtgctttcat gaagtagaatctggaggtgagtcggaggatgaaaagggctgcaggttcctgggacaagat gcttgccaggatatgcatctatccaatgatcaaatgtttgctgagcatctacttcgtgcc gggcactgtgctggtacagaacttgtgaagatgacaatgtttccagactacagccttaaa cacaacgactttagggaaactggtcttccaattttctttcgacactgttacacgctggtg ggttgcctcatgctgctgctgagcaaccttcacctgatgggcggtgccacagcaggaaca atccttcaggtgctagtaagcctcaggaggaggatgctgtcccgctggcacgaggagggg ctggggcacgacagtcatcatccaagcaaagctgattatgacaccctcagatcatatcat gaggtggacaagatgagtccatggcccatgctgagaactggtacttctgtcttcatagac tcactcaccaaccagagatgtacgacggaccccaaatggtctgtcaaaaatgtgaacttg cactaa >gi568815582f:83708077_83909420|GENSCAN_predicted_peptide_4|199_aa MRGFGPGLTARRLLPLRLPPRPPGPRLASGQAAGALERAMDELLRRAVPPTPAYELREKT PAPAEGQCADFVSFYGGLAETAQRAELLGRLARGFGVDHGQVAEQSAGVLHLRQQQREAA VLLQAEDRLRYALVPRYRGLFHHISKLDGGVRFLVQLRADLLEAQALKLVEGPDVRSRAS GQGKPEQLLPVLELSNGDF >gi568815582f:83708077_83909420|GENSCAN_predicted_CDS_4|600_bp atgcgaggcttcgggccaggcttgacggccaggcgtctcctcccgctgcggttgcccccg cggccgcccgggccccggctggcgagcgggcaggcggccggcgccctggagcgggccatg gacgagctgctgcgccgcgcggtgccgccgacgccggcctacgagctgcgcgagaagaca ccggcgcccgccgagggtcagtgcgcggacttcgtgagcttctacggtgggctggccgag acggcccagcgggccgaactgctgggccgcctggcgcggggcttcggcgtggaccacggc caggtggcggagcagagcgccggcgtgctccatctgcgccagcagcagcgggaggcggcg gtgctgctgcaggccgaggaccggctgcgctacgcgctggtgccgcgctatcgcggcctc ttccaccacatcagcaagctggacggcggcgtgcgcttcctggtgcagctgcgggccgac ctgctggaggcgcaggccctcaagctggtggaggggccggacgtccggagcagggcttct ggacaagggaagcctgaacagctgctgccggtgttagaactgagcaacggagacttctaa >gi568815582f:83708077_83909420|GENSCAN_predicted_peptide_5|52_aa XLEGGVMAGIGAAILGHDVALEMKVFLGRATRFKSESTKSTKEQNQLWAAHL >gi568815582f:83708077_83909420|GENSCAN_predicted_CDS_5|159_bp nnactggaaggcggtgtgatggctggcattggagcagccatcctgggccacgacgtggct ttggagatgaaggtttttctcggcagagcaacaagattcaagtctgaatccaccaagagc accaaagagcagaatcagctctgggctgcccacctctag