GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:37:19 Sequence gi568815584f:76339354_76598914 : 259561 bp : 48.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3141 3294 154 2 1 74 18 174 0.052 9.16 1.02 Term + 10962 11098 137 1 2 59 54 102 0.216 2.08 1.03 PlyA + 11904 11909 6 1.05 2.00 Prom + 11972 12011 40 -4.26 2.01 Init + 35119 35277 159 0 0 41 85 128 0.317 7.62 2.02 Intr + 36514 36609 96 0 0 66 59 58 0.131 0.91 2.03 Intr + 42685 42825 141 0 0 36 80 115 0.146 6.05 2.04 Intr + 57346 57455 110 0 2 119 18 36 0.421 -1.22 2.05 Intr + 57834 58092 259 0 1 94 80 72 0.525 4.47 2.06 Intr + 64254 64353 100 2 1 65 83 97 0.686 6.58 2.07 Term + 84098 84255 158 0 2 58 41 154 0.388 5.80 2.08 PlyA + 88991 88996 6 1.05 3.00 Prom + 89420 89459 40 -3.36 3.01 Init + 95419 95456 38 0 2 85 29 72 0.591 -1.50 3.02 Intr + 95582 95683 102 2 0 102 81 21 0.669 2.09 3.03 Intr + 99988 100397 410 1 2 112 100 797 0.938 77.21 3.04 Intr + 119806 119904 99 2 0 107 39 34 0.364 0.48 3.05 Intr + 123192 123308 117 1 0 97 116 197 0.999 23.84 3.06 Intr + 124975 125079 105 2 0 87 30 95 0.424 3.79 3.07 Intr + 142663 142773 111 2 0 109 84 127 0.994 14.75 3.08 Intr + 143245 143406 162 2 0 87 99 231 0.645 24.05 3.09 Intr + 152094 152363 270 1 0 142 107 648 0.990 69.61 3.10 Intr + 155277 155462 186 1 0 92 61 40 0.363 1.36 3.11 Intr + 158861 159099 239 0 2 96 70 515 0.799 47.83 3.12 Intr + 160203 160351 149 2 2 78 91 36 0.610 1.93 3.13 Intr + 172560 172635 76 0 1 69 83 100 0.557 7.12 3.14 Intr + 173047 173202 156 0 0 42 111 67 0.916 4.61 3.15 Term + 176663 176761 99 1 0 79 49 41 0.447 -2.57 3.16 PlyA + 176867 176872 6 1.05 4.00 Prom + 179519 179558 40 -3.16 4.01 Init + 197438 197587 150 1 0 115 80 112 0.506 11.35 4.02 Intr + 201299 201412 114 1 0 56 75 40 0.223 0.14 4.03 Intr + 216412 216448 37 0 1 93 92 26 0.150 1.34 4.04 Intr + 223519 223725 207 2 0 52 49 91 0.042 0.65 4.05 Intr + 225705 225733 29 1 2 114 82 -12 0.651 -1.37 4.06 Term + 230115 230219 105 2 0 135 49 55 0.871 4.71 4.07 PlyA + 230503 230508 6 1.05 5.00 Prom + 234881 234920 40 -2.76 5.01 Sngl + 253739 254284 546 1 0 65 45 186 0.344 8.10 5.02 PlyA + 255152 255157 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 27063 27464 402 2 0 66 45 170 0.896 6.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:76339354_76598914|GENSCAN_predicted_peptide_1|96_aa MLMLVLPLAAAVVISTRSGEEPEAQTGKERNGRGCLLKVIGYGAKDFWFEFENTVLDTEQ FKTALALEKFATFLERQDICLRMFVHPEKCQQAEGV >gi568815584f:76339354_76598914|GENSCAN_predicted_CDS_1|291_bp atgctgatgctggtgctcccacttgcagctgctgtggtcatctccactcgttcaggtgag gagcctgaagctcagacaggcaaagagaggaacgggcgtggctgtctgcttaaggttata ggctatggagccaaagacttctggtttgaatttgaaaacactgtgctggacacagaacaa ttcaaaacagcccttgccctcgagaagtttgctaccttcctagagagacaggacatatgc ctgcgaatgtttgtacatccagaaaaatgtcagcaagctgaaggggtttga >gi568815584f:76339354_76598914|GENSCAN_predicted_peptide_2|340_aa MPSLEAKDWGLFPNCNNGTLDVHFTNRSLAAVCLENGFQQGRKDAEKPVTEVLTHLAVGH SPPPPFNCEKAGLRALTPSTPTAQESVKPFCAANKAPVVEDVSEPRNIKSRVHPKRVSTV GNLCVHMCECVPCTCYQACASACSRTNPDRGLARRLNFRRLNSRSQRWRAFGFRSQIDLH LQRLPVGPEPPILQMPQEGACRGDPGPPSGQDPSQRHPAGQSPRASLGSRTSWSDRSDFP SLPSDATHAKQPKRQRDPAQLYISLLHEAPVPSEVVPESAREVEVVTGDSLNQDDQNSIE TDGKVDPLRTFVGSTGPQGLCGFCSWGAGGLLFSDLDPVL >gi568815584f:76339354_76598914|GENSCAN_predicted_CDS_2|1023_bp atgccttccttggaggccaaggactggggcctttttcctaattgcaacaacgggacattg gatgtgcattttacaaatcgctccctggctgctgtgtgcctggagaatggatttcagcag ggcagaaaggatgccgaaaaaccagtcactgaggtcctgacccacctggcggttggacat tcccccccacccccctttaactgcgagaaggcaggccttagggctctaacacccagtact cctactgctcaggagagtgtcaagcccttctgtgctgccaataaagcccctgttgtagag gatgtttctgaacctcgaaacatcaaaagccgtgtccaccccaagagagtttccaccgtg ggaaacctgtgtgtgcacatgtgtgaatgtgttccgtgcacatgctaccaagcctgtgcc tctgcctgcagcaggacaaacccagacagaggcttagcccggaggctcaacttccggagg ctcaactcccggagccagcgctggcgggccttcggattcaggtcacagatagacttgcac ctccagaggctgcctgtggggcctgagccccccatcttacagatgcctcaggagggggct tgtagaggtgacccaggaccccctagcggccaagaccccagtcaacggcacccagcaggc cagtcccccagggccagtctgggcagcagaacctcctggagtgaccgctctgactttcct tctctgccctctgatgctacccacgcaaaacagccaaagcgccagagagaccctgctcag ctctacatttccttgctgcacgaggccccggtgccctcagaggtggtccctgagagtgcc agagaggtggaggtggtgacaggtgactctttaaaccaggatgatcagaattcgatcgaa actgatggcaaggtggatcctttaagaacttttgtgggctccactggaccccaggggctc tgtgggttctgctcctggggcgctggtggactcctcttctcggaccttgatcctgtcctg tga >gi568815584f:76339354_76598914|GENSCAN_predicted_peptide_3|772_aa MRSLPGAALAQAWTQVNGTGSELQPFPSPRTQVNGTGSELQPFPSVRLLNRMSSDDRHLG SSCGSFIKTEPSSPSSGIDALSHHSPSGSSDASGGFGLALGTHANGLDSPPMFAGAGLGG TPCRKSYEDCASGIMEDSAIKCEYMLNAIPKRLCLVCGDIASGYHYGVASCEACKAFFKR TIQDEKNLFTSGVAGNMKSLGSVKQSCGCSLTSIRQWNIEYSCPATNECEITKRRRKSCQ ACRFMKCLKVGMLKEGPSEDKGPDLRRFVNTTWTIMCYRNWTFHFRLPFPGVRLDRVRGG RQKYKRRLDSESSPYLSLQISPPAKKPLTKIVSYLLVAEPDKLYAMPPPGMPEGDIKALT TLCDLADRELVVIIGWAKHIPGFSSLSLGDQMSLLQSAWMEILILGIVYRSLPYDDKLVY AEDYIMDEEHSRLAGLLELYRAILQLVRRYKKLKVEKEEFVTLKALALANSGFDKCIMTC FYNDVSYKIASRPCASPTQRGAFEPGIAEGFEVLSGGGGAKAVKAEEARCTKPDSMYIED LEAVQKLQDLLHEALQDYELSQRHEEPWRTGKLLLTLPLLRQTAAKAVQHFYSVKLQGKV PMHKLFLEMLEAKHHATGLVYQCHRRGALLPHLGYQRFGVAPLGTASTPALWQDPCSPPG CARSSCGDVFSVLSIGMELPSKAQLQVRFYTFGMFVDHGDGDCSQHNVYFSSTVKEGNRQ SPEVASHGSSRPAKKSAGHQRGEQQADRSAVHITSVTARDPTLFVQQYLTSQ >gi568815584f:76339354_76598914|GENSCAN_predicted_CDS_3|2319_bp atgaggtccctgcctggagcagctcttgcccaggcctggactcaagtgaatgggacaggc tctgagctccagccttttccttctccgaggactcaagtgaatgggacaggctctgagctc cagccttttccttctgtaaggctgctgaacaggatgtcctcggacgacaggcacctgggc tccagctgcggctccttcatcaagactgagccgtccagcccgtcctcgggcatcgatgcc ctcagccaccacagccccagtggctcgtccgacgccagcggcggctttggcctggccctg ggcacccacgccaacggtctggactcgccacccatgtttgcaggcgccgggctgggaggc accccatgccgcaagagctacgaggactgtgccagcggcatcatggaggactcggccatc aagtgcgagtacatgctcaacgccatccccaagcgcctgtgcctcgtgtgcggggacatt gcctctggctaccactacggcgtggcctcctgcgaggcttgcaaggccttcttcaagagg actatccaagatgagaaaaatctgtttacctccggggttgctgggaacatgaaatccttg ggatctgtaaagcagagctgtggctgctccctgacaagcattcgccagtggaacattgag tacagctgcccggccaccaacgagtgcgagatcaccaaacggaggcgcaagtcctgccag gcctgccgcttcatgaaatgcctcaaagtggggatgctgaaggaagggccctctgaagat aagggcccagacctaagacgctttgtcaatacaacttggaccatcatgtgctatagaaat tggaccttccacttccgtttaccatttccaggtgtgcgccttgatcgagtgcgtggaggc cgtcagaaatacaagcgacggctggactcagagagcagcccatacctgagcttacaaatt tctccacctgctaaaaagccattgaccaagattgtctcatacctactggtggctgagccg gacaagctctatgccatgcctccccctggtatgcctgagggggacatcaaggccctgacc actctctgtgacctggcagaccgagagcttgtggtcatcattggctgggccaagcacatc ccaggcttctcaagcctctccctgggggaccagatgagcctgctgcagagtgcctggatg gaaatcctcatcctgggcatcgtgtaccgctcgctgccctatgacgacaagctggtgtac gctgaggactacatcatggatgaggagcactcccgcctcgcggggctgctggagctctac cgggccatcctgcagctggtacgcaggtacaagaagctcaaggtggagaaggaggagttt gtgacgctcaaggccctggccctcgccaactccggttttgacaaatgcataatgacatgt ttctacaatgacgtatcatacaaaatagcatcaaggccctgtgcttcacctacacagagg ggagcatttgaaccaggcattgcagagggcttcgaggttctgtcaggtggaggaggagca aaagcagttaaagcagaggaagccaggtgcacaaagccagattccatgtacatcgaggat ctagaggctgtccagaagctgcaggacctgctgcacgaggcactgcaggactacgagctg agccagcgccatgaggagccctggaggacgggcaagctgctgctgacactgccgctgctg cggcagacggccgccaaggccgtgcagcacttctatagcgtcaaactgcagggcaaagtg cccatgcacaaactcttcctggagatgctggaggccaagcatcatgccacagggctagtg taccagtgccacaggaggggtgccctcctaccacacttgggctaccagaggtttggtgta gctcccctgggcaccgcgagcacgccagctctctggcaggacccctgcagtccccctggc tgtgccagaagtagctgtggggacgtcttcagtgtgctaagcatcggcatggagctgcct tcaaaggcacagctgcaggtgaggttttatacttttggtatgtttgttgatcacggtgat ggtgattgctctcaacacaatgtctacttctcctcgacggtcaaggagggaaatagacag agcccagaggtggccagtcatggttcctcaagacctgccaagaagagtgcaggccaccag cgtggggagcagcaggcagataggtccgcagttcacataacctccgtaactgcccgtgat ccaacgctgtttgtccaacagtatttaacaagtcagtaa >gi568815584f:76339354_76598914|GENSCAN_predicted_peptide_4|213_aa MEQGAALVGEARAAQEPTEAGEGSGMAGFSPEGCPAGRQLRPGEKSSTAQMLKETSHTCA SHLNRTGHLSKSIRYLACQVLTFSDLYEKISPKYPNLNNHDPLLAPLSLPEDRHTFTQGS QAELPVLPVPTQLSSARGHESGVRVANVSLYFITHMRHLITSLLDSGQRGQELRGEASQV WNSNLLYSRGFGALIQSTNNNNNSKRKSRASYT >gi568815584f:76339354_76598914|GENSCAN_predicted_CDS_4|642_bp atggagcagggggcggcgctcgtcggggaggctcgggctgcacaggaacccacggaggcg ggggaaggctcaggcatggcaggcttcagtcccgagggctgccccgcgggaaggcagcta aggcccggcgagaaatcgagcacagcgcagatgctgaaggaaacaagccacacctgtgcg tcccatctcaacagaactgggcacctgtcaaaatccatccgctacttagcttgtcaagtt ttgaccttctctgacctctatgagaaaatatctcccaaataccccaatttgaataaccac gatcccctccttgctcctctgagtctgcccgaggacagacacaccttcacccagggcagc caagcagagctacccgtgctgcctgtacctacacagctgtcatctgcccgtgggcatgag agtggggtccgtgtggccaatgtctccttgtattttattactcacatgcgacacttaatt accagccttctggattctggacaaagaggtcaggaactcaggggagaggcctcccaggtt tggaattcaaatttactctactcccgtggttttggagctctaatccaatcaactaacaac aacaacaacagcaaaaggaagtctagggccagttatacctaa >gi568815584f:76339354_76598914|GENSCAN_predicted_peptide_5|181_aa MGACVKGTESPPRSRNLSPDQVVCESSQPVIVLLSSSKQWEQRPSRNQFRTNAPGHQSPF VSKREKPPETHQNQGYQQHAVTWVDIGITHLQGLIHASEGGKMLLAHTEVPRGVLNGHFW VGMRLKEGPGTTLIPKLHKSPMGLCPAECLMTLGMQCVCHAGEAQSEGGALTSSWSALWA V >gi568815584f:76339354_76598914|GENSCAN_predicted_CDS_5|546_bp atgggagcatgtgtgaaaggcaccgagtcacctcctaggtcccgtaacctgtcccctgac caggttgtctgtgagtccagtcagccagtgattgtcttattgtcctcatcaaagcaatgg gaacaaaggccttctagaaaccagttcaggacaaacgcccctggacatcaaagtcccttt gtgtctaagagagagaagcccccagaaacccatcagaaccagggctaccagcagcacgca gtcacgtgggtggatattggtatcacccacctgcaaggcttgattcatgcttctgaaggt ggcaaaatgctcctggcccacacagaagtccctcggggagtgttgaatggccacttttgg gtcgggatgaggctgaaggagggccctggaacaactctcatccccaaactccacaagtct cctatgggtctgtgtccagcagaatgtctcatgacccttgggatgcagtgtgtatgccat gctggtgaagctcagagtgagggtggggctctcacctcttcatggtctgctctttgggct gtgtga